Visual attention dehazing network with multi-level features refinement and fusion Dual self-attention with co-attention networks for visual question answering