Abstract: Perceptual ambiguity and task conflict limit multi-task robotic manipulation via imitation learning. We propose a framework combining a Language-Conditioned Visual Representation (LCVR) ...