Does Human-Like Contextual Object Recognition Emerge from Language Supervision and Language-Guided Inference? | Synapse