Multi-modal Auto-Encoders as Joint Estimators for Robotics Scene Understanding | Synapse