What Makes Training Multi-Modal Classification Networks Hard? | Synapse