From Simple to Complex Scenes: Learning Robust Feature Representations for Accurate Human Parsing | Synapse