Towards Explainable 3D Grounded Visual Question Answering: A New Benchmark and Strong Baseline | Synapse