Transformer-Based Video-Structure Multi-Instance Learning for Whole Slide Image Classification | Synapse