Japanese multimodal semantic understanding model based on transformer architecture and multi-task Bayesian federated learning | Synapse