Towards a Multi-array Architecture for Accelerating Large-scale Matrix Multiplication on FPGAs | Synapse