cCUDA: Effective Co-Scheduling of Concurrent Kernels on GPUs | Synapse