Pulse nav.journalClub Debates activos Tendencias Explorar Investigadores

Join discussions, follow papers, and never miss your next session.

Download on theApp Store

© Synapse Social LLC, 2026

Política de privacidad

Inicio Explorar nav.journalClub Tendencias

⌘+K

Supporting Very Large Models using Automatic Dataflow Graph Partitioning | Synapse

March 22, 2019Open Access

Supporting Very Large Models using Automatic Dataflow Graph Partitioning

Puntos clave

Los puntos clave no están disponibles para este artículo en este momento.

Resumen

This paper presents Tofu, a system that partitions very large DNN models across multiple GPU devices to reduce per-GPU memory footprint. Tofu is designed to partition a dataflow graph of fine-grained tensor operators used by platforms like MXNet and TensorFlow. In order to automatically partition each operator, we propose to describe the semantics of an operator in a simple language inspired by Halide. To optimally partition different operators in a dataflow graph, Tofu uses a recursive search algorithm that minimizes the total communication cost. Our experiments on an 8-GPU machine show that Tofu enables the training of very large CNN and RNN models. It also achieves 25% - 400% speedup over alternative approaches to train very large models.

Preguntar a la IA

Me gusta

Guardar

Compartir

Ver artículo completo

Preguntar a la IA

Me gusta

Guardar

Compartir

Ver artículo completo

Cite This Study

Wang et al. (Fri,) studied this question.

synapsesocial.com/papers/6a1967c5ff42a97fac581f1a https://doi.org/https://doi.org/10.1145/3302424.3303953

Also Consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

1Binarized Neural Networks2016 · 924 citations
2SUMMA: scalable universal matrix multiplication algorithm1997 · 485 citations
3Data optimization: Allocation of arrays to reduce communication on SIMD machines1990 · 208 citations
4Halide2013 · 896 citations
5Reduction of cache coherence overhead by compiler data layout and loop transformation2006 · 54 citations