Pulse nav.journalClub Debates activos Tendencias Explorar Investigadores

Join discussions, follow papers, and never miss your next session.

Download on theApp Store

Inicio Explorar nav.journalClub Tendencias

⌘+K

© Synapse Social LLC, 2026

Política de privacidad

TinyLlama: An Open-Source Small Language Model | Synapse

January 4, 2024Open Access

TinyLlama: An Open-Source Small Language Model

Puntos clave

Los puntos clave no están disponibles para este artículo en este momento.

Resumen

We present TinyLlama, a compact 1.1B language model pretrained on around 1 trillion tokens for approximately 3 epochs. Building on the architecture and tokenizer of Llama 2, TinyLlama leverages various advances contributed by the open-source community (e.g., FlashAttention and Lit-GPT), achieving better computational efficiency. Despite its relatively small size, TinyLlama demonstrates remarkable performance in a series of downstream tasks. It significantly outperforms existing open-source language models with comparable sizes. Our model checkpoints and code are publicly available on GitHub at https://github.com/jzhang38/TinyLlama.

Preguntar a la IA

Me gusta

Guardar

Compartir

Ver artículo completo

Preguntar a la IA

Me gusta

Guardar

Compartir

Ver artículo completo

Cite This Study

Zhang et al. (Thu,) studied this question.

synapsesocial.com/papers/6a1c1d724ebd09f3dfa97661 https://doi.org/https://doi.org/10.48550/arxiv.2401.02385