Language Models

  • Grok-2 [Blog]

    • xAI

    • Grok-2 Beta was released on 2024/08/13.

  • Gemma 2: Improving Open Language Models at a Practical Size (arXiv:2408.00118) [arXiv] [Code]

  • The Llama 3 Herd of Models (arXiv:2407.21783) [arXiv] [Blog] [Code]

  • Mixtral 8x7B (arXiv:2401.04088) [arXiv] [Blog] [Code]

  • Llama 2: Open Foundation and Fine-Tuned Chat Models (arXiv 2307.09288) [Paper] [Homepage]

    • Meta AI

    • Llama 2

    • Released with a permissive community license and is available for commercial use.

  • LLaMA: Open and Efficient Foundation Language Models (arXiv 2302.13971) [Paper] [Code]

    • Meta AI

    • 6.7B, 13B, 32.5B, 65.2B

    • Open-access

  • PaLM: Scaling Language Modeling with Pathways (JMLR 2023) [Paper] [PaLM API]

    • 540B; open access to PaLM APIs in March 2023.

  • BLOOM: A 176B-Parameter Open-Access Multilingual Language Model (arXiv 2211.05100) [Paper] [Model] [Blog]

    • 176B

    • open-access

  • OPT: Open Pre-trained Transformer Language Models (arXiv: 2205.01068) [Paper] [Code]

    • Meta AI

    • Range from 125M to 175B parameters.

    • Open-access

Last updated