Resources

Explore the data and resources used in LatamGPT

New resource

LatamGPT 1.0 Model

LatamGPT-SFT-1.0 is the first version of the Latin American language model, based on Llama 3.1 with 70 billion parameters and trained with Continued Pretraining (CPT) and Supervised Fine-Tuning (SFT) using regional data.

Download the model directly from Hugging Face.

View on Hugging Face

Model available on Hugging Face

LatamGPTCPT + SFT v1.0
70B
Parameters
CPT + SFT
Training
Base modelLlama 3.1

Trueque Benchmark

Trueque is a human-reviewed collaborative evaluation benchmark for measuring LLM performance on questions about Latin American knowledge and culture.

Explore 500 curated questions on history, culture, geography, and gastronomy from 20 Latin American countries.

View on Hugging Face

Dataset available on Hugging Face

Trueque BenchmarkBeta v0.1
500
Questions
20
Countries
LicenseApache 2.0
LanguagesES / PT

CHOCLO

CHOCLO is a benchmark specialized in Latin American cultural knowledge to evaluate how well language models understand and represent the culture of the region.

Over 100,000 rows with questions on geography, fauna, flora, traditions, gastronomy, and public figures from 18 countries, with three difficulty levels.

View on Hugging Face

Dataset available on Hugging Face

CHOCLO
104K+
Rows
18
Countries
LicenseMIT
Categories7

Copuchat - Contribute Data

Copuchat is an experimental application built on GPT 4.1, by OpenAI, that simulates real conversations with users from Latin America and the Caribbean to improve the alignment of future versions of LatamGPT.

Help improve LatamGPT and participate in anonymous conversations that will be useful for training the model.

Open Copuchat

Participate in conversations to contribute to LatamGPT training

CopuChat
Interactive chat interface