Skip to content

OpenCatalogcurated by FLOSSK

AI & Machine Learning

OLMo

Allen AI fully open LLM **pipeline**: weights, training code, data mixes, and evaluation—research transparency flagship.

Official site Source repository

Why it is included

Rare end-to-end reproducible LLM release for science and governance studies.

Best for

Researchers auditing training data influence and rebuilding from scratch.

Strengths

Open data + code
Reproducibility
Academic rigor

Limitations

Not always largest frontier sizes

Good alternatives

BLOOM · GPT-NeoX

Related tools