Llema is a generative AI model fine-tuned for the mathematics domain. It was fine-tuned using the Proof-Pile-2 dataset, which combines scientific papers with other mathematics datasets. The researchers have provided public access to the models, dataset, code to encourage future research around the topic of AI and mathematics.
A growing observatory of examples of how open data from official sources and generative artificial intelligence (AI) are intersecting across domains and geographies.
Share your project for inclusion. We seek to learn from generative AI initiatives that use open government and research data across a Spectrum of Scenarios. More information on each scenario can be found in our report: A Fourth Wave of Open Data? Exploring the Spectrum of Scenarios for Open Data and Generative AI.