Observatory of Examples of How Open Data and Generative AI Intersect

A growing observatory of examples of how open data from official sources and generative artificial intelligence (AI) are intersecting across domains and geographies.

Share your project for inclusion. We seek to learn from generative AI initiatives that use open government and research data across a Spectrum of Scenarios. More information on each scenario can be found in our report: A Fourth Wave of Open Data? Exploring the Spectrum of Scenarios for Open Data and Generative AI.

CroissantLLM
Developed by researchers at various European and American universities as well as private technology companies, CroissantLLM is a large language model that aims to support English-French language queries. The model is trained on both web scraped data and open government data from France. This initiative aims to improve LLMs capability to analyze non-English data.

Region

emea

Sector

academiaprivate_sector

Scenario

inference_and_insight_generation

Start Date

2024

Location: France