Observatory of Examples of How Open Data and Generative AI Intersect

A growing observatory of examples of how open data from official sources and generative artificial intelligence (AI) are intersecting across domains and geographies.

Share your project for inclusion. We seek to learn from generative AI initiatives that use open government and research data across a Spectrum of Scenarios. More information on each scenario can be found in our report: A Fourth Wave of Open Data? Exploring the Spectrum of Scenarios for Open Data and Generative AI.

GPT-SW3

GPT-SW3 is an open-source generative AI model collaboratively developed by AI Sweden, RISE, and Wallenberg AI, Autonomous Systems, and Software Programs (WASP WARA). It was trained on datasets, including Wikipedia, Wikimedia, and the Norwegian Colossal Corpus—an open dataset comprising texts from government publications, parliamentary records, newspapers, literature, and public reports. GPT-SW3 is designed to perform natural language processing tasks specifically for Nordic languages such as Swedish, Norwegian, Danish, and Icelandic, including content generation, translation, and digital assistant functions.

Region

emea

Sector

public_sectoracademianon-profit

Scenario

pre-training

Start Date

2023

Location: Sweden