Observatory of Examples of How Open Data and Generative AI Intersect

A growing observatory of examples of how open data from official sources and generative artificial intelligence (AI) are intersecting across domains and geographies.

Share your project for inclusion. We seek to learn from generative AI initiatives that use open government and research data across a Spectrum of Scenarios. More information on each scenario can be found in our report: A Fourth Wave of Open Data? Exploring the Spectrum of Scenarios for Open Data and Generative AI.

Statistics Canada
Statistics Canada conducted a pilot program around generating synthetic data for training purposes. The team created synthetic datasets from census data that includes sensitive information. These datasets were used in two Hackathons, with the condition that they could not be publicly shared. Organizers highlighted that the synthetic datasets preserved the usefulness of the original data for analysis while minimizing the risk of revealing sensitive information. Hackathon participants successfully used these datasets for training purposes.

Region

north_america

Sector

public_sector

Scenario

data_augmentation

Start Date

2023

Location: Canada