Observatory of Examples of How Open Data and Generative AI Intersect

A growing observatory of examples of how open data from official sources and generative artificial intelligence (AI) are intersecting across domains and geographies.

Share your project for inclusion. We seek to learn from generative AI initiatives that use open government and research data across a Spectrum of Scenarios. More information on each scenario can be found in our report: A Fourth Wave of Open Data? Exploring the Spectrum of Scenarios for Open Data and Generative AI.

Generating a Fully Synthetic Human Services Dataset
This report, produced by researchers at the Urban Institute in collaboration with Allegheny County partners, describes the process of creating a synthetic version of the countys 2021 human services dataset. The synthetic data aims to replicate statistical properties of the confidential data while protecting individual privacy, enabling wider access to detailed human services information. The document covers the data synthesis methodology, evaluation of data quality and privacy risks, and the challenges of balancing utility and confidentiality in synthetic administrative data.

Region

north_america

Sector

public_sector

Scenario

data_augmentation

Start Date

2023

Location: United States