Data Disquiet: Concerns about the Governance of Data for Generative AI

CIGI Paper No. 290

Susan Ariel Aaronson

March 18, 2024

The growing popularity of large language models (LLMs) has raised concerns about their accuracy. These chatbots can be used to provide information, but it may be tainted by errors or made-up or false information (hallucinations) caused by problematic data sets or incorrect assumptions made by the model. The questionable results produced by chatbots has led to growing disquiet among users, developers and policy makers. The author argues that policy makers need to develop a systemic approach to address these concerns. The current piecemeal approach does not reflect the complexity of LLMs or the magnitude of the data upon which they are based, therefore, the author recommends incentivizing greater transparency and accountability around data-set development.

About the Author

Susan Ariel Aaronson

Susan Ariel Aaronson is a CIGI senior fellow, research professor of international affairs at George Washington University (GWU) and co-principal investigator with the NSF-NIST Institute for Trustworthy AI in Law & Society, where she leads research on data and AI governance.

Data Disquiet: Concerns about the Governance of Data for Generative AI

CIGI Paper No. 290

About the Author

Recommended

DeepSeek and China’s AI Innovation in US-China Tech Competition

Digital Governance in China: Trends in Generative AI and Digital Assets

Ghana’s Pathway to AI Governance and Its Implications for Africa

AI, Innovation and the Public Good: A New Policy Playbook

Advancing Multi-stakeholderism for Global Governance of the Internet and AI

Measuring and Visualizing AI (grounding decisions in data with Nestor Maslej)

Canada Is a Signatory to the First Global Treaty on AI: Why That Matters

Artificial Intelligence and National Defence: A Strategic Foresight Analysis

Generative AI, Democracy and Human Rights

Militarizing AI: How to Catch the Digital Dragon?

How to Predict the Future with Accuracy (throwing darts with Robert de Neufville)

Responsible AI and Civilian Protection in Armed Conflict