2/25/2026 3:00:22 PM | less than a minute read

Chris Mammen Talks Synthetic Data Risks in AI Training

Digital Brain Hologram Hud. Artificial intelligence AI machine deep learning. Business Technology Internet Network Concept.

Get in touch

Chris Mammen

Partner

Get in touch

Chris Mammen

Partner

Increasingly, the use of synthetic data—artificially generated data that replicates real-world experiences—is being used to train AI models. But relying too heavily on synthetic data can lead to fundamental flaws. Womble Bond Dickison Partner Chris Mammen recently discussed these risks with Lexology, saying, “One illustration shows that if a data set with images of dogs is re-trained on AI outputs, or synthetic data, the most common images—say, golden retrievers—will gradually become over-represented in the data, until all of the outputs are golden retrievers. Then model starts to lose track.”

One illustration shows that if a data set with images of dogs is re-trained on AI outputs, or synthetic data, the most common images - say golden retrievers - will gradually become over-represented in the data, until all of the outputs are golden retrievers. Then the model starts to lose track.

Chris Mammen Talks Synthetic Data Risks in AI Training

Get in touch

Get in touch

Tags

Get in touch

Get in touch

Latest Insights

The Accidental Disclosure Trap: How Generative AI Can Void Your Patent Rights

Chris Mammen Discusses IP Uncertainty Around Celebrity Deepfakes

Colorado Stays Enforcement of AI Law