
Origin Lab Secures $8M to Bridge the Gap Between Video Games and AI World Models
A new startup called Origin Lab is connecting video game companies with AI labs hungry for physical-world training data — and investors are taking notice.
A New Marketplace for AI Training Data Is Born
As artificial intelligence rapidly evolves beyond text and language, a new generation of AI systems is emerging — ones designed to understand how the physical world operates. These so-called world models aim to power robotics, simulate physical environments, and replicate real-world object behavior. But unlike large language models, which feast on the vast ocean of text available online, world-model developers are struggling to find the right kind of training data.
Enter Origin Lab, a startup with a surprisingly clever solution: tap the video game industry.
What Origin Lab Actually Does
Origin Lab functions as a two-sided marketplace designed to connect AI research labs with video game companies that are sitting on enormous reserves of valuable digital assets. On one end, cutting-edge labs focused on world modeling — such as Yann LeCun's AMI Labs or Fei-Fei Li's World Labs — can purchase high-quality, properly licensed data. On the other end, video game studios can monetize the digital content they've already built, generating a new revenue stream from existing assets.
The company doesn't just pass raw game files back and forth, however. Origin Lab handles the crucial step of transforming video game assets into formats that AI systems can actually use as training data. Depending on the complexity involved, that process might mean running simple rendering jobs or converting hours of in-game walkthrough footage through automated pipelines.
"The AI systems that are being built now need to understand how the physical world works and how things move," said Anne-Margot Rodde, co-CEO and co-founder of Origin Lab. "That data essentially lives in video games."
The Bridge No One Had Built Yet
Rodde and her co-founders — Antoine Gargot and Colin Carrier — recognized that while AI labs had long eyed video game footage as a promising training resource, significant obstacles stood in the way. Licensing complications and inconsistent data quality repeatedly blocked progress. The infrastructure simply didn't exist to make these transactions seamless and trustworthy.
"It became clear that the video game industry was sitting on some incredibly valuable data, but there was no real way or infrastructure to connect AI labs and the video game industry," Rodde explained. "So essentially, we built that bridge."
$8 Million Seed Round Signals Strong Investor Confidence
Origin Lab has just announced a successful $8 million seed funding round led by Lightspeed Ventures. Additional participation came from SV Angel, Eniac, Seven Stars, and FPV, alongside angel investments from notable technology figures including Twitch co-founder Kevin Lin and Cruise founder Kyle Vogt.
The involvement of high-profile backers reflects growing confidence in the commercial opportunity that exists at the intersection of AI development and data supply.
Faraz Fatemi, the Lightspeed partner who spearheaded the investment, pointed to the explosive revenue growth seen by established data companies as a key motivator.
"We've seen how sharp the revenue scaling can be for data vendors that are serving the major labs," Fatemi told TechCrunch. "These are very well-capitalized businesses, and the bottleneck for all of them is data."
The Problem With Existing Video Game Data Sources
The appetite for video game data among AI developers is not new. Amazon has publicly acknowledged interest in leveraging Twitch footage for model training. However, the risks of sourcing such data without proper licensing have become increasingly apparent.
In December 2024, OpenAI faced backlash when early versions of its Sora video-generation model appeared to reproduce recognizable footage from popular games and streaming content — raising suspicions that the model had been trained on Twitch streams without proper authorization. Incidents like this have underscored the urgent need for a legitimate, structured pipeline for acquiring game-derived training data.
Origin Lab's model directly addresses this gap by ensuring that all data transactions are properly licensed and traceable.
Why This Matters for the Broader AI Ecosystem
Origin Lab's early fundraising success is more than just a company milestone — it signals a maturing market for AI infrastructure startups that don't build models themselves but instead supply the essential ingredients that make model development possible.
The trajectory of companies like Scale AI has demonstrated just how lucrative the data supply chain can be. As world-model development accelerates and demand for diverse, high-fidelity physical-world data intensifies, Origin Lab is positioning itself as a critical intermediary in an increasingly competitive AI landscape.
For video game companies, the proposition is equally compelling: decades of meticulously crafted digital environments, physics simulations, and character animations may be worth far more than anyone previously imagined.
