TL;DR

Origin Lab, a new startup, raised $8 million to create a marketplace for video game data, enabling AI labs to access high-quality training assets. The platform helps game companies monetize digital assets while supporting AI development of physical world models.

Origin Lab has secured $8 million in seed funding to develop a marketplace that connects video game companies with AI research labs seeking high-quality data for building physical world models. This funding signals investor confidence in the emerging market for game-derived data to support AI advancements.

The funding round was led by Lightspeed Ventures, with participation from SV Angel, Eniac, Seven Stars, FPV, and angel investors including Twitch co-founder Kevin Lin and Cruise founder Kyle Vogt. Origin Lab’s platform aims to serve as an intermediary, licensing digital assets from video game companies and converting them into usable training data for AI labs focused on physical and world modeling.

According to co-CEO Anne-Margot Rodde, the platform addresses a significant gap: while AI labs need detailed data to understand real-world physics and object movement, sourcing high-quality, licensed data has been challenging due to licensing issues and data quality concerns. Origin Lab’s marketplace intends to streamline this process, facilitating data exchange between game developers and AI researchers.

Industry interest in game footage as training data has grown, especially after events such as OpenAI’s December 2024 revelation that its models trained on Twitch streams, which include gameplay footage. This has highlighted both the potential and the legal complexities of using game content for AI training, making a licensed marketplace a timely solution.

Why It Matters

This development is significant because it addresses a key bottleneck in AI research—access to high-quality, licensed training data. As AI systems increasingly aim to understand and interact with the physical world, the need for realistic, detailed data sources grows. By monetizing digital assets from video games, game companies can generate new revenue streams, while AI labs gain access to the data they need to advance models that could impact robotics, simulation, and autonomous systems.

The success of this funding round underscores the growing market for data vendors serving major AI labs, which are often heavily capitalized but face persistent data sourcing challenges. As the industry moves toward more sophisticated world models, platforms like Origin Lab could become essential infrastructure for AI development.

Amazon

video game digital asset licensing platform

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Background

In recent months, the AI community has shown increasing interest in leveraging video game content for training data, citing its detailed, dynamic environments as ideal for modeling physical interactions. However, licensing issues have limited widespread adoption. The December 2024 controversy involving OpenAI’s use of Twitch streams, which included gameplay footage, underscored the legal and ethical complexities involved. Meanwhile, companies like Amazon have expressed interest in using streaming data for AI training, highlighting a market gap that Origin Lab aims to fill.

Origin Lab’s approach builds on prior efforts to use game footage but seeks to formalize and license the data, creating a marketplace that benefits both game developers and AI researchers. The startup’s founders believe this will accelerate the development of AI systems capable of understanding and interacting with the physical environment more effectively.

“The AI systems that are being built now need to understand how the physical world works and how things move. That data essentially lives in video games.”

— Anne-Margot Rodde, co-CEO and co-founder of Origin Lab

“We’ve seen how sharp the revenue scaling can be for data vendors serving major labs. These are very well-capitalized businesses, and the bottleneck for all of them is data.”

— Faraz Fatemi, partner at Lightspeed Ventures

Amazon

AI training data from video game assets

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

What Remains Unclear

It is still unclear how quickly the marketplace will scale, how many video game companies will participate, or how licensing agreements will be structured. The legal and ethical considerations regarding proprietary game assets and user-generated content remain complex and evolving.

アイ・オー・データ I-O Data GV-USB2 Video/VHS 0.3 inch (8 mm) DVD Dubbing PC Ingestion Video Capture Japanese Manufacturer

アイ・オー・データ I-O Data GV-USB2 Video/VHS 0.3 inch (8 mm) DVD Dubbing PC Ingestion Video Capture Japanese Manufacturer

Features: Easy dubbing! Store your important video tape on DVD. USB Connected Video Capture

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

What’s Next

Next steps include launching the platform, onboarding initial game industry partners, and establishing licensing frameworks. Monitoring how AI labs adopt the marketplace and the volume of data transactions will be key indicators of success. Further funding rounds or industry partnerships may follow as the platform matures.

Grandpa Beck's Games Cover Your Assets Card Game | from The Creators of Skull King | Easy to Learn and Outrageously Fun for Kids, Teens, & Adults | 2-6 Players Ages 7+

Grandpa Beck's Games Cover Your Assets Card Game | from The Creators of Skull King | Easy to Learn and Outrageously Fun for Kids, Teens, & Adults | 2-6 Players Ages 7+

Classic Card Game with the original artwork: Experience the fun and timeless excitement of the original 'Cover Your…

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Key Questions

How will Origin Lab ensure the legality of the data it licenses?

Origin Lab plans to establish licensing agreements with game companies to ensure legal use of digital assets, but specific legal frameworks are still being developed and clarified.

What types of data will be available on the platform?

The platform aims to offer a range of data, from simple rendered scenes to hours of gameplay walkthrough footage, all converted into formats suitable for training AI models.

Will this impact the revenue of game developers?

Yes, game companies can monetize existing digital assets by licensing them through Origin Lab, creating additional revenue streams from their digital content.

How does this development compare to previous efforts to use game footage for AI training?

Unlike earlier ad hoc or unlicensed uses, Origin Lab’s marketplace aims to formalize and license data, providing a legal and scalable infrastructure for AI research.

You May Also Like

TIL that 32 bit time will run out in 2038, while 64 bit time will run out approximately 292 billion years from now

The 32-bit Unix time will overflow on January 19, 2038, causing potential system failures. 64-bit systems are unaffected for billions of years, but some legacy systems remain vulnerable.

Japan banks eye Anthropic’s Mythos in gearing up cybersecurity drive

Japan’s three major banks are enhancing cybersecurity defenses with Anthropic’s Mythos AI, following concerns over AI-driven vulnerabilities in financial systems.

The first webcam was created because a bunch of computer engineers were too lazy to walk over to the coffee machine.

The first webcam was invented because computer engineers wanted to avoid walking to the coffee machine, according to sources. This innovation changed remote monitoring.

Why Laser Cutters Are Attracting Makers

Unlock the potential of laser cutters and discover how they can transform your creative projects in ways you never imagined.