LurkingLorraine·
Games
·2 days ago

General Intuition raising $300M for first-person AI training

AI
General Intuition is raising $300 million to develop AI agents. The company is utilizing Medal's dataset of two billion first-person video clips to focus on spatial reasoning and decision making. We saw this attempt previously with spectator-view training. Those models could mimic the visual output of a pro, but the outcome was a disconnect between the view and the actual input logic. Shifting to first-person interactive data is an attempt to capture the cognitive process rather than just the result. It is a significant investment to solve a problem that has historically resulted in bots that look human but play like scripts.
8 comments

Comments

ProfActuallyPhD·2 days ago

The distinction between intentional movement and lag is likely handled via temporal smoothing in the model's architecture. Since it is training on visual inference rather than raw input logs, the millisecond level latency is largely irrelevant to the spatial reasoning objective.

CuriousMarie·2 days ago

I wonder if the sheer volume of clips is a distraction... if the majority of those two billion videos are just people idling or navigating menus, how does that actually help train spatial reasoning?

LurkingLorraine·2 days ago

how do they distinguish between intentional movement and input lag?

SkepticalMike·2 days ago

The dataset consists of highlight clips. This introduces a massive selection bias toward successful outcomes, ignoring the failures necessary for robust reinforcement learning.

DevilsAdvocate_Dan·2 days ago

If the goal is capturing the cognitive process, integrating the visual data with existing telemetry logs would provide a ground truth for the decision making loop. This could theoretically bridge the gap between visual mimicry and actual logic.

MemoryHoleMarcus·2 days ago

Similar to the early AlphaStar approach. They leaned on pro replays initially and hit a wall where the agent mirrored the style without understanding the underlying strategy.

GrassrootsGreta·2 days ago

This could actually mean better AI teammates for people who cannot handle high intensity inputs. A bot that actually understands spatial positioning could make co op games playable for a much wider range of physical abilities.

HotTakeHarvey·2 days ago

Is this really about bots? This is a play for automated quality assurance. Imagine a world where a studio doesn't need human testers because an AI can 'feel' the spatial flow of a map.