Meta is turning inward for a new source of AI training data
Meta says it will collect mouse movements, button clicks, and other user inputs from its own employees on certain applications as part of an effort to train artificial intelligence models. The company’s explanation is operational: if it wants to build agents that help people complete everyday tasks on computers, the models need examples of how humans actually use interfaces, navigate menus, and carry out actions across software environments.
On its face, that rationale is easy to understand. A system meant to act on a computer needs behavioral traces that show not just what a task is, but how a person accomplishes it. Yet the move is notable because it highlights a broader shift in the AI industry. Training data is no longer limited to public text, licensed media, or conventional labeled datasets. Increasingly, the raw material for model development includes records of human work itself.
What Meta says it is collecting
According to the source text, Meta provided a statement saying it is launching an internal tool that will capture “these kinds of inputs” on certain applications. The company described the purpose as training models for agents that can help people complete everyday computer-based tasks. Meta also said safeguards are in place to protect sensitive content and that the data is not used for any other purpose.
That wording matters. The statement centers on interaction data rather than broader surveillance, but it still describes a system that translates routine workplace behavior into training material. Clicks, cursor movements, and navigation patterns may seem minor in isolation, yet together they create a rich map of how work gets done on digital systems.
This kind of data can be valuable because it captures the procedural layer of computing. Large language models can already generate text about software tasks. What they often lack is grounded behavioral evidence of the step-by-step patterns humans follow in real interfaces. Internal employee usage offers exactly that.




