Hidden in the Supply Chain

An investigation by Rest of World has revealed that gig workers across Africa who were hired through data-labeling platform Appen to perform routine annotation tasks — identifying objects in images, transcribing audio, categorizing text — were unknowingly contributing to AI systems used by the United States military. The workers, many of whom were paid a few dollars per hour, had no idea that their labor was feeding into defense and intelligence applications.

The revelation exposes a troubling aspect of the AI supply chain: the vast workforce of human annotators whose labor is essential for training machine learning systems is often kept deliberately in the dark about how their work is ultimately used. The disconnect between the people who label the data and the organizations that deploy the resulting AI systems raises serious ethical questions about informed consent, labor practices, and the hidden human infrastructure of military technology.

How Data Labeling for the Military Works

Modern AI systems, particularly those used for image recognition, natural language processing, and decision support, require enormous quantities of labeled training data. Someone must look at thousands of satellite images and draw boxes around vehicles. Someone must listen to hours of audio and transcribe what they hear. Someone must read text and categorize it by topic, sentiment, or intent.

This work is typically outsourced through a chain of intermediaries. A defense contractor might hire a technology company to develop an AI system. That company might subcontract the data labeling to a platform like Appen, which in turn distributes the work to freelancers around the world, many of them in countries where labor costs are a fraction of what they are in the United States or Europe.

At each step in this chain, the ultimate end use of the data becomes more obscured. The gig workers at the bottom of the pyramid see individual tasks — label this image, transcribe this audio clip — without context about the broader system they are helping to build. Appen's terms of service and non-disclosure agreements often prohibit workers from knowing the identity of the end client, let alone the application their work supports.