Google Coral Board Runs Gemma 3 Locally

Google is bringing local AI to a much smaller form factor

Google has introduced a new Coral Board designed to run AI workloads directly on compact devices instead of sending those tasks back to the cloud. The board, revealed at Google I/O, is built around the company’s Coral ecosystem and is aimed at products such as headphones, AR glasses, and smartwatches, where latency, connectivity, and power constraints make on-device processing especially valuable.

The headline claim is straightforward: Google says the board can run its open-source Gemma 3 270M language model entirely locally. That makes the hardware notable less for raw scale than for what it represents. Edge AI has often been limited by fragmented accelerators, tight memory budgets, and the practical difficulty of fitting useful models onto small systems. Google is pitching the Coral Board as a more coherent answer to that problem.

What the hardware includes

At the center of the board is a Synaptics Astra SL2619 chip with a 2 GHz dual-core processor, 2 GB of RAM, and 1 TOPS of compute. Google says the board also includes the Coral NPU, an open-source machine learning unit based on the RISC-V architecture and developed by Google Research.

Those specifications place the device in a class where efficiency and integration matter more than headline performance. Google is not positioning the board as a desktop-class AI machine. It is positioning it as a developer-friendly building block for hardware that needs to interpret audio, vision, or text on the device itself.

Ex-Tesla Optimus scientist unveils European humanoid robot startup

Ex-Tesla Optimus Scientist Launches European Humanoid Robot Startup UMA

Rémi Cadène, former Tesla Optimus scientist, unveils UMA, a Paris-based startup building humanoid robots for European manufacturing, aiming to rival Tesla and Boston Dynamics.

Read article

Why local inference matters

Running Gemma 3 locally means a device can handle some tasks without a persistent cloud connection. That can reduce latency, improve responsiveness, and lower dependence on network reliability. It also changes cost dynamics for some applications, since inference does not have to be routed to a remote service every time a user speaks, gestures, or requests a translation.

Google’s own demonstrations point in that direction. At I/O, the company showed real-time translation, voice-controlled hardware, and a generative music setup in which a YOLOv8 vision model tracked jellyfish movement and translated it into music. All of those examples are meant to show that the board is not just for model demos, but for interactive systems that blend sensors, inference, and output in real time.

An attempt to reduce accelerator fragmentation

One of the more interesting parts of Google’s description is its framing of the board as a fix for fragmentation among AI accelerators. That is a real obstacle for developers trying to build products across categories such as wearables, smart devices, and embedded systems. Models may fit in theory, but deployment often breaks down across incompatible toolchains, hardware quirks, and narrow vendor support.

By pairing an open-source NPU approach with a known model family and public demos, Google appears to be trying to make the edge stack feel more complete. That does not eliminate the hard tradeoffs around thermal limits, battery use, or memory ceilings. But it does suggest a more serious push to turn local AI from a showcase feature into a repeatable development path.

OpenAI and Anthropic Give Away Millions in Compute Credits to Lure Startups

AI giants OpenAI and Anthropic, along with cloud providers, are offering startups free compute credits worth millions to lock them into their ecosystems early, even as IPO pressures mount.

Read article

What this means for developers

The board is expected to ship this summer, though Google has not announced pricing. That leaves open one of the biggest practical questions. Small-device AI hardware has to be cheap enough, accessible enough, and easy enough to program that it can move beyond prototypes.

Still, the product says something important about where edge AI is heading. A year ago, many conversations about AI hardware focused almost entirely on giant clusters and data centers. That part of the market still dominates. But Google’s new board is a reminder that there is another front in the AI buildout: giving small devices enough local intelligence to become more autonomous and more useful without leaning constantly on cloud inference.

What stands out

Gemma 3 270M runs on the board without cloud support.
The hardware targets compact products such as glasses, headphones, and wearables.
Google is framing the board as a way to simplify edge AI development across fragmented accelerator choices.

If the platform is affordable and the software stack is stable, Coral Board could become more than a demo unit. It could mark a practical step toward AI that lives inside everyday devices rather than one that always has to call home.

This article is based on reporting by The Decoder. Read the original article.

Originally published on the-decoder.com