Technology

Google Releases Gemma 4 With Multimodal and On-Device Agentic AI

San Francisco: Google DeepMind has launched Gemma 4, its most powerful family of open AI models to date. The models are built on the same research as Gemini 3 and ship in four sizes under an Apache 2.0 licence, a major shift from previous Gemma terms that limited commercial use.

The four sizes run on everything from smartphones to data centre servers. Smaller edge models work on consumer laptops and Raspberry Pi devices. Larger models target workstations and enterprise deployments. The biggest, a 31B Dense model, currently ranks third on the global Arena AI open-model leaderboard.

All Gemma 4 models can process images and video natively. The smaller edge models also understand audio, enabling speech recognition out of the box. Context windows range from 128,000 tokens for edge models to 256,000 for the larger versions. Every model supports more than 140 languages.

The Apache 2.0 licence is the headline business story. Previous Gemma releases carried restrictions that blocked certain enterprise and commercial uses. The new licence removes those barriers entirely, no usage caps, no risk of access termination, and full commercial freedom for businesses building on the models.

The launch is also a competitive statement. Open-weight models from Chinese labs: Qwen 3.5, DeepSeek V3, and Kimi K2.5, have rapidly closed the gap on Western frontier models. Gemma 4 is Google’s answer. It gives enterprises a domestically built alternative backed by verified benchmarks and an unrestricted licence.

Google partnered with Nvidia, Qualcomm, and MediaTek to optimise Gemma 4 across a wide range of hardware. The models are available today on Hugging Face, Kaggle, Ollama, and Google AI Studio. The two smallest models will also power Gemini Nano 4, Google’s next on-device model for Android arriving on consumer devices later this year.

Demis Hassabis, CEO of Google DeepMind, called Gemma 4 the best open models in the world for their sizes. Gemma has now surpassed 400 million downloads and 100,000 community variants since its first release.

Anurag Shukla

Anurag Shukla is a Senior Journalist with over two decades of experience across television, digital, and print media. He has worked with leading national news organisations and has also served as a Research Officer in the Prime Minister’s Office (PMO), contributing to media research and policy-level content. A former journalism academic, Anurag brings strong editorial depth and a keen understanding of how technology, governance, and society intersect at Tea4Tech.

Recent Posts

Anthropic Reveals Claude Mythos but Withholds Release Over Safety Concerns

San Francisco: Anthropic has unveiled Claude Mythos, its most powerful AI model to date, and…

2 days ago

X Brings Back Voice Notes to X Chat

San Francisco: X has reintroduced Voice Notes to X Chat, allowing users to send audio…

2 days ago

OpenAI Launches USD 100 ChatGPT Pro Plan

San Francisco: OpenAI has introduced a new USD 100/month ChatGPT Pro subscription, offering a mid‑tier…

2 days ago

Meta Unveils Muse Spark As Its First Proprietary AI Model

San Francisco: Meta has launched Muse Spark, its first proprietary AI model and the inaugural…

2 days ago

Aria Networks Raises $125M to Build AI-Native Networking for Data Centres

London: Aria Networks has raised USD 125 million to build networking infrastructure designed specifically for…

2 days ago

Spirit AI Raises $145M as Humanoid Robots Hit Factory Production Lines

BEIJING: Spirit AI has raised $145 million in a new funding, bringing total funding across…

3 days ago