Business

Taalas Raises $169M for Model-Specific AI Inference Chips

TORONTO: Toronto-based chip startup Taalas raises $169 million in funding to develop model specific AI processors . The company develops specialized AI inference chips that print AI model components directly into silicon. Investors include Quiet Capital, Fidelity, and semiconductor veteran Pierre Lamond.

Taalas builds model-specific processors by hardwiring AI model portions onto chips. The approach trades generality for speed and cost efficiency. The custom silicon pairs with large amounts of on-chip SRAM memory. This eliminates external memory access during inference operations and reduces latency.

The company claims its first chip generates 17,000 output tokens per second. This delivers 73 times more performance than NVIDIA’s H200 graphics card. The processor uses one-tenth the power while delivering this performance.

Taalas partners with Taiwan Semiconductor Manufacturing Company to produce chips in approximately two months. The foundry-optimized workflow enables customers to move from model weights to deployable cards rapidly. The company’s HC1 chip uses TSMC’s 6-nanometer process.

The funding round brings Taalas’s total capital raised to over $200 million across three rounds. The company assembles 25 engineers with experience from AMD, Apple, Google, NVIDIA, and Tenstorrent. Taalas was founded in 2023 by Bajic, Lejla Bajic, and Drago Ignjatovic.

It is the bespoke design for each model that gives the Taalas chip its advantage

Ljubisa Bajic CEO of Taalas.

The company’s first product runs the open-source Llama 3.1 8B language model. Taalas plans to launch a chip capable of running a 20-billion-parameter Llama model this summer. The startup targets a cutting-edge processor capable of deploying frontier models by year end.

The announcement comes weeks after NVIDIA’s $20 billion deal to license IP from Groq. The transaction reignites investor interest in specialized AI inference technology. Competitors including Cerebras and Groq focus on custom silicon solutions for inference optimization.

Shobhit Kalra

Shobhit Kalra is the Chief Sub Editor at Tea4Tech, with over 12 years of experience across digital media, digital marketing, and health technology. He is responsible for editorial review, content structuring, and quality control of articles covering software, SaaS products, and developments across the technology ecosystem. || At Tea4Tech, Shobhit oversees content accuracy, clarity, and adherence to editorial standards, ensuring published stories meet the newsroom’s guidelines for originality, sourcing, and consistency.

Recent Posts

Google AI Studio Launches ‘Vibe Coding’ Upgrade with Antigravity Agent

San Francisco: Google AI Studio has launched a completely rebuilt vibe coding experience. It is…

2 days ago

Perplexity Unveils Health Tool with Apple Health & Fitbit Support

San Francisco: Perplexity has recently launched Perplexity Health, a new feature that connects directly to users’ health data from…

2 days ago

Google Reinvents UI Design with AI-Powered Stitch Canvas

San Francisco: Google Labs has relaunched Stitch as a fully AI-native design canvas. Anyone can…

2 days ago

Google Brings Safer App Installation Option to Android

California: Google is introducing a new, safer way for Android users to install apps from outside the…

2 days ago

Cloaked Raises $375M to Bring AI-Powered Privacy Protection to Enterprise

New York: Most security tools solve one problem. A password manager here. A VPN there.…

2 days ago

Google Expands Personal Intelligence Access to All U.S. Users

Washington, DC: Google has made its Personal Intelligence feature free for all users in the United States, instead…

3 days ago