Probably Raises $9M to Build a Hallucination Filter for AI Models

June 16, 2026

AI startup Probably is developing a system designed to prevent large language model hallucinations and factual errors before they reach end users, an effort the company says could improve reliability while lowering the cost of deploying AI applications.

The company recently raised $9 million in seed funding from Andreessen Horowitz and is focused on building AI systems that approach the accuracy levels typically associated with deterministic software. Founder Peter Elias said the goal is to stop hallucinations and basic factual mistakes from reaching users and to target accuracy of roughly 99.99%.

Probably’s first product is a data science tool that generates answers from large datasets. Each response includes source citations and an audit trail, features that have become increasingly common across AI-powered products as companies seek to improve transparency and trust.

To reduce errors, the startup built a validation layer that sits between the language model and the user. Initial outputs generated by the model are reviewed by a deterministic validator, which flags responses that do not match the underlying dataset. The company says the system is trained around that validation process and optimized for both speed and accuracy.

“What we’ve learned in building this is that the better your context-management system, the weaker the model may become,” Elias said.

According to the company, that architecture makes it possible to rely on significantly smaller AI models rather than the latest frontier systems. Elias said the current version operates on a model several generations behind leading offerings, allowing it to run on local hardware while reducing token-related expenses.

Cost management has become an increasingly important consideration as organizations expand AI deployments. Elias argues that the validation framework could be applied beyond data science workflows to sectors where accuracy is critical, including accounting and medical services.

“It seems to me that it’s interesting that large AI research labs have not tried to do this yet,” Elias said. “They’re not incentivized to do so, because they make money when there is more need to fix the model.”

In a separate comment, Elias reiterated that view, saying, “They don’t stand to gain from doing this, because they make money when there’s more need to fix the model.”

The company believes its approach could make AI systems more dependable in high-stakes environments by combining language models with deterministic verification. If successful, the model could help organizations reduce operating costs while increasing confidence in AI-generated results for tasks where accuracy is essential.

This analysis is based on reporting from Mezha.

Image courtesy of Probably

This article was generated with AI assistance and reviewed for accuracy and quality.

Last updated: June 16, 2026

Report Error

About this article: This article was generated with AI assistance and reviewed by our editorial team to ensure it follows our editorial standards for accuracy and independence. We maintain strict fact-checking protocols and cite all sources.

Word count: 431Reading time: 0 minutes

Explore More AI Resources

Continue with high-value guides related to this topic.

Compare AI Models

See ChatGPT, Claude, and Gemini side-by-side in one place.

Best AI Newsletters

Find top AI newsletters and subscribe to ChatAI Daily.

AI FAQ

Quick answers about ChatAI, AI tools, and multi-model chat.

AI Tools

Use free AI tools for summarization, translation, and more.

📧 Stay Updated

Get the latest AI news delivered to your inbox every morning.

Continue Reading

Smallest.ai Raises $13 Million and Launches Hydra Voice AI Model for Human-Like Conversations

Smallest.ai has raised $13 million in Series A funding and introduced Hydra, a new speech-to-speech model designed to reduce latency and make AI voice conversations feel more natural. The investment...

July 31, 2026•5 min read

Dili Lands $15 Million to Automate AI Compliance for Data Centers and Infrastructure Projects

Dili has raised a $15 million Series A round to expand its AI-powered compliance platform for U.S. infrastructure projects, bringing the startup's total funding to $21.7 million. The company said its...

July 30, 2026•5 min read

Encore AI Raises $30 Million to Build AI Agents Trained on Top Sales Performers

Encore AI today announced a $30 million Series A funding round led by Team8, with participation from Planven, The Garage, Lukatz, and several commercial banks and insurers. The company said it will...

July 29, 2026•5 min read

Explore All Articles

Probably Raises $9M to Build a Hallucination Filter for AI Models

Explore More AI Resources

Compare AI Models

Best AI Newsletters

AI FAQ

AI Tools

AI Tools for this Article

📧 Stay Updated

Related Articles

Smallest.ai Raises $13 Million and Launches Hydra Voice AI Model for Human-Like Conversations

Dili Lands $15 Million to Automate AI Compliance for Data Centers and Infrastructure Projects

Encore AI Raises $30 Million to Build AI Agents Trained on Top Sales Performers

Continue Reading

Smallest.ai Raises $13 Million and Launches Hydra Voice AI Model for Human-Like Conversations

Dili Lands $15 Million to Automate AI Compliance for Data Centers and Infrastructure Projects

Encore AI Raises $30 Million to Build AI Agents Trained on Top Sales Performers

Stay Ahead of AI

Go Premium

Follow Our Community

ChatAI

Go Premium

ChatAI

Follow Our Community