Your AI models are failing in production—Here’s how to fix model selection
The Allen Institute of AI updated its reward model evaluation RewardBench to better reflect real-life scenarios...
Nvidia CEO Jensen Huang sings praises of processor in Nintendo Switch 2
Nvidia CEO Jensen Huang, a key supplier for the hybrid console, sang the praises of the...
Phonely’s new AI agents hit 99% accuracy—and customers can’t tell they’re not human
Phonely, Maitai and Groq achieve breakthrough in AI phone support with sub-second response times and 99.2%...
Epic Games reveals The State of Unreal for 2025
Epic Games unveiled the State of Unreal in a keynote speech by CEO Tim Sweeney at...
What game companies can learn from AI analysis of 1.5M gamer conversations | Creativ Company
Creativ Company is emerging today as a new kind of market intelligence company. It uses AI...
Inside Intuit’s GenOS update: Why prompt optimization and intelligent data cognition are critical to enterprise agentic AI success
Intuit is using advanced genetic algorithms to help with prompt optimizations that could have significant impact...
CockroachDB’s distributed vector indexing tackles the looming AI data explosion enterprises aren’t ready for
Scaling distributed SQL queries needs more performance and efficiency in the agentic AI era. It’s a...
Car and chipmakers form group to develop open in-car connectivity
Automotive car makers, suppliers, semiconductor manufacturers and ecosystem partners announced the formation of the OpenGMSL Association.
Enterprise alert: PostgreSQL just became the database you can’t ignore for AI applications
Analysts provide insight on what the latest acquisition of a PostgreSQL database vendor means for enterprise...
How S&P is using deep web scraping, ensemble learning and Snowflake architecture to collect 5X more data on SMEs
Previously, S&P only had data on about 2 million SMEs, but its AI-powered RiskGauge platform expanded...
Google quietly launches AI Edge Gallery, letting Android phones run AI without the cloud
Google quietly launched AI Edge Gallery, an experimental Android app that runs AI models offline without...
OpenAI’s Sora is now available for FREE to all users through Microsoft Bing Video Creator on mobile
OpenAI‘s Sora was one of the most hyped releases of the AI era, launching in December...
Aethir enables better user acquisition via Instant Play streaming for Doctor Who: Worlds Apart
Aethir provides better computing efficiency with its Instant Play streaming solution for Doctor Who: Worlds Apart.
Model Context Protocol: A promising AI integration layer, but not a standard (yet)
Enterprises should experiment with MCP where it adds value, isolate dependencies and prepare for a multi-protocol...
The future of engineering belongs to those who build with AI, not without it
As we look ahead, the relationship between engineers and AI systems will likely evolve from tool...
Micro Center nerd store fills the Fry’s vacuum with its return to Silicon Valley
Micro Center, an electronics retailer, has opened a store in Silicon Valley in California And so...
QwenLong-L1 solves long-context reasoning challenge that stumps current LLMs
Alibaba's QwenLong-L1 helps LLMs deeply understand long documents, unlocking advanced reasoning for practical enterprise applications.
ElevenLabs debuts Conversational AI 2.0 voice assistants that understand when to pause, speak, and take turns talking
With Conversational AI 2.0, ElevenLabs aims to provide tools and infrastructure for truly intelligent, context-aware enterprise...
Which LLM should you use? Token Monster automatically combines multiple models and tools for you
This architecture lets Token Monster tap into a range of models from different providers without having...
FLUX.1 Kontext enables in-context image generation for enterprise AI pipelines
FLUX.1 Kontext from Black Forest Labs aims to let users edit images multiple times through both...
Emotive voice AI startup Hume launches new EVI 3 model with rapid custom voice creation
While EVI 3’s specific API pricing has not been announced yet (marked as TBA), the pattern...
DeepSeek R1-0528 arrives in powerful open source challenge to OpenAI o3 and Google Gemini 2.5 Pro
Additionally, the model’s hallucination rate has been reduced, contributing to more reliable and consistent output.
Encharge AI unveils EN100 AI accelerator chip with analog memory
EnCharge AI, a startup that raised $144 million to date, announced the EnCharge EN100, an AI...
How Snowflake’s open-source text-to-SQL and Arctic inference models solve enterprise AI’s two biggest deployment headaches
New open-source efforts from Snowflake aim to help solve that unsolved challenges of text-to-SQL and inference...
Peer launches Global Simulation as real-time digital Earth with AI agents
Peer launched Global Simulation, a real-time digital Earth where players use avatars to connect by location...
DanaBot takedown shows how agentic AI cut months of SOC analysis to weeks
Agentic AI played a decisive role in dismantling DanaBot, a Russian malware platform responsible for more...
s3: The new RAG framework that trains search agents with minimal data
S3 decouples RAG search from generation, boosting efficiency and generalization for enterprise LLM applications with minimal...
Mistral launches new code embedding model that outperforms OpenAI and Cohere in real-world retrieval tasks
Mistral's Codestral Embed will help make RAG use cases faster and find duplicate code segments using...
Nvidia CEO takes a shot at U.S. policy cutting off AI chip sales to China
Nvidia CEO Jensen Huang tiptoed into politics with a comment taking a shot at the U.S....
Nvidia beats estimates for Q1 results as revenues rise 69% from a year ago
Nvidia, the AI and graphics chip company driving societal changes with AI, reported revenue for the...
Less is more: Meta study shows shorter reasoning improves AI accuracy by 34%
New research from Meta reveals AI models achieve 34.5% better accuracy with shorter reasoning chains, challenging...
Rumi raises $4.7M to change passive media into interactive AI experiences
Rumi, an AI media company, has raised $4.7 million in a pre-seed funding round to transform...
Akool Live Camera can translate video calls in real time, swap faces, and get live virtual avatars to mimic human movements
Akool Live Camera uses AI to capture human movement and mimic that movement with a generated...
Everyone’s looking to get in on vibe coding — and Google is no different with Stitch, its follow-up to Jules
Google is looking to compete in vibe coding with Stitch, which designs user interfaces (UIs) with...
Spott’s AI-native recruiting platform scores $3.2M to end hiring software chaos
Spott secures $3.2 million in funding to build an all-in-one AI-native recruitment platform that automates workflows...
Anthropic debuts Claude conversational voice mode on mobile that searches your Google Docs, Drive, Calendar
With the rollout of voice mode, Anthropic continues to broaden Claude's functionality and accessibility to all...
Security leaders lose visibility as consultants deploy shadow AI copilots to stay employed
Fearing sweeping layoffs driven by AI and automation, elite consultants and high performers are turning to...
Mistral launches API for building AI agents that run Python, generate images, perform RAG and more
For professionals like the Lead AI Engineer or Senior AI Engineer, the Mistral Agents API represents...
Build a Rocket Boy drops trailer detailing MindsEye’s blend of Grand Theft Auto and AI robot combat
IOI Partners and Build a Rocket Boy dropped a new trailer on MindsEye, a new title...
What Salesforce’s $8B acquisition of Informatica means for enterprise data and AI
Industry analysts explain how Salesforce's $8 billion Informatica acquisition will transform enterprise data management and accelerate...
From disruption to reinvention: How knowledge workers can thrive after AI
We are beginning a cognitive migration: Away from what AI now does well, and toward a...
Google’s ‘world-model’ bet: building the AI operating layer before Microsoft captures the UI
Google doubles down on its ‘world-model’ vision, racing to build an AI operating layer to drive...
Beyond single-model AI: How architectural design drives reliable multi-agent orchestration
Successful AI agents require enterprises to orchestrate interactions, manage shared knowledge and plan for failure.
OpenAI updates Operator to o3, making its $200 monthly ChatGPT Pro subscription more enticing
Operator remains a research preview and is accessible only to ChatGPT Pro users. The Responses API...
The battle to AI-enable the web: NLweb and what enterprises need to know
Microsoft's NLWeb protocol transforms websites into AI-powered apps with conversational interfaces.
The 3 biggest bombshells from this week’s AI extravaganza
Enterprises looking to build with AI should find plenty to look forward to with the announcements...
Why enterprise RAG systems fail: Google study introduces ‘sufficient context’ solution
Google's "sufficient context" helps refine RAG systems, reduce LLM hallucinations, and boost AI reliability for business...
After GPT-4o backlash, researchers benchmark models on moral endorsement—find sycophancy persists across the board
A new benchmark can test how much LLMs become sycophants, and found that GPT-4o was the...
Anthropic faces backlash to Claude 4 Opus behavior that contacts authorities, press if it thinks you’re doing something ‘egregiously immoral’
Bowman later edited his tweet and the following one in a thread to read as follows,...
Anthropic overtakes OpenAI: Claude Opus 4 codes seven hours nonstop, sets record SWE-Bench score and reshapes enterprise AI
Anthropic's Claude Opus 4 outperforms OpenAI's GPT-4.1 with unprecedented seven-hour autonomous coding sessions and record-breaking 72.5%...