Artificial intelligence
-
StepFun Releases StepAudio 2.5 Real-Time: End-to-End Voice Modeling with Roleplay-Specific RLHF and Linguistic Understanding
StepFun, an AI lab based in Shanghai, has released StepAudio 2.5 Realtime. It is a real-time speech modeling language with…
Read More » -
Microsoft Research Releases Webwright: A Terminal-Native Web Agent Framework That Scores 60.1% in Odysseys, Up from Base GPT-5.4’s 33.5%
Most web agents today call the browser one action at a time. The model detects the state of the current…
Read More » -
Tencent Open-Sources Memory for TencentDB agent: A 4-tier local memory pipeline for AI Agents
Tencent released TencentDB Agent Memory, an open source memory system for AI agents. The project runs under the MIT license.…
Read More » -
Perplexity Open-Sources Bumblebee: A Read-Only Provisioning Scanner for Developer Endpoints
Attackers are increasingly targeting packages, editor extensions, and configurations of AI tools on developer machines and not just production systems.…
Read More » -
Microsoft Releases Fara1.5: A Family of Browser Computing Agents (4B/9B/27B) Beyond OpenAI Operator and Gemini 2.5 Computing Online-Mind2Web
Microsoft Research’s AI Frontiers lab has released Fara1.5. It is a family of computer user interface (CUA) models for the…
Read More » -
Cohere Releases Command A+: 218B Sparse MoE Model for Agentic Workflow Running on Two H100 GPUs
Cohere recently released Command A+, as an open source model that streamlines enterprise agent workflows. Available under the Apache 2.0…
Read More » -
VLA Models: Training Data Requirements Defined
The transition from chatbots to robots that follow natural language commands goes through one class of models. VLA models —…
Read More » -
Technology often creates jobs for young, skilled workers. Will AI do the same? | MIT News
At any given time, technology does two things in employment: It replaces traditional jobs, and it creates new lines of…
Read More » -
Meet Turbovec: A Rust Vector Index with Python Bindings, and Built on Google’s TurboQuant Algorithm
Vector Search supports multiple regression generation augmented (RAG) pipelines. At scale, it’s expensive. Storing 10 million embedded documents in float32…
Read More » -
Google Launches Gemini 3.5 Flash at I/O 2026: A Faster and Cheaper Model for AI Agents and Coding
Google recently released Gemini 3.5 Flash at Google I/O May, 2026. The first version of Gemini 3.5. The series combines…
Read More »