In 2026, AI agents excel at tasks like customer service, but struggle with complex inquiries. MIT and Harvard researchers improved AI's ability to ask questions through a "Battleship" game, leading to significant gains in performance and efficiency.
Amazon SageMaker AI now supports Fundamental's NEXUS model for accurate tabular data predictions in days. NEXUS offers deterministic results, native tabular understanding, and non-sequential reasoning for structured data analysis.
NVIDIA AI team releases Cosmos 3, a unified model for physical AI. Combines physical reasoning, world generation, and action generation for robotics and autonomous vehicles.
Researchers from MIT and the MIT-IBM Computing Research Lab developed ChartNet, a dataset and series of open-source models that outperform commercial AI models in tasks like chart interpretation. This breakthrough could empower small firms with limited budgets to leverage AI for business trend analysis and scientific figure interpretation.
Amazon Bedrock enables generative AI for 100,000+ organizations worldwide, offering comprehensive capabilities for bold innovation. Introducing Amazon Bedrock Ops Alert, a proactive monitoring solution for sustainable operational management of AI workloads, empowering teams to drive real business impact.
Google DeepMind released Gemma 4 12B, an encoder-free multimodal model for text, images, audio, and video. The model runs on a laptop with 16 GB of RAM, bridging the gap between edge-friendly and larger variants, with open-source weights available for download.
Deep Learning AMI and AWS Deep Learning Containers now support SOCI snapshotter and index for efficient container image management. SOCI's lazy loading reduces network bandwidth usage and improves container startup times, benefiting organizations managing large container images in cloud environments.
Alibaba releases Qwen3. 7-Plus, a multimodal agent model on Bailian platform with image and video understanding. It offers deep reasoning and self-programming capabilities, positioning it for long-running tasks.
Amazon Nova Forge allows users to build customized language models that blend proprietary data with curated datasets, preventing catastrophic forgetting and improving domain performance without degrading general capabilities. The tool helps navigate the challenges of hyperparameter tuning for domain-specific tasks, avoiding expensive failures and ensuring the right balance between stability and...
TinyFish introduces BigSet, an open-source multi-agent system that turns natural-language descriptions into structured web datasets. BigSet automates schema inference, data gathering, deduplication, and scheduled updates for efficient dataset creation.
Amazon Nova 2 Lite offers a cost-effective object detection solution with no training needed. Easily deploy with Amazon Bedrock, AWS Lambda, and API Gateway for various industries.
Linear regression model demo showcases 46% accuracy on training data, outperforming baseline predictions. Galaxy Science Fiction, known for stunning cover art, featured renowned space artist Chesley Bonestell.
JetBrains released Mellum2, a specialized AI model for software engineering with 12B parameters. It uses a Mixture-of-Experts architecture and goes through extensive pre-training and post-training stages for various tasks.
Amazon Bedrock AgentCore payments, in partnership with Coinbase and Stripe (Privy), allows agents to access paid resources on behalf of end users. AgentCore addresses risks like runaway spending and lack of end user consent in autonomous payment systems.
NVIDIA introduces RTX Spark PCs for personal agents at GTC Taipei, with new AI compute and memory capabilities. Partnership with Microsoft brings secure on-device agents to Windows, along with updates for Hermes Agent and OpenClaw.