Researchers from MIT and the MIT-IBM Computing Research Lab developed ChartNet, a dataset and series of open-source models that outperform commercial AI models in tasks like chart interpretation. This breakthrough could empower small firms with limited budgets to leverage AI for business trend analysis and scientific figure interpretation.
Google DeepMind released Gemma 4 12B, an encoder-free multimodal model for text, images, audio, and video. The model runs on a laptop with 16 GB of RAM, bridging the gap between edge-friendly and larger variants, with open-source weights available for download.
Tod Machover, music tech pioneer at MIT Media Lab, awarded George Peabody Medal for groundbreaking work in AI and participatory opera. Known as a musical visionary, Machover expands music's boundaries and potential for all.
NVIDIA AI team releases Cosmos 3, a unified model for physical AI. Combines physical reasoning, world generation, and action generation for robotics and autonomous vehicles.
Linear regression model demo showcases 46% accuracy on training data, outperforming baseline predictions. Galaxy Science Fiction, known for stunning cover art, featured renowned space artist Chesley Bonestell.
TinyFish introduces BigSet, an open-source multi-agent system that turns natural-language descriptions into structured web datasets. BigSet automates schema inference, data gathering, deduplication, and scheduled updates for efficient dataset creation.
Amazon Nova Forge allows users to build customized language models that blend proprietary data with curated datasets, preventing catastrophic forgetting and improving domain performance without degrading general capabilities. The tool helps navigate the challenges of hyperparameter tuning for domain-specific tasks, avoiding expensive failures and ensuring the right balance between stability and...
Amazon Nova 2 Lite offers a cost-effective object detection solution with no training needed. Easily deploy with Amazon Bedrock, AWS Lambda, and API Gateway for various industries.
JetBrains released Mellum2, a specialized AI model for software engineering with 12B parameters. It uses a Mixture-of-Experts architecture and goes through extensive pre-training and post-training stages for various tasks.
Alibaba releases Qwen3. 7-Plus, a multimodal agent model on Bailian platform with image and video understanding. It offers deep reasoning and self-programming capabilities, positioning it for long-running tasks.
OpenAI's GPT-5.5, GPT-5.4, and Codex now available on Amazon Bedrock for advanced AI applications. High-performance inference engine for complex tasks.
NVIDIA introduces RTX Spark PCs for personal agents at GTC Taipei, with new AI compute and memory capabilities. Partnership with Microsoft brings secure on-device agents to Windows, along with updates for Hermes Agent and OpenClaw.
Kernel ridge regression (KRR) and support vector regression (SVR) are machine learning techniques that can be combined to create a sparse KRR model approximating an SVR model. This hybrid approach offers the benefits of KRR's large dataset handling and SVR's efficiency in model storage, demonstrating high predictive accuracy in a demo using the scikit KernelRidge module.
A new paper introduces 'Parallax,' a parameterized Local Linear Attention mechanism that enhances efficiency without cutting compute. Parallax simplifies and improves the LLA framework, making it more efficient and easier to implement, with the potential to scale to LLM pretraining and codesign with Muon.
Amazon Bedrock AgentCore payments, in partnership with Coinbase and Stripe (Privy), allows agents to access paid resources on behalf of end users. AgentCore addresses risks like runaway spending and lack of end user consent in autonomous payment systems.