AI/ML News

Stay updated with the latest news and articles on artificial intelligence and machine learning

$1B bet: LeCun's world models vs LLM's empire

Yann LeCun is taking a bold step with his new startup AMI, working to create “world models” that understand the physical world, reason about causality, and develop true common sense. This approach directly challenges today’s dominant paradigm, suggesting that scaling LLMs alone may never achieve human-level intelligence.

Inside the AI brain: memory vs. reasoning

Recent research has revealed that AI language models store memory and reasoning in entirely separate neural circuits, showing that machines “think” and “remember” in different ways. This discovery leads the way to creating AI systems that can forget sensitive data while preserving their intelligence.

Mamba-3 – the next evolution in language modeling

Mamba-3 - state-space model that redefines how AI thinks, learns, and understands language. By improving context tracking, information processing, and response generation, Mamba-3 sets a new standard for performance and inference speed, beyond traditional transformer models.

UAVs master high-precision tasks mid-air

FlyingToolbox is a drone system capable of docking and exchanging tools mid-air, even in turbulent airflow. This technology enables precise multi-stage operations: from maintenance and high-altitude construction to emergency response missions.

Hidden bias in large language models

MIT scientists explored a critical flaw in AI language models called position bias, where models favor information at the beginning and end of text while ignoring the middle. Their research reveals this bias is rooted not only in the training data, but also in the architecture of the models themselves.

Phi-4 – small models, big results

Microsoft’s Phi-4 family is a new generation of compact language models built for complex tasks like math, coding, and planning – often outperforming larger systems. Trained with advanced techniques and curated data, they offer strong reasoning while staying efficient for low-latency use.

Midjourney V7: Faster, smarter, more realistic

Midjourney has launched V7, its most powerful AI image model yet, featuring smarter prompts and real-time personalization. With a redesigned architecture, V7 delivers improved object coherence, enhanced texture realism, and introduces Draft Mode for rapid, cost-efficient image iteration.

Super-Turing AI: Learning like the human mind

A new advanced neural system that mimics the brain’s learning processes promises to create faster, more efficient, and energy-saving AI. By leveraging Hebbian learning and spike-timing-dependent plasticity, this innovation could enhance AI performance while significantly reducing environmental and economic costs.

From text to 3D: the magic behind Edify 3D by NVIDIA

Edify 3D by NVIDIA creates high-quality 3D models in under 2 minutes using AI. By combining multi-view diffusion models and Transformers, it offers fast, accurate, and scalable 3D generation from text or images, making it a perfect solution for gaming, animation, and design industries.

Microsoft launched the Phi-4 model with fully open weights

Microsoft has launched the Phi-4 model with open weights under the MIT license, offering researchers and developers unprecedented flexibility. With 14 billion parameters, Phi-4 outperforms its counterparts in solving mathematical problems and multitasking, ensuring efficient work with limited resources.

Controversial science: AI and Nobel Prizes

The 2024 Nobel Prizes in physics and chemistry have set a precedent for acknowledging AI’s contributions to science. While some may question the fit between AI and traditional disciplines, others see this as a necessary step toward recognizing the interdisciplinary nature of modern research.

MIT's MAIA: an automated agent for interpreting AI models

MAIA can interpret neural networks by conducting experiments and refining its analysis, enhancing understanding of AI models. This agent can identify neuron activities, remove irrelevant features, and detect biases, making AI systems safer and more transparent.

From barks to words: AI decodes dog vocalizations

AI learnt to decode dog barks, identifying playful versus aggressive barks, as well as the dog’s age, sex, and breed. Originally trained on human speech, AI models have achieved impressive accuracy, offering significant advancements in animal care and communication research.

Llama 3: the latest advances in LLM

Llama 3, Meta AI's latest advancement, boasts unmatched language understanding, enhancing its capacity for complex tasks. With expanded vocabulary and advanced safety features, the model ensures improved performance and versatility.

Does GPT-4 Pass the Turing Test?

In 1950, British scientist Alan Turing proposed a test to determine whether machines can think. To date, no artificial intelligence has yet successfully passed it. Will ChatGPT be the first?

Tracking every pixel: motion estimation with OmniMotion

The latest motion estimation method can extract long-term motion trajectories for every pixel in a frame, even in the case of fast movements and complex scenes. Learn more about the exciting technology and the future of motion analysis in this article about OmniMotion.

TalkToModel: Interface for Understanding ML Models

TalkToModel is an innovative system for enabling open conversations with ML models. This platform allows users to not only understand, but also communicate with ML models in natural language, as well as receive explanations of their predictions and operating processes.

New AI Model Creates 3D Objects and Characters for Virtual Game Worlds

During the last decade, one of the biggest issues in the gaming industry is the explosive growth of the AAA video games production cost. Studios are always on the look-up for technologies that could help bring down the cost of game development. Recent advances in the neural image generation models bring some hope that the realization of this dream may be not so far away.

Philosophers vs Transformers: Neural net impersonates a famous cognitive scientist

Can computers think? Can AI models be conscious? These and similar questions often pop up in discussions of recent AI progress, achieved by natural language models GPT-3, LAMDA and other transformers. They are nonetheless still controversial and on the brink of a paradox, because there are usually many hidden assumptions and misconceptions about how the brain works and what thinking means. There is no other way, but to explicitly reveal these assumptions and then explore how the human information processing could be replicated by machines.

Old photo restoration using neural networks

Now you won’t surprise anyone with filters that improve the quality of photos. But the restoration of old portraits still leaves much to be desired. Older photos tend to be too blurry, so normal image sharpening methods won't work on them.

No Language Left Behind

Facebook has released the NLLB project (No Language Left Behind). The main feature of this development is the coverage of more than two hundred languages, including rare languages ​​of African and Australian peoples. In addition, Facebook has applied a new approach to the machine learning model, where the translation is carried out directly from one language to another, without intermediate translation into English.