AI/ML News

Stay updated with the latest news and articles on artificial intelligence and machine learning

AI showdown: GPT‑5.3-Codex vs Claude Opus 4.6

OpenAI and Anthropic are redefining the capabilities of AI, introducing models that tackle complex tasks from coding to multi-step knowledge work. With features like agentic collaboration, long-context reasoning, and autonomous problem-solving, these upgrades showcase AI’s potential as an intelligent digital collaborator across professional workflows.

A look under the hood of Codex

OpenAI has shared new insights into how its AI coding assistant Codex works, revealing how it combines powerful language models with automated tools to write, test, and modify code. The explanation highlights both the growing power of AI in software development and the careful design needed to keep these systems fast, safe, and reliable.

Inside the AI brain: memory vs. reasoning

Recent research has revealed that AI language models store memory and reasoning in entirely separate neural circuits, showing that machines “think” and “remember” in different ways. This discovery leads the way to creating AI systems that can forget sensitive data while preserving their intelligence.

When machines start building their own minds

Now AI can autonomously generate the “brains” of robots, creating a fully functional drone control system 20 times faster than humans. The experiment with generative AI models like ChatGPT, Gemini, and Claude reveals both the potential and current limits of machines building machines.

OpenAI released its most capable open models

OpenAI's newest open-weight models gpt-oss-120b and gpt-oss-20b bring advanced reasoning and 128K-token context windows – all under Apache 2.0 license. With support for local deployment and optimization for consumer hardware, these models mark a major shift toward transparent and decentralized AI.

Does AI struggle with its confidence?

New research reveals that LLMs like GPT-4o and Gemma 3 often stick to their initial answers even when wrong – yet quickly lose confidence when challenged. This surprising mix of overconfidence and self-doubt mirrors human cognitive biases and raises concerns about AI reliability.

AI’s hallucination problem is getting worse

The most advanced AI models from tech giants like OpenAI and DeepSeek are generating false information at unprecedented rates – and no one knows exactly why. Due to this surge in AI “hallucinations”, the reliability of AI across critical fields is being called into question.

GPT-4.5 – a leap forward in AI capabilities

GPT-4.5, OpenAI's most advanced AI yet, features improved natural language understanding, enhanced emotional intelligence, and more intuitive conversations. It excels in writing, brainstorming, and problem-solving while minimizing AI hallucinations for more reliable results.

Microsoft launched the Phi-4 model with fully open weights

Microsoft has launched the Phi-4 model with open weights under the MIT license, offering researchers and developers unprecedented flexibility. With 14 billion parameters, Phi-4 outperforms its counterparts in solving mathematical problems and multitasking, ensuring efficient work with limited resources.

Alibaba vs. OpenAI: Can a new model outperform ChatGPT?

Alibaba's new AI model, QwQ-32B-Preview, challenges ChatGPT with its impressive math and logic skills, outperforming competitors on key benchmarks. Released under an open license, it offers advanced reasoning capabilities but still struggles with tasks requiring strong common-sense understanding.

A new era of multimodal AI with GPT-4o

During the Spring Update event OpenAI’s presented GPT-4о – the unique omnimodel that integrates text, audio and image processing, allowing it to work faster and more efficiently than ever before.

Google’s Gemini AI is going to surpass ChatGPT

A groundbreaking NLP model Gemini AI is set to surpass existing benchmarks. With its multimodal prowess, scalability across various domains, and integration potential within Google's ecosystem, Gemini AI represents a significant leap in AI technology.

Does GPT-4 Pass the Turing Test?

In 1950, British scientist Alan Turing proposed a test to determine whether machines can think. To date, no artificial intelligence has yet successfully passed it. Will ChatGPT be the first?