OpenAI and Anthropic are redefining the capabilities of AI, introducing models that tackle complex tasks from coding to multi-step knowledge work. With features like agentic collaboration, long-context reasoning, and autonomous problem-solving, these upgrades showcase AI’s potential as an intelligent digital collaborator across professional workflows.
OpenAI has shared new insights into how its AI coding assistant Codex works, revealing how it combines powerful language models with automated tools to write, test, and modify code. The explanation highlights both the growing power of AI in software development and the careful design needed to keep these systems fast, safe, and reliable.
Recent research has revealed that AI language models store memory and reasoning in entirely separate neural circuits, showing that machines “think” and “remember” in different ways. This discovery leads the way to creating AI systems that can forget sensitive data while preserving their intelligence.
Now AI can autonomously generate the “brains” of robots, creating a fully functional drone control system 20 times faster than humans. The experiment with generative AI models like ChatGPT, Gemini, and Claude reveals both the potential and current limits of machines building machines.
OpenAI's newest open-weight models gpt-oss-120b and gpt-oss-20b bring advanced reasoning and 128K-token context windows – all under Apache 2.0 license. With support for local deployment and optimization for consumer hardware, these models mark a major shift toward transparent and decentralized AI.
New research reveals that LLMs like GPT-4o and Gemma 3 often stick to their initial answers even when wrong – yet quickly lose confidence when challenged. This surprising mix of overconfidence and self-doubt mirrors human cognitive biases and raises concerns about AI reliability.
The most advanced AI models from tech giants like OpenAI and DeepSeek are generating false information at unprecedented rates – and no one knows exactly why. Due to this surge in AI “hallucinations”, the reliability of AI across critical fields is being called into question.
GPT-4.5, OpenAI's most advanced AI yet, features improved natural language understanding, enhanced emotional intelligence, and more intuitive conversations. It excels in writing, brainstorming, and problem-solving while minimizing AI hallucinations for more reliable results.
Microsoft has launched the Phi-4 model with open weights under the MIT license, offering researchers and developers unprecedented flexibility. With 14 billion parameters, Phi-4 outperforms its counterparts in solving mathematical problems and multitasking, ensuring efficient work with limited resources.
Alibaba's new AI model, QwQ-32B-Preview, challenges ChatGPT with its impressive math and logic skills, outperforming competitors on key benchmarks. Released under an open license, it offers advanced reasoning capabilities but still struggles with tasks requiring strong common-sense understanding.
During the Spring Update event OpenAI’s presented GPT-4о – the unique omnimodel that integrates text, audio and image processing, allowing it to work faster and more efficiently than ever before.
Deep active learning blends conventional neural network training with strategic data sample selection. This innovative approach results in enhanced model performance, efficiency, and accuracy across a wide array of applications.
A groundbreaking NLP model Gemini AI is set to surpass existing benchmarks. With its multimodal prowess, scalability across various domains, and integration potential within Google's ecosystem, Gemini AI represents a significant leap in AI technology.
In 1950, British scientist Alan Turing proposed a test to determine whether machines can think. To date, no artificial intelligence has yet successfully passed it. Will ChatGPT be the first?
OpenAI had an impressive DevDay introducing new features. Let's dive into the world of innovation and explore new horizons in the landscape of artificial intelligence. Find out about all the new amazing possibilities in our article!
Scholars has developed DetectGPT that can distinguish AI-generated text from human-written text 95% of the time for popular open source LLMs.
Meta AI launched LLaMA, a collection of foundation language models that can compete with or even outperform the best existing models such as GPT-3, Chinchilla and PaLM.
“We find that DALL·E also allows for control over the viewpoint of a scene and the 3D style in which a scene is rendered” OpenAI explains. Produced images can range from illustrations to objects, and also adjusted real-world pictures.