AI/ML News

Stay updated with the latest news and articles on artificial intelligence and machine learning

Alibaba vs. OpenAI: Can a new model outperform ChatGPT?

Alibaba's new AI model, QwQ-32B-Preview, challenges ChatGPT with its impressive math and logic skills, outperforming competitors on key benchmarks. Released under an open license, it offers advanced reasoning capabilities but still struggles with tasks requiring strong common-sense understanding.

AI can control computer just like a human

Anthropic has introduced Claude 3.5 Sonnet, a new AI model capable of controlling a computer similarly to a human. The model uses screenshots of the desktop to navigate applications and perform tasks such as clicking, typing, and gathering information.

Stable Diffusion 3.5 opens new doors in digital art

Stable Diffusion 3.5, the latest release from Stability AI, introduces three powerful model variants that deliver enhanced image quality, speed, and accessibility for consumer hardware. The models are free for non-commercial use, and integrate advanced safety features to prevent misuse.

Movie Gen – the future of AI video generation

Meta has unveiled Movie Gen, an AI-powered tool that creates high-definition videos with synchronized sound from simple text prompts. The model provides advanced video creation and editing features, offering users enhanced control over content generation.

Google releases major updates for Gemini models

With price cuts, increased rate limits, and faster output, new Gemini models by Google make advanced AI more accessible for developers worldwide. They boost speed, reduce costs, and enhance performance across a wide range of text, code, and multimodal tasks.

Will Ideogram 2.0 overtake MidJourney?

The latest text-to-image model from Ideogram AI introduces significant advancements that could challenge the dominance of established players like MidJourney and Leonardo AI. New features are already available, including multiple distinct styles, enhanced realism, and advanced prompting tools.

A new era of multimodal AI with GPT-4o

During the Spring Update event OpenAI’s presented GPT-4о – the unique omnimodel that integrates text, audio and image processing, allowing it to work faster and more efficiently than ever before.

Llama 3: the latest advances in LLM

Llama 3, Meta AI's latest advancement, boasts unmatched language understanding, enhancing its capacity for complex tasks. With expanded vocabulary and advanced safety features, the model ensures improved performance and versatility.

Efficient fact-checking in LLMs like ChatGPT with SAFE

Google’s DeepMind developed a new method for long-form factuality in large language models, – Search-Augmented Factuality Evaluator (SAFE). The AI fact-checking tool has demonstrated impressive accuracy rates, outperforming human fact-checkers.

Stable Diffusion 3 – next-gen AI image generator

Stability AI presented the latest advancement in image generative AI models – Stable Diffusion 3. Its expanded parameter range and diffusion transformer architecture ensure smooth generation of complex, high-quality images and accurate text-to-visual translation.

Google introduces Gemma – a new open-source model

Drawing inspiration from its predecessor Gemini, Gemma is focused on openness and accessibility, offering versatile models suitable for various devices and frameworks. The model marks a significant step towards democratizing AI while emphasizing its responsible development and transparency.

Google’s Gemini AI is going to surpass ChatGPT

A groundbreaking NLP model Gemini AI is set to surpass existing benchmarks. With its multimodal prowess, scalability across various domains, and integration potential within Google's ecosystem, Gemini AI represents a significant leap in AI technology.

A memristor-based Bayesian machine

A group of researchers have created a Bayesian machine, an AI approach that performs computations based on Bayes' theorem, using memristors. It is significantly more energy-efficient than existing hardware solutions, and could be used for safety-critical applications.

Benefits of the Look to Speak

Look to Speak is designed to help those with motor function impairments and speech difficulties to communicate more easily. The app lets people use their eyes to select pre-written phrases and have them spoken out loud.

How sound can model the world

MIT researchers have developed a machine-learning technique that precisely collects and models the underlying acoustics of a location from just a limited number of sound recordings.

New AI Model Creates 3D Objects and Characters for Virtual Game Worlds

During the last decade, one of the biggest issues in the gaming industry is the explosive growth of the AAA video games production cost. Studios are always on the look-up for technologies that could help bring down the cost of game development. Recent advances in the neural image generation models bring some hope that the realization of this dream may be not so far away.

Philosophers vs Transformers: Neural net impersonates a famous cognitive scientist

Can computers think? Can AI models be conscious? These and similar questions often pop up in discussions of recent AI progress, achieved by natural language models GPT-3, LAMDA and other transformers. They are nonetheless still controversial and on the brink of a paradox, because there are usually many hidden assumptions and misconceptions about how the brain works and what thinking means. There is no other way, but to explicitly reveal these assumptions and then explore how the human information processing could be replicated by machines.

Old photo restoration using neural networks

Now you won’t surprise anyone with filters that improve the quality of photos. But the restoration of old portraits still leaves much to be desired. Older photos tend to be too blurry, so normal image sharpening methods won't work on them.

No Language Left Behind

Facebook has released the NLLB project (No Language Left Behind). The main feature of this development is the coverage of more than two hundred languages, including rare languages ​​of African and Australian peoples. In addition, Facebook has applied a new approach to the machine learning model, where the translation is carried out directly from one language to another, without intermediate translation into English.

Photorealistic clothing animation for avatars

A group of scientists using machine learning "rediscovered" the law of universal gravitation.

Animated avatars have long become a part of our lives. But realistic modeling of closing animation still remained an open challenge.

On the one hand, modern physical modeling techniques can generate realistic clothing geometry at interactive speed. On the other hand, modeling a photorealistic appearance usually requires physical rendering, which is too expensive for interactive applications.

Rediscovering celestial mechanics with machine learning

A group of scientists using machine learning "rediscovered" the law of universal gravitation.

To do this, they trained a "graph neural network" to simulate the dynamics of the Sun, planets and large moons of the solar system from 30 years of observations. Then they used symbolic regression to discover the analytical expression for the force law implicitly learned by the neural network.

On the way to protect the planet: how analytics can support sustainability

The Nature Conservancy reconsidered its marketing strategy via digital transformation with the help of SAS Customer Intelligence 360. As a result this international environmental nonprofit had its best year ever for membership revenue. That as nothing else contributes to advancing its mission of creating a more sustainable future.

Automation Can Replace over 1.4 Million Jobs

“Employers and employees alike need to change their perspective. The future of work is already here and the introduction of technology does not affect work in a uniform way. We must acknowledge where it supplements existing work and invest in a targeted reskilling approach that recognises the new roles technology is creating and ensures human and machine labour complement one another.

Practices for Creating an AI Serving Engine

AI-powered engines review and analyze information in the knowledgebase, deal with model deployment, and check the performance. They introduce a new approach in which apps can take advantage of artificial intelligence to enhance operational effectiveness and help to address different business challenges.

Using Artificial Intelligence to Analyze Vehicle Occupants

“Over the last decade, Affectiva has continuously pursued new patents as we have pioneered and advanced the fields of Emotion AI and Human Perception AI. The breadth and depth of our patent portfolio reflect our commitment to pushing the boundaries of computer vision, machine learning, deep learning and AI at the edge; and, is a testament to our leadership in defining the many creative and diverse applications of Human Perception AI that are shaping industries today and in the future.”

How AI Can Protect Your Digital Life

With increased social media usage in recent years, and all of us living our lives online yet more, we need to develop the ways to reduce threats, ensure our safety and remove interactions that are creating concern. Artificial Intelligence (AI) is a progressed machine learning technology that plays an important role in contemporary life and is also essential in how today's social media networks function.

Development of Artificial Intelligence

The abilities that computer systems have are very advanced. The earliest equipment not only helped people solve complex mathematical problems, but also stored large amounts of information. Today computers operate complex equipment and systems to prevent human errors.

Designing Soft Robots Which Can Sense

Traditional rigid robots are incapable of a wide range of tasks. Instead soft robots may interact with people more safely or easily access narrow spaces. However, for robots to successfully complete their goal, it is essential to know the exact position of their body parts. That’s a complex task for a soft-bodied robot that can undergo nearly infinite number of modifications.