New research focuses on enhancing computer vision technologies by incorporating physics-based awareness into data-driven techniques. This hybrid AI-powered computer vision empowers machinery to intelligently perceive, interact, and respond to real-time environments.
Through the groundbreaking "curious replay" method, AI agents gain the ability to self-reflect and learn from novel experiences, resulting in a substantial enhancement of their adaptability and performance.
A team at Stanford has developed Sophia, a new approach that optimizes pretraining of LLMs. Using the two key techniques, it could help researchers to train LLMs in half the time, thus reducing costs and making it affordable for small organizations and academic groups.
A groundbreaking AI system uses non-invasive methods and fMRI scanner data to translate thoughts into continuous text. With the achieved success rates in converting the content of human thoughts the semantic decoder opens up new possibilities for enhancing communication.
The European Space Agency is developing a sample retrieval system using neural networks, aiming to collect and transport samples from Mars. The challenging mission of returning samples gathered by Perseverance rover is considered crucial for unlocking the mysteries of the Red Planet.
Generative AI is revolutionizing the world of gaming by transforming virtual characters and enhancing their conversational skills. The NVIDIA Avatar Cloud Engine (ACE) for Games empowers developers to infuse intelligence into NPCs, reshaping gaming experiences and pushing the boundaries of what is possible.
Researchers are working on a more effective way to train machines for uncertain, real-world situations. A new algorithm will decide when a “student” machine should follow its teacher, and when it should learn on its own.
A new architecture aims to overcome the existing limitations of neural networks and symbolic AI. Developed model already demonstrates high effectiveness in solving logical problems and provides a promising framework for integrating different AI paradigms.
The meminductor joins the previously discovered memristors and memcapacitors in a line of circuit elements that can store and recall previous current or voltage values.
Solar cells based on hybrid organic-inorganic perovskites are a rapidly developing area of alternative energy. These molecules initiated the development of a new class of photovoltaic devices – perovskite solar cells.
The researchers used a diverse set of simple image generation programs to create a dataset for training a computer vision model. This approach can improve the performance of image classification models trained on synthetic data.
Researchers developed a new approach to motion modeling using relative position change. They evaluated the ability of deep neural networks architectures to model motion using motion recognition and prediction tasks.
Researchers designed a new AI algorithm that is designed to visualize data clusters and other macroscopic features in a way that they are as distinct, easy to observe and human-understandable as possible.
Scholars has developed DetectGPT that can distinguish AI-generated text from human-written text 95% of the time for popular open source LLMs.
Researchers have recently created a new neuromorphic computing system supporting deep belief neural networks (DBNs) - a generative and graphical class of deep learning models.
A team of scientists has developed a machine learning solution to forecast amine emissions from carbon-capture plants using experimental data from a stress test performed at an actual plant in Germany.
Scientists have developed the first bio-realistic artificial neuron that can effectively interact with real biological neurons.
Scientists presented a smart bionic finger that can create 3D maps of the internal structure of materials by touching their exterior surface.
The wireless soft e-skin can both detect and transmit the sense of touch, and form a sensory network, which opens up great possibilities for improving interactive sensory communication.
Meta AI launched LLaMA, a collection of foundation language models that can compete with or even outperform the best existing models such as GPT-3, Chinchilla and PaLM.
MusicLM is a new music generation AI that creates high-quality music based on textual descriptions in a similar way that DALL-E generates images from texts.
Scientists from the University of Michigan conducted a study of robot behavior strategies to restore trust between a bot and a human. Can such strategies fully restore trust and how effective are they after repeated errors?
A group of researchers have created a Bayesian machine, an AI approach that performs computations based on Bayes' theorem, using memristors. It is significantly more energy-efficient than existing hardware solutions, and could be used for safety-critical applications.
Using advances in artificial intelligence engineers at the University of Colorado Boulder are working on a new type of walking cane for blind or visually impaired.
Tel Aviv University researchers have achieved a technological-biological breakthrough: in response to the presence of an odor, the new biological sensor sends data that the robot is able to detect and interpret.
Text-to-speech models usually require significantly longer training samples, while VALL-E creates a much more natural-sounding synthetic voice from just a few seconds.
Researchers from Stanford University developed a new type of stretchable biocompatible material that gets sprayed on the back of the hand and can recognize its movements.
Point·E is a new system for text-conditional synthesis of 3D point clouds that first generates synthetic views and then generates colored point clouds conditioned on these views.
Self-driving cars have long been considered the next generation mode of transportation. To enable autonomous navigation of such vehicles numerous different technologies need to be implemented.
New research from the Pacific Northwest National Laboratory uses machine learning, data analysis and artificial intelligence to identify potential nuclear threats.
Researchers have discovered new ways for retailers to use AI in conjunction with in-store cameras to better understand consumer behavior and adapt store layouts to maximize sales.
Decoding speech based on brain activity has been a long-established goal of neuroscientists and clinicians. Nowdays, Meta is working on an AI model that can decode speech from noninvasive recordings of brain activity to help people after traumatic brain injury.
Look to Speak is designed to help those with motor function impairments and speech difficulties to communicate more easily. The app lets people use their eyes to select pre-written phrases and have them spoken out loud.
MIT researchers have developed a machine-learning technique that precisely collects and models the underlying acoustics of a location from just a limited number of sound recordings.
By 2050 humanity will have to almost double the global food supply to make sure that every dweller of the planet has enough food. With climate change going at increasing speed, water resources drop and arable lands erode, doing that sustainably will be a huge challenge for us.
During the last decade, one of the biggest issues in the gaming industry is the explosive growth of the AAA video games production cost. Studios are always on the look-up for technologies that could help bring down the cost of game development. Recent advances in the neural image generation models bring some hope that the realization of this dream may be not so far away.
Can computers think? Can AI models be conscious? These and similar questions often pop up in discussions of recent AI progress, achieved by natural language models GPT-3, LAMDA and other transformers. They are nonetheless still controversial and on the brink of a paradox, because there are usually many hidden assumptions and misconceptions about how the brain works and what thinking means. There is no other way, but to explicitly reveal these assumptions and then explore how the human information processing could be replicated by machines.
Now you won’t surprise anyone with filters that improve the quality of photos. But the restoration of old portraits still leaves much to be desired. Older photos tend to be too blurry, so normal image sharpening methods won't work on them.
Facebook has released the NLLB project (No Language Left Behind). The main feature of this development is the coverage of more than two hundred languages, including rare languages of African and Australian peoples. In addition, Facebook has applied a new approach to the machine learning model, where the translation is carried out directly from one language to another, without intermediate translation into English.
A group of scientists using machine learning "rediscovered" the law of universal gravitation.
Animated avatars have long become a part of our lives. But realistic modeling of closing animation still remained an open challenge.
On the one hand, modern physical modeling techniques can generate realistic clothing geometry at interactive speed. On the other hand, modeling a photorealistic appearance usually requires physical rendering, which is too expensive for interactive applications.
A group of scientists using machine learning "rediscovered" the law of universal gravitation.
To do this, they trained a "graph neural network" to simulate the dynamics of the Sun, planets and large moons of the solar system from 30 years of observations. Then they used symbolic regression to discover the analytical expression for the force law implicitly learned by the neural network.