Llama 3.1's multilingual LLMs, available on Amazon SageMaker JumpStart, offer optimized generative AI models for developers and businesses. SageMaker JumpStart provides access to pre-trained foundation models, allowing for customization and secure deployment in a dedicated VPC...
Satellite imagery enhances monitoring of Earth's changes, but cloud segmentation is crucial. Algorithms like Random Forest and YOLO are compared for cloud removal in Sentinel-2 images. Access data through Copernicus Open Access Hub, Google Earth Engine, or Python package...
Foundation models, like Large Language Models (LLMs), are being adapted for time series modeling through Large Time Series Foundation Models (LTSM). By leveraging sequential data similarities, LTSM aims to learn from diverse time series data for tasks like outlier detection and classification, building on the success of LLMs in computational linguistic...
TDS celebrates milestone with engaging articles on cutting-edge computer vision and object detection techniques. Highlights include object counting in videos, AI player tracking in ice hockey, and a crash course on autonomous driving...
MusGConv introduces a perception-inspired graph convolution block for processing music score data, improving efficiency and performance in music understanding tasks. Traditional MIR approaches are enhanced by MusGConv, which models musical scores as graphs to capture complex, multi-dimensional music...
PyTorch 2.0 introduced torch.compile for faster code execution. AWS optimized torch.compile for Graviton3 processors, resulting in significant performance improvements for NLP, CV, and recommendation...
Yann LeCun's 1989 breakthrough with Convolutional Neural Networks preserved spatial image data, revolutionizing Computer Vision research. CNNs use filters to extract feature maps, stacking layers to create powerful image...
Transformers, known for revolutionizing NLP, now excel in computer vision tasks. Explore the Vision Transformer and Masked Autoencoder Vision Transformer architectures enabling this...
MIT and Meta researchers develop PlatoNeRF, a computer vision technique using shadows and machine learning to create accurate 3D models of scenes, improving autonomous vehicles and AR/VR efficiency. Combining lidar and AI, PlatoNeRF offers new opportunities for reconstructions and will be presented at the Conference on Computer Vision and Pattern...
MIT researchers found that large language models can understand the visual world and generate complex scenes. By querying LLMs to self-correct code for images, they improved simple drawings and trained a vision system without using visual...
Sprinklr utilizes AI to enhance customer experience, achieving 20% throughput improvement with AWS Graviton3 for cost-effective ML inference. Thousands of servers fine-tune and serve over 750 AI models across 60+ verticals, processing 10 billion predictions...
Scientists are using AI to identify advanced materials for solar cells. MIT engineers develop a computer vision technique to speed up material characterization by 85 times, aiming for fully automated materials...
Choosing the right AI use case is crucial for success. AI can be valuable even with moderate performance, offering unique solutions. Examples include Sensor Fusion and Generative AI in everyday...
Scientists at MIT and the MIT-IBM Watson AI Lab have developed a new approach to teach computers to pinpoint actions in videos using only transcripts. This method, called spatio-temporal grounding, improves accuracy in identifying actions in longer videos and could have applications in online learning and...
Discover the groundbreaking research by XYZ Company on the development of a new AI technology that can revolutionize the healthcare industry. Learn how this innovation is set to improve patient care and diagnosis...
New study reveals groundbreaking AI technology developed by Google, revolutionizing data analysis in healthcare. Findings show significant increase in accuracy and efficiency of diagnosing rare...
Discover the groundbreaking collaboration between Tesla and SpaceX, revolutionizing electric vehicles and space travel. Explore how their innovative technologies are shaping the future of...
Discover how Tesla's new self-driving technology is revolutionizing the automotive industry. With advanced AI algorithms and cutting-edge sensors, Tesla is paving the way for autonomous...
Discover how Company X revolutionized the tech industry with its groundbreaking AI technology. Find out how their product has disrupted traditional business models and set new standards for...
Discover how innovative startups are revolutionizing the tech industry with cutting-edge products. From AI-powered solutions to sustainable technologies, these companies are reshaping the...
Discover the groundbreaking collaboration between Tesla and SpaceX in developing innovative renewable energy solutions. Explore how Elon Musk's vision is revolutionizing the future of transportation and space...
Discover the latest groundbreaking research on AI technology by leading companies like Google and IBM. Learn about the potential impact on various industries and the future of artificial...
3D Gaussian splatting, a new method for novel view synthesis, is challenging NeRFs as the predominant technique for 3D scene representation. This technique utilizes anisotropic Gaussians to render crisp 3D models in real-time, providing a unique approach to scene representation and image...
Recent advancements in AI, including GenAI and LLMs, are revolutionizing industries with enhanced productivity and capabilities. Vision transformer architectures like ViTs are reshaping computer vision, offering superior performance and scalability compared to traditional...
Access Sun RGB-D dataset for 3D understanding from 2D images. Dataset includes indoor scenes with 2D and 3D annotations from various 3D scanners. Explore Python code to access this valuable resource for deeper ML...
MIT researchers developed a dataset to simulate peripheral vision in AI models, improving object detection. Understanding peripheral vision in machines could enhance driver safety and predict human behavior, bridging the gap between AI and human...
Article highlights deploying ML models in the cloud, combining CS and DS fields, and overcoming memory limitations in model deployment. Key technologies include Detectron2, Django, Docker, Celery, Heroku, and AWS...
This article discusses the importance of high-quality data and reducing labeling errors in pose estimation models. It demonstrates how a custom labeling workflow in Amazon SageMaker Ground Truth can streamline the labeling process and minimize errors, ultimately reducing the cost of obtaining accurate pose...
Automate mortgage document fraud detection using ML models and business-defined rules with Amazon Fraud Detector, a fully managed fraud detection service. Upload historical data, train the model, review performance, and deploy the API to make predictions for improved fraud detection and underwriting...
AI technology has the ability to transform food images into recipes, allowing for personalized food recommendations, cultural customization, and automated cooking execution. This innovative method combines computer vision and natural language processing to generate comprehensive recipes from food images, bridging the gap between visual depictions of dishes and symbolic...
Automate detecting document tampering and fraud at scale using AWS AI and machine learning services for mortgage underwriting. Develop a deep learning-based computer vision model to detect and highlight forged images in mortgage underwriting using Amazon...
MIT's Improbable AI Lab has developed a multimodal framework called HiP, which uses three different foundation models to help robots create detailed plans for complex tasks. Unlike other models, HiP does not require access to paired vision, language, and action data, making it more cost-effective and...
This article explores monocular depth estimation (MDE) and its importance in computer vision applications. It provides a walkthrough on loading and visualizing depth map data, running inference with Marigold and DPT, and evaluating depth predictions using the SUN RGB-D...
The article explores the use of lightweight hierarchical vision transformers in autonomous robotics, highlighting the effectiveness of a shared trunk concept for multi-task learning. It also discusses the emergence of large multimodal models and their potential to create a unified architecture for end-to-end autonomous driving...
Computer vision has evolved from small pixelated images to generating high-resolution images from descriptions, with smaller models improving performance in areas like smartphone photography and autonomous vehicles. The ResNet model has dominated computer vision for nearly eight years, but challengers like Vision Transformer (ViT) are emerging, showing state-of-the-art performance in computer...
The PGA TOUR is developing a next-generation ball position tracking system using computer vision and machine learning techniques to locate golf balls on the putting green. The system, designed by the Amazon Generative AI Innovation Center, successfully tracks the ball's position and predicts its resting...
2024 could be the tipping point for Music AI, with breakthroughs in text-to-music generation, music search, and chatbots. However, the field still lags behind Speech AI, and advancements in flexible and natural source separation are needed to revolutionize music interaction through...
Gaussian splatting is a fast and interpretable method for representing 3D scenes without neural networks, gaining popularity in a world obsessed with AI models. It uses 3D points with unique parameters to closely match renders to known dataset images, offering a refreshing alternative to complex and opaque methods like...
Autonomous machines in robotics showcased their capabilities in 2023, with notable mentions including Glüxkind's AI-powered smart stroller, Soft Robotics' mGripAI system for food packing, and Quanta's TM25S robot for product inspection, all utilizing NVIDIA...
ICL, a multinational manufacturing and mining corporation, developed in-house capabilities using machine learning and computer vision to automatically monitor their mining equipment. With support from the AWS Prototyping program, they were able to build a framework on AWS using Amazon SageMaker to extract vision from 30 cameras, with the potential to scale to...