Unlocking Performance: Benchmarking and Optimizing Endpoint Deployment in Amazon SageMaker JumpStart
This article explores the complex relationship between latency and throughput when deploying large language models (LLMs) using Amazon SageMaker JumpStart. Benchmarking of LLMs like Llama 2, Falcon, and Mistral variants reveals the impact of model architecture, serving configurations, instance type hardware, and concurrent requests on performance.
OpenAI and Common Sense Media have joined forces to create AI guidelines and educational materials for parents, educators, and teens, including family-friendly GPTs in OpenAI's GPT store. Common Sense Media aims to help teens and families safely utilize AI, expanding from their reviews of films and TV shows to AI assistants.
MIT neuroscientists have discovered that sentences with unusual grammar or unexpected meaning generate stronger responses in the brain's language processing centers, while straightforward sentences barely engage these regions. The researchers used an artificial language network to predict the brain's response to different sentences.
Developers of open world video games and analytics managers both face the challenge of balancing exploration and exploitation. To solve this tension, they can build alternative paths, offer knowledge management systems, foster online communities, and make continuous improvements. Salespeople, like gamers, have main quests in the form of specific metrics they need to track, so creating simple an...
MIT Policy Hackathon brings together students and professionals from around the world to tackle societal challenges using generative AI tools like ChatGPT. Winning team "Ctrl+Alt+Defeat" focuses on addressing the eviction crisis in the US.
Atacama Biomaterials, a startup combining architecture, machine learning, and chemical engineering, develops eco-friendly materials with multiple applications. Their technology allows for the creation of data and material libraries using AI and ML, producing regionally sourced, compostable plastics and packaging.
This article explores methods for creating fine-tuning datasets to generate Cypher queries from text, utilizing large language models (LLMs) and a predefined graph schema. The author also mentions an ongoing project that aims to develop a comprehensive fine-tuning dataset using a human-in-the-loop approach.
The MIT Abdul Latif Jameel Clinic for Machine Learning in Health discussed whether the "black box" decision-making process of AI models should be fully explained for FDA approval. The event also highlighted the need for education, data availability, and collaboration between regulators and medical professionals in the regulation of AI in health.
MIT PhD students are using game theory to improve the accuracy and dependability of natural language models, aiming to align the model's confidence with its accuracy. By recasting language generation as a two-player game, they have developed a system that encourages truthful and reliable answers while reducing hallucinations.
Researchers at MIT and IBM have developed a new method called "physics-enhanced deep surrogate" (PEDS) that combines a low-fidelity physics simulator with a neural network generator to create data-driven surrogate models for complex physical systems. The PEDS method is affordable, efficient, and reduces the training data needed by at least a factor of 100 while achieving a target error of 5 per...
MIT's Improbable AI Lab has developed a multimodal framework called HiP, which uses three different foundation models to help robots create detailed plans for complex tasks. Unlike other models, HiP does not require access to paired vision, language, and action data, making it more cost-effective and transparent.
MIT researchers have developed an automated interpretability agent (AIA) that uses AI models to explain the behavior of neural networks, offering intuitive descriptions and code reproductions. The AIA actively participates in hypothesis formation, experimental testing, and iterative learning, refining its understanding of other systems in real time.
The aviation industry has a fatality risk of 0.11, making it one of the safest modes of transportation. MIT scientists are looking to aviation as a model for regulating AI in healthcare to ensure marginalized patients are not harmed by biased AI models.
MIT scientists have developed two machine-learning models, the "PRISM" neural network and a logistic regression model, for early detection of pancreatic cancer. These models outperformed current methods, detecting 35% of cases compared to the standard 10% detection rate.
Google Brain introduced Transformer in 2017, a flexible architecture that outperformed existing deep learning approaches, and is now used in models like BERT and GPT. GPT, a decoder model, uses a language modeling task to generate new sequences, and follows a two-stage framework of pre-training and fine-tuning.