Latest from MIT : New model offers a way to speed up drug discovery

Huge libraries of drug compounds may hold potential treatments for a variety of diseases, such as cancer or heart disease. Ideally, scientists would like to experimentally test each of these compounds against all possible targets, but doing that kind of screen is prohibitively time-consuming. In recent years, researchers have begun using computational methods to screen…

Latest from MIT : MIT researchers make language models scalable self-learners

Socrates once said: “It is not the size of a thing, but the quality that truly matters. For it is in the nature of substance, not its volume, that true value is found.” Does size always matter for large language models (LLMs)? In a technological landscape bedazzled by LLMs taking center stage, a team of…

Latest from Google AI – Evaluating speech synthesis in many languages with SQuId

Posted by Thibault Sellam, Research Scientist, Google Previously, we presented the 1,000 languages initiative and the Universal Speech Model with the goal of making speech and language technologies available to billions of users around the world. Part of this commitment involves developing high-quality speech synthesis technologies, which build upon projects such as VDTTS and AudioLM,…

Latest from MIT Tech Review – Google DeepMind’s game-playing AI just found another way to make code faster

DeepMind’s run of discoveries in fundamental computer science continues. Last year the company used a version of its game-playing AI AlphaZero to find new ways to speed up the calculation of a crucial piece of math at the heart of many different kinds of code, beating a 50-year-old record. Now it has pulled the same…

Latest from MIT : Q&A: Gabriela Sá Pessoa on Brazilian politics, human rights in the Amazon, and AI

Gabriela Sá Pessoa is a journalist passionate about the intersection of human rights and climate change. She came to MIT from The Washington Post, where she worked from her home country of Brazil as a news researcher reporting on the Amazon, human rights violations, and environmental crimes. Before that, she held roles at two of…

Latest from Google AI – Visual captions: Using large language models to augment video conferences with dynamic visuals

Posted by Ruofei Du, Research Scientist, and Alex Olwal, Senior Staff Research Scientist, Google Augmented Reality Recent advances in video conferencing have significantly improved remote video communication through features like live captioning and noise cancellation. However, there are various situations where dynamic visual augmentation would be useful to better convey complex and nuanced information. For…

Latest from MIT Tech Review – To avoid AI doom, learn from nuclear safety

This story originally appeared in The Algorithm, our weekly newsletter on AI. To get stories like this in your inbox first, sign up here. Ok, doomer. For the past few weeks, the AI discourse has been dominated by a loud group of experts who think there is a very real possibility we could develop an artificial-intelligence…

Latest from MIT : Scaling audio-visual learning without labels

Researchers from MIT, the MIT-IBM Watson AI Lab, IBM Research, and elsewhere have developed a new technique for analyzing unlabeled audio and visual data that could improve the performance of machine-learning models used in applications like speech recognition and object detection. The work, for the first time, combines two architectures of self-supervised learning, contrastive learning…

Latest from MIT Tech Review – EmTech Next is happening June 13-15

EmTech Next, MIT Technology Review’s signature digital transformation conference, is June 13-15, 2023. This year’s event looks at the game-changing power of generative AI, the technology, and the legal implications of generated content. Leaders from OpenAI, Google, Meta, NVIDIA, and more are expected to discuss the future of AI. Join online June 13-15, 2023

Latest from Google AI – AVFormer: Injecting vision into frozen speech models for zero-shot AV-ASR

Posted by Arsha Nagrani and Paul Hongsuck Seo, Research Scientists, Google Research Automatic speech recognition (ASR) is a well-established technology that is widely adopted for various applications such as conference calls, streamed video transcription and voice commands. While the challenges for this technology are centered around noisy audio inputs, the visual stream in multimodal videos…