Latest from MIT : Q&A: Gabriela Sá Pessoa on Brazilian politics, human rights in the Amazon, and AI

Gabriela Sá Pessoa is a journalist passionate about the intersection of human rights and climate change. She came to MIT from The Washington Post, where she worked from her home country of Brazil as a news researcher reporting on the Amazon, human rights violations, and environmental crimes. Before that, she held roles at two of…

Latest from Google AI – Visual captions: Using large language models to augment video conferences with dynamic visuals

Posted by Ruofei Du, Research Scientist, and Alex Olwal, Senior Staff Research Scientist, Google Augmented Reality Recent advances in video conferencing have significantly improved remote video communication through features like live captioning and noise cancellation. However, there are various situations where dynamic visual augmentation would be useful to better convey complex and nuanced information. For…

Latest from MIT Tech Review – To avoid AI doom, learn from nuclear safety

This story originally appeared in The Algorithm, our weekly newsletter on AI. To get stories like this in your inbox first, sign up here. Ok, doomer. For the past few weeks, the AI discourse has been dominated by a loud group of experts who think there is a very real possibility we could develop an artificial-intelligence…

Latest from MIT : Scaling audio-visual learning without labels

Researchers from MIT, the MIT-IBM Watson AI Lab, IBM Research, and elsewhere have developed a new technique for analyzing unlabeled audio and visual data that could improve the performance of machine-learning models used in applications like speech recognition and object detection. The work, for the first time, combines two architectures of self-supervised learning, contrastive learning…

Latest from MIT Tech Review – EmTech Next is happening June 13-15

EmTech Next, MIT Technology Review’s signature digital transformation conference, is June 13-15, 2023. This year’s event looks at the game-changing power of generative AI, the technology, and the legal implications of generated content. Leaders from OpenAI, Google, Meta, NVIDIA, and more are expected to discuss the future of AI. Join online June 13-15, 2023

Latest from Google AI – AVFormer: Injecting vision into frozen speech models for zero-shot AV-ASR

Posted by Arsha Nagrani and Paul Hongsuck Seo, Research Scientists, Google Research Automatic speech recognition (ASR) is a well-established technology that is widely adopted for various applications such as conference calls, streamed video transcription and voice commands. While the challenges for this technology are centered around noisy audio inputs, the visual stream in multimodal videos…

Latest from MIT Tech Review – Welcome to the new surreal. How AI-generated video is changing film.

The Frost nails its uncanny, disconcerting vibe in its first few shots. Vast icy mountains, a makeshift camp of military-style tents, a group of people huddled around a fire, barking dogs. It’s familiar stuff, yet weird enough to plant a growing seed of dread. There’s something wrong here. “Pass me the tail,” someone says. Cut…

Latest from Google AI – Retrieval-augmented visual-language pre-training

Posted by Ziniu Hu, Student Researcher, and Alireza Fathi, Research Scientist, Google Research, Perception Team Large-scale models, such as T5, GPT-3, PaLM, Flamingo and PaLI, have demonstrated the ability to store substantial amounts of knowledge when scaled to tens of billions of parameters and trained on large text and image datasets. These models achieve state-of-the-art…

Latest from Google AI – Large sequence models for software development activities

Posted by Petros Maniatis and Daniel Tarlow, Research Scientists, Google Software isn’t created in one dramatic step. It improves bit by bit, one little step at a time — editing, running unit tests, fixing build errors, addressing code reviews, editing some more, appeasing linters, and fixing more errors — until finally it becomes good enough…