Latest from MIT Tech Review – Meta’s new AI models can recognize and produce speech for more than 1,000 languages

Meta has built AI models that can recognize and produce speech for more than 1,000 languages—a tenfold increase on what’s currently available. It’s a significant step toward preserving languages that are at risk of disappearing, the company says. Meta is releasing its models to the public via the code hosting service GitHub. It claims that…

Latest from MIT : Researchers use AI to identify similar materials in images

A robot manipulating objects while, say, working in a kitchen, will benefit from understanding which items are composed of the same materials. With this knowledge, the robot would know to exert a similar amount of force whether it picks up a small pat of butter from a shadowy corner of the counter or an entire…

Latest from MIT : Using data to write songs for progress

A three-year recipient of MIT’s Emerson Classical Vocal Scholarships, senior Ananya Gurumurthy recalls getting ready to step onto the Carnegie Hall stage to sing a Mozart opera that she once sang with the New York All-State Choir. The choir conductor reminded her to articulate her words and to engage her diaphragm. “If you don’t project…

Latest from Google AI – Making ML models differentially private: Best practices and open challenges

Posted by Natalia Ponomareva and Alex Kurakin, Staff Software Engineers, Google Research Large machine learning (ML) models are ubiquitous in modern applications: from spam filters to recommender systems and virtual assistants. These models achieve remarkable performance partially due to the abundance of available training data. However, these data can sometimes contain private information, including personal…

Latest from Google AI – Sparse video tubes for joint video and image vision transformers

Posted by AJ Piergiovanni and Anelia Angelova, Research Scientists, Google Video understanding is a challenging problem that requires reasoning about both spatial information (e.g., for objects in a scene, including their locations and relations) and temporal information for activities or events shown in a video. There are many video understanding applications and tasks, such as…

Latest from MIT : Is medicine ready for AI? Doctors, computer scientists, and policymakers are cautiously optimistic

The advent of generative artificial intelligence models like ChatGPT has prompted renewed calls for AI in health care, and its support base only appears to be broadening. The second annual MIT-MGB AI Cures Conference, hosted on April 24 by the Abdul Latif Jameel Clinic for Machine Learning in Health (Jameel Clinic), saw its attendance nearly…

Latest from MIT : A better way to study ocean currents

To study ocean currents, scientists release GPS-tagged buoys in the ocean and record their velocities to reconstruct the currents that transport them. These buoy data are also used to identify “divergences,” which are areas where water rises up from below the surface or sinks beneath it. By accurately predicting currents and pinpointing divergences, scientists can…

Latest from MIT : An AI challenge only humans can solve

The Dark Ages were not entirely dark. Advances in agriculture and building technology increased Medieval wealth and led to a wave of cathedral construction in Europe. However, it was a time of profound inequality. Elites captured virtually all economic gains. In Britain, as Canterbury Cathedral soared upward, peasants had no net increase in wealth between…