Latest from Google AI – Visual language maps for robot navigation

Posted by Oier Mees, PhD Student, University of Freiburg, and Andy Zeng, Research Scientist, Robotics at Google People are excellent navigators of the physical world, due in part to their remarkable ability to build cognitive maps that form the basis of spatial memory — from localizing landmarks at varying ontological levels (like a book on…

Latest from MIT Tech Review – These new tools let you see for yourself how biased AI image models are

Popular AI image-generating systems notoriously tend to amplify harmful biases and stereotypes. But just how big a problem is it? You can now see for yourself using interactive new online tools. (Spoiler alert: it’s big.) The tools, built by researchers at AI startup Hugging Face and Leipzig University and detailed in a non-peer-reviewed paper, allow…

Latest from MIT Tech Review – The bearable mediocrity of Baidu’s ChatGPT competitor

China Report is MIT Technology Review’s newsletter about technology developments in China. Sign up to receive it in your inbox every Tuesday. Did you stay up late last week to watch the release of Ernie Bot, the first Chinese rival to ChatGPT? It felt like the most anticipated event in China’s tech world so far this year,…

Latest from MIT : Learning to grow machine-learning models

It’s no secret that OpenAI’s ChatGPT has some incredible capabilities — for instance, the chatbot can write poetry that resembles Shakespearean sonnets or debug code for a computer program. These abilities are made possible by the massive machine-learning model that ChatGPT is built upon. Researchers have found that when these types of models become large…

Latest from MIT Tech Review – Google just launched Bard, its answer to ChatGPT—and it wants you to make it better

Google has launched Bard, the search giant’s answer to OpenAI’s ChatGPT and Microsoft’s Bing Chat. Unlike Bing Chat, Bard does not look up search results—all the information it returns is generated by the model itself. But it is still designed to help users brainstorm and answer queries. Google wants Bard to become an integral part…

Latest from MIT Tech Review – How AI experts are using GPT-4

WOW, last week was intense. Several leading AI companies had major product releases. Google said it was giving developers access to its AI language models, and AI startup Anthropic unveiled its AI assistant Claude. But one announcement outshined them all: OpenAI’s new multimodal large language model, GPT-4. My colleague William Douglas Heaven got an exclusive preview. Read about…

Latest from MIT : Detailed images from space offer clearer picture of drought effects on plants

“MIT is a place where dreams come true,” says César Terrer, an assistant professor in the Department of Civil and Environmental Engineering. Here at MIT, Terrer says he’s given the resources needed to explore ideas he finds most exciting, and at the top of his list is climate science. In particular, he is interested in…

Latest from MIT Tech Review – Language models might be able to self-correct biases—if you ask them

Large language models are infamous for spewing toxic biases, thanks to the reams of awful human-produced content they get trained on.  But if the models are large enough, and humans have helped train them, then they may be able to self-correct for some of these biases. Remarkably, all we have to do is ask. That’s…

Latest from Google AI – Vid2Seq: a pretrained visual language model for describing multi-event videos

Posted by Antoine Yang, Student Researcher, and Arsha Nagrani, Research Scientist, Google Research, Perception team Videos have become an increasingly important part of our daily lives, spanning fields such as entertainment, education, and communication. Understanding the content of videos, however, is a challenging task as videos often contain multiple events occurring at different time scales….