Latest from MIT Tech Review – AI’s progress isn’t the same as creating human intelligence in machines

The term “artificial intelligence” really has two meanings. AI refers both to the fundamental scientific quest to build human intelligence into computers and to the work of modeling massive amounts of data. These two endeavors are very different, both in their ambitions and in the amount of progress they have made in recent years.

Scientific AI, the quest to both construct and understand human-level intelligence, is one of the most profound challenges in all of science; it dates back to the 1950s and is likely to continue for many decades.

Data-centric AI, on the other hand, began in earnest in the 1970s with the invention of methods for automatically constructing “decision trees” and has exploded in popularity over the last decade with the resounding success of neural networks (now dubbed “deep learning”). Data-centric artificial intelligence has also been called “narrow AI” or “weak AI,” but the rapid progress over the last decade or so has demonstrated its power.

Deep-learning methods, coupled with massive training data sets plus unprecedented computational power, have delivered success on a broad range of narrow tasks from speech recognition to game playing and more. The artificial-intelligence methods build predictive models that grow increasingly accurate through a compute-intensive iterative process. In previous years, the need for human-labeled data to train the AI models has been a major bottleneck in achieving success. But recently, research and development focus has shifted to ways in which the necessary labels can be created automatically, based on the internal structure of the data.

The GPT-3 language model released by OpenAI in 2020 exemplifies both the potential and the challenges of this approach. GPT-3 was trained on billions of sentences. It automatically generates highly plausible text, and even sensibly answers questions on a broad range of topics, mimicking the same language that a person might use.

But GPT-3 suffers from several problems that researchers are working to address. It’s often inconsistent—you can get contradictory answers to the same question. Second, GPT-3 is prone to “hallucinations”: when asked who the president of the United States was in 1492, it will happily conjure up an answer. Third, GPT-3 is an expensive model to train and expensive to run. Fourth, GPT-3 is opaque—it’s difficult to understand why it drew a particular conclusion. Finally, since GPT-3 parrots the contents of its training data, which is drawn from the web, it often spews out toxic content, including sexism, racism, xenophobia, and more. In essence, GPT-3 cannot be trusted.

Despite these challenges, researchers are investigating multi-modal versions of GPT-3 (such as DALL-E2), which create realistic images from natural-language requests. AI developers are also considering how to use these insights in robots that interact with the physical world. And AI is increasingly being applied to biology, chemistry, and other scientific disciplines to glean insights from the massive data and complexities in those fields.

The bulk of the rapid progress today is in this data-centric AI, and the work of this year’s 35 Innovators Under 35 winners is no exception. While data-centric AI is powerful, it has key limitations: the systems are still designed and framed by humans. A few years ago, I wrote an article for MIT Technology Review called “How to know if artificial intelligence is about to destroy civilization.” I argued that successfully formulating problems remains a distinctly human capability. Pablo Picasso famously said, “Computers are useless. They only give you answers.”

We continue to anticipate the distant day when AI systems can formulate good questions—and shed more light on the fundamental scientific challenge of understanding and constructing human-level intelligence.

Oren Etzioni is CEO of the Allen Institute for AI and a judge for this year’s 35 Innovators competition.

Latest from MIT : 3 Questions: Should we label AI systems like we do prescription drugs?

AI systems are increasingly being deployed in safety-critical health care situations. Yet these models sometimes hallucinate incorrect information, make biased predictions, or fail for unexpected reasons, which could have serious consequences for patients and clinicians. In a commentary article published today in Nature Computational Science, MIT Associate Professor Marzyeh Ghassemi and Boston University Associate Professor Elaine…

Artificial Intelligence

Latest from MIT Tech Review – Inside the tedious effort to tally AI’s energy appetite

After working on it for months, my colleague Casey Crownhart and I finally saw our story on AI’s energy and emissions burden go live last week. The initial goal sounded simple: Calculate how much energy is used each time we interact with a chatbot, and then tally that up to understand why everyone from leaders…

Artificial Intelligence

Latest from MIT Tech Review – AI is changing how we study bird migration

A small songbird soars above Ithaca, New York, on a September night. He is one of 4 billion birds, a great annual river of feathered migration across North America. Midair, he lets out what ornithologists call a nocturnal flight call to communicate with his flock. It’s the briefest of signals, barely 50 milliseconds long, emitted…

Artificial Intelligence

Latest from Google AI – MediaPipe FaceStylizer: On-device real-time few-shot face stylization

Posted by Haolin Jia, Software Engineer, and Qifei Wang, Senior Software Engineer, Core ML In recent years, we have witnessed rising interest across consumers and researchers in integrated augmented reality (AR) experiences using real-time face feature generation and editing functions in mobile applications, including short videos, virtual reality, and gaming. As a result, there is…

Artificial Intelligence

Latest from IBM Developer : A Python Flask audio search application

Note: This code pattern uses Watson Discovery V1 and will not work with Discovery V2. However, you can still use it to learn the Discovery features. Future plans include updating the code pattern to work with Discovery V2. Summary This code pattern explains how to create an application that you can use to search for…

Artificial Intelligence

Latest from MIT : How to build AI scaling laws for efficient LLM training and budget maximization

When researchers are building large language models (LLMs), they aim to maximize performance under a particular computational and financial budget. Since training a model can amount to millions of dollars, developers need to be judicious with cost-impacting decisions about, for instance, the model architecture, optimizers, and training datasets before committing to a model. To anticipate…

Similar Posts