Latest from IBM Developer : A Python Flask audio search application

Note: This code pattern uses Watson Discovery V1 and will not work with Discovery V2. However, you can still use it to learn the Discovery features. Future plans include updating the code pattern to work with Discovery V2.

Summary

This code pattern explains how to create an application that you can use to search for a topic within video and audio files.

Description

While listening to a podcast or to video or audio files of courses, you often want to jump directly to the topic rather than listening to extraneous information. However, finding the topics and keywords in the entire recording can be challenging.

In this code pattern, create an application that you can use to search within the video or audio files. With the app, not only can you search, but you can also highlight the text where the search string or topic occurs in the file. The code pattern performs a natural language query search in audio files, and returns the results with the proper timeframe where your search topic is being discussed. This example uses an IBM® Watson Machine Learning introduction video to illustrate the process.

When you have completed the code pattern, you understand how to:

Prepare audio and video data and perform chunking to break it into smaller chunks to work with
Work with the Watson Speech to Text service through API calls to convert audio or video to text
Work with the Watson Discovery service through API calls to perform a search on text chunks
Create a Python Flask application and deploy it on IBM Cloud.

Flow

The user uploads the video or audio file on the UI.
The video or audio file is processed with the moviepy and pydub Python libraries, and is chunked to create smaller chunks to work with.
The user interacts with the Watson Speech to Text service through the provided application UI. The audio chunks are converted into text chunks with Watson Speech to Text.
The text chunks are uploaded on Watson Discovery by calling Watson Discovery APIs with Python SDKs.
The user performs a search query using Watson Discovery.
The results are shown on the UI.

Instructions

Get detailed steps in the readme file. Those steps show how to:

Clone the GitHub repository.
Create the Watson Speech to Text service.
Create a Watson Discovery instance.
Run the application locally.

Artificial Intelligence

Latest from MIT Tech Review – 2021 was the year of monster AI models

It’s been a year of supersized AI models. When OpenAI released GPT-3, in June 2020, the neural network’s apparent grasp of language was uncanny. It could generate convincing sentences, converse with humans, and even autocomplete code. GPT-3 was also monstrous in scale—larger than any other neural network ever built. It kicked off a whole new trend in…

Artificial Intelligence

Latest from MIT Tech Review – The White House just unveiled a new AI Bill of Rights

The White House wants Americans to know: The age of AI accountability is coming. US President Biden has today unveiled a new AI Bill of Rights, which outlines five protections Americans should have in the AI age. Biden has called for stronger privacy protections and for tech companies to stop collecting data in the past….

Artificial Intelligence

O’Reilly Media – MCP: What It Is and Why It Matters—Part 4

This is the last of four parts in this series. Part 1 can be found here, Part 2 here, and Part 3 here. 9. Future Directions and Wishlist for MCP The trajectory of MCP and AI tool integration is exciting, and there are clear areas where the community and companies are pushing things forward. Here…

Artificial Intelligence

Latest from MIT : Closing the design-to-manufacturing gap for optical devices

Photolithography involves manipulating light to precisely etch features onto a surface, and is commonly used to fabricate computer chips and optical devices like lenses. But tiny deviations during the manufacturing process often cause these devices to fall short of their designers’ intentions. To help close this design-to-manufacturing gap, researchers from MIT and the Chinese University…

Artificial Intelligence

O’Reilly Media – Context Serialization

In a recent edition of The Sequence Engineering newsletter, “Why Did MCP Win?,” the authors point to context serialization and exchange as a reason—perhaps the most important reason—why everyone’s talking about the Model Context Protocol. I was puzzled by this—I’ve read a lot of technical and semitechnical posts about MCP and haven’t seen context serialization…

Artificial Intelligence

Latest from MIT Tech Review – The tech industry can’t agree on what open source AI means. That’s a problem.

Suddenly, “open source” is the latest buzzword in AI circles. Meta has pledged to create open-source artificial general intelligence. And Elon Musk is suing OpenAI over its lack of open-source AI models. Meanwhile, a growing number of tech leaders and companies are setting themselves up as open-source champions. But there’s a fundamental problem—no one can…

Summary

Description

Flow

Instructions

Similar Posts