Latest from IBM Developer : A Python Flask audio search application

Note: This code pattern uses Watson Discovery V1 and will not work with Discovery V2. However, you can still use it to learn the Discovery features. Future plans include updating the code pattern to work with Discovery V2.

Summary

This code pattern explains how to create an application that you can use to search for a topic within video and audio files.

Description

While listening to a podcast or to video or audio files of courses, you often want to jump directly to the topic rather than listening to extraneous information. However, finding the topics and keywords in the entire recording can be challenging.

In this code pattern, create an application that you can use to search within the video or audio files. With the app, not only can you search, but you can also highlight the text where the search string or topic occurs in the file. The code pattern performs a natural language query search in audio files, and returns the results with the proper timeframe where your search topic is being discussed. This example uses an IBM® Watson Machine Learning introduction video to illustrate the process.

When you have completed the code pattern, you understand how to:

Prepare audio and video data and perform chunking to break it into smaller chunks to work with
Work with the Watson Speech to Text service through API calls to convert audio or video to text
Work with the Watson Discovery service through API calls to perform a search on text chunks
Create a Python Flask application and deploy it on IBM Cloud.

Flow

The user uploads the video or audio file on the UI.
The video or audio file is processed with the moviepy and pydub Python libraries, and is chunked to create smaller chunks to work with.
The user interacts with the Watson Speech to Text service through the provided application UI. The audio chunks are converted into text chunks with Watson Speech to Text.
The text chunks are uploaded on Watson Discovery by calling Watson Discovery APIs with Python SDKs.
The user performs a search query using Watson Discovery.
The results are shown on the UI.

Instructions

Get detailed steps in the readme file. Those steps show how to:

Clone the GitHub repository.
Create the Watson Speech to Text service.
Create a Watson Discovery instance.
Run the application locally.

Artificial Intelligence

Latest from IBM Developer : Build a framework that connects WhatsApp to Watson services

Summary To enable mobile users to leverage IBM Watson® services through a messenger app, complete this developer code pattern and build a framework that can act as an intermediator in connecting Watson services to WhatsApp Messenger. Description There are currently 2.4 billion users on WhatsApp, and the number keeps climbing. For medium and large businesses,…

Artificial Intelligence

Latest from MIT Tech Review – You need to talk to your kid about AI. Here are 6 things you should say.

In the past year, kids, teachers, and parents have had a crash course in artificial intelligence, thanks to the wildly popular AI chatbot ChatGPT. In a knee-jerk reaction, some schools, such as the New York City public schools, banned the technology—only to cancel the ban months later. Now that many adults have caught up with…

Artificial Intelligence

Latest from MIT : Making AI-generated code more accurate in any language

Programmers can now use large language models (LLMs) to generate computer code more quickly. However, this only makes programmers’ lives easier if that code follows the rules of the programming language and doesn’t cause a computer to crash. Some methods exist for ensuring LLMs conform to the rules of whatever language they are generating text…

Artificial Intelligence

Latest from MIT : 3 Questions: Amar Gupta on an integrated approach to enhanced health-care delivery

Covid-19 was somewhat of a metaverse itself. Many of our domains turned digital — with much attention toward one emerging space: virtual care. The pandemic exacerbated the difficulties of providing appropriate medical board oversight to ensure proper standard of services for patients. MIT researcher and former professor Amar Gupta explores through his research on how…

Artificial Intelligence

Latest from MIT : MIT researchers combine deep learning and physics to fix motion-corrupted MRI scans

Compared to other imaging modalities like X-rays or CT scans, MRI scans provide high-quality soft tissue contrast. Unfortunately, MRI is highly sensitive to motion, with even the smallest of movements resulting in image artifacts. These artifacts put patients at risk of misdiagnoses or inappropriate treatment when critical details are obscured from the physician. But researchers…

Artificial Intelligence

Latest from MIT Tech Review – These two new AI benchmarks could help make models less biased

A new pair of AI benchmarks could help developers reduce bias in AI models, potentially making them fairer and less likely to cause harm. The research, from a team based at Stanford, was posted to the arXiv preprint server in early February. The researchers were inspired to look into the problem of bias after witnessing…

Summary

Description

Flow

Instructions

Similar Posts