Categories
AI Marketing

DeepLearning.AI Launches Multimodal RAG: Chat with Videos Course with Intel

  • Last updated September 12, 2024
  • In AI News

Participants will now be able to create an interactive chat system using the BridgeTower model, a multimodal transformer developed by Intel and Microsoft Research.

DeepLearning.AI recently introduced a new course, ‘Multimodal RAG: Chat with Videos’ offered by Andrew Ng, founder and CEO in collaboration with Intel Corporation, and taught by Vasudev Lal, Principal AI Research Scientist at Intel Labs. 

Focusing on building a system that answers grounded responses from video content. In this course, participants will create an interactive chat system using the BridgeTower model, a multimodal transformer developed by Intel and Microsoft Research. 

Click here to check out the course.

This course aims at helping participants generate joint embeddings from video content and store them in a vector database building a Retrieval-Augmented Generation (RAG) pipeline to fetch relevant video data using Large Vision-Language Models (LVLMs) to answer questions using both text and image inputs. 

By the …

Watch/Read More