Skip to content

A Multimodal RAG Pipeline with LangChain & Unstructured Library

Notifications You must be signed in to change notification settings

Mercytopsy/Multimodal-RAG-Pipeline

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Multimodal RAG Pipeline with LangChain

This project demonstrates how to build a Multimodal RAG (Retrieval-Augmented Generation) pipeline using LangChain to process PDF documents (text + tables + images) with the Unstructured library.

✨ Features

  • Extract text, tables and images from PDFs using `Unstructured. '
  • Retriever-augmented generation (RAG) pipeline for multimodal content.
  • LangChain integration for document processing and retrieval
  • Question-answering to process, find, and answer questions about documents.

🚀 Quick Start

  1. Prerequisites
  • Python 3.10+
  1. Clone repository
  1. Install dependencies
  • pip install -r requirements.txt

Watch the video

About

A Multimodal RAG Pipeline with LangChain & Unstructured Library

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published