🤖 AI Assistant with RAG (Retrieval-Augmented Generation)

A powerful Streamlit chatbot that combines Google's Gemini API with RAG capabilities to provide intelligent responses based on your uploaded content. Supports multimodal inputs including text, images, audio, video, and web content.

✨ Features

📁 File Processing

Documents: PDF, Word (.docx), text files
Images: JPG, PNG, GIF, WebP with AI vision analysis
Audio: MP3, WAV, M4A with automatic transcription
Video: MP4, AVI, MOV with audio extraction and transcription
Web Content: Extract content from any webpage
YouTube: Automatic transcript extraction from YouTube videos

🧠 AI Capabilities

Multimodal Chat: Text + image queries using Gemini Vision
RAG System: Intelligent context retrieval from your knowledge base
Semantic Search: Find relevant information across all uploaded content
Chat Memory: Persistent conversation history
Source Attribution: See which documents informed each response

🗄️ Vector Storage Options

ChromaDB: Local vector database (default)
Pinecone: Cloud vector database (optional, requires API key)

🚀 Quick Start

1. Installation

# Clone the repository
git clone <repository-url>
cd streamlit-gemini-rag-chatbot

# Install dependencies
pip install -r requirements.txt

2. Configuration

# Copy the environment template
cp .env.example .env

# Edit .env and add your API keys
GOOGLE_API_KEY=your_gemini_api_key_here
PINECONE_API_KEY=your_pinecone_api_key_here  # Optional
PINECONE_ENVIRONMENT=your_pinecone_environment_here  # Optional

3. Get API Keys

Required: Google Gemini API

Go to Google AI Studio
Create a new API key
Add it to your .env file

Optional: Pinecone API

Sign up at Pinecone
Create a new project and get your API key
Add it to your .env file

4. Run the Application

streamlit run app.py

📖 How to Use

Upload Content

Files: Use the sidebar to upload documents, images, audio, or video files
Web URLs: Enter any webpage URL to extract and index its content
YouTube: Paste YouTube URLs to get automatic transcripts

Chat with Your AI

Ask questions about your uploaded content
Upload images directly in the chat for multimodal queries
Get responses with source attribution
View chat history and export conversations

Example Queries

"Summarize the key points from the uploaded PDF"
"What does this image show?" (with image upload)
"Compare the information from different documents"
"What are the main topics discussed in the video?"

🏗️ Architecture

├── app.py                 # Main Streamlit application
├── config/
│   └── settings.py        # Configuration and settings
├── utils/
│   ├── gemini_client.py   # Gemini API integration
│   ├── vector_store.py    # Vector database operations
│   ├── file_processor.py  # File processing and content extraction
│   └── chat_memory.py     # Chat history management
├── components/
│   ├── sidebar.py         # Sidebar UI components
│   └── chat_interface.py  # Chat interface components
└── requirements.txt       # Python dependencies

🔧 Advanced Configuration

Vector Store Selection

ChromaDB (Default): Local storage, no API key required
Pinecone: Cloud storage, requires API key but offers better scalability

File Processing Settings

Edit config/settings.py to customize:

Maximum file size limits
Text chunking parameters
Supported file types
Embedding dimensions

RAG Parameters

Chunk Size: How text is split for processing
Top-K Retrieval: Number of relevant chunks to retrieve
Embedding Model: Sentence transformer model for embeddings

🛠️ Troubleshooting

Common Issues

API Key Errors
- Ensure your Gemini API key is valid and has sufficient quota
- Check that the key is properly set in the .env file
File Processing Errors
- Large files may take time to process
- Some video formats may require additional codecs
Memory Issues
- For large files, consider increasing chunk size
- Use Pinecone for better scalability with large datasets

Performance Tips

Optimize File Sizes: Compress large video/audio files before upload
Use Specific Queries: More specific questions yield better results
Regular Cleanup: Clear chat history periodically for better performance

📝 Development

Adding New File Types

Update SUPPORTED_FILE_TYPES in config/settings.py
Add processing logic in utils/file_processor.py
Test with sample files

Custom Embedding Models

Modify the generate_embeddings method in utils/gemini_client.py
Update the embedding dimension in settings
Rebuild your vector database

🤝 Contributing

Fork the repository
Create a feature branch
Make your changes
Add tests if applicable
Submit a pull request

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

Google Gemini API for multimodal AI capabilities
Streamlit for the amazing web framework
ChromaDB and Pinecone for vector storage solutions
OpenAI Whisper for audio transcription
All the open-source libraries that make this possible

📞 Support

If you encounter any issues or have questions:

Check the troubleshooting section above
Review the configuration settings
Open an issue on GitHub with detailed information about your problem

Happy chatting with your AI assistant! 🚀

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🤖 AI Assistant with RAG (Retrieval-Augmented Generation)

✨ Features

📁 File Processing

🧠 AI Capabilities

🗄️ Vector Storage Options

🚀 Quick Start

1. Installation

2. Configuration

3. Get API Keys

Required: Google Gemini API

Optional: Pinecone API

4. Run the Application

📖 How to Use

Upload Content

Chat with Your AI

Example Queries

🏗️ Architecture

🔧 Advanced Configuration

Vector Store Selection

File Processing Settings

RAG Parameters

🛠️ Troubleshooting

Common Issues

Performance Tips

📝 Development

Adding New File Types

Custom Embedding Models

🤝 Contributing

📄 License

🙏 Acknowledgments

📞 Support

About

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
components		components
config		config
utils		utils
.env.example		.env.example
.gitattributes		.gitattributes
README.md		README.md
app.py		app.py
main.py		main.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

🤖 AI Assistant with RAG (Retrieval-Augmented Generation)

✨ Features

📁 File Processing

🧠 AI Capabilities

🗄️ Vector Storage Options

🚀 Quick Start

1. Installation

2. Configuration

3. Get API Keys

Required: Google Gemini API

Optional: Pinecone API

4. Run the Application

📖 How to Use

Upload Content

Chat with Your AI

Example Queries

🏗️ Architecture

🔧 Advanced Configuration

Vector Store Selection

File Processing Settings

RAG Parameters

🛠️ Troubleshooting

Common Issues

Performance Tips

📝 Development

Adding New File Types

Custom Embedding Models

🤝 Contributing

📄 License

🙏 Acknowledgments

📞 Support

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages