You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
🌐 ChunkCanvas.cc is a document-processing GUI designed for RAG workflows. It allows you to parse PDFs, process images, transcribe audio/video, and extract data from Excel/CSV files using either local engines (Ollama, vLLM, Docling) or cloud providers (OpenRouter). From there, you can chunk content with configurable LangChain's RecursiveCharacterTextSplitter, edit parsed text and chunks directly in the UI, generate embeddings with your preferred provider, and ingest (text, embeddings and metadata) into the selected vector database of choice.
demo.webm
Document Support
Format
Built-in
Docling
Ollama
vLLM
OpenRouter
PDF (text)
✅
✅
PDF (vision)
✅
✅
✅
✅
Images
✅
✅
✅
Audio
✅
✅
✅
Video
✅
✅
✅
Excel / CSV
✅
Embeddings & Vector Databases
Embeddings Provider
Voyage AI
☁️
Cohere
☁️
OpenRouter
☁️
Ollama
🖥️
vLLM
🖥️
Vector Database
Pinecone
☁️
ChromaDB
☁️ 🖥️
MongoDB Atlas
☁️
FAISS
🖥️
Quick Start (Docker)
git clone https://github.com/Rayen023/chunkcanvas.git
cd chunkcanvas/
docker compose up -d
cd backend
uv sync
uv run uvicorn app.faiss_server:app --reload --port 8010
# separate terminal for Docling:
uv run uvicorn app.docling_server:app --reload --port 8020
ChunkCanvas.cc is a document-processing GUI designed for RAG workflows. Parse PDFs, images, audio, and Excel files, chunk and edit content, generate embeddings, and ingest data into vector databases using local or cloud engines, all in one streamlined, configurable interface.