This project provides a Python client and pipeline scripts for the document ingestion workflow in RAGFlow, an open-source Retrieval-Augmented Generation (RAG) engine. It takes a collection of ...
Standalone implementation of RAGFlow's backend pipeline that transforms raw documents into searchable, embedded chunks through parsing, chunking, embedding, tokenization, and indexing. This project ...