kreuzberg (kreuzberg-dev/kreuzberg) is an open-source AI project on GitHub. Repository summary: A polyglot document intelligence framework with a Rust core. Extract text, metadata, images, and structured information from PDFs, Office documents, images, and 97+ formats. Available for Rust, Python, Ruby, Java, Go, PHP, Elixir, C#, R, C, TypeScript (Node/Bun/Wasm/Deno)- or use via CLI, REST API, or MCP server. Its focus includes MCP and tool-calling integration, retrieval-augmented generation, developer-centric engineering workflows, image and vision workflows. It is suitable for extension, integration, and iterative delivery in real workflows.
License
Other
Stars
8,268
Homepage
https://kreuzberg.dev/Features
- Core capability: A polyglot document intelligence framework with a Rust core. Extract text, metadata, images, and structured information from PDFs, Office documents, images, and 97+ formats. Available for Rust, Python, Ruby, Java, Go, PHP, Elixir, C#, R, C, TypeScript (Node/Bun/Wasm/Deno)- or use via CLI, REST API, or MCP server.
- Provides MCP or tool-calling integration
- Supports vector retrieval and retrieval-augmented reasoning
- Built for code generation, debugging, or engineering integration
- Supports image generation, editing, or vision understanding
- Repository: kreuzberg-dev/kreuzberg
Use Cases
- Connects external systems into agent workflows
- Builds enterprise knowledge Q&A and document retrieval systems
- Supports AI engineering build-and-iterate workflows for dev teams
- Used for visual content production and model experimentation
- Build internal AI workflow prototypes with kreuzberg
- Validate kreuzberg in production-like engineering scenarios
FAQ
Teams should first define integration boundaries and call patterns, then map repository capabilities into concrete interfaces, parameters, and access rules. GitHub repository: https://github.com/kreuzberg-dev/kreuzberg. Community traction is around 8,268 stars. License: Other.
It usually works as an execution component or capability layer, with common deployment fits such as: Connects external systems into agent workflows, Builds enterprise knowledge Q&A and document retrieval systems, Supports AI engineering build-and-iterate workflows for dev teams.