Unstract
LLM-Driven Extraction of Unstructured Data — Built for API Deployments & ETL Pipeline Workflows
Why it is included
Unstract is an open-source, no-code platform purpose-built for extracting data from unstructured documents using LLMs, with high accuracy. Easily deploy API and ETL pipelines for your unstructured data.
Best for
Users exploring vetted FOSS alternatives in this space (information processing).
Strengths
- ~6,522 GitHub stars (per upstream list)
- Open source
Limitations
- Verify license, platform support, and security posture for your environment.
Good alternatives
Related tools
AI & Machine Learning
OpenClaw
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
AI & Machine Learning
Osaurus
Own your AI. The native macOS harness for AI agents -- any model, persistent memory, autonomous execution, cryptographic identity. Built in Swift. Fully offline. Open source.
AI & Machine Learning
tgpt
AI Chatbots in terminal for free
AI & Machine Learning
Everywhere
Context-aware AI assistant for your desktop. Ready to respond intelligently, seamlessly integrating multiple LLMs and MCP tools.
AI & Machine Learning
DeepChat
🐬DeepChat - A smart assistant that connects powerful AI to your personal world
AI & Machine Learning
olmOCR
Toolkit for linearizing PDFs for LLM datasets/training