OpenContracts
Basic Information
OpenContracts is a free, open source GPL-3 document analytics platform designed to ingest, analyze and annotate unstructured documents, primarily PDFs and text formats. It provides an enterprise-oriented workspace and APIs to manage document corpuses, extract layout-aware text blocks, generate vector embeddings, and run LLM-backed queries. The project exposes a pluggable parsing and microservice analyzer architecture so teams can add new ingestion engines, custom parsers, embedders and thumbnail generators. It includes a web-based human annotation interface, tooling to perform bulk data extraction across many documents, and integrations with vector search tooling such as a Django-backed pgvector store and LlamaIndex wrappers. Documentation and quickstart guides are provided and the project is structured to be deployable locally or in container environments.