Launch kit·sifter.run
PDF
Sifter screenshot
Open-source document intelligence for extracting structured records from unstructured files
sifter.run

Sifter

Turn messy document folders into queryable structured databases.

Tagline

Turn messy folders into databases

Documents as a queryable database layer

Stop building brittle extraction templates

Open-source document intelligence for real chaos

1

Sifter is a document database layer, not a document search tool.

The page explicitly contrasts structured aggregation with RAG similarity search and emphasizes filter/aggregate/query semantics. This is a strong category-creation angle because the product behaves like a database for files, not a retrieval layer.

2

The anti-template extractor for real-world document chaos.

The strongest repeated proof point is that it works across layout changes, supplier variations, and mixed document formats without per-layout configuration. That directly attacks the brittle template-extraction market.

3

Open-source document intelligence for teams that can’t trust black-box SaaS.

MIT license, self-hosting, BYOK, and no vendor lock-in are prominent. This angle will resonate with engineering-led buyers, privacy-conscious teams, and companies building workflows on top of extracted document data.

Sign up free to see your ICP hypotheses
Announcement

Your PDFs are a dark database. Sifter turns messy folders into queryable structured rows. No templates. No per-layout rules. Just schema-driven extraction, citations, filters, aggregates, dashboards, API, SDK, and MCP. Ship it on your docs.

Announcement

RAG was built for retrieval. Sifter was built for documents that need rows. Invoices, receipts, contracts, resumes, scans. Extract fields with natural language schemas. Query them like a database. Export them like data. Dashboard them like metrics.

Build-in-public

I kept seeing the same failure: Teams would dump PDFs into a vector store and then ask why totals, dates, and vendor names were wrong. Because search is not structure. Sifter makes document collections queryable. Think database, not chatbot.

Your kit is ready. Sign up free to unlock, takes 10 seconds.

7 more X posts · 2 LinkedIn · Product Hunt copy · ad hooks · 100-user playbook · landing critique

Unlock my kit