About

A semantic filesystem, built in the open, that you can fully own.

StrataFS is the answer to a question we kept asking ourselves: why does every "AI search" tool want to send our files to someone else's servers?

Why we're building this

Companies have decades of files. They live in laptops, in S3 buckets, in SharePoint sites, in Jira projects, in a Drive folder someone created in 2018. Knowing what's in them — and getting an AI agent to use that context — used to require either expensive SaaS or weeks of glue code.

We thought you should be able to point one binary at all of that, get a ranked search in 100 ms, and hand the same engine to your AI agent through the Model Context Protocol. No vendor in the path. No file contents leaving the machine. So we built it.

What's in the box today

Project status

What's next

In flight or imminent:

Further out: distributed read replicas, OCR for images and scanned PDFs, multimodal embeddings.

Who's behind it

StrataFS is a project of Neul Labs, with development led by Dipankar Sarkar. The project is fully open source under the MIT license. Contributions, bug reports, and design discussions are welcome through the GitHub repository.

The name

"Strata" — layers of sediment. "FS" — filesystem. The picture is of passive storage as geological strata, and StrataFS as the survey that reveals what's actually in each layer. Less metaphorically: every file type is a stratum we know how to read.

Index your storage. Search it like a human.

MIT-licensed. Multi-storage. Native MCP server for AI agents. Local, S3, GCS, Azure, SharePoint, Drive and Jira.