Trover — Plug. Ask. Find.

The Expedition

You know it’s in there.
Somewhere.

Every knowledge worker has the same problem. Years of work product buried across backup drives, Time Machine snapshots, NAS boxes, and cloud exports. You know you wrote that board deck. You just can’t find it.

🗃

Scattered Across Everything

Time Machine snapshots. NTFS drives from your old Windows laptop. ext4 volumes from a Linux server. NAS shares. Google Takeout. iCloud exports. OneDrive dumps. Your career is spread across a dozen devices, three operating systems, and five cloud services.

🔎

Your OS Can’t Find It

Spotlight doesn’t index backup drives. Windows Search ignores external volumes. Linux locate only knows what’s mounted right now. No OS search tool scores relevance, understands context, or searches across filesystem boundaries.

🔐

Locked Inside Black Boxes

ZIP archives, RAR files, 7z bundles, compressed tarballs, encrypted disk images, Outlook PSTs, Apple Mail archives. Your most important data is trapped inside containers that search tools won’t even open. Trover spelunks in.

🧠

AI Needs the Data First

You can’t ask an AI about documents it’s never seen. Someone has to scan every drive, crack open every archive, extract the text, deduplicate the copies, and organize it all before a single question can be answered. That’s the job.

The Method

Three steps. That’s it.

From dusty drive to answered question in minutes.

Plug In

Connect any drive. Time Machine, external HDD, NAS, cloud export. Trover scans and scores every document by your definition of importance.

Ask

“What did I present to the board in Q1 2025?” Natural language queries across your entire document history. Powered by RAG.

Find

Get answers with sources. The exact file, the exact slide, the exact paragraph. Deduplicated, scored, and cited.

The Treasure Map

What nobody else does.

🔗

No Boundaries

Hard drives, cloud storage, email archives, compressed bundles, proprietary formats—Trover crosses every boundary. No vendor lock-in, no iron curtains between platforms. Your data is yours, and Trover finds it wherever it hides, deduplicates it across every source, and gives you one clean picture.

🌐

Every Format. Every Medium.

Documents, images, audio, video, and email archives. Trover extracts text and tables from PPTX, DOCX, PDF, and XLSX. Reads EXIF and OCR from photos. Transcribes audio and video locally via Whisper. Parses email threads from Apple Mail, Gmail, and Outlook. Everything gets chunked, embedded, and made searchable.

🔒

Local-First. Cloud When You Choose.

By default, everything runs on your machine—scanning, embeddings, deduplication, extraction. Connect cloud sources when you want to, not because you have to. You control what stays local and what gets indexed from the cloud. No forced uploads, no telemetry, no surprises.

⚖

Relevance Scoring

Every file gets scored 0–100 by a 4-tier keyword engine you control. A lawyer weights “deposition.” A PM weights “roadmap.” An MRO tech weights “airworthiness directive.” Year bonuses, system path penalties, and multi-match detection surface what matters and bury what doesn’t.

⚙

Config-Driven

One YAML file controls everything: local drives, cloud sources, keywords, categories, scoring rules, media settings, transcription models, and output structure. Customize for your domain—legal, aviation, consulting, research—in minutes.

⚡

Crash-Safe & Incremental

Power failure, drive disconnect, laptop sleep—Trover picks up exactly where it stopped. SQLite WAL mode means zero data loss. Only new or changed files get processed on re-runs. Scan 40,000 files today, plug in the same drive next month, and Trover skips what it already knows.

Beyond Documents

Your photos. Your recordings.
Your meetings. Found.

Ten years of family photos on an old drive. A keynote recording from 2019. A deposition on a thumb drive. Trover doesn’t just find them—it understands what’s inside.

📷

Image Intelligence

Plug in a drive and find every photo your family took in 2016. Trover reads EXIF data—dates, GPS coordinates, camera model—and runs OCR on screenshots, whiteboards, and scanned documents to make the text inside searchable.

Supports: JPG, PNG, HEIC, TIFF, GIF, WebP, BMP, SVG
Extraction: EXIF metadata • GPS geolocation • OCR text recognition

🎧

Audio Transcription

“What did the CEO say in that all-hands?” Trover transcribes audio files locally using Whisper—no cloud, no API costs—and indexes every word. Search your voice memos, podcast recordings, and meeting audio by what was said.

Supports: MP3, WAV, M4A, AAC, FLAC, OGG, WMA
Extraction: Local Whisper transcription • Timestamped segments • Codec metadata

🎥

Video Transcription

Training videos, Zoom recordings, conference talks, family milestones. Trover extracts the audio track, transcribes it locally, and makes the entire recording searchable. “Find the part where we discussed pricing.”

Supports: MP4, MOV, AVI, MKV, WMV, WebM, M4V
Extraction: Audio extraction via ffmpeg • Local Whisper transcription • Duration & resolution metadata

Connect Everything

Local drives. Cloud storage.
Email archives. One pipeline.

Trover doesn’t care where your data lives. Plug in a hard drive or connect a cloud account—every file flows through the same scan, score, dedup, extract, and index pipeline.

✉

Email Intelligence

“What did I email the board in March 2021?” Trover parses your email archives—Apple Mail, Gmail exports, Outlook PSTs, Thunderbird—reconstructs threads, extracts attachments, and makes decades of correspondence searchable.

Formats: Apple Mail (.emlx) • Gmail Takeout (.mbox) • Outlook (.pst/.ost) • Thunderbird • Generic .eml
Features: Thread reconstruction • Attachment extraction • Contact graph • Date-range queries

🔌

Cloud Storage

Connect your Google Drive, Dropbox, OneDrive, iCloud, or Box account and Trover treats it as another drive. No manual exports, no downloads. OAuth read-only access, scan on demand, dedup against your local files automatically.

Integrations: Google Drive • Dropbox • OneDrive • iCloud Drive • Box
Access: OAuth 2.0 • Read-only • Revocable anytime • No data modification

🏗

Enterprise Infrastructure

For teams with data in cloud infrastructure—S3 buckets, GCS objects, Azure Blob containers. Trover connects via IAM or service credentials and indexes everything the same way. Plus Slack exports and Gmail API for live email access.

Platforms: AWS S3 • Google Cloud Storage • Azure Blob • Slack • Gmail API
Auth: IAM credentials • OAuth 2.0 • Service accounts • Encrypted token storage

Cross-Source Deduplication

The same board deck on Google Drive, an old backup HDD, and a Gmail attachment from 2021—Trover resolves all three to a single canonical entry with full provenance. That’s the hybrid model: local and cloud, unified.

For Builders

Building something? Plug in. Ask. Find.

Trover isn’t just a desktop tool—it’s data infrastructure. Your scored, deduplicated document corpus, ready to query from whatever you’re building.

MCP Server · 18 Tools

RAG queries, semantic search, file browsing, collection management, indexing control, and more—directly from Claude, Cursor, or any MCP-compatible AI tool. 5 resources, 4 prompt templates, zero-config discovery via .mcp.json. Works standalone—no FastAPI needed.

REST API · 25+ Endpoints

High-performance FastAPI with full search, scoring, retrieval, deduplication, job management, and config control. 6 integration patterns out of the box: Zapier, Slack, multi-agent, CLI, Python SDK, and cURL. 4 deployment modes including Docker and Tauri desktop.

Integrates with all major cloud platforms—with a proprietary hybrid model for local and cloud data intelligence, backup, and retrieval.

AWS S3

Google Cloud

Azure Blob

Dropbox

Google Drive

OneDrive

iCloud

Slack

Zapier

Docker

FastAPI

Tauri

Builder early access available →

Your files are
buried treasure.

You know it’s in there.
Somewhere.

Scattered Across Everything

Your OS Can’t Find It

Locked Inside Black Boxes

AI Needs the Data First

Three steps. That’s it.

What nobody else does.

No Boundaries

Every Format. Every Medium.

Local-First. Cloud When You Choose.

Relevance Scoring

Config-Driven

Crash-Safe & Incremental

Your photos. Your recordings.
Your meetings. Found.

Image Intelligence

Audio Transcription

Video Transcription

Local drives. Cloud storage.
Email archives. One pipeline.

Email Intelligence

Cloud Storage

Enterprise Infrastructure

Cross-Source Deduplication

Building something? Plug in. Ask. Find.

MCP Server · 18 Tools

REST API · 25+ Endpoints

The expedition starts soon.

Your files are buried treasure.

You know it’s in there.Somewhere.

Scattered Across Everything

Your OS Can’t Find It

Locked Inside Black Boxes

AI Needs the Data First

Three steps. That’s it.

What nobody else does.

No Boundaries

Every Format. Every Medium.

Local-First. Cloud When You Choose.

Relevance Scoring

Config-Driven

Crash-Safe & Incremental

Your photos. Your recordings.Your meetings. Found.

Image Intelligence

Audio Transcription

Video Transcription

Local drives. Cloud storage.Email archives. One pipeline.

Email Intelligence

Cloud Storage

Enterprise Infrastructure

Cross-Source Deduplication

Building something? Plug in. Ask. Find.

MCP Server · 18 Tools

REST API · 25+ Endpoints

The expedition starts soon.

Get Demo Access + Join Beta

You're on the list.

Your files are
buried treasure.

You know it’s in there.
Somewhere.

Your photos. Your recordings.
Your meetings. Found.

Local drives. Cloud storage.
Email archives. One pipeline.