DataGator

Portable, forensic-capable indexing and deduplication for filesystems, archives, containers, and raw devices.

πŸ” Explore Features πŸš€ Get Started

What Makes DataGator Unique?

🐊 DataGator Core vs Pro

DataGator Core is built for power users, researchers, and forensic tinkerers who want full control using CLI tools and containers.
Pro adds real-time indexing, dashboards, syncing, and team workflows β€” built for compliance, operations, and distributed environments.

Feature Core (Open Source) Pro (Enterprise)
🧠 Cryptographic FS Indexingβœ…βœ…
πŸ” DeduplicationPlanned (external)βœ… Chunk-level + overlay
πŸ“‚ Archive + Disk Image Supportβœ…βœ… + auto extraction
🧾 Snapshots & DiffsManual onlyβœ… Visual + timed
πŸ“Š HTML Dashboardsβœ… Static exportβœ… Interactive + live
πŸ” ACL + Metadata Comparisonβœ…βœ… + extended tagging
🌐 Web UIπŸš«βœ…
πŸ“‘ P2P Segment SyncπŸš«βœ… Gossip protocol
πŸ›  Watch Mode (live indexing)πŸš«βœ…
πŸ“ Audit-Ready Compliance ReportsManualβœ… Exportable + signed
πŸ”’ EncryptionBYO onlyβœ… Integrated
πŸ’Ό SupportCommunityβœ… SLA + direct access
πŸ’Ό Contact Sales 🐊 Try Core Now

Use Cases Across Domains

Comparison: DataGator vs the World

Get Started in 3 Steps

All you need is a bind-mount and Docker or Podman.

docker run --rm \
  -v /your/data:/mnt:ro \
  -v ~/.DataGator.id:/etc/DataGator.id:ro \
  datagator scan /mnt

Want to detect changes over time?

./drift_detector.sh

Need a web report?

open __postgresql/dashboard/index.html

How DataGator Sees Your Filesystem

DataGator doesn’t just index file paths β€” it models your storage with cryptographic and structural fidelity.

Test-Driven from Day One

Every core shell or AWK module in DataGator is paired with a unit test. No test, no merge. Run everything at once:

find scripts -name '*_test.sh' -exec bash {} \;

This ensures stability across updates, formats, and edge cases.

Our Philosophy

β€œWe do not store files β€” we store truth.
Names are not paths β€” they are sequences of intention.”
β€” DataGator Design Memo #001

Edge Cases and Real-World Gotchas

Who Is DataGator For?

What DataGator Replaces

CLI Command Overview

Command Description Key Flags
scan Ingest volumes from scan.yaml into indexed DB --volume, --path
condense Deduplicate, archive, and compress old sessions --older-than, --archive-out
export Export sessions or host views as JSON/YAML --session, --out
doctor Run diagnostics on system, mountpoints, and schemas --details, --json

Built for Compliance and Courtroom

Beyond Indexing: What Sets DataGator Apart

AI Dataset Integrity & Model Provenance

Extend DataGator with Custom Parsers

Want to analyze a weird format or ancient archive? You can build your own extractor.

Future & Expansion Plans