Installation¶
This guide covers installing Data Miner and its dependencies.
Prerequisites¶
- Python 3.12+
- PostgreSQL 15+ (local or remote)
- FFmpeg (for video processing)
Quick Install¶
# Clone repository
git clone https://github.com/tycoai/data_miner.git
cd data_miner
# Install with uv (recommended)
uv pip install -e .
# Or with pip
pip install -e .
PostgreSQL Setup¶
Option 1: Docker Compose (Recommended)¶
The included docker-compose.yaml starts PostgreSQL on port 5432 with default credentials. It also includes:
- Loki - Log aggregation at
http://localhost:3100 - Grafana - Log visualization at
http://localhost:3000 - Adminer - Database admin UI at
http://localhost:8880
# Fix permissions for Loki and Grafana (first time only)
sudo chown -R 10001:10001 data/loki
sudo chown -R 472:472 data/grafana
Option 2: Local PostgreSQL¶
Environment Configuration¶
Copy the example environment file:
Edit .env with your settings:
# Database URL (default works with Docker Compose)
DATABASE_URL=postgresql://postgres:postgres@localhost:5432/data_miner
# Hugging Face token (optional, for private models)
# Get from: https://huggingface.co/settings/tokens
HF_TOKEN=your_token_here
# Debug mode (disables heartbeat - for development only)
# DATA_MINER_DEBUG=1
Initialize Database¶
GPU Setup (Optional)¶
For ML inference, ensure CUDA is available:
The pipeline automatically falls back to CPU if CUDA is unavailable.
Verify Installation¶
Next Steps¶
- Configuration - Set up your pipeline config
- Quickstart - Run your first pipeline