Configuration
config.yaml
yaml
# Subreddits to monitor
subreddits:
- productivity
- Entrepreneur
- smallbusiness
- freelance
- SaaS
- startups
# Crawl schedule (cron syntax)
schedule: "0 8 * * *" # Daily at 8 AM UTC
# How far back on first crawl (days)
initial_lookback_days: 7
# Minimum score to push to sheet
min_relevance_score: 0.6
# Google Sheet ID (from URL)
google_sheet_id: "your-sheet-id"
# Sheet tab name
sheet_name: "Reddit Leads"Google Sheet Columns
| Column | Content |
|---|---|
| A: Date | Crawl date |
| B: Subreddit | r/subreddit |
| C: Title | Post title |
| D: Body Preview | First 200 chars |
| E: Author | u/username |
| F: URL | Reddit link |
| G: Comments | Comment count |
| H: Score | Relevance (0-1) |
| I: Reason | AI explanation |
| J: Recommended Comment | Suggested reply |
| K: Status | Your tracking (manual) |
CLI Commands
bash
# Manual crawl
python -m app.cli crawl
# Crawl specific subreddit
python -m app.cli crawl --subreddit productivity
# Test connections
python -m app.cli test-reddit
python -m app.cli test-sheetsCustomizing AI Classification
Edit app/prompts/classify.txt to adjust what posts are considered relevant.