Get Capcat running in 5 minutes with this streamlined setup guide.
The new wrapper system handles all setup automatically:
# Navigate to Capcat directory
cd "Capcat/Application"
# Everything is handled automatically - just run capcat!
./capcat list sources
If you prefer manual control:
# Navigate to Capcat directory
cd "Capcat/Application"
# Create virtual environment (if not exists)
python3 -m venv venv
# Activate virtual environment
source venv/bin/activate
# Install dependencies
pip install -r requirements.txt
# Run capcat directly
python capcat.py list sources
# Test wrapper system
./capcat list sources
# Should show 25+ sources across 4 categories
# Output: "Tech (4), News (5), Science (3), etc."
Launch the interactive menu for guided workflows:
./capcat catch
What would you like me to do?
> Catch articles from a bundle of sources
Catch articles from a list of sources
Catch from a single source
Catch a single article by URL
Manage Sources (add/remove/configure)
Exit
./capcat catch./capcat catch./capcat catchFor comprehensive interactive mode documentation, see Interactive Mode Guide.
# Download a single article
./capcat single https://example.com/article
# With media files
./capcat single https://bbc.com/news/technology --media
# Tech news bundle (Hacker News + Lobsters + InfoQ)
./capcat bundle tech --count 10
# General news bundle (BBC + CNN + Reuters)
./capcat bundle news --count 15 --media
# Specific sources
./capcat fetch hn,bbc --count 20
# List all available sources
./capcat list sources
# List predefined bundles
./capcat list bundles
# Save detailed logs to file (includes all debug information)
./capcat -L capcat.log bundle tech --count 10
# Verbose console output + file logging
./capcat -V -L debug.log fetch hn --count 15
# Timestamped log files
./capcat -L logs/news-$(date +%Y%m%d-%H%M%S).log bundle news --count 10
# These sources use YAML configuration (no coding required)
./capcat fetch iq,euronews,straitstimes --count 5
# These sources have custom Python implementations
./capcat fetch hn,bbc,techcrunch --count 5
# Images only (default)
./capcat bundle tech --count 5
# All media types (images + videos + documents)
./capcat bundle tech --count 5 --media
# Run comprehensive source test
python test_comprehensive_sources.py
# Quick individual test
./capcat fetch hn --count 3
./capcat list sourcesCapcat includes automatic protection against problematic sites:
# Example: consumed.today attempted to download 471 images (103MB)
# Automatically blocked: "LINK_AGGREGATOR detected"
# Protection saved: 103MB of unwanted downloads
# Without --media flag: Standard protection limits apply
./capcat single https://example.com/article
# With --media flag: Bypass limits for legitimate sites (up to 500MB)
./capcat single https://example.com/article --media
# Note: --media flag ignored for blocked aggregator sites
Capcat uses a two-layer wrapper system for reliability:
capcatrun_capcat.pycapcat.py# Primary method (recommended)
./capcat command args
# Alternative method (direct Python)
python3 run_capcat.py command args
# Manual method (requires venv activation)
source venv/bin/activate && python capcat.py command args
# If bash wrapper fails, use Python wrapper directly
python3 run_capcat.py list sources
# Check wrapper system health
./capcat --help
# Let wrapper handle dependencies automatically
./capcat list sources
# Or manually activate environment (advanced users)
source venv/bin/activate
# Remove and recreate (wrapper will rebuild)
rm -rf venv
./capcat list sources
# Some sources may have anti-bot protection (normal)
# Success rate of 90% (14-16/25) is expected
bundle tech, bundle news--count 5 to test new sources-L logfile.log for troubleshooting and debugging