Archive Articles with Confidence. Share without Limits.

A command-line tool designed to solve content preservation challenges with Ethical Scraping.

$ ./capcat bundle tech --count 30

Fetching from 3 sources in bundle 'tech'
◯ STARTING HACKER NEWS (30 ITEMS)
◯ STARTING LOBSTERS (30 ITEMS)
◯ STARTING IEEE SPECTRUM (30 ITEMS)

✓ 90 articles saved successfully
Saved to: ../News/news_2025-11-24/

Defined with natural language processing. Built with large language models.
A Minimum Viable Product.

The Problem: Information Overload and Inefficient Recall

Browser Inefficiencies

Twenty tabs open while researching. Days later, you can't remember which article contained the specific information you need.

Lost Context

Bookmarks give you link lists but no context. Search history shows URLs but not content summaries.

Disappearing Content

Harvard Law School study shows 25% of links become inaccessible. When content disappears, the recall problem compounds.

Two Complete Interfaces, One Powerful Backend

Capcat offers two modes optimized for different workflows - both sharing a unified backend for consistent, reliable results.

Command-Line Mode

Fast, scriptable automation for power users. Perfect for daily routines, cron jobs, and integration with existing workflows.

Interactive Menu (TUI)

Visual, guided exploration for discovering sources and testing workflows. No memorization required - see all options at once.

Bulk RSS Fetching

Archive from multiple sources simultaneously. Predefined bundles (tech, news, science, AI) or create custom selections.

Local Markdown Storage

Permanent archives in Markdown format. Integrate seamlessly with Obsidian, Notion, or any note-taking system.

HTML Generation

Optional HTML output with customizable themes. Color-coded sources, visual hierarchy, shareable archives.

Offline Accessibility

Once fetched, content remains accessible forever. No dependency on live websites or internet connectivity.

How Capcat Works

Choose Your Interface

Start with CLI for speed or TUI for visual exploration. Both provide complete functionality.

Select Sources

Pick from 11 configured sources (Hacker News, BBC, Guardian, Nature, etc.) or use predefined bundles.

Parallel Fetching

Articles download simultaneously from all sources. 3× faster than sequential processing.

Organized Storage

Automatic date-based folder structure. Markdown files with front matter, images preserved, optional HTML.

Search & Recall

Local searchability across your entire archive. Visual scanning with HTML themes.

Why Two Modes?

TUI Mode: Discovery & Learning

Visual source browsing
Checkbox multi-selection
Guided RSS source wizard
Confirmation summaries
Multi-level TUI menu

Best for: Ease of use, one-off tasks, testing new sources.

CLI Mode: Automation & Speed

Fast commands from muscle memory
Shell aliases and scripts
Custom article counts
Verbose debugging flags
Exit codes for error handling

Best for: Daily automation, power users, developers integrating with existing workflows.

Tutorials & Documentation

Tutorials for mastering both CLI and interactive menu modes.

Sources Ready to Archive

Preconfigured sources across technology, news, science, and AI - or add your own RSS feeds.

Tech Pro

Hacker News
Lobsters

Tech

IEEE Spectrum
Mashable

News

BBC News
The Guardian

Science

Nature
Scientific American

Ready to Start Archiving?

1. Clone Repository

git clone https://github.com/stayukasabov/capcat.git

2. Setup Dependencies

./scripts/fix_dependencies.sh

3. Start Archiving

./capcat catch

Archive Articles with Confidence. Share without Limits.

Defined with natural language processing. Built with large language models.
A Minimum Viable Product.

The Problem: Information Overload and Inefficient Recall

Browser Inefficiencies

Lost Context

Disappearing Content

Two Complete Interfaces, One Powerful Backend

Command-Line Mode

Interactive Menu (TUI)

Bulk RSS Fetching

Local Markdown Storage

HTML Generation

Offline Accessibility

How Capcat Works

Choose Your Interface

Select Sources

Parallel Fetching

Organized Storage

Search & Recall

Why Two Modes?

TUI Mode: Discovery & Learning

CLI Mode: Automation & Speed

Tutorials & Documentation

Interactive Menu

CLI Commands

Advanced Topics

Sources Ready to Archive

Tech Pro

Tech

News

Science

Add Your Own Sources

Ready to Start Archiving?

1. Clone Repository

2. Setup Dependencies

3. Start Archiving

Archive Articles with Confidence. Share without Limits.

Defined with natural language processing. Built with large language models. A Minimum Viable Product.

The Problem: Information Overload and Inefficient Recall

Browser Inefficiencies

Lost Context

Disappearing Content

Two Complete Interfaces, One Powerful Backend

Command-Line Mode

Interactive Menu (TUI)

Bulk RSS Fetching

Local Markdown Storage

HTML Generation

Offline Accessibility

How Capcat Works

Choose Your Interface

Select Sources

Parallel Fetching

Organized Storage

Search & Recall

Why Two Modes?

TUI Mode: Discovery & Learning

CLI Mode: Automation & Speed

Tutorials & Documentation

Interactive Menu

CLI Commands

Advanced Topics

Sources Ready to Archive

Tech Pro

Tech

News

Science

Add Your Own Sources

Ready to Start Archiving?

1. Clone Repository

2. Setup Dependencies

3. Start Archiving

Defined with natural language processing. Built with large language models.
A Minimum Viable Product.