community web

Article Extractor

Extract clean article text and metadata from any web page with heuristics for paywalls and author detection.

58/100 240 GitHub stars Updated Feb 4, 2026

#scraping#readability#articles

What it does

Extract clean article text and metadata from any web page with heuristics for paywalls and author detection.

Same corner

Up-to-date library documentation for Cursor and Claude prompts — never get hallucinated APIs from stale training data.

Microsoft's official browser automation via structured accessibility snapshots — no screenshots required, deterministic results.

Powerful web scraping and search for LLM clients with markdown conversion, structured extraction, and full-site crawl.

Coming soon

The submission portal opens next sprint. For now, ship to GitHub and tag claude-skill.

Free PDF

The full Skills Bank, organized by category, with install commands and 1-line verdicts.