← ClaudeAtlas

html-to-mdlisted

Convert HTML documents to clean Markdown via pandoc
fabioc-aloha/Alex_ACT_Edition · ★ 0 · Data & Documents · score 73
Install: claude install-skill fabioc-aloha/Alex_ACT_Edition
# Html To Md Convert HTML documents to clean Markdown. Strips inline styles, scripts, and tracking pixels while preserving semantic structure. ## Quick Start ```bash node .github/skills/html-to-md/scripts/html-to-md.cjs page.html page.md ``` ## What's preserved - Headings, paragraphs, lists, blockquotes - Tables (when structure is regular) - Links and inline code - Image references (URLs kept as-is) - Emphasis (bold, italic, strikethrough) ## What's dropped - Inline `style` attributes - `<script>` and `<style>` blocks - Tracking pixels and analytics tags - Most `<div>`/`<span>` wrappers (semantic content preserved) ## Optional flags | Flag | Effect | |---|---| | `--download-images` | Fetch referenced images to a local `images/` folder | | `--wrap N` | Line wrap width (default: 80) | ## Post-conversion - Run [lint-clean-markdown](../lint-clean-markdown/SKILL.md) over the output to fix heading hierarchy and list spacing. - HTML often has multiple `<h1>` tags; Markdown wants exactly one. ## Related - [docx-to-md](../docx-to-md/SKILL.md) — Word source - [lint-clean-markdown](../lint-clean-markdown/SKILL.md) — clean up the result ## Would Revise If Revisit this skill by **2026-08-26** (90 days) or sooner if any of the following fires: pandoc upstream changes html-to-md behavior in a way that breaks the documented flag semantics; the `--download-images` flow fails on a real source the user runs through it; or `lint-clean-markdown` post-processing stops being the righ