pdflux-pdf2markdownlisted

Convert unstructured documents into LLM-ready structured data. Supports PDF, Word, PPT, and images; extracts paragraphs, formulas, tables, charts, and other elements in one step; generates up to 8 levels of headings; and outputs Markdown organized in reading order. Useful for field extraction, comparison and validation, knowledge retrieval, and intelligent Q&A.
PaodingAI/skills · ★ 16 · Data & Documents · score 71

Install: claude install-skill PaodingAI/skills

# PDFlux-PDF2Markdown Run a JavaScript workflow that submits a single local file to the `pdflux` synchronous API through PDRouter (`POST /openapi/{serviceCode}/file/markdown`) and prints the response result in one step. This skill only targets the latest OpenAPI flow and does not support deprecated legacy routes. ## Installation ```bash npx skills add PaodingAI/skills ``` ## Usage ```bash node skills/pdflux-pdf2markdown/scripts/upload_to_markdown.js <local-file-path> [output-markdown-path] ``` ## Execution Constraints - You must invoke `scripts/upload_to_markdown.js` directly. Do not reimplement the API flow yourself. - The behavior contract below explains what the script does, what it outputs, and when to use it. It is not a manual checklist for the model to imitate step by step. - Even if the task is only to extract tables, read fields, inspect body text, or prepare input for later scripts, you must run this script first and continue from the generated Markdown. - Only inspect or modify the script implementation when the script itself is unavailable, failing, or needs a fix. Do not bypass it during normal use. ## When to Use - Use this skill when the user wants to parse a document, retrieve specific document content, or extract tables from a document. - Use this skill when the user says things like "convert to Markdown", "output Markdown", "export Markdown", or "extract Markdown", and return the Markdown content directly. - When later work depends on the document c