oma-pdf
SolidConvert PDF files to Markdown using opendataloader-pdf. Extracts text, tables, headings, lists, and images with correct reading order. Use for PDF parsing, PDF to Markdown conversion, document extraction, and AI-ready data preparation.
Install
Quality Score: 93/100
Skill Content
Details
- Author
- first-fluke
- Repository
- first-fluke/oh-my-agent
- Created
- 4 months ago
- Last Updated
- today
- Language
- TypeScript
- License
- MIT
Similar Skills
Semantically similar based on skill content — not just same category
pdf-to-md
Convert any PDF (or DOCX/PPTX/XLSX/image) to clean Markdown. For scientific papers, produce the canonical paper-to-md bundle (Markdown plus section_audit.json and article.json) using the remote OCR API when an OCR key is available, or LiteParse v2 locally when it is not. For any non-paper PDF, defer to a fast, local, no-API-key LiteParse v2 conversion. Use when turning a PDF or manuscript into Markdown, extracting article structure, or preparing input for csag-extraction.
office-to-md
Convert Office documents (Word, Excel, PowerPoint, PDF) to Markdown format. ONLY use this skill when the user explicitly requests to CONVERT, TRANSFORM or PARSE a specific office file into Markdown. Do NOT trigger for general questions, documentation reading, or discussions about files.
ocr-and-documents
Extract text from PDFs and scanned documents. Use web_extract for remote URLs, pymupdf for local text-based PDFs, marker-pdf for OCR/scanned docs. For DOCX use python-docx, for PPTX see the powerpoint skill.
pdf-folder-to-markdown
Bulk-convert every PDF in a folder (and its subfolders) to Markdown using a generated Python script, and optionally generate a companion script to delete the original PDFs afterwards. Triggers on requests like 'convert all PDFs in this folder to markdown', 'bulk PDF to MD', 'extract text from every PDF in <folder>', 'make markdown files from these PDFs', 'turn all PDFs into .md', or follow-ons like 'now delete the PDFs' or 'clean up non-markdown files'. Use whenever the user wants to process a folder full of PDFs (often with per-item subfolders) into Markdown.
oma-hwp
Convert HWP / HWPX / HWPML files to Markdown using kordoc. Extracts text, headings, tables, lists, images, footnotes, and hyperlinks. Use for Korean word processor files (Hangul), government documents, and AI-ready data preparation.