phd-deepread

Solid

Guided workflow for processing academic PDFs into structured literature notes using Text-First decision tree (PyMuPDF + Tesseract OCR) and Claude-assisted analysis. Perfect for literature review and note-taking in Obsidian.

Data & Documents 47 stars 2 forks Updated today MIT

Install

View on GitHub

Quality Score: 86/100

Stars 20%
56
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
80
License 10%
100
Description 5%
100

Skill Content

# PhD Deep Read Workflow This skill implements a sophisticated **PhD Deep Read Workflow** that processes academic PDFs into structured literature notes for Obsidian using a hybrid decision-tree approach. The workflow combines: 1. **Text-First PDF Extraction**: Decision-tree using PyMuPDF (fast text extraction) for searchable PDFs and Tesseract OCR fallback for scanned/complex pages 2. **Structured Note Generation**: Template-driven generation of comprehensive academic literature notes 3. **Critical Thinking Canvas**: JSON Canvas files with 9 interconnected nodes for deep critical analysis 4. **Workflow Automation**: Scripts to automate the 4-stage pipeline ## When to Use This Skill Activate when the user: - Wants to process academic PDFs into structured literature notes - Needs to extract text from PDFs with complex layouts or tables - Wants to generate Obsidian-compatible notes with YAML frontmatter and Dataview callouts - Needs critical thinking canvases for deep analysis of papers - Has a collection of PDFs to process in batch ## Commands The skill provides these main commands: - `setup`: Setup environment and install required tools (PyMuPDF, Tesseract OCR) - `extract`: Extract text and images from PDFs using Text-First decision tree (PyMuPDF + Tesseract OCR) - `generate`: Generate structured literature notes using .clauderules template - `canvas`: Create JSON Canvas files for critical thinking with 9 interconnected nodes - `run`: Run full workflow automation (extra...

Details

Author
heleninsights-dot
Repository
heleninsights-dot/phd-deepread-workflow
Created
2 months ago
Last Updated
today
Language
Python
License
MIT

Similar Skills

Semantically similar based on skill content — not just same category

Data & Documents Solid

markdrop

Professional AI skill and usage instructions for the Markdrop package, a Python tool for converting PDFs to Markdown/HTML with AI-powered image/table descriptions.

204 Updated 2 months ago
shoryasethia
Data & Documents Featured

pdf

Use this skill whenever the user wants to do anything with PDF files. This includes reading or extracting text/tables from PDFs, combining or merging multiple PDFs into one, splitting PDFs apart, rotating pages, adding watermarks, creating new PDFs, filling PDF forms, encrypting/decrypting PDFs, extracting images, and OCR on scanned PDFs to make them searchable. If the user mentions a .pdf file or asks to produce one, use this skill.

14,116 Updated today
eigent-ai
AI & Automation Listed

notes-processor

Process and organize notes.md using the LNO (Leverage/Neutral/Overhead) framework for systematic task prioritization. Use when: (1) adding action items to notes.md, (2) organizing daily tasks, (3) categorizing work by ROI potential, (4) carrying forward incomplete tasks to new date sections, (5) routing brain-specific notes, (6) user requests "update notes" or "process my notes", or (7) synchronizing action items from CLAUDE.md files to daily tracking. Applies Shreyas Doshi's ROI-based prioritization framework to maintain strategic focus in daily execution.

23 Updated 3 months ago
samarv
Data & Documents Solid

writing-skills

Use when creating new skills, editing existing skills, or verifying skills work before deployment

36 Updated 3 weeks ago
tylerwind