document-scanning

Solid

Document discovery, inventory building, and metadata extraction for accessibility audits. Use when scanning folders for Office documents (.docx, .xlsx, .pptx) and PDFs, building file inventories, detecting changes via git diff, or extracting document properties like title, author, and language.

Data & Documents 306 stars 32 forks Updated 3 days ago MIT

Install

View on GitHub

Quality Score: 90/100

Stars 20%
83
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

<!-- CANONICAL SOURCE: .github/skills/document-scanning/SKILL.md -- Edit the canonical source; sync to Gemini via scripts/check-gemini-sync.ps1 --> # Document Scanning ## Supported File Types | Extension | Type | Sub-Agent | |-----------|------|-----------| | .docx | Word document | word-accessibility | | .xlsx | Excel workbook | excel-accessibility | | .pptx | PowerPoint presentation | powerpoint-accessibility | | .pdf | PDF document | pdf-accessibility | ## File Discovery Commands ### PowerShell (Windows) ```powershell # Non-recursive scan Get-ChildItem -Path "<folder>" -File -Include *.docx,*.xlsx,*.pptx,*.pdf # Recursive scan Get-ChildItem -Path "<folder>" -File -Include *.docx,*.xlsx,*.pptx,*.pdf -Recurse | Where-Object { $_.Name -notlike '~$*' -and $_.Name -notlike '*.tmp' -and $_.Name -notlike '*.bak' } | Where-Object { $_.FullName -notmatch '[\\/](\.git|node_modules|__pycache__|\.vscode)[\\/]' } ``` ### Bash (macOS) ```bash # Non-recursive scan find "<folder>" -maxdepth 1 -type f \( -name "*.docx" -o -name "*.xlsx" -o -name "*.pptx" -o -name "*.pdf" \) ! -name "~\$*" # Recursive scan find "<folder>" -type f \( -name "*.docx" -o -name "*.xlsx" -o -name "*.pptx" -o -name "*.pdf" \) \ ! -name "~\$*" ! -name "*.tmp" ! -name "*.bak" \ ! -path "*/.git/*" ! -path "*/node_modules/*" ! -path "*/__pycache__/*" ! -path "*/.vscode/*" ``` ## Delta Detection ### Git-based ```bash # Files changed since last commit git diff --name-only HEAD~1 HEAD -- '*.docx' '*.xls...

Details

Author
Community-Access
Repository
Community-Access/accessibility-agents
Created
3 months ago
Last Updated
3 days ago
Language
JavaScript
License
MIT

Similar Skills

Semantically similar based on skill content — not just same category