detecting-deepfake-audio-in-vishing-attacks

Featured

Detects AI-generated deepfake audio used in voice phishing (vishing) attacks by extracting spectral features (MFCC, spectral centroid, spectral contrast, zero-crossing rate) and classifying samples with machine learning models. Supports batch analysis of audio files, generates confidence scores, and produces forensic reports. Activates for requests involving deepfake voice detection, vishing investigation, AI-generated speech analysis, voice cloning detection, or audio authenticity verification.

AI & Automation 12,642 stars 1468 forks Updated today Apache-2.0

Install

Quality Score: 99/100

Stars 20%

100

Recency 20%

100

Frontmatter 20%

70

Documentation 15%

100

Issue Health 10%

50

License 10%

100

Description 5%

100

Skill Content

# Detecting Deepfake Audio in Vishing Attacks ## When to Use - A suspected vishing call used an AI-cloned executive voice to authorize a wire transfer - Security operations received a voicemail that sounds like the CEO but the tone seems off - Incident response needs to determine whether a recorded phone call contains synthetic speech - Fraud investigation requires forensic proof that audio was AI-generated - Red team exercises use voice cloning and blue team needs detection capability **Do not use** for text-based phishing (email/SMS); use email header analysis or URL detonation tools instead. ## Prerequisites - Python 3.9+ with librosa, numpy, scikit-learn, and scipy installed - Audio samples in WAV, MP3, or FLAC format (mono or stereo, any sample rate) - Reference corpus of known genuine voice samples for the targeted individual (optional but improves accuracy) - FFmpeg installed for audio format conversion (librosa dependency) - Minimum 3 seconds of audio for reliable feature extraction ## Workflow ### Step 1: Audio Preprocessing Normalize and prepare audio samples for feature extraction: ```python import librosa import numpy as np # Load audio, resample to 16kHz mono y, sr = librosa.load("suspect_call.wav", sr=16000, mono=True) # Trim silence from beginning and end y_trimmed, _ = librosa.effects.trim(y, top_db=25) # Normalize amplitude to [-1, 1] y_norm = y_trimmed / np.max(np.abs(y_trimmed)) ``` Audio preprocessing ensures consistent feature extraction acro...

Details

Author: mukul975
Repository: mukul975/Anthropic-Cybersecurity-Skills
Created: 3 months ago
Last Updated: today
Language: Python
License: Apache-2.0

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Featured

conducting-social-engineering-pretext-call

Plan and execute authorized vishing (voice phishing) pretext calls to assess employee susceptibility to social engineering and evaluate security awareness controls.

12,642 Updated today

AI & Automation Featured

detecting-business-email-compromise-with-ai

Deploy AI and NLP-powered detection systems to identify business email compromise attacks by analyzing writing style, behavioral patterns, and contextual anomalies that evade traditional rule-based filters.

12,642 Updated today

AI & Automation Listed

audio-quality-check

Analyze audio recording quality - echo detection, loudness, speech intelligibility, SNR, spectral analysis. Use when the user wants to check a recording's quality, detect echo or duplication in audio files, measure speech clarity, compare original vs processed audio, diagnose why a recording sounds bad, or analyze audio tracks from Blackbox or any call recording app. Triggers on audio quality, recording analysis, echo detection, check recording, sound quality, analyze audio, speech quality, PESQ, STOI, loudness, SNR, audio diagnostics, recording sounds bad, echo in recording, audio duplication.

29 Updated 5 days ago

AI & Automation Featured

performing-steganography-detection

Detect and extract hidden data embedded in images, audio, and other media files using steganalysis tools to uncover covert communication channels.

12,642 Updated today

AI & Automation Listed

voice-auth

Writing voice authentication and AI-smell elimination skill. Activates when the user wants AI-generated writing to sound like them specifically — not generic, not polished-AI, not corporate bland. Analyzes a writing sample to fingerprint sentence length, vocabulary density, structural habits, punctuation patterns, and filler tendencies. Generates and rewrites content matching that fingerprint. Kills AI clichés, passive construction, hollow openers, and corporate hedges. Iterates by adjusting the voice model, not just the output. Use when user says: make this sound like me, remove the AI smell, this doesn't sound like me, write in my voice, match my style, too formal, too polished, sounds like ChatGPT, sounds like AI, rewrite in my tone, voice match, edit for tone, too corporate, my audience will know this is AI, sounds generic, make it human. Do NOT activate for: technical documentation where voice is not relevant, code generation, structured reports with fixed format requirements. First response: "Voice Auth

2 Updated 4 days ago