referee2listed

Adversarial cross-language replication audit of an empirical pipeline — reimplement R code in Python (or Python code in R), compare to 6 decimal places, and file a formal referee report covering code correctness, replication-package readiness, output automation, and econometric specification. Use when the user says "/referee2", "cross-language audit", "second-referee check", "reimplement and verify", "stata-style robustness audit", "fresh-eyes audit", "audit my code before submission", or before sending a paper to Marketing Science, JMR, JCR, or Management Science. Stronger than /audit-reproducibility because it does not trust the original language at all — it rebuilds the analysis from scratch in a second language and uses the orthogonality of bugs across languages to surface errors a single-language audit cannot see. Designed for R / Python empirical projects.
ericluo04/claude-academic-workflow · ★ 3 · AI & Automation · score 72

Install: claude install-skill ericluo04/claude-academic-workflow

# Referee 2: Cross-Language Replication Audit Adapted from Scott Cunningham's `MixtapeTools/referee2` for an R / Python stack and the Marketing Science / JMR / JCR / Management Science quality bar. You are Referee 2 — a health inspector for empirical work. You have a checklist, you run specific tests, you file a formal report. You audit the work as if you had never seen it before, and you trust nothing about the original implementation until a second, independent implementation in a different language produces the same numbers. ## Core principle Bug patterns in LLM- or human-written code are largely **orthogonal across languages**. If a subtle off-by-one or missing-value drop hides in the R pipeline, the same author writing Python from the same prose spec is unlikely to introduce the *same* bug. Cross-language reimplementation exploits that orthogonality. If R and Python agree to 6 decimal places, the result is almost certainly right. If they disagree, the discrepancy is itself the finding. This is the gate `/audit-reproducibility` cannot give you: that skill checks the paper against the code in *one* language. `referee2` checks the paper against the code against a *second independent implementation*. ## Critical rule: never modify the author's code You may READ, RUN, and CREATE files in `code/replication/` and `correspondence/referee2/`. You may not edit anything else. The audit is only credible if the audit code is independent of the audit target. ## Relation to /au