codex-e2e-test

Solid

Run PR-grade real Codex E2E validation through claude-tap, including resume turns, multiple tool calls, optional image input, viewer verification, and screenshot evidence.

Testing & QA 1,021 stars 109 forks Updated today MIT

Install

View on GitHub

Quality Score: 96/100

Stars 20%

100

Recency 20%

100

Frontmatter 20%

Documentation 15%

100

Issue Health 10%

License 10%

100

Description 5%

100

Skill Content

# Codex E2E Test Skill Run real end-to-end validation that starts `claude-tap` from local source, connects to the real Codex CLI via OAuth, captures OpenAI Responses API traces, and produces viewer screenshots suitable for PR evidence. Use this skill for every PR that changes capture, proxying, viewer rendering, session/dashboard behavior, client launch logic, trace ordering, content blocks, tools, token usage, or screenshot/demo assets. If a PR cannot run this flow, state why in the PR and cover the same risk with another real client trace. ## Prerequisites - `codex` CLI installed (`npm install -g @openai/codex`) and authenticated via OAuth - Python dev dependencies: `uv sync --extra dev` - Playwright installed: `uv run playwright install chromium` Verify OAuth works: ```bash codex exec "say hello" --dangerously-bypass-approvals-and-sandbox ``` If it fails with token errors, re-authenticate: ```bash codex auth login ``` ## Key Difference from Claude E2E Codex uses the **OpenAI Responses API** (`/v1/responses`) instead of Anthropic Messages API. With OAuth authentication, the upstream is `https://chatgpt.com/backend-api/codex`, **not** `https://api.openai.com`. The proxy must be told the correct target with `--tap-target`. ## Run a Real Codex E2E Trace Prefer the resume + multimodal flow below for PR evidence. The simple commands are only smoke tests for checking local setup. ### Simple (single tool call) ```bash claude-tap --tap-client codex \ --tap-target h...

Details

Author: liaohch3
Repository: liaohch3/claude-tap
Created: 3 months ago
Last Updated: today
Language: Python
License: MIT

Integrates with

OpenAI · AI Anthropic · AI Playwright · Testing npm · DevTools

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Listed

codex-cli

Run OpenAI Codex CLI for coding tasks and second-opinion audits. Use when a user asks to run/ask/use Codex, says "codex prompt", or wants Claude to delegate a logic/code review to OpenAI models. Covers direct `codex` CLI invocation (exec, review, resume, apply, doctor, mcp), the six reasoning-effort levels (none/minimal/low/medium/high/xhigh), sandbox + dangerous flags, background execution, rate-limit safety, and when to defer to the official OpenAI Codex Claude Code plugin (`codex:rescue`) instead. Preflights with `codex doctor` to read the current default model + surface available updates; never hardcodes model/effort, letting Codex pick its own current best default unless the user explicitly names one.

24 Updated yesterday

georgekhananaev

Testing & QA Solid

real-e2e-test

Run real E2E tests against Claude CLI in pytest and tmux modes

1,021 Updated today

liaohch3

Testing & QA Solid

e2e-test

Run claude-tap end-to-end tests with pytest

1,021 Updated today

liaohch3

AI & Automation Listed

codex

Use when Claude wants a read-only second opinion from OpenAI Codex CLI on: exploring an unfamiliar codebase, reviewing a plan/design .md, or reviewing a PR diff. Codex runs sandboxed read-only (no writes, no prompts, no network).

0 Updated 3 days ago

YoniChechik

Code & Development Listed

codex-review

Have OpenAI Codex review the current branch with documentation research. Use for second-opinion code reviews or when you want cross-AI verification.

41 Updated 2 weeks ago

benjaminshoemaker