Analysis Artifacts Reference
Canonical definitions for raw and derived analysis artifacts.
Core Raw Artifacts
ExplorationHistory
Per-iteration search trace: selected paths, tactic attempts, outcomes, and terminal flags.
- Implementation:
prover/history.py(produced byprover/mcts.py) - Outputs:
*_history.json
MCTSTree
Serialized MCTS state with visit/value data and node topology.
- Implementation:
prover/mcts.py - Outputs:
*_mcts_tree.json
Search Graph (Lean)
Directed graph of explored goals and tactic expansions.
- Implementation:
prover/proof.py(ProofGraph) - Outputs:
*_graph.jsonwithgraph_kind=search_graph
Proof Graph (External Backends)
Backend-derived proof object graph (not interactive search graph).
- Implementation:
atp/proof_objects.py+ backend parsers - Outputs:
*_graph.jsonwithgraph_kind=proof_graph
Search Trace Graph (Proxy)
Trace-derived graph when true interactive search graph is unavailable.
- Outputs:
*_search_trace_graph.json - Required metadata:
trace_source,trace_completeness
Proof Process Trace (Replay)
Backend-native step trace used to derive proxy trace graphs without collapsing everything into the final proof term.
- Implementation:
atp/coq/process_trace.py - Outputs:
*_process_trace.json.gz
Proof Term DAG (ExprDAG)
Semantic proof-term representation for hashing, diffing, and complexity.
- Implementation:
prover/expr.py,prover/adapters/lean.py - Outputs:
*_proof_term.json.gz
Proof Assembly Trace
Stepwise tactic-to-term assembly log.
- Implementation:
prover/assembly.py,prover/adapters/lean.py - Outputs:
*_assembly.json.gz
GoalCache
Maps checkpoint-scoped mvar_id values to signatures and outcome features.
- Implementation:
prover/goal_cache.py,prover/goal_features.py - Outputs:
goal_cache.json.gz
Aggregate Run Artifacts
Summary
Primary run aggregate payload.
- Output:
summary.json.gz - Includes theorem/intervention summaries, aggregates, optional postprocess enrichments
Failure Analysis
Heuristic failure-mode categorization.
- Implementation:
analysis/failures.py - Output:
failure_analysis.json
Corpus Classification Report
Intervention classification over structural and term-level features.
- Implementation:
analysis/corpus.py - Output:
analysis_report.json
Root Goal Similarity
Cross-theorem root-goal TED matrix + clustering.
- Implementation:
analysis/root_goal_similarity.py,analysis/postprocess_metrics.py - Output:
root_goal_similarity.json
External Statement Similarity
Cross-problem statement TED matrix for external corpora.
- Implementation:
analysis/external_statement_similarity.py,analysis/postprocess_metrics.py - Output:
external_statement_similarity.json
GED Matrix Bundle
Pairwise variant distances per theorem and optional cross-theorem aggregations.
- Outputs:
ged_matrix.json,all_ged_matrices.json
Lake Extraction Surfaces
Cross-run lake extraction consumes run artifacts into normalized tables.
- Implementations:
analysis/lake/reconcile.py,analysis/lake/db.py,analysis/lake/extract_basin.py - See operational reference: Run Lake (Cross-Run DuckDB)
Artifact Integrity Invariants
- Artifact capability should be gated by
run_status.jsoncapability flags. - Goal IDs in Lean artifacts are checkpoint-scoped IDs, not raw Lean IDs.
- Backend-specific artifacts must set correct family metadata (
graph_family,graph_backend,graph_provenance, legacygraph_kind,trace_source,trace_completeness).
For exact field-level JSON schemas, use Log File Schemas.