DeepSeek Provider Settings

Runtime settings and reproducibility constraints for DeepSeekTacticProvider.

Model and Loading Contract

Model ID: l3lab/ntp-mathlib-context-deepseek-coder-1.3b
Runtime: local MLX (mlx_lm)
Expected converted model location (default):
- $SPECTER_ARTIFACT_ROOT/wonton-soup/models/ntp-mathlib-deepseek-1.3b-mlx-bf16

Implementation: prover/providers/deepseek.py (_resolve_model_path).

Current code path uses stochastic sampling:

sampler: make_sampler(temp=0.6, top_p=0.9)
max_new_tokens = 64
per-request generation count: min(n, self._num_samples)
default num_samples = 10
MLX decoding uses batch_generate(...) over repeated prompt copies, not a Python-side serial loop
prompt tokens are truncated to the 2048-token window by keeping both the prefix and suffix when the assembled prompt is too long

Implementation: prover/providers/deepseek.py (_generate_tactics).

Provider prompt includes:

system instruction block
[CTX] theorem/import context plus recent proof steps from the current assembly trace
[STATE] pretty goal state
[TAC] model completion region

Tactic extraction prefers text before [/TAC], then falls back to first non-empty line.

Caching improves throughput but can reduce effective stochastic variation for repeated prompts.

This provider currently does not expose explicit seed control in its public interface. Reproducibility is therefore bounded by:

Practical guidance:

For strict reproducibility studies, use deterministic providers or run paired baselines.
For basin-style variability studies, record full run config and disable assumptions of deterministic replay.

Expected cost drivers:

Primary knobs affecting cost/variance:

Model path missing:
- Ensure $SPECTER_ARTIFACT_ROOT is set and converted MLX model exists.
Low diversity despite sampling:
- Check prompt cache hit rate and effective num_samples.
Unstable outputs across runs:
- Treat as stochastic behavior; compare aggregates rather than single-run anecdotes.

When changing DeepSeek decoding/runtime behavior:

Update prover/providers/deepseek.py.
Update this document.
Update Tactic Provider Options for Tactic Suggestion if selection guidance changes.
Note reproducibility impact in run/report metadata.