Architecture. The harness is a deterministic 2D point-mass simulator (Falcon-9 class), dt = 0.1s, guidance evaluated every tick. Atmosphere: ρ = 1.225·exp(−alt/8500). Drag: 0.5·ρ·v²·CdA. Stage 1: 27t dry, 418t propellant, 7.9 MN, Isp 297s. Stage 2: 5t dry, 111t propellant, 1.45 MN, Isp 348s, 9.5t payload. The model writes only the guidance module: ascent_guidance(t, s) (stack, then stage 2 post-separation) and booster_guidance(t, s) (stage 1 to touchdown). Both return throttle and pitch; ascent also returns a separation flag. Input is a state dict: alt, downrange, vx, vy, speed, pitch, mass, prop_frac, q, aoa, twr, g, dt, droneship_x. Throttle clamps to 0 or [0.05, 1.0]. Pitch rate limited to 4°/s (ascent) and 25°/s (booster).
Pass/fail criteria (hard, no partial credit). Ascent structural limit: breakup if q > 85 kPa, or if q > 15 kPa and q·AoA > 350,000 Pa·deg. Booster reentry limit: breakup if q > 170 kPa. Orbit: alt ≥ 185 km, vx ≥ 7,750 m/s, |vy| ≤ 120 m/s. Landing: touchdown at 350 km ± 1.5 km, |vy| ≤ 10 m/s, |vx| ≤ 4 m/s. Mission success = orbit AND booster landing.
Iteration loop. Analyst: telemetry samples event log → failure report (timeline, proximate cause, physical chain). Director: failure report → numbered, phase-tagged fix directives. Flight Software: prior program directives → patched module. Reviewer: GNC pass (sign errors, missing clamps, AoA violations, throttle floors) → corrected module. One pass = one test flight.
Results.
Flight 1: open-loop pitch schedule; T 68s, q = 37 kPa, AoA = 9.5°, q·AoA exceeded 350k Pa·deg, max-Q breakup.
Flight 2: zero-AoA gravity turn cleared max-Q; separation at 50% stage-1 propellant; stage 2 reached orbital altitude at vx = 3.9 km/s (target 7.75) and fell back; unbraked booster reached q = 171 kPa on reentry.
Flight 3: orbit confirmed T 411s; vacuum-ballistics landing predictor, drag-induced ~15 km range error, booster 14.9 km short (vy = 4 m/s, vx = 66 m/s residual).
Flight 4: drag-aware predictor (forward-integrated CdA), lead-target hoverslam geometry, vectored landing burn upright below 60 m, achieving orbit droneship landing.
Model: moonshot/kimi-k2-7-code.