4/ Here's a demo. We gave it a 4-page PDF with three probit estimators (MLE, Gibbs, MH) and one prompt: "Build the R package from this PDF. Run Monte Carlo. Ship it."
What came back: a complete R package with C /Armadillo backends, 3 estimators, a full test suite, and Monte Carlo results — all verified against R's glm().