Science of Computer Programming is dedicated to the distribution of research results in the areas of software systems development, use and maintenance.
PyBDR (Wu et al.) is a Python toolkit for set-boundary-based reachability analysis to improve formal verification, controller synthesis, and state estimation.
Its modular architecture has building blocks for prototyping new analysis methods.
doi.org/10.1016/j.scico.2026…
LoAT (by Frohn and Giesl) is a model checker that combines acceleration techniques with bounded model checking.
LoAT can compress many execution steps into shortcuts and find deep counterexamples that are challenging for simple BMC.
doi.org/10.1016/j.scico.2026…
Baumeister released RTLola, a stream-based monitoring specification language and software framework for automatically generating runtime monitors.
They provide an overview of the framework’s API, implementation, and discuss its practical impact.
authors.elsevier.com/a/1n2U6…
Challiol et al. introduce Agile Empathy, a framework for empathy training in early-stage software design combining Design Thinking, Agile values, and experiential learning.
It was explored in two settings with 32 senior CS students using mixed methods.
authors.elsevier.com/a/1n0Cd…
Mansab et al. introduce TNA-CAF, a vulnerability detector that aligns code tokens with AST nodes via constrained cross-attention.
It achieves strong F1/ROC-AUC on 195K C/C functions and shows strong cross-lingual generalization to Python.
authors.elsevier.com/a/1n2ni…
Liu et al. introduce CodeRAG, a retrieval-augmented multimodal framework for software vulnerability detection, using a lightweight Transformer followed by an LLM to handle unclear cases.
It improved F1/recall over baselines on Devign and Reveal datasets.
authors.elsevier.com/a/1m-2L…
Alwadaeen and Alzamil introduce Optimized Greedy Additional (OGA), a faster test-case prioritization method using lazy heap updates, bitmaps, and similarity filtering.
It achieved large speedups with negligible fault-detection loss on 54 Java projects
authors.elsevier.com/a/1m~…
Manfred Broy introduces a formalism for nondeterministic distributed systems beyond monotonicity, using timed streams to model feedback loops and concurrency.
The paper also discusses the practical significance and applicability of the formalism.
doi.org/10.1016/j.scico.2026…
Arcaini and Cetinkaya introduce DETOUR, a regression testing tool for ADAs that uses hierarchical clustering to prioritize/select road-structure tests that cause lane departures.
It is demonstrated with an illustrative example in its prioritization mode.
authors.elsevier.com/a/1m~…
Trovato et al. introduce AMBER, a JMH extension that classifies time series to detect steady-state execution and dynamically stop warm-up iterations.
It was evaluated across over 30 Java projects, demonstrating higher accuracy and shorter test times.
doi.org/10.1016/j.scico.2026…
Leonhard and Stocco introduce PerturbationDrive for perturbation-based testing of ADAS, applying 30 image perturbations.
PerturbationDrive supports both offline evaluation on static datasets and online closed-loop testing in different simulators.
doi.org/10.1016/j.scico.2026…
Chouikh et al. introduce a rule-based co-evolution approach that uses ATL transformations from metamodel differences for semantic preservation in Maude.
It is evaluated on a UML Sequence Diagram, showing reduced repetitive manual adaptations from users.
authors.elsevier.com/a/1mv73…
Mock et al. introduce Beyond Balance, a loss-weighting framework to address class imbalance in fine-tuning Transformer models for code comment classification.
It consistently improves F1 scores across datasets, showing effective handling of rare classes.
doi.org/10.1016/j.scico.2026…
Chondamrongkul et al. introduce RepoAI, a model-agnostic multi-agent framework for automated code refactoring using LLM orchestration and RAG.
It is evaluated on 1000 RefactorBench tasks across multiple models, showing gains over direct generation.
authors.elsevier.com/a/1mtME…
Pizard and Marenco introduce a scenario-based training approach to help learners apply systematic review evidence in software engineering decisions.
This is evaluated via an exploratory mixed-methods study across three course iterations.
authors.elsevier.com/a/1msAA…
Ma et al. introduce MutDBD, a mutation-based framework that identifies attacked classes in backdoored DNNs.
It was evaluated across 32 scenarios on CIFAR-10, GTSRB, and ImageNet, showing reduced attack success rates while maintaining model accuracy.
authors.elsevier.com/a/1mtAp…
Kertusha et al. present an SLR on web testing, highlighting trends, tools, and the growing role of AI-driven.
The survey reveals limited industrial adoption and human-centered validation despite increasing use of real-world systems.
doi.org/10.1016/j.scico.2026…
Abe and Hayashi introduce ICCheck, a language-agnostic tool that synchronizes code clones.
It was evaluated across multiple repositories, demonstrating fast (median 0.27s) detection of overlooked clone inconsistencies in 30 languages.
doi.org/10.1016/j.scico.2026…
Broy introduces a theoretical framework for modeling concurrent distributed systems to handle non-deterministic and non-monotonic behavior.
It shows how timed streams serve as an auxiliary construct for defining concurrent composition with feedback.
sciencedirect.com/science/ar…
Xu et al. introduce change metrics that use runtime differences between pretraining and fine-tuning to identify bugs in fine-tuning.
It evaluates it by training 5 learners across 6 datasets and 233 models, showing improved fault diagnosis over baselines.
authors.elsevier.com/a/1mou7…