Science of Computer Programming is dedicated to the distribution of research results in the areas of software systems development, use and maintenance.

Joined January 2021
Photos and videos
PyBDR (Wu et al.) is a Python toolkit for set-boundary-based reachability analysis to improve formal verification, controller synthesis, and state estimation. Its modular architecture has building blocks for prototyping new analysis methods. doi.org/10.1016/j.scico.2026…

1
9
LoAT (by Frohn and Giesl) is a model checker that combines acceleration techniques with bounded model checking. LoAT can compress many execution steps into shortcuts and find deep counterexamples that are challenging for simple BMC. doi.org/10.1016/j.scico.2026…

1
22
Baumeister released RTLola, a stream-based monitoring specification language and software framework for automatically generating runtime monitors. They provide an overview of the framework’s API, implementation, and discuss its practical impact. authors.elsevier.com/a/1n2U6…

1
15
Challiol et al. introduce Agile Empathy, a framework for empathy training in early-stage software design combining Design Thinking, Agile values, and experiential learning. It was explored in two settings with 32 senior CS students using mixed methods. authors.elsevier.com/a/1n0Cd…

2
27
Mansab et al. introduce TNA-CAF, a vulnerability detector that aligns code tokens with AST nodes via constrained cross-attention. It achieves strong F1/ROC-AUC on 195K C/C functions and shows strong cross-lingual generalization to Python. authors.elsevier.com/a/1n2ni…

1
22
Liu et al. introduce CodeRAG, a retrieval-augmented multimodal framework for software vulnerability detection, using a lightweight Transformer followed by an LLM to handle unclear cases. It improved F1/recall over baselines on Devign and Reveal datasets. authors.elsevier.com/a/1m-2L…

1
3
35
Alwadaeen and Alzamil introduce Optimized Greedy Additional (OGA), a faster test-case prioritization method using lazy heap updates, bitmaps, and similarity filtering. It achieved large speedups with negligible fault-detection loss on 54 Java projects authors.elsevier.com/a/1m~…

1
33
Manfred Broy introduces a formalism for nondeterministic distributed systems beyond monotonicity, using timed streams to model feedback loops and concurrency. The paper also discusses the practical significance and applicability of the formalism. doi.org/10.1016/j.scico.2026…

1
25
Arcaini and Cetinkaya introduce DETOUR, a regression testing tool for ADAs that uses hierarchical clustering to prioritize/select road-structure tests that cause lane departures. It is demonstrated with an illustrative example in its prioritization mode. authors.elsevier.com/a/1m~…

1
79
Trovato et al. introduce AMBER, a JMH extension that classifies time series to detect steady-state execution and dynamically stop warm-up iterations. It was evaluated across over 30 Java projects, demonstrating higher accuracy and shorter test times. doi.org/10.1016/j.scico.2026…

2
36
Leonhard and Stocco introduce PerturbationDrive for perturbation-based testing of ADAS, applying 30 image perturbations. PerturbationDrive supports both offline evaluation on static datasets and online closed-loop testing in different simulators. doi.org/10.1016/j.scico.2026…

1
22
Chouikh et al. introduce a rule-based co-evolution approach that uses ATL transformations from metamodel differences for semantic preservation in Maude. It is evaluated on a UML Sequence Diagram, showing reduced repetitive manual adaptations from users. authors.elsevier.com/a/1mv73…

1
33
Mock et al. introduce Beyond Balance, a loss-weighting framework to address class imbalance in fine-tuning Transformer models for code comment classification. It consistently improves F1 scores across datasets, showing effective handling of rare classes. doi.org/10.1016/j.scico.2026…

1
34
Chondamrongkul et al. introduce RepoAI, a model-agnostic multi-agent framework for automated code refactoring using LLM orchestration and RAG. It is evaluated on 1000 RefactorBench tasks across multiple models, showing gains over direct generation. authors.elsevier.com/a/1mtME…

2
65
Pizard and Marenco introduce a scenario-based training approach to help learners apply systematic review evidence in software engineering decisions. This is evaluated via an exploratory mixed-methods study across three course iterations. authors.elsevier.com/a/1msAA…

1
40
Ma et al. introduce MutDBD, a mutation-based framework that identifies attacked classes in backdoored DNNs. It was evaluated across 32 scenarios on CIFAR-10, GTSRB, and ImageNet, showing reduced attack success rates while maintaining model accuracy. authors.elsevier.com/a/1mtAp…

3
57
Kertusha et al. present an SLR on web testing, highlighting trends, tools, and the growing role of AI-driven. The survey reveals limited industrial adoption and human-centered validation despite increasing use of real-world systems. doi.org/10.1016/j.scico.2026…

1
2
81
Abe and Hayashi introduce ICCheck, a language-agnostic tool that synchronizes code clones. It was evaluated across multiple repositories, demonstrating fast (median 0.27s) detection of overlooked clone inconsistencies in 30 languages. doi.org/10.1016/j.scico.2026…

2
75
Broy introduces a theoretical framework for modeling concurrent distributed systems to handle non-deterministic and non-monotonic behavior. It shows how timed streams serve as an auxiliary construct for defining concurrent composition with feedback. sciencedirect.com/science/ar…

1
42
Xu et al. introduce change metrics that use runtime differences between pretraining and fine-tuning to identify bugs in fine-tuning. It evaluates it by training 5 learners across 6 datasets and 233 models, showing improved fault diagnosis over baselines. authors.elsevier.com/a/1mou7…

1
42