Explore projects
-
AVT-SVT / public / ElectrolyteMedia
BSD 3-Clause "New" or "Revised" LicenseModelica framework for the dynamic simulation of aqueous electrolyte systems under consideration of dissociation and phase equilibria
Updated -
-
ACS / Public / Virtualization / cricket
MIT LicenseCricket consists of two parts: A virtualization layer for CUDA applications that allows the isolation of GPU and CPU parts by using Remote Procedure Calls and a checkpoint/restart tool for GPU kernels. Logo by Freepik.com
Updated -
A tool for adding metadata to tables
Updated -
-
Updated
-
-
plotID / plotID_matlab
Apache License 2.0The plotID toolkit supports researchers in tracking and storing relevant data in plots. Plots are labelled with an ID and the corresponding data is stored.
Updated -
Create all documents for your next CRC/SFB in an easy and efficient way.
Updated -
-
Archived 1Updated
-
A collection of tools for slip system analysis. Please cite our review paper and tutorial if you use the code provided in this project.
Updated -
Coscine / research / MetadataTracker
MIT LicenseThis repository contains the application which tries to detect change on data resources and describes them using PROV-O and metadata extraction methods.
Updated -
Updated
-
ISEA / BMS Simulation as MIL
BSD 3-Clause Clear LicenseUpdated -
Updated
-
Title: Capable AI Coding Agents
Author: Robin Rentmeister
Abstract: The use of Large Language Models (LLMs) in software engineering is shifting from single-shot code completion to autonomous, multi-step workflows that operate over large repositories and interact with external tools and APIs. While this transition promises productivity gains, it also introduces concerns for correct and secure results. In contrast to isolated snippet generation, agentic pipelines can amplify small mistakes into cascading failures, exhibit non-deterministic behavior, and produce plausible code that silently violates security or domain constraints.
This survey reviews failure modes in agentic coding, consolidates techniques for improving correctness and reproducibility, and highlights remaining challenges. We organize assurance mechanisms into three complementary layers: (i) operational infrastructure for deterministic execution, provenance, and tracking; (ii) cognitive architectures that couples generation with executable checks and iterative repair; and (iii) interaction paradigms for human and multi-agent interaction, policy enforcement, and governance. Drawing on evidence from execution-based benchmarks and safety evaluations, we summarize which mechanisms yield consistent improvements and where concerns persist. We conclude by outlining open challenges toward scalable, long-horizon robustness and verifiable reliability for AI coding agents.
UpdatedUpdated