Alvaro Cardenas
· ProfessorVerifiedUniversity of California, Santa Cruz · Electrical and Computer Engineering
Active 2003–2026
Research signals
Five dimensions sourced from public faculty / publication signals. Sign in to compare against your own profile and see your match score.
Research topics
- Computer Science
- Computer Security
- Artificial Intelligence
- Embedded system
- Business
- Software engineering
- Operating system
- Computer network
- Risk analysis (engineering)
- Programming language
- Mathematics
- Engineering
- Real-time computing
- Algorithm
Selected publications
Stable GFlowNets with Probabilistic Guarantees
arXiv (Cornell University) · 2026-05-03
preprintOpen accessGenerative Flow Networks (GFlowNets) learn to sample states proportional to an unnormalized reward. Despite their theoretical promise, practical training is often unstable, exhibiting severe loss spikes and mode collapse. To tackle this, we first assess the sensitivity of GFlowNet objectives, demonstrating that a small Total Variation (TV) distance between the learned and target distributions does not preclude unbounded training loss. Motivated by this mismatch, we establish converse guarantees by deriving loss-to-TV bounds that certify global fidelity from bounded trajectory balance losses. Lastly, we propose Stable GFlowNets, an algorithm that leverages our theoretical results to stabilize training, and empirically demonstrate improved training behavior and superior distributional fidelity.
Stable GFlowNets with Probabilistic Guarantees
ArXiv.org · 2026-05-03
articleOpen accessGenerative Flow Networks (GFlowNets) learn to sample states proportional to an unnormalized reward. Despite their theoretical promise, practical training is often unstable, exhibiting severe loss spikes and mode collapse. To tackle this, we first assess the sensitivity of GFlowNet objectives, demonstrating that a small Total Variation (TV) distance between the learned and target distributions does not preclude unbounded training loss. Motivated by this mismatch, we establish converse guarantees by deriving loss-to-TV bounds that certify global fidelity from bounded trajectory balance losses. Lastly, we propose Stable GFlowNets, an algorithm that leverages our theoretical results to stabilize training, and empirically demonstrate improved training behavior and superior distributional fidelity.
Chasing the Public Score: User Pressure and Evaluation Exploitation in Coding Agent Workflows
arXiv (Cornell University) · 2026-04-22
preprintOpen accessFrontier coding agents are increasingly used in workflows where users supervise progress primarily through repeated improvement of a public score, namely the reported score on a public evaluation file with labels in the workspace, rather than through direct inspection of the agent's intermediate outputs. We study whether multi-round user pressure to improve that score induces public score exploitation: behavior that raises the public score through shortcuts without improving hidden private evaluation. We begin with a preliminary single-script tabular classification task, where GPT-5.4 and Claude Opus 4.6 both exploit label information within 10 rounds of user-agent interaction. We then build AgentPressureBench, a 34-task machine-learning repository benchmark spanning three input modalities, and collect 1326 multi-round trajectories from 13 coding agents. On our benchmark, we observe 403 exploitative runs, spanning across all tasks. We also find that stronger models have higher exploitation rates, supported by a significant Spearman rank correlation of 0.77. Our ablation experiments show that higher user pressure leads to earlier exploitation, reducing the average first exploit round by 15.6 rounds (i.e., 19.67 to 4.08). As a mitigation, adding explicit anti-exploit wordings in prompt mostly eliminates exploitation (100% to 8.3%). We hope that our work can bring attention to more careful use of coding agents workflow, and developing more robust coding agents under user pressure. Our project page is at https://ucsc-vlaa.github.io/AgentPressureBench .
Chasing the Public Score: User Pressure and Evaluation Exploitation in Coding Agent Workflows
arXiv (Cornell University) · 2026-04-22
articleOpen accessFrontier coding agents are increasingly used in workflows where users supervise progress primarily through repeated improvement of a public score, namely the reported score on a public evaluation file with labels in the workspace, rather than through direct inspection of the agent's intermediate outputs. We study whether multi-round user pressure to improve that score induces public score exploitation: behavior that raises the public score through shortcuts without improving hidden private evaluation. We begin with a preliminary single-script tabular classification task, where GPT-5.4 and Claude Opus 4.6 both exploit label information within 10 rounds of user-agent interaction. We then build AgentPressureBench, a 34-task machine-learning repository benchmark spanning three input modalities, and collect 1326 multi-round trajectories from 13 coding agents. On our benchmark, we observe 403 exploitative runs, spanning across all tasks. We also find that stronger models have higher exploitation rates, supported by a significant Spearman rank correlation of 0.77. Our ablation experiments show that higher user pressure leads to earlier exploitation, reducing the average first exploit round by 15.6 rounds (i.e., 19.67 to 4.08). As a mitigation, adding explicit anti-exploit wordings in prompt mostly eliminates exploitation (100% to 8.3%). We hope that our work can bring attention to more careful use of coding agents workflow, and developing more robust coding agents under user pressure. Our project page is at https://ucsc-vlaa.github.io/AgentPressureBench .
VLAA-GUI: Knowing When to Stop, Recover, and Search, A Modular Framework for GUI Automation
ArXiv.org · 2026-04-23
articleOpen accessAutonomous GUI agents face two fundamental challenges: early stopping, where agents prematurely declare success without verifiable evidence, and repetitive loops, where agents cycle through the same failing actions without recovery. We present VLAA-GUI, a modular GUI agentic framework built around three integrated components that guide the system on when to Stop, Recover, and Search. First, a mandatory Completeness Verifier enforces UI-observable success criteria and verification at every finish step -- with an agent-level verifier that cross-examines completion claims with decision rules, rejecting those lacking direct visual evidence. Second, a mandatory Loop Breaker provides multi-tier filtering: switching interaction mode after repeated failures, forcing strategy changes after persistent screen-state recurrence, and binding reflection signals to strategy shifts. Third, an on-demand Search Agent searches online for unfamiliar workflows by directly querying a capable LLM with search ability, returning results as plain text. We additionally integrate a Coding Agent for code-intensive actions and a Grounding Agent for precise action grounding, both invoked on demand when required. We evaluate VLAA-GUI across five top-tier backbones, including Opus 4.5, 4.6 and Gemini 3.1 Pro, on two benchmarks with Linux and Windows tasks, achieving top performance on both (77.5% on OSWorld and 61.0% on WindowsAgentArena). Notably, three of the five backbones surpass human performance (72.4%) on OSWorld in a single pass. Ablation studies show that all three proposed components consistently improve a strong backbone, while a weaker backbone benefits more from these tools when the step budget is sufficient. Further analysis also shows that the Loop Breaker nearly halves wasted steps for loop-prone models.
VLAA-GUI: Knowing When to Stop, Recover, and Search, A Modular Framework for GUI Automation
arXiv (Cornell University) · 2026-04-23
preprintOpen accessAutonomous GUI agents face two fundamental challenges: early stopping, where agents prematurely declare success without verifiable evidence, and repetitive loops, where agents cycle through the same failing actions without recovery. We present VLAA-GUI, a modular GUI agentic framework built around three integrated components that guide the system on when to Stop, Recover, and Search. First, a mandatory Completeness Verifier enforces UI-observable success criteria and verification at every finish step -- with an agent-level verifier that cross-examines completion claims with decision rules, rejecting those lacking direct visual evidence. Second, a mandatory Loop Breaker provides multi-tier filtering: switching interaction mode after repeated failures, forcing strategy changes after persistent screen-state recurrence, and binding reflection signals to strategy shifts. Third, an on-demand Search Agent searches online for unfamiliar workflows by directly querying a capable LLM with search ability, returning results as plain text. We additionally integrate a Coding Agent for code-intensive actions and a Grounding Agent for precise action grounding, both invoked on demand when required. We evaluate VLAA-GUI across five top-tier backbones, including Opus 4.5, 4.6 and Gemini 3.1 Pro, on two benchmarks with Linux and Windows tasks, achieving top performance on both (77.5% on OSWorld and 61.0% on WindowsAgentArena). Notably, three of the five backbones surpass human performance (72.4%) on OSWorld in a single pass. Ablation studies show that all three proposed components consistently improve a strong backbone, while a weaker backbone benefits more from these tools when the step budget is sufficient. Further analysis also shows that the Loop Breaker nearly halves wasted steps for loop-prone models.
Control Barrier Function based Attack-Recovery with Provable Guarantees
IEEE Transactions on Automatic Control · 2026-01-01
preprintOpen accessSenior authorThis paper investigates security guarantees for cyber-physical systems (CPS) against actuator attacks. We in troduce a new attack detection mechanism based on zeroing control barrier function (ZCBF) conditions. We propose an adaptive recovery mechanism that responds based on the system's proximity to safety violations. Our attack-detection mechanism has been proven to be sound, meaning it consistently detects adversarial attacks without any false negatives. Additionally, we propose a novel hybrid control law that addresses delays in attack detection and prevents Zeno behavior. We also propose a sampling-based method to verify whether a set is a viability domain for CPS. Finally, we employ a Quadratic Programming (QP) approach for synthesizing control laws for the hybrid control policy, utilizing the viability domain to ensure safety in the presence of adversarial attacks on system actuators. The efficacy of the proposed method is demonstrated in a simulation case study involving a quadrotor system.
D4: Dynamic Data-Driven Discovery of Adversarial Vehicle Maneuvers
Lecture notes in computer science · 2025-08-25
book-chapterSenior authorLarge Language Models are Autonomous Cyber Defenders
2025-05-05 · 2 citations
preprintOpen accessSenior authorFast and effective incident response is essential to prevent adversarial cyberattacks. Autonomous Cyber Defense (ACD) aims to automate incident response through Artificial Intelligence (AI) agents that plan and execute actions. Most ACD approaches focus on single-agent scenarios and leverage Reinforcement Learning (RL). However, ACD RL-trained agents depend on costly training, and their reasoning is not always explainable or transferable. Large Language Models (LLMs) can address these concerns by providing explainable actions in general security contexts. Researchers have explored LLM agents for ACD but have not evaluated them on multi-agent scenarios or interacting with other ACD agents. In this paper, we show the first study on how LLMs perform in multi-agent ACD environments by proposing a new integration to the CybORG CAGE 4 environment. We examine how ACD teams of LLM and RL agents can interact by proposing a novel communication protocol. Our results highlight the strengths and weaknesses of LLMs and RL and help us identify promising research directions to create, train, and deploy future teams of ACD agents.
Cybersecurity for Next-Generation Road Transportation: A Review
ACM Journal on Autonomous Transportation Systems · 2025-06-14 · 4 citations
reviewOpen accessEmerging transportation technologies, including electric vehicles (EVs), autonomous vehicles (AVs), and connected vehicles (CVs), are poised to revolutionize mobility and logistics through advancements in connectivity, automation, and electrification. However, the convergence of these systems introduces substantial cybersecurity challenges, including data breaches, spoofing, and infrastructure attacks, due to increasingly complex and interconnected attack surfaces. This article presents a comprehensive survey of cybersecurity threats, vulnerabilities, and evaluation practices within the AV, CV, and EV landscape. We analyze 227 peer-reviewed studies published between 2021 and 2025 and introduce two novel taxonomies: a three-stage cyber-attack lifecycle framework—Stream to Information (S2I), Information to Decision (I2D), and Decision to Actuation (D2A)—and 11 representative attack paths. Our findings reveal that the stream and data processing stages are most frequently exploited, with limited standardization in evaluation metrics and inadequate emphasis on real-world operational consequences. We highlight the growing importance of integrating quantum-safe cryptography and AI-driven anomaly detection as proactive defense mechanisms. Finally, we offer stage-wise design recommendations and identify future research directions, including the need for cross-domain evaluation frameworks, cyber-social risk assessments, and secure integration of emerging multi-modal systems. This survey aims to support policymakers, researchers, and industry stakeholders in developing resilient, secure, and trustworthy next-generation road transportation ecosystems.
Recent grants
NSF · $164k · 2015–2018
CPS: Medium: Collaborative Research: Security vs. Privacy in Cyber-Physical Systems
NSF · $655k · 2019–2024
NeTS: Small: Collaborative Research: Measurement and Modeling of Industrial Control Networks
NSF · $249k · 2017–2019
NSF · $345k · 2016–2019
NSF · $486k · 2019–2024
Frequent coauthors
- 52 shared
Jairo Giraldo
Massachusetts Institute of Technology
- 52 shared
Nicanor Quijano
- 44 shared
Luis Francisco Cómbita
District University of Bogotá
- 36 shared
Xenofon Koutsoukos
Vanderbilt University
- 36 shared
Erik Yoon
Massachusetts Institute of Technology
- 36 shared
Girish V. Chowdhary
- 36 shared
Shashank Shekhar
- 36 shared
Peter W. Sauer
University of Illinois Urbana-Champaign
- Resume-aware match score
- Save to shortlist
- AI-drafted outreach
See your match with Alvaro Cardenas
PhdFit ranks faculty by your research interests, methods, and publications — grounded in their actual work, not templates.
- Free to start
- No credit card
- 30-second signup