Red Teaming on Agentic AI

GPT-5

It is the era of Agentic AI but Agentic AI still suffers from trustworthiness issues, including weakness on jailbreaking. We advance the status of Agentic AI by debugging its issues via red teaming. In particular, we consider the following question:

How to learn red agentic AI to debug target agentic AI continuously?

We exploit our knowledge and experience on Trustworthy Agentic AI to build red Agentic AI for blue Agentic AI.

On-going/Potential Projects

Red Agent Learning: Learn red Agents for blue LLMs via RL.
Red Agent Continual Learning: Learn continual red teaming Agents.

Do you have creative ideas in building red Agentic AI?

Keywords

Red Teaming
Agentic AI
Reinforcement Learning

AIxCC-Winner

Red Teaming on Agentic AI

On-going/Potential Projects

Keywords

Related Work