Foundations of AI Alignment

Large language models raise many practical issues, including hallucination and harmful responses. This raises the following question.

How do we build AI systems that are reliably truthful, safe, and secure?

Keywords

  • Large Language Models
  • Conformal Abstention
  • Conformal Prediction
  • Uncertainty Quantification
  • Reinforcement Learning
Kyungmin Kim , Youngbin Choi , Seoyeon Lee , Suhyeon Jun , Dongwoo Kim , Sangdon Park
ICML RLxF Workshop 2026
Kyungmin Kim , Youngbin Choi , Hyounghun Kim , Dongwoo Kim , Sangdon Park
EMNLP Findings 2025
Jeongyeon Hwang , Junyoung Park , Hyejin Park , Dongwoo Kim , Sangdon Park , Jungseul Ok
EMNLP 2025
Minjae Lee* , Yoonjae Jung* , Sangdon Park
2025
🏆 Best Paper Finalist from CKAIA
Minjae Lee* , Kyungmin Kim* , Taesoo Kim , Sangdon Park
NeurIPS 2024
🏆 Spotlight (Top 2.08%)🏆 POSTECH GSAI BK21 Best Paper Award
Hyejin Park* , Jeongyeon Hwang* , Sunung Mun , Sangdon Park , Jungseul Ok
CVPR 2024
Shuo Li , Sangdon Park , Insup Lee , Osbert Bastani
NAACL 2024
🏆 ICML'23 TEACH Workshop Best Paper Award
Wenwen Si , Sangdon Park , Insup Lee , Edgar Dobriban , Osbert Bastani
ICLR 2024
Wenwen Si , Shuo Li , Sangdon Park , Insup Lee , Osbert Bastani
CVPR 2023
Sangdon Park , Osbert Bastani , Taesoo Kim
Security 2023
Sangdon Park , Edgar Dobriban , Insup Lee , Osbert Bastani
NeurIPS 2022
Sooyong Jang , Sangdon Park , Insup Lee , Osbert Bastani
ICML 2022
Shuo Li , Sangdon Park , Xiayan Ji , Insup Lee , Osbert Bastani
2022
Sangdon Park , Edgar Dobriban , Insup Lee , Osbert Bastani
ICLR 2022
Sangdon Park , Shuo Li , Insup Lee , Osbert Bastani
ICLR 2021
Ramneet Kaur , Susmit Jha , Anirban Roy , Sangdon Park , Edgar Dobriban , Oleg Sokolsky , Insup Lee
AAAI 2021
Sangdon Park , Osbert Bastani , James Weimer , Insup Lee
AISTATS 2020
Sangdon Park , Osbert Bastani , Nikolai Matni , Insup Lee
ICLR 2020