Foundations of AI Alignment
Large language models raise many practical issues, including hallucination and harmful responses. This raises the following question.
How do we build AI systems that are reliably truthful, safe, and secure?
Keywords
- Large Language Models
- Conformal Abstention
- Conformal Prediction
- Uncertainty Quantification
- Reinforcement Learning
Related Publications
Kyungmin Kim
,
Youngbin Choi
,
Seoyeon Lee
,
Suhyeon Jun
,
Dongwoo Kim
,
Sangdon Park
ICML RLxF Workshop
2026
Jaewoo Jeong
,
Taesoo Kim
,
Sangdon Park
ICML
2026
Junyoung Yang*
,
Kyungmin Kim*
,
Sangdon Park
ICLR
2026
Kyungmin Kim
,
Youngbin Choi
,
Hyounghun Kim
,
Dongwoo Kim
,
Sangdon Park
EMNLP Findings
2025
Saemi Moon*
,
Minjong Lee*
,
Sangdon Park
,
Dongwoo Kim
ICCV
2025
Jeongyeon Hwang
,
Junyoung Park
,
Hyejin Park
,
Dongwoo Kim
,
Sangdon Park
,
Jungseul Ok
EMNLP
2025
Minjae Lee*
,
Yoonjae Jung*
,
Sangdon Park
2025
🏆 Best Paper Finalist from CKAIA
Minjae Lee*
,
Kyungmin Kim*
,
Taesoo Kim
,
Sangdon Park
NeurIPS
2024
🏆 Spotlight (Top 2.08%)🏆 POSTECH GSAI BK21 Best Paper Award
Hyejin Park*
,
Jeongyeon Hwang*
,
Sunung Mun
,
Sangdon Park
,
Jungseul Ok
CVPR
2024
Shuo Li
,
Sangdon Park
,
Insup Lee
,
Osbert Bastani
NAACL
2024
🏆 ICML'23 TEACH Workshop Best Paper Award
Wenwen Si
,
Sangdon Park
,
Insup Lee
,
Edgar Dobriban
,
Osbert Bastani
ICLR
2024
Wenwen Si
,
Shuo Li
,
Sangdon Park
,
Insup Lee
,
Osbert Bastani
CVPR
2023
Sangdon Park
,
Osbert Bastani
,
Taesoo Kim
Security
2023
Sangdon Park
,
Edgar Dobriban
,
Insup Lee
,
Osbert Bastani
NeurIPS
2022
Sooyong Jang
,
Sangdon Park
,
Insup Lee
,
Osbert Bastani
ICML
2022
Shuo Li
,
Sangdon Park
,
Xiayan Ji
,
Insup Lee
,
Osbert Bastani
2022
Sangdon Park
,
Edgar Dobriban
,
Insup Lee
,
Osbert Bastani
ICLR
2022
Sangdon Park
,
Shuo Li
,
Insup Lee
,
Osbert Bastani
ICLR
2021
Ramneet Kaur
,
Susmit Jha
,
Anirban Roy
,
Sangdon Park
,
Edgar Dobriban
,
Oleg Sokolsky
,
Insup Lee
AAAI
2021
Sangdon Park
,
Osbert Bastani
,
James Weimer
,
Insup Lee
AISTATS
2020
Sangdon Park
,
Osbert Bastani
,
Nikolai Matni
,
Insup Lee
ICLR
2020
Sangdon Park
,
James Weimer
,
Insup Lee
ICCPS
2017