Research
I'm working on human-AI safety, LLM planning & alignment, and embodied AI. My long-term goal is to build safe, reliable AI systems that effectively assist humans. Previously, I also did research on video retrieval and out-of-distribution generalization. I am happy to collaborate and answer questions about my research.
- I've been considering these questions on human-AI safety & alignment:
Are we underestimating the ways LLMs might subtly influence daily human decisions?
Do we recognize that certain harms might surface long after the interaction?
How should our alignment algorithms account for these risks?
- I've been considering these questions on embodied intelligence: LLMs are powerful for reasoning, but are they sufficient for robot planning? What are their capabilities, when should we use them, and how can we augment them with other tools?
|
|
RLHS: Mitigating Misalignment in RLHF with Hindsight Simulation
Kaiqu Liang,
Haimin Hu,
Ryan Liu,
Thomas L. Griffiths,
Jaime Fernández Fisac
Preprint & NeurIPS Safe Generative AI Workshop
paper
/
website
/
poster
/
tweet
We found that RLHF can induce significant misalignment when humans provide feedback while implicitly predicting future outcomes, creating incentives for LLM deception. To address this, we propose RLHS (Hindsight Simulation): By simulating future outcomes of the interaction before providing feedback, we drastically reduce misalignment.
|
|
Introspective Planning: Aligning Robots' Uncertainty with Inherent Task Ambiguity
Kaiqu Liang,
Zixu Zhang,
Jaime Fernández Fisac
Neural Information Processing Systems (NeurIPS), 2024
paper
/
website
/
code
/
poster
/
tweet
We propsoed introspective planning as a systematic approach that aligns LLM’s uncertainty
with the inherent ambiguity of the task.
|
|
Who Plays First? Optimizing the Order of Play in Stackelberg Games with Many Robots
Haimin Hu,
Gabriele Dragotto,
Zixu Zhang,
Kaiqu Liang,
Bartolomeo Stellato,
Jaime Fernández Fisac
Robotics: Science and Systems (RSS), 2024
We introduced Branch and Play (B&P), an algorithm that effectively resolves multi-agent spatial navigation problems by determining the optimal order of play.
|
|
Simple Baselines for Interactive Video Retrieval with Questions and Answers
Kaiqu Liang,
Samuel Albanie
International Conference on Computer Vision (ICCV), 2023
paper
/
code
We proposed several simple yet effective baselines for interactive video retrieval via question-answering.
|
|
Path Independent Equilibrium Models Can Better Exploit Test-Time Computation
Cem Anil*,
Ashwini Pokle*,
Kaiqu Liang*,
Johannes Treutlein,
Yuhuai Wu,
Shaojie Bai,
Zico Kolter,
Roger Grosse
Neural Information Processing Systems (NeurIPS), 2022
We demonstrated that equilibrium model improves generalization in harder instances due to their path independence, highlighting its importance for model performance and scalability.
|
|
Out-of-Distribution Generalization with Deep Equilibrium Models
Kaiqu Liang*,
Cem Anil*,
Yuhuai Wu,
Roger Grosse
ICML Workshop on Uncertainty and Robustness in Deep Learning , 2021
We demonstrated and discussed why Deep Equilibrium (DEQ) Models outperform fixed-depth counterparts in generalizing under distribution shifts.
|
Education
|
Princeton University, USA
Ph.D. in Computer Science • Aug. 2022 to Now
|
|
Cambridge University, UK
MPhil in Machine Learning and Machine Intelligence • Oct. 2021 to Aug. 2022
|
|
University of Toronto, Canada
Honours Bachelor of Science • Sep. 2017 to May 2021
Computer Science Specialist & Statistics Major & Mathematics Minor
CGPA: 3.99/4.00 (94.1%)
|
|
|
Teaching
|
Teaching Assistant • ECE346/COS348/MAE346: Intelligent Robotic Systems • Princeton University
Teaching Assistant • COS 350: Ethics of computing • Princeton University
Teaching Assistant • CSC165: Mathematical Expression and Reasoning for Computer Science • University of Toronto
|
Reviewer services
|
International Conference on Learning Representations (ICLR)
Neural Information Processing Systems (NeurIPS)
International Conference on Machine Learning (ICML)
European Conference on Computer Vision (ECCV)
|
|