Kaiqu Liang

I am a 3rd year PhD student in Computer Science at Princeton University, advised by Jaime Fernández Fisac. I also work closely with Tom Griffiths.

Previously, I completed my MPhil in Machine Learning at the University of Cambridge, advised by Samuel Albanie and Bill Byrne. I did my undergraduate studies at the University of Toronto, where I was advised by Roger Grosse and Sven Dickinson. During my undergraduate years, I was also a student researcher in Vector Institute, advised by Roger Grosse.

Email  /  Google Scholar  /  Twitter  /  Github  /  Linkedin

profile photo

News

12/2024: Our workshop proposal - Public Trust in Autonomous Systems - is accepted by ICRA 2025! We look forward to hosting it in May 2025.
09/2024: Our paper Introspective Planning is accepted to NeurIPS 2024.

Research

I'm working on human-AI safety, LLM planning & alignment, and embodied AI. My long-term goal is to build safe, reliable AI systems that effectively assist humans. Previously, I also did research on video retrieval and out-of-distribution generalization. I am happy to collaborate and answer questions about my research.

- I've been considering these questions on human-AI safety & alignment: Are we underestimating the ways LLMs might subtly influence daily human decisions? Do we recognize that certain harms might surface long after the interaction? How should our alignment algorithms account for these risks?

- I've been considering these questions on embodied intelligence: LLMs are powerful for reasoning, but are they sufficient for robot planning? What are their capabilities, when should we use them, and how can we augment them with other tools?

intro RLHS: Mitigating Misalignment in RLHF with Hindsight Simulation
Kaiqu Liang, Haimin Hu, Ryan Liu, Thomas L. Griffiths, Jaime Fernández Fisac
Preprint & NeurIPS Safe Generative AI Workshop
paper / website / poster / tweet

We found that RLHF can induce significant misalignment when humans provide feedback while implicitly predicting future outcomes, creating incentives for LLM deception. To address this, we propose RLHS (Hindsight Simulation): By simulating future outcomes of the interaction before providing feedback, we drastically reduce misalignment.

intro Introspective Planning: Aligning Robots' Uncertainty with Inherent Task Ambiguity
Kaiqu Liang, Zixu Zhang, Jaime Fernández Fisac
Neural Information Processing Systems (NeurIPS), 2024
paper / website / code / poster / tweet

We propsoed introspective planning as a systematic approach that aligns LLM’s uncertainty with the inherent ambiguity of the task.

intro Who Plays First? Optimizing the Order of Play in Stackelberg Games with Many Robots
Haimin Hu, Gabriele Dragotto, Zixu Zhang, Kaiqu Liang, Bartolomeo Stellato, Jaime Fernández Fisac
Robotics: Science and Systems (RSS), 2024

We introduced Branch and Play (B&P), an algorithm that effectively resolves multi-agent spatial navigation problems by determining the optimal order of play.

intro Simple Baselines for Interactive Video Retrieval with Questions and Answers
Kaiqu Liang, Samuel Albanie
International Conference on Computer Vision (ICCV), 2023
paper / code

We proposed several simple yet effective baselines for interactive video retrieval via question-answering.

intro Path Independent Equilibrium Models Can Better Exploit Test-Time Computation
Cem Anil*, Ashwini Pokle*, Kaiqu Liang*, Johannes Treutlein, Yuhuai Wu, Shaojie Bai, Zico Kolter, Roger Grosse
Neural Information Processing Systems (NeurIPS), 2022

We demonstrated that equilibrium model improves generalization in harder instances due to their path independence, highlighting its importance for model performance and scalability.

intro Out-of-Distribution Generalization with Deep Equilibrium Models
Kaiqu Liang*, Cem Anil*, Yuhuai Wu, Roger Grosse
ICML Workshop on Uncertainty and Robustness in Deep Learning , 2021

We demonstrated and discussed why Deep Equilibrium (DEQ) Models outperform fixed-depth counterparts in generalizing under distribution shifts.


Education

Princeton University, USA
Ph.D. in Computer Science • Aug. 2022 to Now
Cambridge University, UK
MPhil in Machine Learning and Machine Intelligence • Oct. 2021 to Aug. 2022
University of Toronto, Canada
Honours Bachelor of Science • Sep. 2017 to May 2021
Computer Science Specialist & Statistics Major & Mathematics Minor
CGPA: 3.99/4.00 (94.1%)

Teaching

Teaching Assistant • ECE346/COS348/MAE346: Intelligent Robotic Systems • Princeton University
Teaching Assistant • COS 350: Ethics of computing • Princeton University
Teaching Assistant • CSC165: Mathematical Expression and Reasoning for Computer Science • University of Toronto

Reviewer services

International Conference on Learning Representations (ICLR)
Neural Information Processing Systems (NeurIPS)
International Conference on Machine Learning (ICML)
European Conference on Computer Vision (ECCV)

Website source from Jon Barron