Publications

* means equal contributions.

[WAFR’24] Dongjie Yu*, Hang Xu*, Yizhou Chen, Yi Ren and Jia Pan. “BiKC: Keypose-Conditioned Consistency Policy for Bimanual Robotic Manipulatio”. In The 16th International Workshop on the Algorithmic Foundations of Robotics, 2024.
1. tldr: We present Bimanual Keypose-Conditioned Consistency Policy (BiKC) to enhance the operational reliability and eﬃciency by overcoming per-stage error and per-step latency accumulated across multiple stages.
2. [Paper] [Code]
[ICML’22] Dongjie Yu*, Haitong Ma*, Shengbo Eben Li and Jianyu Chen. “Reachability Constrained Reinforcement Learning”. In Proceedings of the 39th International Conference on Machine Learning, PMLR 162:25636-25655, 2022.
1. tldr: We characterize feasible sets (persistently safe states) with Hamilton-Jacobi reachability analysis in constrained RL. The algorithm reachs a zero-violation policy with competitive performance.
2. [Paper] [Code_learning] [Code_env]

[T-ASE] Dongjie Yu*, Wenjun Zou, Yujie Yang, Haitong Ma, Shengbo Eben Li, Yuming Yin, Jianyu Chen and Jingliang Duan. “Safe Model-Based Reinforcement Learning with an Uncertainty-Aware Reachability Certificate”. IEEE Transactions on Automation Science and Engineering, DOI: 10.1109/TASE.2023.3292388, 2023.
1. tldr: We propose a distributional form of reachability certificate to consider the uncertaity of the system dynamics in safe RL. The algorithm is able to learn a zero-violation policy with competitive performance, significantly reducing violations during exploration.
2. [Paper] [Code]