Iterative Self-Tuning LLMs for Enhanced Jailbreaking Capabilities
Chung-En Sun, Xiaodong Liu, Weiwei Yang, Tsui-Wei Weng, Hao Cheng, Aidan San, Michel Galley, Jianfeng Gao
NAACL 2025 | April 2025
Selected for oral presentation
Chung-En Sun, Xiaodong Liu, Weiwei Yang, Tsui-Wei Weng, Hao Cheng, Aidan San, Michel Galley, Jianfeng Gao
NAACL 2025 | April 2025
Selected for oral presentation
Zhuoshi Pan, Qianhui Wu, Huiqiang Jiang, Xufang Luo, Hao Cheng, Dongsheng Li, Yuqing Yang, Chin-Yew Lin, H. Vicky Zhao, Lili Qiu, Jianfeng Gao
ICLR 2025 | April 2025
Shirley Wu, Michel Galley, Baolin Peng, Hao Cheng, Gavin Li, Yao Dou, Weixin Cai, James Zou, J. Leskovec, Jianfeng Gao
February 2025
Yangyu Huang, Tianyi Gao, Haoran Xu, Qihao Zhao, Yang Song, Zhipeng Gui, Tengchao Lv, Hao Cheng, Lei Cui, Scarlett Li, Furu Wei
January 2025
Tong Chen, Hao Fang, Patrick Xia, Xiaodong Liu, Ben Van Durme, Luke Zettlemoyer, Jianfeng Gao, Hao Cheng
ICLR 2025 | November 2024
Xiao Yu, Baolin Peng, Vineeth Vajipey, Hao Cheng, Michel Galley, Jianfeng Gao, Zhou Yu
MSR-TR-2024-63 | October 2024
Publié par Microsoft
Chengyu Dong, Liyuan Liu, Hao Cheng, Jingbo Shang, Jianfeng Gao, Xiaodong Liu
ICLR 2024 | October 2023
Pan Lu, Hritik Bansal, Tony Xia, Jiacheng Liu, Chun-yue Li, Hannaneh Hajishirzi, Hao Cheng, Kai-Wei Chang, Michel Galley, Jianfeng Gao
ICLR 2024 | October 2023
Weizhi Wang, Li Dong, Hao Cheng, Xiaodong Liu, Xifeng Yan, Jianfeng Gao, Furu Wei
NeurIPS 2023 | October 2023
Pan Lu, Baolin Peng, Hao Cheng, Michel Galley, Kai-Wei Chang, Ying Nian Wu, Song-Chun Zhu, Jianfeng Gao
NeurIPS 2023 | October 2023
Hao Cheng, Hao Fang, Xiaodong Liu, Jianfeng Gao
ACL 2023 | July 2023
Zelalem Gero, Chandan Singh, Hao Cheng, Tristan Naumann, Michel Galley, Jianfeng Gao, Hoifung Poon
May 2023
Baoling Peng, Michel Galley, Pengcheng He, Hao Cheng, Yujia Xie, Yu Hu, Qiuyuan Huang, Lars Liden, Zhou Yu, Weizhu Chen, Jianfeng Gao
MSR-TR-2023-46 | March 2023
Publié par Microsoft
Yichong Xu, Chenguang Zhu, Shuohang Wang, Siqi Sun, Hao Cheng, Xiaodong Liu, Jianfeng Gao, Pengcheng He, Michael Zeng, Xuedong Huang
IJCAI 2022 | July 2022
human parity result on CommonsenseQA
Sheng Zhang, Hao Cheng, Shikhar Vashishth, Cliff Wong, Jinfeng Xiao, Xiaodong Liu, Tristan Naumann, Jianfeng Gao, Hoifung Poon
EMNLP 2022 | May 2022
Kaixin Ma, Hao Cheng, Xiaodong Liu, Eric Nyberg, Jianfeng Gao
ACL 2022 | May 2022
Subhabrata (Subho) Mukherjee, Xiaodong Liu, Guoqing Zheng, Saghar Hosseini, Hao Cheng, Greg Yang, Chris Meek, Ahmed Awadallah, Jianfeng Gao
NeurIPS 2021 | December 2021
Chia-Hsuan Lee, Hao Cheng, Mari Ostendorf
2021 Empirical Methods in Natural Language Processing | September 2021
Yu Wang, Jinchao Li, Tristan Naumann, Chenyan Xiong, Hao Cheng, Rob Tinn, Cliff Wong, Naoto Usuyama, Rick Rogahn, Zhihong Shen, Yang Qin, Eric Horvitz, Paul Bennett, Jianfeng Gao, Hoifung Poon
KDD 2021 | August 2021
Hao Cheng, Yelong Shen, Xiaodong Liu, Pengcheng He, Wei Chen, Jianfeng Gao
ACL-IJCNLP 2021 | December 2020
Hao Cheng, Xiaodong Liu, Lis Pereira, Yaoliang Yu, Jianfeng Gao
October 2020
Yu Gu, Robert Tinn, Hao Cheng, Michael Lucas, Naoto Usuyama, Xiaodong Liu, Tristan Naumann, Jianfeng Gao, Hoifung Poon
August 2020
Hao Cheng, Ming-Wei Chang, Kenton Lee, Kristina Toutanova
Meeting of the Association for Computational Linguistics | June 2020
Xiaodong Liu, Yu Wang, Jianshu Ji, Hao Cheng, Xueyun Zhu, Emmanuel Awa, Pengcheng He, Weizhu Chen, Hoifung Poon, Guihong Cao, Jianfeng Gao
Meeting of the Association for Computational Linguistics | February 2020
DOI PDF PDF Publication Publication Publication Publication Publication
Chung-En Sun, Xiaodong Liu, Weiwei Yang, Tsui-Wei Weng, Hao Cheng, Aidan San, Michel Galley, Jianfeng Gao
NAACL 2025 | April 2025
Selected for oral presentation
Zhuoshi Pan, Qianhui Wu, Huiqiang Jiang, Xufang Luo, Hao Cheng, Dongsheng Li, Yuqing Yang, Chin-Yew Lin, H. Vicky Zhao, Lili Qiu, Jianfeng Gao
ICLR 2025 | April 2025
Shirley Wu, Michel Galley, Baolin Peng, Hao Cheng, Gavin Li, Yao Dou, Weixin Cai, James Zou, J. Leskovec, Jianfeng Gao
February 2025
Tong Chen, Hao Fang, Patrick Xia, Xiaodong Liu, Ben Van Durme, Luke Zettlemoyer, Jianfeng Gao, Hao Cheng
ICLR 2025 | November 2024
Xiao Yu, Baolin Peng, Vineeth Vajipey, Hao Cheng, Michel Galley, Jianfeng Gao, Zhou Yu
MSR-TR-2024-63 | October 2024
Publié par Microsoft
Chengyu Dong, Liyuan Liu, Hao Cheng, Jingbo Shang, Jianfeng Gao, Xiaodong Liu
ICLR 2024 | October 2023
Pan Lu, Hritik Bansal, Tony Xia, Jiacheng Liu, Chun-yue Li, Hannaneh Hajishirzi, Hao Cheng, Kai-Wei Chang, Michel Galley, Jianfeng Gao
ICLR 2024 | October 2023
Weizhi Wang, Li Dong, Hao Cheng, Xiaodong Liu, Xifeng Yan, Jianfeng Gao, Furu Wei
NeurIPS 2023 | October 2023
Pan Lu, Baolin Peng, Hao Cheng, Michel Galley, Kai-Wei Chang, Ying Nian Wu, Song-Chun Zhu, Jianfeng Gao
NeurIPS 2023 | October 2023
Hao Cheng, Hao Fang, Xiaodong Liu, Jianfeng Gao
ACL 2023 | July 2023
Zelalem Gero, Chandan Singh, Hao Cheng, Tristan Naumann, Michel Galley, Jianfeng Gao, Hoifung Poon
May 2023
Baoling Peng, Michel Galley, Pengcheng He, Hao Cheng, Yujia Xie, Yu Hu, Qiuyuan Huang, Lars Liden, Zhou Yu, Weizhu Chen, Jianfeng Gao
MSR-TR-2023-46 | March 2023
Publié par Microsoft
Yichong Xu, Chenguang Zhu, Shuohang Wang, Siqi Sun, Hao Cheng, Xiaodong Liu, Jianfeng Gao, Pengcheng He, Michael Zeng, Xuedong Huang
IJCAI 2022 | July 2022
human parity result on CommonsenseQA
Sheng Zhang, Hao Cheng, Shikhar Vashishth, Cliff Wong, Jinfeng Xiao, Xiaodong Liu, Tristan Naumann, Jianfeng Gao, Hoifung Poon
EMNLP 2022 | May 2022
Subhabrata (Subho) Mukherjee, Xiaodong Liu, Guoqing Zheng, Saghar Hosseini, Hao Cheng, Greg Yang, Chris Meek, Ahmed Awadallah, Jianfeng Gao
NeurIPS 2021 | December 2021
Yu Wang, Jinchao Li, Tristan Naumann, Chenyan Xiong, Hao Cheng, Rob Tinn, Cliff Wong, Naoto Usuyama, Rick Rogahn, Zhihong Shen, Yang Qin, Eric Horvitz, Paul Bennett, Jianfeng Gao, Hoifung Poon
KDD 2021 | August 2021
Hao Cheng, Yelong Shen, Xiaodong Liu, Pengcheng He, Wei Chen, Jianfeng Gao
ACL-IJCNLP 2021 | December 2020
Hao Cheng, Xiaodong Liu, Lis Pereira, Yaoliang Yu, Jianfeng Gao
October 2020
Hao Cheng, Ming-Wei Chang, Kenton Lee, Kristina Toutanova
Meeting of the Association for Computational Linguistics | June 2020
Xiaodong Liu, Yu Wang, Jianshu Ji, Hao Cheng, Xueyun Zhu, Emmanuel Awa, Pengcheng He, Weizhu Chen, Hoifung Poon, Guihong Cao, Jianfeng Gao
Meeting of the Association for Computational Linguistics | February 2020
DOI PDF PDF Publication Publication Publication Publication Publication
Chung-En Sun, Xiaodong Liu, Weiwei Yang, Tsui-Wei Weng, Hao Cheng, Aidan San, Michel Galley, Jianfeng Gao
NAACL 2025 | April 2025
Selected for oral presentation
Zhuoshi Pan, Qianhui Wu, Huiqiang Jiang, Xufang Luo, Hao Cheng, Dongsheng Li, Yuqing Yang, Chin-Yew Lin, H. Vicky Zhao, Lili Qiu, Jianfeng Gao
ICLR 2025 | April 2025
Shirley Wu, Michel Galley, Baolin Peng, Hao Cheng, Gavin Li, Yao Dou, Weixin Cai, James Zou, J. Leskovec, Jianfeng Gao
February 2025
Xiao Yu, Baolin Peng, Vineeth Vajipey, Hao Cheng, Michel Galley, Jianfeng Gao, Zhou Yu
MSR-TR-2024-63 | October 2024
Publié par Microsoft
Baoling Peng, Michel Galley, Pengcheng He, Hao Cheng, Yujia Xie, Yu Hu, Qiuyuan Huang, Lars Liden, Zhou Yu, Weizhu Chen, Jianfeng Gao
MSR-TR-2023-46 | March 2023
Publié par Microsoft
Shirley Wu, Michel Galley, Baolin Peng, Hao Cheng, Gavin Li, Yao Dou, Weixin Cai, James Zou, J. Leskovec, Jianfeng Gao
February 2025
Xiao Yu, Baolin Peng, Vineeth Vajipey, Hao Cheng, Michel Galley, Jianfeng Gao, Zhou Yu
MSR-TR-2024-63 | October 2024
Publié par Microsoft
Hao Cheng, Hao Fang, Xiaodong Liu, Jianfeng Gao
ACL 2023 | July 2023
Baoling Peng, Michel Galley, Pengcheng He, Hao Cheng, Yujia Xie, Yu Hu, Qiuyuan Huang, Lars Liden, Zhou Yu, Weizhu Chen, Jianfeng Gao
MSR-TR-2023-46 | March 2023
Publié par Microsoft
Yichong Xu, Chenguang Zhu, Shuohang Wang, Siqi Sun, Hao Cheng, Xiaodong Liu, Jianfeng Gao, Pengcheng He, Michael Zeng, Xuedong Huang
IJCAI 2022 | July 2022
human parity result on CommonsenseQA
Sheng Zhang, Hao Cheng, Shikhar Vashishth, Cliff Wong, Jinfeng Xiao, Xiaodong Liu, Tristan Naumann, Jianfeng Gao, Hoifung Poon
EMNLP 2022 | May 2022
Kaixin Ma, Hao Cheng, Xiaodong Liu, Eric Nyberg, Jianfeng Gao
ACL 2022 | May 2022
Subhabrata (Subho) Mukherjee, Xiaodong Liu, Guoqing Zheng, Saghar Hosseini, Hao Cheng, Greg Yang, Chris Meek, Ahmed Awadallah, Jianfeng Gao
NeurIPS 2021 | December 2021
Chia-Hsuan Lee, Hao Cheng, Mari Ostendorf
2021 Empirical Methods in Natural Language Processing | September 2021
Yu Wang, Jinchao Li, Tristan Naumann, Chenyan Xiong, Hao Cheng, Rob Tinn, Cliff Wong, Naoto Usuyama, Rick Rogahn, Zhihong Shen, Yang Qin, Eric Horvitz, Paul Bennett, Jianfeng Gao, Hoifung Poon
KDD 2021 | August 2021
Hao Cheng, Ming-Wei Chang, Kenton Lee, Kristina Toutanova
Meeting of the Association for Computational Linguistics | June 2020
Xiaodong Liu, Yu Wang, Jianshu Ji, Hao Cheng, Xueyun Zhu, Emmanuel Awa, Pengcheng He, Weizhu Chen, Hoifung Poon, Guihong Cao, Jianfeng Gao
Meeting of the Association for Computational Linguistics | February 2020
DOI PDF PDF Publication Publication Publication Publication Publication
Yangyu Huang, Tianyi Gao, Haoran Xu, Qihao Zhao, Yang Song, Zhipeng Gui, Tengchao Lv, Hao Cheng, Lei Cui, Scarlett Li, Furu Wei
January 2025
Yangyu Huang, Tianyi Gao, Haoran Xu, Qihao Zhao, Yang Song, Zhipeng Gui, Tengchao Lv, Hao Cheng, Lei Cui, Scarlett Li, Furu Wei
January 2025
Zelalem Gero, Chandan Singh, Hao Cheng, Tristan Naumann, Michel Galley, Jianfeng Gao, Hoifung Poon
May 2023
Yu Wang, Jinchao Li, Tristan Naumann, Chenyan Xiong, Hao Cheng, Rob Tinn, Cliff Wong, Naoto Usuyama, Rick Rogahn, Zhihong Shen, Yang Qin, Eric Horvitz, Paul Bennett, Jianfeng Gao, Hoifung Poon
KDD 2021 | August 2021
Yu Gu, Robert Tinn, Hao Cheng, Michael Lucas, Naoto Usuyama, Xiaodong Liu, Tristan Naumann, Jianfeng Gao, Hoifung Poon
August 2020
Yu Wang, Jinchao Li, Tristan Naumann, Chenyan Xiong, Hao Cheng, Rob Tinn, Cliff Wong, Naoto Usuyama, Rick Rogahn, Zhihong Shen, Yang Qin, Eric Horvitz, Paul Bennett, Jianfeng Gao, Hoifung Poon
KDD 2021 | August 2021
Hao Cheng, Xiaodong Liu, Lis Pereira, Yaoliang Yu, Jianfeng Gao
October 2020
Chung-En Sun, Xiaodong Liu, Weiwei Yang, Tsui-Wei Weng, Hao Cheng, Aidan San, Michel Galley, Jianfeng Gao
NAACL 2025 | April 2025
Selected for oral presentation
Zhuoshi Pan, Qianhui Wu, Huiqiang Jiang, Xufang Luo, Hao Cheng, Dongsheng Li, Yuqing Yang, Chin-Yew Lin, H. Vicky Zhao, Lili Qiu, Jianfeng Gao
ICLR 2025 | April 2025
Tong Chen, Hao Fang, Patrick Xia, Xiaodong Liu, Ben Van Durme, Luke Zettlemoyer, Jianfeng Gao, Hao Cheng
ICLR 2025 | November 2024
Chengyu Dong, Liyuan Liu, Hao Cheng, Jingbo Shang, Jianfeng Gao, Xiaodong Liu
ICLR 2024 | October 2023
Pan Lu, Hritik Bansal, Tony Xia, Jiacheng Liu, Chun-yue Li, Hannaneh Hajishirzi, Hao Cheng, Kai-Wei Chang, Michel Galley, Jianfeng Gao
ICLR 2024 | October 2023
Weizhi Wang, Li Dong, Hao Cheng, Xiaodong Liu, Xifeng Yan, Jianfeng Gao, Furu Wei
NeurIPS 2023 | October 2023
Pan Lu, Baolin Peng, Hao Cheng, Michel Galley, Kai-Wei Chang, Ying Nian Wu, Song-Chun Zhu, Jianfeng Gao
NeurIPS 2023 | October 2023
Hao Cheng, Hao Fang, Xiaodong Liu, Jianfeng Gao
ACL 2023 | July 2023
Yichong Xu, Chenguang Zhu, Shuohang Wang, Siqi Sun, Hao Cheng, Xiaodong Liu, Jianfeng Gao, Pengcheng He, Michael Zeng, Xuedong Huang
IJCAI 2022 | July 2022
human parity result on CommonsenseQA
Sheng Zhang, Hao Cheng, Shikhar Vashishth, Cliff Wong, Jinfeng Xiao, Xiaodong Liu, Tristan Naumann, Jianfeng Gao, Hoifung Poon
EMNLP 2022 | May 2022
Kaixin Ma, Hao Cheng, Xiaodong Liu, Eric Nyberg, Jianfeng Gao
ACL 2022 | May 2022
Subhabrata (Subho) Mukherjee, Xiaodong Liu, Guoqing Zheng, Saghar Hosseini, Hao Cheng, Greg Yang, Chris Meek, Ahmed Awadallah, Jianfeng Gao
NeurIPS 2021 | December 2021
Chia-Hsuan Lee, Hao Cheng, Mari Ostendorf
2021 Empirical Methods in Natural Language Processing | September 2021
Yu Wang, Jinchao Li, Tristan Naumann, Chenyan Xiong, Hao Cheng, Rob Tinn, Cliff Wong, Naoto Usuyama, Rick Rogahn, Zhihong Shen, Yang Qin, Eric Horvitz, Paul Bennett, Jianfeng Gao, Hoifung Poon
KDD 2021 | August 2021
Hao Cheng, Yelong Shen, Xiaodong Liu, Pengcheng He, Wei Chen, Jianfeng Gao
ACL-IJCNLP 2021 | December 2020
Hao Cheng, Ming-Wei Chang, Kenton Lee, Kristina Toutanova
Meeting of the Association for Computational Linguistics | June 2020
Xiaodong Liu, Yu Wang, Jianshu Ji, Hao Cheng, Xueyun Zhu, Emmanuel Awa, Pengcheng He, Weizhu Chen, Hoifung Poon, Guihong Cao, Jianfeng Gao
Meeting of the Association for Computational Linguistics | February 2020
DOI PDF PDF Publication Publication Publication Publication Publication
Shirley Wu, Michel Galley, Baolin Peng, Hao Cheng, Gavin Li, Yao Dou, Weixin Cai, James Zou, J. Leskovec, Jianfeng Gao
February 2025
Zelalem Gero, Chandan Singh, Hao Cheng, Tristan Naumann, Michel Galley, Jianfeng Gao, Hoifung Poon
May 2023
Hao Cheng, Xiaodong Liu, Lis Pereira, Yaoliang Yu, Jianfeng Gao
October 2020
Yu Gu, Robert Tinn, Hao Cheng, Michael Lucas, Naoto Usuyama, Xiaodong Liu, Tristan Naumann, Jianfeng Gao, Hoifung Poon
August 2020
Yangyu Huang, Tianyi Gao, Haoran Xu, Qihao Zhao, Yang Song, Zhipeng Gui, Tengchao Lv, Hao Cheng, Lei Cui, Scarlett Li, Furu Wei
January 2025
Xiao Yu, Baolin Peng, Vineeth Vajipey, Hao Cheng, Michel Galley, Jianfeng Gao, Zhou Yu
MSR-TR-2024-63 | October 2024
Publié par Microsoft
Baoling Peng, Michel Galley, Pengcheng He, Hao Cheng, Yujia Xie, Yu Hu, Qiuyuan Huang, Lars Liden, Zhou Yu, Weizhu Chen, Jianfeng Gao
MSR-TR-2023-46 | March 2023
Publié par Microsoft