Research
My current research interests include large language models, vision-language models, and trustworthy AI. I lead the development of Instella, a series of fully open language models at AMD.
We are hiring full-time research scientists and research interns in all areas of generative AI. Feel free to drop me an email with your CV if interested. Research collaborations are also welcome!
|
TTT-Bench: A Benchmark for Evaluating Reasoning Ability with Simple and Novel Tic-Tac-Toe-style Games
Prakamya Mishra, Jiang Liu, Jialian Wu, Xiaodong Yu, Zicheng Liu, Emad Barsoum
EMNLP Main Conference, 2025
Project Page /
arXiv /
Data
|
Agent Laboratory: Using LLM Agents as Research Assistants
Samuel Schmidgall, Yusheng Su, Ze Wang, Ximeng Sun, Jialian Wu, Xiaodong Yu, Jiang Liu, Michael Moor, Zicheng Liu, and Emad Barsoum
EMNLP Findings, 2025
Project Page /
arXiv /
code /
bibtex
|
Instella-T2I: Pushing the Limits of 1D Discrete
Latent Space Image Generation
Ze Wang, Hao Chen, Benran Hu, Jiang Liu, Ximeng Sun, Jialian Wu, Yusheng Su, Xiaodong Yu, Emad
Barsoum, Zicheng Liu
Tech Report, 2025
Blog /
Code /
arXiv /
Huggingface
|
Instella-Math: A Fully Open Language Model with Reasoning Capability
Xiaodong Yu, Jiang Liu, Yusheng Su, Gowtham Ramesh, Zicheng Liu et al.
Tech Report, 2025
Blog /
Code /
Huggingface
|
Instella-Long: A Fully Open Language Model with Long-Context Capability
Jialian Wu, Jiang Liu, Sudhanshu Ranjan, Xiaodong Yu, Gowtham Ramesh, Prakamya Mishra, Zicheng Liu, et al.
Tech Report, 2025
Blog /
Code /
Huggingface
|
Instella: New State-of-the-art Fully Open 3B Language Models
Jiang Liu, Jialian Wu, Xiaodong Yu, Prakamya Mishra, Sudhanshu Ranjan, Zicheng Liu, et al.
Tech Report, 2025
Blog /
Code /
Huggingface
|
Unleashing Hour-Scale Video Training for Long Video-Language Understanding
Jingyang Lin, Jialian Wu, Ximeng Sun, Ze Wang, Jiang Liu, Yusheng Su, Xiaodong Yu, Hao Chen, Jiebo Luo, Zicheng Liu, Emad Barsoum
arXiv, 2025
Project Page /
arXiv /
code /
dataset
|
DIFFNAT: Improving Diffusion Image Quality Using Natural Image Statistics
Aniket Roy, Maiterya Suin, Anshul Shah, Ketul Shah, Jiang Liu, Rama Chellappa
TMLR, 2025
arXiv
|
MOVi: Training-free Text-conditioned Multi-Object Video Generation
Aimon Rahman, Jiang Liu, Ze Wang, Ximeng Sun, Jialian Wu, Xiaodong Yu, Yusheng Su, Vishal M. Patel, Zicheng Liu, Emad Barsoum
arXiv, 2025
arXiv /
code
|
KeyVID: Keyframe-Aware Video Diffusion for Audio-Synchronized Visual Animation
Xingrui Wang, Jiang Liu, Ze Wang, Xiaodong Yu, Jialian Wu, Ximeng Sun, Yusheng Su, Alan Yuille, Zicheng Liu, Emad Barsoum
ICCV Workshop Gen4AVC, 2025
Project Page /
arXiv /
code
|
Self-Taught Agentic Long Context Understanding
Yufan Zhuang, Xiaodong Yu, Jialian Wu, Ximeng Sun, Ze Wang, Jiang Liu, Yusheng Su, Jingbo Shang, Zicheng Liu, Emad Barsoum
ACL, 2025
PDF /
code
|
SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer
Hao Chen, Ze Wang, Xiang Li, Ximeng Sun, Fangyi Chen, Jiang Liu,
Jindong Wang, Bhiksha Raj, Zicheng Liu, Emad Barsoum
CVPR, 2025
PDF /
code
|
AMD OLMo: Introducing the First AMD 1B Language Models
Jiang Liu, Jialian Wu, Prakamya Mishra, Zicheng Liu et al.
Tech Report, 2025
Blog /
Huggingface
|
Instruct2Attack: Language-Guided Semantic Adversarial Attacks
Jiang Liu,
Chen Wei, Yuxiang Guo, Heng Yu, Alan Yuille, Soheil Feizi, Chun Pong Lau, Rama Chellappa
Under Submission, 2024
arXiv /
bibtex
|
DiffProtect: Generate Adversarial Examples with Diffusion Models for Facial Privacy Protection
Jiang Liu,
Chun Pong Lau, Yuxiang Guo, Zhaoyang Wang, Rama Chellappa
Under Submission, 2023
arXiv /
bibtex /
code
|
Interpolated Joint Space Adversarial Training for Robust and Generalizable Defenses
Chun Pong Lau,
Jiang Liu,
Hossein Souri, Wei-An Lin, Soheil Feizi, Rama Chellappa
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
IEEE /
arXiv /
bibtex
|
One Model to Synthesize Them All: Multi-contrast Multi-scale Transformer for Missing Data Imputation
Jiang Liu*,
Srivathsa Pasumarthi*, Ben Duffy, Enhao Gong, Keshav Datta, Greg Zaharchuk (*equal contribution)
IEEE Transactions on Medical Imaging (TMI), 2023
IEEE /
arXiv /
bibtex
|
PolyFormer: Referring Image Segmentation as Sequential Polygon Generation
Jiang Liu*, Hui Ding*, Zhaowei Cai, Yuting Zhang, Ravi Kumar Satzoda, Vijay Mahadevan, R. Manmatha (*equal contribution)
CVPR, 2023
Project Page /
arXiv /
code /
bibtex
|
Segment and Complete: Defending Object Detectors Against Adversarial Patch Attacks With Robust Patch Detection
Jiang Liu,
Alexander Levine, Chun Pong Lau, Rama Chellappa, Soheil Feizi
CVPR, 2022
PDF /
Supp /
arXiv /
bibtex /
code /
Apricot-Mask Dataset
|
Mutual Adversarial Training: Learning together is better than going alone
Jiang Liu,
Chun Pong Lau,
Hossein Souri,
Soheil Feizi,
Rama Chellappa
IEEE Transactions on Information Forensics and Security (TIFS), 2022
IEEE /
arXiv /
bibtex
|
- Conference Reviewer:
CVPR, ICCV, ECCV, ACL, ACM MM, FG, ACCV, AAAI, MICCAI
- Journal Reviewer:
TPAMI, TIP, TIFS, TAI, TOPS, TMI, TCSVT, IJCV, MEDIA
|