| 
            
            | Research 
                My current research interests include large language models, vision-language models, and trustworthy AI. I lead the development of Instella, a series of fully open language models at AMD.
                 We are hiring full-time research scientists and research interns in all areas of generative AI. Feel free to drop me an email with your CV if interested. Research collaborations are also welcome! |  
        
        
          
          
            | DRIFT: Directional Reasoning Injection for Fine-Tuning MLLMs Chao Huang, Zeliang Zhang, Jiang Liu, Ximeng Sun, Jialian Wu, Xiaodong Yu, Ze Wang, Chenliang Xu, Emad Barsoum, Zicheng Liu
 ArXiv, 2025
 Project Page /
              arXiv
 |  
            | XModBench: Benchmarking Cross-Modal Capabilities and Consistency in Omni-Language Models Xingrui Wang , Jiang Liu , Chao Huang , Xiaodong Yu , Ze Wang , Ximeng Sun , Jialian Wu , Alan Yuille , Emad Barsoum , Zicheng Liu
 ArXiv, 2025
 Project Page / 
              arXiv
 |  
            | ImageDoctor: Diagnosing Text-to-Image Generation via Grounded Image Reasoning Yuxiang Guo*, Jiang Liu*, Ze Wang, Hao Chen, Ximeng Sun, Yang Zhao, Jialian Wu, Xiaodong Yu, Zicheng Liu, Emad Barsoum
 ArXiv, 2025
 arXiv /
              Project Page
 |  
            | Latent Visual Reasoning Bangzheng Li, Ximeng Sun, Jiang Liu, Ze Wang, Jialian Wu, Xiaodong Yu, Hao Chen, Emad Barsoum, Muhao Chen, Zicheng Liu
 ArXiv, 2025
 arXiv /
              Project Page
 |  
            | APRIL: Active Partial Rollouts in Reinforcement Learning to Tame Long-tail Generation Yuzhen Zhou, Jiajun Li, Yusheng Su, Gowtham Ramesh, Zilin Zhu, Xiang Long, Chenyang Zhao, Jin Pan, Xiaodong Yu, Ze Wang, Kangrui Du, Jialian Wu, Ximeng Sun, Jiang Liu, Qiaolin Yu, Hao Chen, Zicheng Liu, Emad Barsoum
 ArXiv, 2025
 arXiv /
              Code
 |  
          | TTT-Bench: A Benchmark for Evaluating Reasoning Ability with Simple and Novel Tic-Tac-Toe-style Games Prakamya Mishra, Jiang Liu, Jialian Wu, Xiaodong Yu, Zicheng Liu, Emad Barsoum
 EMNLP Main Conference, 2025
 Project Page /
            arXiv /
            Data
 |  
          | Agent Laboratory: Using LLM Agents as Research Assistants Samuel Schmidgall, Yusheng Su, Ze Wang, Ximeng Sun, Jialian Wu, Xiaodong Yu, Jiang Liu, Michael Moor, Zicheng Liu, and Emad Barsoum
 EMNLP Findings, 2025
 Project Page /
            arXiv /
            code /
            bibtex
 |  
              | Instella-T2I: Pushing the Limits of 1D Discrete
Latent Space Image Generation Ze Wang, Hao Chen, Benran Hu, Jiang Liu, Ximeng Sun, Jialian Wu, Yusheng Su, Xiaodong Yu, Emad
Barsoum, Zicheng Liu
 Tech Report, 2025
 Blog /
                Code /
                arXiv /
                Huggingface
 |  
              | Instella-Math: A Fully Open Language Model with Reasoning Capability Xiaodong Yu, Jiang Liu, Yusheng Su, Gowtham Ramesh, Zicheng Liu et al.
 Tech Report, 2025
 Blog /
                Code /
                Huggingface
 |  
              | Instella-Long: A Fully Open Language Model with Long-Context Capability Jialian Wu, Jiang Liu, Sudhanshu Ranjan, Xiaodong Yu, Gowtham Ramesh, Prakamya Mishra, Zicheng Liu, et al.
 Tech Report, 2025
 Blog /
                Code /
                Huggingface
 |  
              | Instella: New State-of-the-art Fully Open 3B Language Models Jiang Liu, Jialian Wu, Xiaodong Yu, Prakamya Mishra, Sudhanshu Ranjan, Zicheng Liu, et al.
 Tech Report, 2025
 Blog /
                Code /
                Huggingface
 |  
          | Unleashing Hour-Scale Video Training for Long Video-Language Understanding Jingyang Lin, Jialian Wu, Ximeng Sun, Ze Wang, Jiang Liu, Yusheng Su, Xiaodong Yu, Hao Chen, Jiebo Luo, Zicheng Liu, Emad Barsoum
 NeurIPS, 2025 (Spotlight)
 Project Page /
            arXiv /
            code /
            dataset
 |  
          | DIFFNAT: Improving Diffusion Image Quality Using Natural Image Statistics Aniket Roy, Maiterya Suin, Anshul Shah, Ketul Shah, Jiang Liu, Rama Chellappa
 TMLR, 2025
 arXiv
 |  
          | MOVi: Training-free Text-conditioned Multi-Object Video Generation Aimon Rahman, Jiang Liu, Ze Wang, Ximeng Sun, Jialian Wu, Xiaodong Yu, Yusheng Su, Vishal M. Patel, Zicheng Liu, Emad Barsoum
 arXiv, 2025
 arXiv /
            code
 |  
          | KeyVID: Keyframe-Aware Video Diffusion for Audio-Synchronized Visual Animation Xingrui Wang, Jiang Liu, Ze Wang, Xiaodong Yu, Jialian Wu, Ximeng Sun, Yusheng Su, Alan Yuille, Zicheng Liu, Emad Barsoum
 ICCV Workshop Gen4AVC, 2025
 Project Page /
            arXiv /
            code
 |  
          | Self-Taught Agentic Long Context Understanding Yufan Zhuang, Xiaodong Yu, Jialian Wu, Ximeng Sun, Ze Wang, Jiang Liu, Yusheng Su, Jingbo Shang, Zicheng Liu, Emad Barsoum
 ACL, 2025
 PDF /
            code
 |  
          | SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer Hao Chen, Ze Wang, Xiang Li, Ximeng Sun, Fangyi Chen, Jiang Liu,
            Jindong Wang, Bhiksha Raj, Zicheng Liu, Emad Barsoum
 CVPR, 2025
 PDF /
            code
 |  
              | AMD OLMo: Introducing the First AMD 1B Language Models Jiang Liu, Jialian Wu, Prakamya Mishra, Zicheng Liu et al.
 Tech Report, 2025
 Blog /
                Huggingface
 |  
          | Instruct2Attack: Language-Guided Semantic Adversarial Attacks Jiang Liu,
            Chen Wei, Yuxiang Guo, Heng Yu, Alan Yuille, Soheil Feizi, Chun Pong Lau, Rama Chellappa
 Under Submission, 2024
 arXiv /
            bibtex
 |  
          | DiffProtect: Generate Adversarial Examples with Diffusion Models for Facial Privacy Protection Jiang Liu,
            Chun Pong Lau, Yuxiang Guo, Zhaoyang Wang, Rama Chellappa
 Under Submission, 2023
 arXiv /
            bibtex /
            code
 |  
          | Interpolated Joint Space Adversarial Training for Robust and Generalizable Defenses Chun Pong Lau,
            Jiang Liu,
            Hossein Souri, Wei-An Lin, Soheil Feizi, Rama Chellappa
 IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
 IEEE /
            arXiv /
            bibtex
 |  
          | One Model to Synthesize Them All: Multi-contrast Multi-scale Transformer for Missing Data Imputation Jiang Liu*,
            Srivathsa Pasumarthi*, Ben Duffy, Enhao Gong, Keshav Datta, Greg Zaharchuk (*equal contribution)
 IEEE Transactions on Medical Imaging (TMI), 2023
 IEEE /
            arXiv /
            bibtex
 |  
          | PolyFormer: Referring Image Segmentation as Sequential Polygon Generation Jiang Liu*, Hui Ding*, Zhaowei Cai, Yuting Zhang, Ravi Kumar Satzoda, Vijay Mahadevan, R. Manmatha (*equal contribution)
 CVPR, 2023
 Project Page /
            arXiv /
             code /
            bibtex
 |  
          | Segment and Complete: Defending Object Detectors Against Adversarial Patch Attacks With Robust Patch Detection Jiang Liu,
            Alexander Levine, Chun Pong Lau, Rama Chellappa, Soheil Feizi
 CVPR, 2022
 PDF /
            Supp /
            arXiv /
            bibtex /
            code /
            Apricot-Mask Dataset
 |  
          | Mutual Adversarial Training: Learning together is better than going alone Jiang Liu,
            Chun Pong Lau,
            Hossein Souri,
            Soheil Feizi,
            Rama Chellappa
 IEEE Transactions on Information Forensics and Security (TIFS), 2022
 IEEE /
            arXiv /
            bibtex
 |  
                Conference Reviewer:CVPR, ICCV, ECCV, ACL, ACM MM, FG, ACCV, AAAI, MICCAIJournal Reviewer:TPAMI, TIP, TIFS, TAI, TOPS, TMI, TCSVT, IJCV, MEDIA |