Achieving Human Parity on Automatic Chinese to English News Translation
Technical Report 2018
Zhirui Zhang
Researcher and builder across large-model systems, translation, multimodal capability, and deployable intelligence.
Most recently: AI Technical Advisor and entrepreneurial partner at IDEA Research.
Profile
I focus on post-training and reasoning models for large language models, while continuing to build on a long research track in multilingual NLP, machine translation, speech translation, and dialogue systems. Across industry labs and product teams, I have worked on frontier-scale pretraining, general post-training, reasoning-oriented optimization, translation and multilingual capability building, and practical deployment in user-facing systems.
I received my Ph.D. from the University of Science and Technology of China through a joint training program with Microsoft Research Asia. My recent work bridges academic research and real-world model development, with a sustained interest in reliable, iterative, and deployable large-model systems.
Current Focus
Selected Work
Technical Report 2018
EMNLP 2023
ACL-IJCNLP 2021
AAAI 2019
NeurIPS 2020
ICLR 2023
arXiv 2026
NeurIPS 2025
ICML 2025
arXiv 2025
ACL 2024 Findings
Experience
AI Technical Advisor and entrepreneurial partner working on model-and-tool systems for MoonBit and more reliable code intelligence.
Technical expert leading general post-training, reasoning-model exploration, multilingual capability building, and practical large-model deployment.
Algorithm expert contributing to trillion-parameter MoE pretraining, FP8 language-model exploration, and long-context capability validation.
Senior researcher building translation training platforms, interactive translation models, personalized MT, and multilingual research systems.
Algorithm expert for multilingual translation, speech translation, automated training pipelines, and commercial translation services.
Research intern across MSRA and Redmond, working on neural machine translation, dialogue systems, and controllable text generation.
Service & Recognition
Long-term reviewer or program committee member for ACL, EMNLP, NAACL, AAAI, IJCAI, NeurIPS, ICML, and ICLR, with prior service as an ACL area chair.
Writing / Notes
I am setting up a lightweight blog for notes on post-training, translation, multimodal systems, and practical lessons from building deployable model stacks.
Contact