I am an associate professor in the Department of Computer and Information Science at the University of Macau, leading the Natural Language Processing & Portuguese-Chinese Machine Translation Research Group (NLP2CT Lab). I am working on natural language processing, focusing on deep learning and machine translation. I received the Second Prize in the Technological Invention Award and the Second-class Prize in the Science and Technology Progress Award category of the Macao Science and Technology Award in 2022 and 2012, respectively. My recent NSFC, MOST, and FDCT projects focus on low-resource machine translation.

Computational Linguistics, Natural Language Processing, Machine Translation, Machine Learning

News

  • 09-2024: We have four papers accepted to the NeurIPS 2024. Congratulations to Junchao Wu, Runzhe Zhan, Jianhui Pang, and all co-authors.
  • 09-2024: We have two papers accepted to the EMNLP 2024 main conference and two Findings papers. Congratulations to Renhao Li, Shudong Liu, Zhaocong Li, Pei Zhang, and all co-authors.
  • 06-2024: I am excited to receive the university-level Teaching Excellence Award, presented at the 2024 Congregation held on June 2.
  • 05-2024: I have been invited to serve as an Action Editor for TACL.
  • 05-2024: We have one paper accepted to the ACL 2024 main conference and six ACL Findings papers. Congratulations to Shanshan Wang, Runzhe Zhan, Jianhui Pang, Kaixin Lan, and all co-authors. Check out our papers on What is the Best Way for ChatGPT to Translate Poetry?, Prefix Text as a Yarn: Eliciting Non-English Alignment in Foundation Language Model, FOCUS: Forging Originality through Contrastive Use in Self-Plagiarism for Language Models, Anchor-based Large Language Models, Benchmarking and Improving Long-Text Translation with Large Language Models, Domain-Aware k-Nearest-Neighbor Knowledge Distillation for Machine Translation, and Towards Demonstration-Aware Large Language Models for Machine Translation.
  • 03-2024: Received the Teaching Excellence Award, presented by the Faculty of Science and Technology (FST) of the University of Macau (UM).
  • 02-2024: We have three papers accepted to the LREC-COLING 2024 main conference. Congratulations to Jianhui Pang, Guanhua Chen, Xinyu Ma, and all co-authors. Check out our papers on MoNMT: Modularly Leveraging Monolingual and Bilingual Knowledge for Neural Machine Translation, A Two-Stage Prediction-Aware Contrastive Learning Framework for Multi-Intent NLU, and 3AM: An Ambiguity-Aware Multimodal Machine Translation Dataset.
  • 10-2023: We have a paper accepted to the EMNLP 2023. Congratulations to Andy Cheang and all co-authors. Check out our paper on Can LMs Generalize to Future Data? An Empirical Analysis on Text Summarization.
  • 06-2023: Our paper, Rethinking the Exploitation of Monolingual Data for Low-Resource Neural Machine Translation, was accepted by Computational Linguistics. 
  • 05-2023: We have five papers accepted to the ACL 2023 main conference, and two ACL Findings papers. Congratulations to Runzhe Zhan, Tao Fang, Shudong Liu, and all co-authors. Check out our papers on Test-time Adaptation for Machine Translation Evaluation by Uncertainty Minimization, kNN-TL: k-Nearest-Neighbor Transfer Learning for Low-Resource Neural Machine Translation, Revisiting Commonsense Reasoning in Machine Translation: Training, Evaluation and Challenge, Toward Human-Like Evaluation for Natural Language Generation with Error Analysis, TemplateGEC: Improving Grammatical Error Correction with Detection Template, Improving Grammatical Error Correction with Multimodal Feature Integration, and TransGEC: Improving Grammatical Error Correction with Translationese.
  • 02-2023: Our demo paper was accepted by the EACL 2023. Congratulations to Jingkun Ma and Runzhe Zhan. Check out the paper on Yu Sheng: Human-in-Loop Classical Chinese Poetry Generation System and the online system at: https://yusheng.cis.um.edu.mo/.
  • University of Macau, Macau SAR
    • Jul. 2022 ~ Jun. 2023: Interim Associate Dean of Faculty of Science and Technology
    • Aug. 2017 ~ present: Associate Professor
    • Feb. 2008 ~ Jul. 2017: Assistant Professor
    • Sep. 2005 ~ Jan. 2008: Research Fellow
  • Dublin City University, Dublin, Ireland
    • Sep. 2018 ~ Nov. 2018: Visiting Scholar
  • Institute of Systems and Computer Engineering of Macau, INESC-Macau, Macau SAR (2nd Appointment)
    • Feb. 2008 ~ Dec. 2011: Project Manager
    • Sep. 2005 ~ Jan. 2008: Research Fellow
  • University-level Teaching Excellence Award 2023/2024, University of Macau, 2024.
  • FST Teaching Excellence Award 2023/2024, University of Macau, 2024.
  • The Second Prize of the Technological Invention Award, Macao Science and Technology Award 2022
  • 2021/2022 Incentive Scheme for Outstanding Academic Staff, presented by the University of Macau Development Foundation (UMDF) of the University of Macau (UM), 2022.
  • FST Research Excellence Award, presented by the Faculty of Science and Technology (FST) of the University of Macau (UM), 2022.
  • FST Teaching Excellence Award 2017/2018, University of Macau, 2018.
  • The Second Prize of Macau Science and Technology Progress Award, Macao Science and Technology Award 2012.
  • Research on Chinese-Oriented Low-Resource Machine Translation Technologies, NSFC-FDCT, FDCT/060/2022/AFJ, 2023/01-2025/12, PI.
  • Modularized Semantic Neural Machine Translation: Fundamentals & Application, MOST-FDCT, FDCT/070/2022/AMJ, 2022/12-2025/12, PI.
  • Pretraining-based Automatic Evaluation of Machine Translation, MYRG, MYRG-GRG2023-00006-FST-UMDF, 2024/01-2025/12, PI.
  • Large Language Models for Machine Translation with Enhanced Discourse Modeling, Tencent AI Lab Rhino-Bird, 2023/07-2024/06, PI.
  • Research of Non-Autoregressive Neural Machine Translation, MYRG, MYRG2020-00054-FST, 2021/01-2023/12, PI.
  • How does a Machine Translate Human Language? The 4th Macau Young Scientists Conference Seminars and Award Ceremony, Macau, 2022
  • Advances in Neural Network Machine Translation and Its Applications, The NewTranx Forum: The Fourth Lecture, Shenzhen, 2022
  • Portuguese-Chinese Machine Translation Research, Artificial Intelligence & International Communication Forum, Beijing, 2022
  • Word or Attributes: Learning Better Bilingual Word Embeddings for NMT, First Symposium on Frontiers of Science and Technology, University of Macau, 2022
  • Neural Machine Translation Model Training & Evaluation Methods, Beihang University, 2021
  • Difficulty-Aware NMT Model Training & Evaluation Methods, The 2021 International Conference on Big Data and Machine Intelligence: Models, Algorithms and Applications (ICBM), 2021
  • A Cognitive Perspective for Understanding NMT Training and Evaluation, The Alibaba Apsara Conference 2021, 2021
  • Probing the Learning of an NMT Model and Its Training Strategy, NiuTrans Forum: The First Online Machine Translation Forum, 2021
  • Will AI Replace Human Translators? TEDx Senado Square 2020, 2020
  • Enhancing Neural MT via Linguistic Information, The iFLYTEK 1024 Global Developer Fest, 2019
  • Deep Learning for Portuguese-Chinese Machine Translation, Coimbra University, Portugal, 2019
  • Intelligent Translation Technologies: Building Multilingual Translation Ecosystem, Fábrica de Startup Rio, Rio de Janeiro, Brazil, 2019
  • Linguistic Inspired Machine Translation Model, Huawei Noah Ark Lab, 2019
  • The Research of Portuguese-Chinese Machine Translation Based on Deep Learning, Tsinghua University, 2019
  • Recent MT Research in UM: An Application to Portuguese-Chinese Translation, University of Macau, 2019
  • Enhancing NMT via Linguistic Information, ADAPT Centre, Dublin City University, Ireland, 2018
  • Bidirectional Hierarchical Representations for Attention-Based Neural Machine Translation, Soochow University, 2017
  • Chinese-Portuguese MT: Toward Better Chinese Word Segmentation for Statistical Machine Translation via Graph Knowledge Constraints, The 16th China-Japan Natural Language Processing Joint Promotion Conference (CJNLP 2016), Shenyang, China, 2016
  • Liangxin Liu, Xuebo Liu, Derek F. Wong, Dongfang Li, Ziyi Wang, Baotian Hu, and Min Zhang. SelectIT: Selective Instruction Tuning for LLMs via Uncertainty-Aware Self-Reflection. In: Proceedings of the Thirty-eighth Conference on Neural Information Processing Systems 2024, NeurIPS 2024, Vancouver, Canada, December 09 – 15, 2024.
  • YimingWang, Pei Zhang, Baosong Yang, Derek F.Wong, Zhuosheng Zhang, and RuiWang. Trajectory Volatility for Out-of-Distribution Detection in Mathematical Reasoning. In: Proceedings of the Thirtyeighth Conference on Neural Information Processing Systems 2024, NeurIPS 2024, Vancouver, Canada, December 09 – 15, 2024.
  • Junchao Wu, Runzhe Zhan, Derek F. Wong, Shu Yang, Xinyi Yang, Yulin Yuan, and Lidia S. Chao. DetectEval: Benchmarking LLM-Generated Text Detection in Real-World Scenarios. In: Proceedings of the Thirty-eighth Conference on Neural Information Processing Systems, Datasets and Benchmarks Track, NeurIPS 2024, Vancouver, Canada, December 09-15, 2024.
  • Fanghua Ye, Mingming Yang, Jianhui Pang, LongyueWang, Derek F. Wong, Emine Yilmaz, Shuming Shi, and Zhaopeng Tu. Benchmarking LLMs via Uncertainty Quantification. In: Proceedings of the Thirty-eighth Conference on Neural Information Processing Systems, Datasets and Benchmarks Track, NeurIPS 2024, Vancouver, Canada, December 09-15, 2024.
  • Renhao Li, Minghuan Tan, Derek F. Wong, and Min Yang. CoEvol: Constructing Better Responses for Instruction Finetuning through Multi-Agent Cooperation. In: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, EMNLP 2024, Miami, Florida, November 12-16, 2024. Association for Computational Linguistics, 2024.
  • Shudong Liu, Zhaocong Li, Xuebo Liu, Runzhe Zhan, Derek F. Wong, Lidia S. Chao, and Min zhang. Can LLMs Learn Uncertainty on Their Own? Expressing Uncertainty Effectively in A Self-Training Manner. In: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, EMNLP 2024, Miami, Florida, November 12-16, 2024. Association for Computational Linguistics, 2024.
  • Zhipeng Qian, Pei Zhang, Baosong Yang, Kai Fan, Yiwei Ma, Derek F. Wong, Xiaoshuai Sun, and Rongrong Ji. AnyTrans: Translate AnyText in the Image with Large Scale Models. In: Findings of the Association for Computational Linguistics: EMNLP 2024. Association for Computational Linguistics, 2024.
  • Tianxiang Hu, Pei Zhang, Baosong Yang, Jun Xie, Derek F. Wong, and Rui Wang. Large Language Model for Multi-Domain Translation: Benchmarking and Domain CoT Fine-tuning. In: Findings of the Association for Computational Linguistics: EMNLP 2024. Association for Computational Linguistics, 2024.
  • Jianhui Pang, Fanghua Ye, Longyue Wang, Dian Yu, Derek Fai Wong, Shuming Shi, and Zhaopeng Tu. Salute the Classic: Revisiting Challenges of Machine Translation in the Age of Large Language Models. In: Transactions of the Association for Computational Linguistics (TACL) 2024.
  • Yanming Sun, Xuebo Liu, Derek F.Wong, Yuchu Lin, Bei Li, Runzhe Zhan, Lidia S. Chao, and Min Zhang. Understanding and Improving Low-Resource Neural Machine Translation with Shallow Features. In: Proceedings of the Natural Language Processing and Chinese Computing – 13th CCF International Conference, NLPCC 2024, Hangzhou, China, October 31 – November 2, 2024. Ed. by Derek F.Wong, ZhongyuWei, and Muyun Yang. Lecture Notes in Computer Science. Springer, 2024.
  • Jingkun Ma, Runzhe Zhan, Derek F. Wong, and Lidia S. Chao. Activate Integrated Controllable Generation with Soft Prompt. In: Proceedings of the Natural Language Processing and Chinese Computing – 13th CCF International Conference, NLPCC 2024, Hangzhou, China, October 31 – November 2, 2024. Ed. by Derek F. Wong, ZhongyuWei, and Muyun Yang. Lecture Notes in Computer Science. Springer, 2024.
  • Xingchen Huang, Yujia Huo, Derek F.Wong, Yao Wang, Liqiong Cai, and Yonghong Jiang. AQLoRA: An Adaptive Quantization-based Efficient Fine-tuning Method for LLMs. In: Proceedings of the Natural Language Processing and Chinese Computing – 13th CCF International Conference, NLPCC 2024, Hangzhou, China, October 31 – November 2, 2024. Ed. by Derek F. Wong, Zhongyu Wei, and Muyun Yang. Lecture Notes in Computer Science. Springer, 2024.
  • ShanshanWang, Derek F.Wong, Jingming Yao, and Lidia S. Chao. What is the BestWay for ChatGPT to Translate Poetry? In: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), ACL 2024, Bangkok, Thailand, August 11-16, 2024.
  • Runzhe Zhan, Xinyi Yang, Derek F.Wong, Lidia S. Chao, and Yue Zhang. Prefix Text as a Yarn: Eliciting Non-English Alignment in Foundation Language Model. In: Findings of the 62nd Annual Meeting of the Association for Computational Linguistics: ACL 2024, Bangkok, Thailand, August 11-16, 2024.
  • Jianhui Pang, Fanghua Ye, Derek F.Wong, and LongyueWang. Anchor-based large language models. In: Findings of the 62nd Annual Meeting of the Association for Computational Linguistics: ACL 2024, Bangkok, Thailand, August 11-16, 2024.
  • Kaixin Lan, Tao Fang, Derek F.Wong, Yabo Xu, Lidia S. Chao, and Cecilia G Zhao. FOCUS: Forging Originality through Contrastive Use in Self-Plagiarism for Language Models. In: Findings of the 62nd Annual Meeting of the Association for Computational Linguistics: ACL 2024, Bangkok, Thailand, August 11-16, 2024.
  • Chen Li, Meishan Zhang, Xuebo Liu, Zhaocong Li, Derek F.Wong, and Min Zhang. Towards Demonstration- Aware Large Language Models for Machine Translation. In: Findings of the 62nd Annual Meeting of the Association for Computational Linguistics: ACL 2024, Bangkok, Thailand, August 11-16, 2024.
  • Zhexuan Wang, Shudong Liu, Xuebo Liu, Miao Zhang, Derek F. Wong, and Min Zhang. Domain- Aware k-Nearest-Neighbor Knowledge Distillation for Machine Translation. In: Findings of the 62nd Annual Meeting of the Association for Computational Linguistics: ACL 2024, Bangkok, Thailand, August 11-16, 2024.
  • Longyue Wang, Zefeng Du, Wenxiang Jiao, Chenyang Lyu, Jianhui Pang, Leyang Cui, Kaiqiang Song, Derek F. Wong, Shuming Shi, and Zhaopeng Tu. Benchmarking and Improving Long-Text Translation with Large Language Models. In: Findings of the 62nd Annual Meeting of the Association for Computational Linguistics: ACL 2024, Bangkok, Thailand, August 11-16, 2024.
  • Pang, Jianhui, Baosong Yang, Derek F. Wong, Dayiheng Liu, Xiangpeng Wei, Jun Xie, and Lidia S. Chao. MoNMT: Modularly Leveraging Monolingual and Bilingual Knowledge for Neural Machine Translation. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pp. 11560-11573. 2024.
  • Xinyu Ma, Xuebo Liu, Derek F. Wong, Jun Rao, Bei Li, Liang Ding, Lidia S. Chao, Dacheng Tao, and Min Zhang. 3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pp. 1-13. 2024.
  • Guanhua Chen, Yutong Yao, Derek F. Wong, and Lidia S. Chao. A Two-Stage Prediction-Aware Contrastive Learning Framework for Multi-Intent NLU. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pages 1778–1788. 2024.
  • Chenyang Lyu, Zefeng Du, Jitao Xu, Yitao Duan, Minghao Wu, Teresa Lynn, Alham Fikri Aji, Derek F. Wong, and Longyue Wang. A Paradigm Shift: The Future of Machine Translation Lies with Large Language Models. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pp. 1339-1352. 2024.
  • Guanhua Chen, Runzhe Zhan, Derek F. Wong, and Lidia S. Chao. Dynamic curriculum learning for conversation response selection. Knowledge-Based Systems 293 (2024): 111687.
  • Jianhui Pang, Baosong Yang, Derek F. WongYu Wan, Dayiheng Liu, Lidia S. Chao, and Jun Xie. Rethinking the Exploitation of Monolingual Data for Low-Resource Neural Machine TranslationComputational Linguistics (CL), pp. 1–22, 2024
  • Data
    • UM-Corpus: A large, multi-domain English-Chinese parallel corpus (Tian et al., 2014).
    • UM-pCorpus: A Large Portuguese-Chinese Parallel Corpus (Chao et al., 2018).
  • Software
    • UM-CAT: The University of Macau’s online computer-aided translation platform for Portuguese, Chinese, and English.
    • UM-NMT: An online neural machine translation for Portuguese and Chinese.
    • Pomotion: Portuguese Morphological Variation analyzer, featured with word lemmatizer and verb conjugation.
  • Graduate Courses
    • Fall, 2019 ~ 2021 Data Science and Data Visualization
    • Fall, 2016 ~ 2021 Applied Natural Language Processing
    • Spring, 2012 ~ 2015 Machine Translation
    • Fall, 2016 University Teaching II
    • Spring, 2015 University Teaching I
  • Undergraduate Courses
    • Spring 2018 ~ 2021 Natural Language Processing
    • Fall 2018 Introduction to Database Systems
    • Fall, 2012 ~ 2017 Introduction to Natural Language Processing
    • Fall, 2018 Computer Organization
    • Fall, 2009 ~ 2017 Compiler Construction
  • Current Students
    • Cuilian Zhang, PhD, since 2017
    • Mingzhou Xu, PhD, since 2017
    • Yu Wan, PhD, since 2018
    • Wai Lei Song, Master, since 2019
    • Tao Fang, PhD, since 2020
    • Songsheng Wang, PhD, since 2020
    • Jianjui Pang, PhD, since 2020
    • Zhaocong Li, Master, since 2020
    • Yanming Sun, Master, since 2020
    • Zefeng Du, Master, since 2020
    • Zhihong Huang, Master, since 2020
    • Runzhe Zhan, PhD, since 2021
    • Shanshan Wang, PhD, since 2022
    • Guanhua Chen, PhD, since 2023
  • Organizations
    • 2023 ~ present: Member, NLPCC  Technical Committee, China Computer Federation (CCF)
    • 2019 ~ present: Member, Machine Translation Steering Committee, Chinese Information Processing Society of China (CIPS)
    • 2019 ~ 2021: Member-at-Large, Executive Committee Members, Asian Federation of Natural Language (AFNLP)
    • 2019 ~ present: Vice President, Board of Director, Macau Greater Bay Area Artificial Intelligence Institute
    • 2015 ~ 2016: Chair, IEEE Engineering in Medicine and Biology Society (EMBS) Hong Kong – Macau Joint Chapter
    • 2012 ~ present: Member, Board of Director, Chinese Information Processing Society of China (CIPS)
    • 2012 ~ present: President, Board of Director, Macau Web Intelligence Consortium (WIC) Association
    • 2010 ~ present: Vice President, Board of Supervisors, Macau Society of Biomedical Engineering (MSBME)
  • Journals
    • Associate Editor: IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP, 2023~present)
    • Associate Editor: ACM Transactions on Asian and Low Resource Language Information Processing (TALLIP, 2022~present)
    • Action Editor: Transactions of the Association for Computational Linguistics (TACL, 2024~present)
    • Action Editor: ACL Rolling Review (ARR, 2023~present)
    • Editorial Board Member: Machine Translation Journal, Springer (2019~present)
    • Editorial Board Member: Natural Language Processing, Elsevier (2022~present)
    • Reviewer: ACL Rolling Review (ARR, 2021), Computational Linguistics (CL, 2020~present), Transactions of the Association for Computational Linguistics (TACL, 2020~2022), ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP, 2016~present), IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP, 2015~present), Neurocomputing (2021~present)
  • Conferences
    • Organizer: MT Summit (2023), IJCAI (2019), CWMT (2014), O-COCOSDA (2012)
    • Program Co-Chair: NLPCC (2024), CWMT (2017), O-COCOSDA (2012)
    • Senior PC Member/Area Chair: NAACL (2024), ECAI (2024), IJCAI (2020)
    • Area Co-Chair: LREC-COLING (2024), EMNLP (2020), CCL (2016, 2019, 2021), NLPCC (2019, 2021)
    • Demo Co-Chair: AACL-IJCNLP (2020), EMNLP (2019)
    • Publication Co-Chair: COLING (2020), IJCNLP (2017)
    • Local Organization Committee Member: EMNLP-IJCNLP (2019), IJCAI (2019), CWMT (2014)
    • Sponsorship Co-Chair: NLPCC (2020), WAIM (2014)
    • Student Research Workshop Co-Chair: IJCNLP-AACL (2023), ACL-IJCNLP (2021)
    • PC Member: ICML (2021~ ), ICLR (2021~ ), NeurIPS (2020~ ), AAAI (2019~ ), IJCAI (2019~ ), ACL (2014~ ), EMNLP (2015~ ), NAACL-HLT (2016~ ), EACL (2021), MT Summit (2021), COLING (2014, 2018, 2020), CCL (2016~ ), NLPCC (2015, 2018~ )

Copyright © Derek F. Wong. Last Updates: 2021-08