Chair of Baidu Technical Committee
Email: wu_hua@baidu.com
Address: Baidu Technology Park Building No. 1, No. 10 Xibeiwang East Road, Haidian District, Beijing, 100093, China
I joined Baidu in 2010. Now I am the technical leader of Baidu NLP department and knowledge graph department. Before that, I worked for Toshiba (China) R&D Center and Microsoft Research Asia (MSRA). I obtained Ph.D. degree in pattern recognition and intelligent system from the Institute of Automation, Chinese Academy of Science in 2001.
My research interest includes dialogue systems, machine translation, natural language processing and knowledge graph.
News
- We are hiring (both interns and employees)! Please drop me an email with your resume if you are interested in working with us on NLP problems, including but not limited to Dialogue Systems, Machine Translation, Question Answering, Distributed Representation, Generation, Knowledge Graph. Experiences with machine (incl. but not limited to deep) learning for NLP are preferred.
- We are organizing the Workshop on Simultaneous Translation (2022, 2021, 2020), where there is a shared task on Chinese-English and English-Spanish simultaneous translation.
- Our PLATO-2 model was ranked top 1 at DSTC9 tracks 1, 2 and 3 and PLATO-XL was ranked top 1 at DSTC10 tracks 1 and 2.
- We launched LUGE (Language Understanding and Generation Evaluation Benchmarks ) on Chinese NLP, which aims to provide researchers with various kinds of data sets and evaluations, and jointly promote the progress of Chinese NLP technology. A recent introduction on this is available here (In Chinese). If you are interested in LUGE or sharing data sets, pls. contact me.
Professional Activities
- Program co-chair of AACL 2020, ACL 2014
- Action editor of TACL, starting from July, 2021
- Area chairs or SPC of ACL, IJCAI and AAAI
- Co-organize the first Workshop on Automatic Simultaneous Translation 2020
- Co-organize the ICDAR Workshop of Document Image and Language 2021
Research
♠ Open-Domain Dialogue Systems
The aim of the open domain dialogue system is let the machines capable of chatting, answering question and completing tasks, as well as the ability of rapid learning and continuous evolution. Its core competencies are as follows:- Understanding: understand natural languages
- Expression: express in fluent natural languages
- Emotion: understand emotions and respond with appropriate emotions
- Thinking: Context-based calculation, reasoning and decision making
- Learning: Capable of learning and evolution
-
Large-scale pre-trained response generation model
Based on the available large-scale open-domain conversation, we pre-trained a response generation model PLATO-2 via curriculum learning. We have released our English models and source codes at Github. PLATO-2 was ranked top 1 at DSTC 9 Track 1, Track2, and Track 3 shared tasks. we also trained a model named PLATO-XL with 10 billion parameters. -
Knowledge-grounded policy learning and response generation
we leverage graphs to guide policy learning. Different kinds of graphs are used including knowledge graphs, conversation graphs constructed from query logs, event graphs constructed from stories. Several papers were published in AAAI 2020, ACL 2020, IJCAI 2020. -
Datasets for knowledge-grounded dialogue system
DuCov: This corpus is designed to facilitate the researches towards building a human-like conversational agent: endowing it with the ability of proactively leading the conversation. In DuConv, one acts as a conversation leader and the other acts as the follower. The leader is provided with a knowledge graph and asked to sequentially change the discussion topics, following the given conversation goal, and meanwhile keep the dialogue as natural and engaging as possible. DuConv enables a very challenging task as the model needs to both understand dialogue and plan over the given knowledge graph. This dataset contains about 270K utterances and 30k dialogues.
DuRecDial: This corpus is designed to facilitate conversational recommendation over multi-type dialogs, where the bots can proactively and naturally lead a conversation from a non-recommendation dialog (e.g., QA) to a recommendation dialog, considering user’s interests and feedback. DuRecDial contains about 10k dialogs, 156k utterances. In each dialog, the recommender proactively leads a multi-type dialog to approach recommendation targets and then makes multiple recommendations with rich interaction behavior. This dataset allows us to systematically investigate different parts of the overall problem, e.g., how to naturally lead a dialog, how to interact with users for recommendation.
♠ Machine Translation
Since 2010, we have been working on an online machine translation product named Baidu Translate, which translates among 203 languages. In 2011, we launched the statistical machine translation service. In May, 2015, we launched the world’s first neural machine translation service. Besides text translation, Baidu Translate supports speech-to-speech translation, simultaneous translation, and OCR/image translation.-
Simultaneous Translation
We co-organized the first Workshop on Automatic Simultaneous Translation 2020, where we release the first Chinese-English simultaneous translation dataset, which contains about 70 hours of Chinese speech audio, human transcripts, ASR results and English translations. In order to make tradeoff between translation quality and translation efficiency, we proposed several methods including wait-k and adaptive meaningful units segmentation method. -
Multilingual Translation
For most of language pairs such as Chinese-Spanish, Chinese-Japanese, Chinese-Thai Language, there exists data sparseness problems. Besides pivot language approaches, we proposed the one to many translation method in 2015, which shares the source language encode, and use individual decodes for each target language.
♠ Pre-trained Model: ERNIE
-
Recently pre-trained models have achieved state-of-the-art results in various language understanding tasks. In order to extract the lexical, syntactic and semantic information from training corpora, we propose a continual pre-training framework named ERNIE 2.0 which incrementally builds pre-training tasks and then learn pre-trained models on these constructed tasks via continual multi-task learning. Based on this framework, we construct several tasks and train the ERNIE 2.0 model to capture lexical, syntactic and semantic aspects of information in the training data. Experimental results demonstrate that ERNIE 2.0 outperforms BERT and XLNet on 16 tasks including English tasks on GLUE benchmarks and several similar tasks in Chinese. The source codes and pre-trained models have been released.
-
We also trained ERNIE 3.0 with 10 billion parameters, which integrats knowledge graph into the models, and models both natural language understanding and text generation in a unified framework. It achieved SOTA results on many NLP tasks.
♠ Question Answering and Machine Reading Comprehension
-
We developed Question Answering and Machine Reading Comprehension methods, which are used in Baidu search engine. Recently, we proposed RocketQA, an optimized training approach to dense passage retrieval for open-domain question answering. RocketQA achieved the 1st rank at the leaderboard of MSMARCO Passage Ranking Task. We released a Chinese dataset namely DuReaderrobust towards evaluating the robustness of machine reading comprehension models, and we hosted a shared task based on DuReaderrobust [Data&Code, Leaderboard].
Papers [Google Scholar]
-
Ernie-vil: Knowledge enhanced vision-language representations through scene graph
Fei Yu, Jiji Tang, Weichong Yin, Yu Sun, Hao Tian, Hua Wu, Haifeng Wang
-
Plato-2: Towards building an open-domain chatbot via curriculum learning
Siqi Bao, Huang He, Fan Wang, Hua Wu, Haifeng Wang, Wenquan Wu, Zhen Guo, Zhibin Liu, Xinchao Xu
-
Knowledge Graph Grounded Goal Planning for Open-Domain Conversation Generation, AAAI 2020
Jun Xu, Haifeng Wang, Zhengyu Niu, Hua Wu, Wanxiang Che
-
ERNIE 2.0: A Continual Pre-Training Framework for Language Understanding., AAAI 2020
Yu Sun, Shuohuan Wang, Yu-Kun Li, Shikun Feng, Hao Tian, Hua Wu, Haifeng Wang
-
Synchronous Speech Recognition and Speech-to-Text Translation with Interactive Decoding, AAAI 2020
JYuchen Liu, Jiajun Zhang, Hao Xiong, Long Zhou, Zhongjun He, Hua Wu, Haifeng Wang, Chengqing Zong
-
Towards Conversational Recommendation over Multi-Type Dialogs, ACL 2020
Zeming Liu, Haifeng Wang, Zheng-Yu Niu, Hua Wu, Wanxiang Che, Ting Liu
-
Conversational Graph Grounded Policy Learning for Open-Domain Conversation Generation, ACL 2020
Jun Xu, Haifeng Wang, Zheng-Yu Niu, Hua Wu, Wanxiang Che, Ting Liu
-
SKEP: Sentiment Knowledge Enhanced Pre-training for Sentiment Analysis, ACL 2020
Hao Tian, Can Gao, Xinyan Xiao, Hao Liu, Bolei He, Hua Wu, Haifeng Wang, Feng Wu
-
Leveraging Graph to Improve Abstractive Multi-Document Summarization, ACL 2020
Wei Li, Xinyan Xiao, Jiachen Liu, Hua Wu, Haifeng Wang, Junping Du
-
Exploring Contextual Word-level Style Relevance for Unsupervised Style Transfer, ACL 2020
Chulun Zhou, Liangyu Chen, Jiachen Liu, Xinyan Xiao, Jinsong Su, Sheng Guo, Hua Wu
-
PLATO: Pre-trained Dialogue Generation Model with Discrete Latent Variable, ACL 2020
Siqi Bao, Huang He, Fan Wang, Hua Wu, Haifeng Wang
-
Syntactic and Semantic-driven Learning for Open Information Extraction, EMNLP (Findings) 2020
Jialong Tang, Yaojie Lu, Hongyu Lin, Xianpei Han, Le Sun, Xinyan Xiao, Hua Wu
-
Learning Adaptive Segmentation Policy for Simultaneous Translation, EMNLP (1) 2020
Ruiqing Zhang, Chuanqiang Zhang, Zhongjun He, Hua Wu, Haifeng Wang
-
ChiTeSQL: A Large-Scale and Pragmatic Chinese Text-to-SQL Dataset, EMNLP 2020
Lijie Wang, Ao Zhang, Kun Wu, Ke Sun, Zhenghua Li, Hua Wu, Min Zhang, Haifeng Wang
-
DuSQL: A Large-Scale and Pragmatic Chinese Text-to-SQL Dataset, EMNLP (1) 2020
Lijie Wang, Ao Zhang, Kun Wu, Ke Sun, Zhenghua Li, Hua Wu, Min Zhang, Haifeng Wang
-
Diversified Multiple Instance Learning for Document-Level Multi-Aspect Sentiment Classification, EMNLP (1) 2020
Yunjie Ji, Hao Liu, Bolei He, Xinyan Xiao, Hua Wu, Yanhua Yu
-
Enhancing Dialog Coherence with Event Graph Grounded Content Planning, IJCAI 2020
Jun Xu, Zeyang Lei, Haifeng Wang, Zheng-Yu Niu, Hua Wu, Wanxiang Che
-
ERNIE-GEN: An Enhanced Multi-Flow Pre-training and Fine-tuning Framework for Natural Language Generation, IJCAI 2020
Dongling Xiao, Han Zhang, Yu-Kun Li, Yu Sun, Hao Tian, Hua Wu, Haifeng Wang
-
DuReaderrobust: A Chinese Dataset Towards Evaluating the Robustness of Machine Reading Comprehension Models, CoRR abs/2004
Hongxuan Tang, Jing Liu, Hongyu Li, Yu Hong, Hua Wu, Haifeng Wang
-
Modeling Coherence for Discourse Neural Machine Translation, AAAI 2019
Hao Xiong, Zhongjun He, Hua Wu, Haifeng Wang
-
Addressing the Under-Translation Problem from the Entropy Perspective, AAAI 2019
Yang Zhao, Jiajun Zhang, Chengqing Zong, Zhongjun He, Hua Wu
-
ARNOR: Attention Regularization based Noise Reduction for Distant Supervision Relation Classification, ACL (1) 2019
Wei Jia, Dai Dai, Xinyan Xiao, Hua Wu
-
Know More about Each Other: Evolving Dialogue Strategy via Compound Assessment, ACL (1) 2019
Siqi Bao, Huang He, Fan Wang, Rongzhong Lian, Hua Wu
-
Proactive Human-Machine Conversation with Explicit Conversation Goal, ACL (1) 2019
Wenquan Wu, Zhen Guo, Xiangyang Zhou, Hua Wu, Xiyuan Zhang, Rongzhong Lian, Haifeng Wang
-
Enhancing Pre-Trained Language Representations with Rich Knowledge for Machine Reading Comprehension, ACL (1) 2019
An Yang, Quan Wang, Jing Liu, Kai Liu, Yajuan Lyu, Hua Wu, Qiaoqiao She, Sujian Li
-
STACL: Simultaneous Translation with Implicit Anticipation and Controllable Latency using Prefix-to-Prefix Framework, ACL (1) 2019
Mingbo Ma, Liang Huang, Hao Xiong, Renjie Zheng, Kaibo Liu, Baigong Zheng, Chuanqiang Zhang, Zhongjun He, Hairong Liu, Xing Li, Hua Wu, Haifeng Wang
-
D-NET: A Pre-Training and Fine-Tuning Framework for Improving the Generalization of Machine Reading Comprehension, MRQA@EMNLP 2019
Hongyu Li, Xiyuan Zhang, Yibing Liu, Yiming Zhang, Quan Wang, Xiangyang Zhou, Jing Liu, Hua Wu, Haifeng Wang
-
Enhancing Local Feature Extraction with Global Representation for Neural Text Classification, EMNLP/IJCNLP (1) 2019
Guocheng Niu, Hengru Xu, Bolei He, Xinyan Xiao, Hua Wu, Sheng Gao
-
Multi-agent Learning for Neural Machine Translation, EMNLP/IJCNLP (1) 2019
Tianchi Bi, Hao Xiong, Zhongjun He, Hua Wu, Haifeng Wang
-
Knowledge Aware Conversation Generation with Explainable Reasoning over Augmented Graphs, EMNLP/IJCNLP (1) 2019
Zhibin Liu, Zheng-Yu Niu, Hua Wu, Haifeng Wang
-
A Key-Phrase Aware End2end Neural Response Generation Model, NLPCC (2) 2019
Jun Xu, Haifeng Wang, Zhengyu Niu, Hua Wu, Wanxiang Che
-
An Overview of the 2019 Language and Intelligence Challenge, NLPCC (2) 2019
Quan Wang, Wenquan Wu, Yabing Shi, Hongyu Li, Zhen Guo, Wei He, Hongyu Liu, Ying Chen, Yajuan Lyu, Hua Wu
-
Baidu Neural Machine Translation Systems for WMT19, WMT (2) 2019
Meng Sun, Bojian Jiang, Hao Xiong, Zhongjun He, Hua Wu, Haifeng Wang
-
Learning to Select Knowledge for Response Generation in Dialog Systems, IJCAI 2019
Rongzhong Lian, Min Xie, Fan Wang, Jinhua Peng, Hua Wu
-
Generating Multiple Diverse Responses with Multi-Mapping and Posterior Mapping Selection, IJCAI 2019
Chaotao Chen, Jinhua Peng, Fan Wang, Jun Xu, Hua Wu
-
ERNIE: Enhanced Representation through Knowledge Integration, CoRR abs/1904
Yu Sun, Shuohuan Wang, Yu-Kun Li, Shikun Feng, Xuyi Chen, Han Zhang, Xin Tian, Danxiang Zhu, Hao Tian, Hua Wu
-
DuTongChuan: Context-aware Translation Model for Simultaneous Interpreting, CoRR abs/1907
Hao Xiong, Ruiqing Zhang, Chuanqiang Zhang, Zhongjun He, Hua Wu, Haifeng Wang
-
CoKE: Contextualized Knowledge Graph Embedding, CoRR abs/1911
Quan Wang, Pingping Huang, Haifeng Wang, Songtai Dai, Wenbin Jiang, Jing Liu, Yajuan Lyu, Yong Zhu, Hua Wu
-
Companion Proceedings of The 2019 World Wide Web Conference
Jinhua Peng, Zongyang Ma, Di Jiang, Hua Wu
-
End-to-end speech translation with knowledge distillation
Yuchen Liu, Hao Xiong, Zhongjun He, Jiajun Zhang, Hua Wu, Haifeng Wang, Chengqing Zong
-
Multi-Channel Encoder for Neural Machine Translation, AAAI 2018
Hao Xiong, Zhongjun He, Xiaoguang Hu, Hua Wu
-
DuReader: a Chinese Machine Reading Comprehension Dataset from Real-world Applications, QA@ACL 2018
Wei He, Kai Liu, Jing Liu, Yajuan Lyu, Shiqi Zhao, Xinyan Xiao, Yuan Liu, Yizhong Wang, Hua Wu, Qiaoqiao She, Xuan Liu, Tian Wu, Haifeng Wang
-
Multi-Turn Response Selection for Chatbots with Deep Attention Matching Network, ACL (1) 2018
Xiangyang Zhou, Lu Li, Daxiang Dong, Yi Liu, Ying Chen, Wayne Xin Zhao, Dianhai Yu, Hua Wu
-
Multi-Passage Machine Reading Comprehension with Cross-Passage Answer Verification, ACL (1) 2018
Yizhong Wang, Kai Liu, Jing Liu, Wei He, Yajuan Lyu, Hua Wu, Sujian Li, Haifeng Wang
-
Addressing Troublesome Words in Neural Machine Translation, EMNLP 2018
Yang Zhao, Jiajun Zhang, Zhongjun He, Chengqing Zong, Hua Wu
-
A New Method of Region Embedding for Text Classification, ICLR (Poster) 2018
Chao Qiao, Bo Huang, Guocheng Niu, Daren Li, Daxiang Dong, Wei He, Dianhai Yu, Hua Wu
-
Familia: A configurable topic modeling framework for industrial text engineering
Di Jiang, Yuanfeng Song, Rongzhong Lian, Siqi Bao, Jinhua Peng, Huang He, Hua Wu
-
An End-to-End Model for Question Answering over Knowledge Base with Cross-Attention Combining Global Knowledge, ACL (1) 2017
Yanchao Hao, Yuanzhe Zhang, Kang Liu, Shizhu He, Zhanyi Liu, Hua Wu, Jun Zhao
-
Improved Neural Machine Translation with SMT Features, AAAI 2016
Wei He, Zhongjun He, Hua Wu, Haifeng Wang
-
Semi-Supervised Learning for Neural Machine Translation, ACL (1) 2016
Yong Cheng, Wei Xu, Zhongjun He, Wei He, Hua Wu, Maosong Sun, Yang Liu
-
Active Learning for Dependency Parsing with Partial Annotation, ACL (1) 2016
Zhenghua Li, Min Zhang, Yue Zhang, Zhanyi Liu, Wenliang Chen, Hua Wu, Haifeng Wang
-
Minimum Risk Training for Neural Machine Translation, ACL (1) 2016
Shiqi Shen, Yong Cheng, Zhongjun He, Wei He, Hua Wu, Maosong Sun, Yang Liu
-
"Shall I Be Your Chat Companion?": Towards an Online Human-Computer Conversation System, CIKM 2016
Rui Yan, Yiping Song, Xiangyang Zhou, Hua Wu
-
Chinese Poetry Generation with Planning based Neural Network, COLING 2016
Zhe Wang, Wei He, Hua Wu, Haiyang Wu, Wei Li, Haifeng Wang, Enhong Chen
-
Latent topic embedding, COLING 2016
Di Jiang, Lei Shi, Rongzhong Lian, Hua Wu
-
Multi-view Response Selection for Human-Computer Conversation, EMNLP 2016
Xiangyang Zhou, Daxiang Dong, Hua Wu, Shiqi Zhao, Dianhai Yu, Hao Tian, Xuan Liu, Rui Yan
-
Agreement-Based Joint Training for Bidirectional Attention-Based Neural Machine Translation, IJCAI 2016
Yong Cheng, Shiqi Shen, Zhongjun He, Wei He, Hua Wu, Maosong Sun, Yang Liu
-
Learning to Respond with Deep Neural Networks for Retrieval-Based Human-Computer Conversation System, SIGIR 2016
Rui Yan, Yiping Song, Hua Wu
-
Question Answering over Knowledge Base with Neural Attention Combining Global Knowledge Information, CoRR abs/1606
Yuanzhe Zhang, Kang Liu, Shizhu He, Guoliang Ji, Zhanyi Liu, Hua Wu, Jun Zhao
-
Multi-Task Learning for Multiple Language Translation, ACL (1) 2015
Daxiang Dong, Hua Wu, Wei He, Dianhai Yu, Haifeng Wang
-
Exploiting Collective Hidden Structures in Webpage Titles for Open Domain Entity Extraction, WWW 2015
Wei Song, Shiqi Zhao, Chao Zhang, Hua Wu, Haifeng Wang, Lizhen Liu, Hanshi Wang
-
Policy Learning for Domain Selection in an Extensible Multi-domain Spoken Dialogue System, EMNLP 2014
Zhuoran Wang, Hongliang Chen, Guanchun Wang, Hao Tian, Hua Wu, Haifeng Wang
-
Improve Statistical Machine Translation with Context-Sensitive Bilingual Semantic Embedding Model, EMNLP 2014
Haiyang Wu, Daxiang Dong, Xiaoguang Hu, Dianhai Yu, Wei He, Hua Wu, Haifeng Wang, Ting Liu
-
Transformation from Discontinuous to Continuous Word Alignment Improves Translation Quality, EMNLP 2014
Zhongjun He, Hua Wu, Haifeng Wang, Ting Liu
-
Improving Pivot-Based Statistical Machine Translation by Pivoting the Co-occurrence Count of Phrase Pairs, EMNLP 2014
Xiaoning Zhu, Zhongjun He, Hua Wu, Conghui Zhu, Haifeng Wang, Tiejun Zhao
-
Improving Pivot-Based Statistical Machine Translation Using Random Walk, EMNLP 2013
Xiaoning Zhu, Zhongjun He, Hua Wu, Haifeng Wang, Conghui Zhu, Tiejun Zhao
-
Generalization of words for chinese dependency parsing, IWPT 2013
Xianchao Wu, Jie Zhou, Yu Sun, Zhanyi Liu, Dianhai Yu, Hua Wu, Haifeng Wang
-
Translation Model Adaptation for Statistical Machine Translation with Monolingual Topic Information, ACL (1) 2012
Jinsong Su, Hua Wu, Haifeng Wang, Yidong Chen, Xiaodong Shi, Huailin Dong, Qun Liu
-
Improve SMT Quality with Automatically Extracted Paraphrase Rules, ACL (1) 2012
Wei He, Hua Wu, Haifeng Wang, Ting Liu
-
Opening Machine Translation Black Box for Cross-Language Information Retrieval, AIRS 2012
Yanjun Ma, Jian-Yun Nie, Hua Wu, Haifeng Wang
-
Two-Word Collocation Extraction Using Monolingual Word Alignment Method, ACM Trans
Zhanyi Liu, Haifeng Wang, Hua Wu, Sheng Li
-
Reordering with Source Language Collocations, ACL 2011
Zhanyi Liu, Haifeng Wang, Hua Wu, Ting Liu, Sheng Li
-
Improving Statistical Machine Translation with Monolingual Collocation, ACL 2010
Zhanyi Liu, Haifeng Wang, Hua Wu, Sheng Li
-
Exploiting Heterogeneous Treebanks for Parsing, ACL/IJCNLP 2009
Zheng-Yu Niu, Haifeng Wang, Hua Wu
-
Revisiting Pivot Language Approach for Machine Translation, ACL/IJCNLP 2009
Hua Wu, Haifeng Wang
-
Collocation Extraction Using Monolingual Word Alignment Method, EMNLP 2009
Zhanyi Liu, Haifeng Wang, Hua Wu, Sheng Li
-
Domain Adaptation for Statistical Machine Translation with Domain Dictionary and Monolingual Corpora, COLING 2008
Hua Wu, Haifeng Wang, Chengqing Zong
-
Predicting and Tagging Dialog-Act Using MDP and SVM, ISCSLP 2008
Keyan Zhou, Chengqing Zong, Hua Wu, Haifeng Wang
-
The TCH machine translation system for IWSLT 2008, IWSLT 2008
Haifeng Wang, Hua Wu, Xiaoguang Hu, Zhanyi Liu, Jianfeng Li, Dengjun Ren, Zheng-Yu Niu
-
Pivot language approach for phrase-based statistical machine translation, Machine Translation 2007
Hua Wu, Haifeng Wang
-
Pivot Language Approach for Phrase-Based Statistical Machine Translation, ACL 2007
Hua Wu, Haifeng Wang
-
Comparative study of word alignment heuristics and phrase-based SMT, MT Summit 2007
Hua Wu, Haifeng Wang
-
Log-Linear generation models for example-based machine translation, MT Summit 2007
Zhanyi Liu, Haifeng Wang, Hua Wu
-
Using RBMT Systems to Produce Bilingual Corpus for SMT, EMNLP-CoNLL 2007
Xiaoguang Hu, Haifeng Wang, Hua Wu
-
Example-based machine translation based on tree-string correspondence and statistical generation, Machine translation 2006
Zhanyi Liu, Haifeng Wang, Hua Wu
-
Word Alignment for Languages with Scarce Resources Using Bilingual Corpora of Other Language Pairs, ACL 2006
Haifeng Wang, Hua Wu, Zhanyi Liu
-
Boosting Statistical Word Alignment Using Labeled and Unlabeled Data, ACL 2006
Hua Wu, Haifeng Wang, Zhanyi Liu
-
Alignment Model Adaptation for Domain-Specific Word Alignment, ACL 2005
Hua Wu, Haifeng Wang, Zhanyi Liu
-
Improving Statistical Word Alignment with Ensemble Methods, IJCNLP 2005
Hua Wu, Haifeng Wang
-
Improving Translation Memory with Word Alignment Information, MT SUMMIT 2005
Hua Wu, Haifeng Wang, Zhanyi Liu, Kai Tang
-
Boosting statistical word alignment, MT SUMMIT 2005
Hua Wu, Haifeng Wang
-
Improving Domain-Specific Word Alignment for Computer Assisted Translation, ACL (Poster and Demonstration) 2004
Hua Wu, Haifeng Wang
-
Improving Domain-Specific Word Alignment with a General Bilingual Corpus, AMTA 2004
Hua Wu, Haifeng Wang
-
Improving Statistical Word Alignment with a Rule-Based Machine Translation System, COLING 2004
Hua Wu, Haifeng Wang
-
Synonymous collocation extraction using translation information, ACL 2003
Hua Wu, Ming Zhou
-
Optimizing synonym extraction using monolingual and bilingual resources, Proceedings of the second international workshop on Paraphrasing 2003
Hua Wu, Ming Zhou
-
Speech Corpus Collection and Annotation
Li Aijun, Chen Xiaoxia, Sun Guohua, Hua Wu, Yin Zhigang, Zu Yiqing
-
A Generation System for Chinese Texts, Sixth International Conference on Spoken Language Processing 2002
Hua Wu, Taiyi HUANG, Bo Xu
-
Chinese generation in a spoken dialogue translation system, COLING 2002
Hua Wu, Taiyi HUANG, Chengqing Zong
-
Analysis on Characteristics of Chinese Spoken Language, Proc. of 5th Natural Language Processing Pacific Rim Symposium 1999
Chengqing Zong, Hua Wu, Taiyi HUANG, Bo XU
-
An INTERLINGUA FOR DIALOGUE TRANSLATION
Hua Wu, Taiyi HUANG, Bo XU