Abstract
In this chapter, we will be reviewing state of the art machine translation systems, and will discuss innovative methods for machine translation, highlighting the most promising techniques and applications. Machine translation (MT) has benefited from a revitalization in the last 10 years or so, after a period of relatively slow activity. In 2005 the field received a jumpstart when a powerful complete experimental package for building MT systems from scratch became freely available as a result of the unified efforts of the MOSES international consortium. Around the same time, hierarchical methods had been introduced by Chinese researchers, which allowed the introduction and use of syntactic information in translation modeling. Furthermore, the advances in the related field of computational linguistics, making off-the-shelf taggers and parsers readily available, helped give MT an additional boost. Yet there is still more progress to be made. For example, MT will be enhanced greatly when both syntax and semantics are on board: this still presents a major challenge though many advanced research groups are currently pursuing ways to meet this challenge head-on. The next generation of MT will consist of a collection of hybrid systems. It also augurs well for the mobile environment, as we look forward to more advanced and improved technologies that enable the working of Speech-To-Speech machine translation on hand-held devices, i.e. speech recognition and speech synthesis. We review all of these developments and point out in the final section some of the most promising research avenues for the future of MT.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Consequently, I will not be concerned with commercial versions of MT systems, nor to the field of computer-assisted Translation systems or translation aids: they are all listed at http://www.hutchinsweb.me.uk/Compendium.htm, a document produced almost on a yearly basis and compiled by John Hutchins, who is also responsible for the main source of information on MT which is regularly posted on the Machine Translation Archive (http://www.mt-archive.info).
References
Abney S (1989) Parsing by chunks. In: Tenny C (ed) The MIT parsing volume, 1988–89. Center for Cognitive Science, MIT, Cambridge
Adly N, Alansary S (2009) Evaluation of Arabic machine translation system based on the universal networking language. In: The 14th international conference on applications of natural language to information systems “NLDB 2009”, Saarland University, Saarbrucken, Germany, 23–26 June 2009
Alansary S, Nagi M, Adly N (2009) The universal networking language in action in English-Arabic machine translation. In: 9th conference on language engineering, Cairo, pp 1–12
Alegria I, Diaz de Ilarraza A, Labaka G, Lersundi M, Mayor A, Sarasola K (2007) Transfer-based MT from Spanish into Basque: reusability, standardization and open source, vol 4394, Lecture Notes in Computer Science. Springer, Berlin/New York, pp 374–384
Alexandrescu A, Kirchhoff K (2009) Graph-based learning for statistical machine translation, 2009. In: Human language technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, pp 119–127
Alkuhlani S, Habash N (2011) A corpus for modeling morpho-syntactic agreement in Arabic: gender, number and rationality. In: Proceedings of the 49th annual meeting of the Association for Computational Linguistics (ACL’11), Portland, Oregon, USA
Ambati V, Lavie A (2008) Improving syntax driven translation models by restructuring divergent and non-isomorphic parse tree structures. In: Proceedings of the eighth conference of the association for machine translation in the Americas, Waikiki, pp 235–244
Ambati V, Lavie A, Carbonell J (2009) Extraction of syntactic translation models from parallel data using syntax from source and target languages. In: MT Summit XII: proceedings of the twelfth machine translation summit, Ottawa, 26–30 Aug 2009, pp 190–197
Ambati V, Vogel S, Carbonell J (2011) Multi-strategy approaches to active learning for statistical machine translation. Associated press article on the web: http://www.cnn.com/2006/TECH/11/06/darpa.translation.ap/index.html
Apidianaki M (2009) Data-driven semantic analysis for multilingual WSD and lexical selection in translation. In: Proceedings of the 12th conference of the European chapter of the association for computational linguistics (EACL), Athens, 30 Mar–3 Apr 2009, pp 77–85
Ashburner M, Ball C, Blake J, Botstein D, Butler H, Cherry J, Davis A et al (2000) Gene ontology: tool for the unification of biology. The gene ontology consortium. Nat Genet 25(1):25–29
Ayan NF, Zheng J, Wang W (2008) Improving alignments for better confusion networks for combining machine translation systems. In: Proceedings of the Coling’08, pp 33–40
Baker K, Bloodgood M, Dorr BJ, Callison-Burch, C, Filardo, NW, Piatko, C, Levin L, Miller S (2011) Modality and negation in SIMT – use of modality and negation in semantically-informed syntactic MT. Comput Linguist 38(2):1–48, (accepted for publication)
Baker K, Bloodgood M, Dorr BT, Callison-Burch C, Filardo NW, Piatko C, Levin L, Miller S (2012) Modality and negation in SIMT – use of modality and negation in semantically-informed syntactic MT. Comput Linguist 1:1–48
Banchs, RE, Costa-jussà MR (2010) A non-linear semantic mapping technique for cross-language sentence matching. In: Proceedings of the 7th international conference on advances in natural language processing (IceTAL), Reykjavik, pp 57–66
Banchs RE, Costa-jussà MR (2011) A semantic feature for statistical machine translation. In: Proceedings of SSST-5, fifth workshop on syntax, semantics and structure in statistical translation, ACL HLT 2011, Portland, June 2011, pp 126–134
Banerjee S, Lavie A (2005) METEOR: an automatic metric for MT evaluation with improved correlation with human judgments. In: Proceedings of the ACL workshop on intrinsic and extrinsic evaluation measures for machine translation and/or summarization, Ann Arbor, June, pp 65–72
Banerjee P, Dandapat S, Forcaday ML, Groves D, Penkale S, Tinsley J, Way A (2011) Technical report: OpenMaTrEx, a free, open-source hybrid data-driven machine translation system
Béchara, H, Ma Y, van Genabith J (2011) Statistical post-editing for a statistical MT system. MT Summit XIII: the thirteenth machine translation summit [organized by the] Asia-Pacific association for machine translation (AAMT), Xiamen, 19–23 Sept 2011, pp 308–315
Bellegarda J (2001) Robustness in statistical language modeling: review and perspectives. In: Junqua J, van Noods G (eds) Robustness in language and speech technology. Kluwer, Dordrecht/Boston, pp 101–121
Bellegarda J (2003) Statistical language model adaptation: review and perspectives. Speech Commun 42:93–108
Bisani M, Ney H (2008) Joint sequence models for grapheme-to-phoneme conversion. Speech Commun 50(5):434–451
Blackwood G, de Gispert A, Byrne W (2008) Phrasal segmentation models for statistical machine translation. In: Proceedings of 22nd international conference on computational linguistics (COLING), Manchester
Blain F, Senellart J, Schwenk H, Plitt M, Roturier J (2011) Qualitative analysis of post-editing for high quality machine translation. In: Proceedings of MTS: 13th machine translation summit, Xiamen, pp 164–171
Bontcheva K (2005) Generating tailored textual summaries from ontologies. In: The semantic web: research and applications, Springer, pp 531–545
Brown RD (1996) Example-based machine translation in the Pangloss system. In: Proceedings of the 16th international conference on computational linguistics (COLING-96), Copenhagen, pp 169–174
Brown PF, Della Pietra SA, Della Pietra VJ, Goldsmith MJ, Hajic J, Mercer RL, Mohanty S (1993) But dictionaries are data too. In: Human language technology: proceedings of a workshop held at Plainsboro, New Jersey, USA, Morgan Kaufmann, San Francisco, 21–24 March 1993, pp 202–205
Cai S, Chiang D, Goldberg Y (2011) Language-independent parsing with empty elements. In: Proceedings of the 49th annual meeting of the ACL, Portland, pp 212–216
Callison-Burch C, Koehn P, Osborne M (2006) Improved statistical machine translation using paraphrases. In: HLT-NAACL 2006: proceedings of the human language technology conference of the North American chapter of the ACL, New York, June 2006, pp 17–24
Carl M, Way A (eds) (2003) Recent advances in example-based machine translation. Kluwer, Dordrecht. Introduction to the workshop on EBMT, xxxi
Carpuat M, Diab M (2010) Task-based evaluation of multiword expressions: a pilot study in statistical machine translation. In: Proceedings of HLT-NAACL 2010, Los Angeles, pp 242–245
Carpuat M, Wu D (2007) Improving statistical machine translation using word sense disambiguation. In: Proceedings of the 2007 joint conference on empirical methods in natural language processing and computational natural language learning (EMNLP-CoNLL 2007), Prague, pp 61–72
Caseli HM, Nunes MGV, Forcada ML (2006) Automatic induction of bilingual resources from aligned parallel corpora: application to shallow-transfer machine translation. Mach Trans 20(4):227–245
Cettolo M, Bertoldi N, Federico M (2011) Methods for smoothing the optimizer instability in SMT. In: Proceedings of the 13th machine translation summit, Asia-Pacific Association for Machine Translation, pp 32–39
Cettolo M, Girardi C, Federico M (2012) WIT3: web inventory of transcribed and translated talks. In: Proceedings of EAMT, Trento, Italy, pp 261–268
Chang Pi-Chuan, Huihsin Tseng, Jurafsky D, Manning CD (2011) Discriminative reordering with Chinese grammatical relations features. In: Proceedings of SSST-3, third workshop on syntax and structure in statistical translation, pp 51–59
Chao Wang, Collins M, Koehn P (2007) Chinese syntactic reordering for statistical machine translation. In: Proceedings of the 2007 joint conference on empirical methods in natural language processing and computational natural language learning (EMNLP-CoNLL), Prague, Czech Republic, pp 737–745
Chappelier J-C, Rajman M (1998) A generalized CYK algorithm for parsing stochastic CFG. In: Proceedings of tabulation in parsing and deduction (TAPD’98), Paris, France
Chen Stanley F, Goodman JT (1998) An empirical study of smoothing techniques for language modeling. Technical Report TR-10-98, Computer Science Group, Harvard University
Chen Yu, Eisele A (2010) Integrating a rule-based with a hierarchical translation system. In: LREC 2010: proceedings of the seventh international conference on language resources and evaluation, Valletta, Malta, 17–23 May 2010, pp 1746–1752
Chenqing Zong, Bo Xu, Taiyi Huang (2002) Interactive Chinese-to-English speech translation based on dialogue management. In: Proceedings of the workshop on speech-to-speech translation: algorithms and systems, pp 61–68
Cherry C, Lin D (2003) A probability model to improve word alignment. ACL-2003: 41st annual meeting of the Association for Computational Linguistics, Sapporo, Japan, 7–12 July 2003
Chiang D (2005) A hierarchical phrase-based model for statistical machine translation. In: Proceedings of ACL, pp 263–270 (Best paper award)
Chiang D (2007) Hierarchical phrase-based translation. Comput Linguist 33(2):202–228
Chiang D (2010) Learning to translate with source and target syntax. In: Proceedings of ACL10, Stroudsburg, PA, USA, pp 1443–1452
Chiang D, Marton Y, Resnik P (2008) Online large-margin training of syntactic and structural translation features. In: EMNLP 2008: proceedings of the 2008 conference on empirical methods in natural language processing, Honolulu, 25–27 Oct 2008, pp 224–233
Chiang D, Knight K, Wang W (2009) 11,001 new features for statistical machine translation. In: Proceedings of HLT-NAACL09, Boulder, pp 218–226
Choi JD, Palmer M, Nianwen Xue (2009) Using parallel propbanks to enhance word-alignments. In: Proceedings of ACL-IJCNLP workshop on linguistic annotation (LAW‘09), pp 121–124
Chung Tagyoung, Gildea D (2010) Effects of empty categories on machine translation. In: Proceedings of EMNLP, pp 636–645
Cicekli I, Altay Güveniri H (2001) Learning translation templates from bilingual translation examples. Appl Intell 15(1):57–76
Cimiano P, Montiel-Ponsoda E, Buitelaar P, Espinoza M, Gomez-Pérez A (2010) A note on ontology localization. J Appl Ontology 5:127–137
Clark J, Dyer C, Lavie A, Smith N (2011) Better hypothesis testing for statistical machine translation: controlling for optimizer instability. In: Proceedings of ACL, Portland
Collier N, Doan S, Kawazoe A, Goodwin RM, Conway M, Tateno Y, Ngo Q-H, Dien D, Kawtrakul A, Takeuchi K, Shigematsu M, Taniguchi K (2008) Bio-Caster: detecting public health rumors with a web-based text mining system. Bioinformatics 24(24):2940–2941
Cowan B, Kučerová I, Collins M (2006) A discriminative model for tree-to-tree translation. In: EMNLP-2006: proceedings of the 2006 conference on empirical methods in natural language processing, Sydney, July 2006, pp 232–241
Cui Lei, Dongdong Zhang, Mu Li, Ming Zhou, Tiejun Zhao (2010) A joint rule selection model for hierarchical phrase-based translation. In: ACL 2010: the 48th annual meeting of the association for computational linguistics, Proceedings of the conference short papers, Uppsala, 11–16 July 2010, pp 6–11
Cui Lei, Dongdong Zhang, Mu Li, Ming Zhou (2011) Function word generation in statistical machine translation systems. MTS:139–146
Daelemans W, Hoste V (eds) (2009) Evaluation of translation technology. Artesis University College, Department of Translators & Interpreters, Antwerp, 261 p
Dan Melamed I (2004) Statistical machine translation by parsing. In: Proceedings of the ACL 2004: 42nd annual meeting of the association for computational linguistics, Barcelona, 21–26 July 2004, pp 653–660
Dandapat S, Forcada ML, Groves D, Penkale S, Tinsley J, Way A (2010) OpenMaTrEx: a free/open-source marker-driven example-based machine translation system. In: Loftsson H et al (eds) Advances in natural language processing: 7th international conference on NLP, IceTAL 2010, Reykjavík, 16–18 Aug 2010. College lecture notes in artificial intelligence, vol 6233. Springer, Berlin/Heidelberg, pp 121–126
Dandapat S, Morrissey S, Way A, Forcada ML (2011) Using example-based MT to support statistical MT when translating homogeneous data in a resource-poor setting, 2011. In: Forcada ML, Depraetere HS, Vandeghinste V (eds) Proceedings of the 15th conference of the European association for machine translation, pp 201–208
DARPA project commented at: 1. http://www.slate.com/articles/technology/future_tense/2012/05/darpa_s_transtac_bolt_and_other_machine_translation_programs_search_for_meaning_.html. 2. http://www.darpa.mil/Our_Work/I2O/Personnel/Dr_Bonnie_Dorr.aspx. 3. http://www.darpa.mil/NewsEvents/Releases/2011/2011/04/19_DARPA_initiates_overarching_language_translation_research_Publishes_Broad_Agency_Announcement_for_Broad_Operational_Language_Translation_program.aspx
David C (2007) Hierarchical phrase-based translation. Comput Linguist 33(2):202–228
David C (2010) Learning to translate with source and target syntax. In: Proceedings of ACL10, Morristown, pp 1443–1452
de Marneffe M-C, Manning CD (2008) Stanford typed hierarchies representation. In: Proceedings of the COLING workshop on cross-framework and cross-domain parser evaluation
de Marneffe, M-C, MacCartney B, Manning CD (2006) Generating typed dependency parses from phrase structure parses. In: Proceedings of LREC-06
Declerck T, Krieger H-U, Thomas SM, Buitelaar P, O’Riain S, Wunner T, Maguet G, McCrae J, Spohr D, Montiel-Ponsoda E (2010) Ontology-based multilingual access to financial reports for sharing business knowledge across Europe. In: Rooz J, Ivanyos J (eds) Internal financial control assessment applying multilingual ontology framework, HVG Press Kft., Budapest, pp 67–76
Delmonte R (2009b) A computational approach to implicit entities and events in text and discourse. In: International journal of speech technology (IJST), Springer, pp 1–14
Delmonte R (2009a) Understanding implicit entities and events with Getaruns. In: ICSC, 2009 IEEE international conference on semantic computing, Berkeley, pp 25–32
Delmonte R, Bristot A, Tonelli S (2007) VIT – Venice Italian Treebank: syntactic and quantitative features. In: De Smedt K, Hajic J, Kübler S (eds) Proceedings of sixth international workshop on Treebanks and linguistic theories, Nealt proceedings series vol 1, ISSN 1736–6305, pp 43–54
DeNeefe S, Knight K, Wang W, Marcu D (2007) What can syntax-based MT learn from phrase-based MT? In: Proceedings of EMNLP-CoNLL, pp 755–763
Denkowski M, Lavie A (2010) METEOR-NEXT and the METEOR paraphrase tables: improved evaluation support for five target languages. In: ACL 2010: joint fifth workshop on statistical machine translation and MetricsMATR. Proceedings of the workshop, Uppsala University, Uppsala, 15–16 July 2010, pp 339–342
Deyi Xiong, Min Zhang, Haizhou Li (2010) Learning translation boundaries for phrase-based decoding. In: Proceedings of HLT-NAACL 2010
Dienes P, Dubey A (2003) Antecedent recovery: experiments with a trace tagger. In: Proceedings of EMNLP, Sapporo
Ding Yuan, Palmer M (2005) Machine translation using probabilistic synchronous dependency insertion grammars. In: ACL-2005: 43rd annual meeting of the association for computational linguistics, University of Michigan, Ann Arbor, 25–30 June 2005, pp 541–548
Doddington G (2002) Automatic evaluation of machine translation quality using n-gram co-occurrence statistics. In: Marcus M (ed) HLT 2002: human language technology conference: proceedings of the second international conference on human language technology research, San Diego, 24–27 Mar 2002, [Morgan Kaufmann for DARPA, San Francisco], pp 138–145
Dugast L, Senellart J, Koehn P (2007) Statistical post-editing on SYSTRAN’s rule-based translation system. In: Proceedings of the second workshop on statistical machine translation, Prague, pp 220–223
Dugast L, Senellart J, Koehn P (2008) Can we relearn an RBMT system? In: Proceedings of the third workshop on statistical machine translation, Columbus, pp 175–178
Dugast L, Senellart J, Koehn P (2009) Selective addition of corpus-extracted phrasal lexical rules to a rule-based machine translation system. In: MT Summit XII: proceedings of the twelfth machine translation summit, Ottawa, 26–30 Aug 2009, pp 222–229
Dyer C, Muresan S, Resnik P (2008) Generalizing word lattice translation. In: Proceedings of ACL, Columbus, pp 1012–1020
Ebling S, Way A, Volk M, Naskar SK (2011) Combining semantic and syntactic generalization in example-based machine translation. In: Forcada ML, Depraetere H, Vandeghinste V (eds) Proceedings of the 15th conference of the European association for machine translation, Leuven, pp 209–216
Eisele A, Federmann C, Uszkoreit H, Saint-Amand H, Kay M, Jellinghaus M, Hunsicker S, Herrmann T, Chen Y (2008) Hybrid machine translation architectures within and beyond the Euro Matrix project. In: Hutchins J, von Hahn W (eds) Proceedings of EAMT 2008: 12th annual conference of the European association for machine translation, Hamburg, 22–23 Sept 2008, pp 27–34
Eisner J (2003) Learning non-isomorphic tree mappings for machine translation. ACL-2003: 41st annual meeting of the association for computational linguistics, Sapporo, 7–12 July 2003
El Kholy A, Habash N (2010) Orthographic and morphological processing for English-Arabic statistical machine translation. In: Proceedings of Traitement Automatique du Langage Naturel (TALN-10). Montréal, Canada
España-Bonet C, Jesús G, Lluís M (2009) Discriminative phrase-based models for Arabic machine translation. ACM Trans Asian Lang Info Process J 8(4):1–20
España-Bonet C, Labaka G, Dìaz de Ilarraza A, Màrquez L (2011) Hybrid machine translation guided by a rule-based system, MTS, pp 554–561
Espinoza M, Gomez-Pérez A, Mena E (2008) Enriching an ontology with multilingual information. In: Proceedings of the 5th annual of the European semantic web conference (ESWC08), Tenerife, pp 333–347
Espinoza M, Montiel-Ponsoda E, Gomez-Pérez A (2009). Ontology localization. In: Proceedings of the 5th international conference on knowledge capture (KCAP09), pp 33–40
Esplà-Gomis M, Saànchez-Cartagena VM, Pérez-Ortiz JA (2011) Multimodal building of monolingual dictionaries for machine translation by non-expert users, In: Proceedings of the 13th MTS, Xiamen, pp 147–154
Fai Wong, Dong-Cheng Hu, Yu-Hang Mao, Ming-Chui Dong, Yi-Ping Li (2005) Machine translation based on constraint-based synchronous grammar. In: Proceedings of the 2nd international joint conference on natural language, Jeju Island, Republic of Korea, pp 612–623
Federmann C, Eisele A, Uszkoreit H, Chen Y, Hunsicker S, Xu J (2010) Further experiments with shallow hybrid MT systems. In: ACL 2010: joint fifth workshop on statistical machine translation and MetricsMATR. Proceedings of the workshop, Uppsala University, Uppsala, 15–16 July 2010, pp 77–81
Feifei Zhai, Jiajun Zhang, Yu Zhou, Chengqing Zong (2011) Simple but effective approaches to improving tree-to-tree model, MTS
Font-Llitjòs A, Carbonell JG, Lavie A (2005) A framework for interactive and automatic refinement of transfer-based machine translation. In: European association of machine translation (EAMT) 10th annual conference, Budapest, Hungary, Citeseer
Forcada ML, Depraetere H, Vandeghinste V (eds) (2011) Proceedings of the 15th conference of the European association for machine translation, Leuven, pp 13–20
Forcada ML, Ginest ́ı-Rosell M, Nordfalk J, O’Regan J, Ortiz-Rojas S, Pérez-Ortiz JA, Sànchez-Martìnez F, Ramìrez-Sànchez G, Tyers FM (2011a) Apertium: a free/open-source platform for rule-based machine translation. Machine translation. Special issue on free/open-source machine translation (in press)
Forcada ML, Ginest ́ı-Rosell M, Nordfalk J, O’Regan J, Ortiz-Rojas S, Pérez-Ortiz JA, Sànchez-Martìnez F, Ramìrez-Sànchez G, Tyers FM (2011b) Apertium: a free/open-source platform for rule-based machine translation. Mach Translat (Special Issue on freeo/open-source machine translation) 25(2):127–144.
Fraser A, Marcu D (2006) Semi-supervised training for statistical word alignment. In: Proceedings of ACL, Sydney, pp 769–776
Fraser A, Marcu D (2007) Getting the structure right for word alignment: LEAF. In: Proceedings of EMNLP, Prague, pp 51–60
Fraser A, Marcu D (2007b) Measuring word alignment quality for Statistical Machine Translation. Comput Linguist 33(3):293–303
Fu B, Brennan R, O’Sullivan D (2010) Cross-lingual ontology mapping and its use on the multilingual semantic web. In: Proceedings of the 1st workshop on the multilingual semantic web, at the 19th international World Wide Web Conference (WWW 2010)
Gales MJF, Liu X, Sinha R, Woodland PC, Yu K, Matsoukas S, Ng T, Nguyen K, Nguyen L, Gauvain J-L, Lamel L, Messaoudi A (2007) Speech recognition system combination for machine rranslation. In: IEEE international conference on acoustics, speech and signal processing, Honolulu, pp 1277–1280
Galley M, Hopkins M, Knight K, Marcu D (2004) What’s in a translation rule? In: HLT-NAACL 2004: human language technology conference and North American chapter of the association for computational linguistics annual meeting, The Park Plaza Hotel, Boston, 2–7 May 2004, pp 273–280
Galley M, Graehl J, Knight K, Marcu D, DeNeefe S, Wang Wei, Thayer I (2006) Scalable inference and training of context-rich syntactic translation models. In: Proceedings of the international conference on computational linguistics/association for computational linguistics (COLING/ACL-06), Sydney, pp 961–968
Gao Q, Vogel S (2011) Utilizing target-side semantic role labels to assist hierarchical phrase-based machine translation. In: Proceedings of SSST-5, fifth workshop on syntax, semantics and structure in statistical translation, ACL HLT 2011, Portland, June 2011, pp 107–115
Gao Yuqing, Bowen Zhou, Weizhong Zhu, Wei Zhang (2008) Handheld speech to speech translation system. Automatic speech recognition on mobile devices and over communication networks, Springer, London
Gildea D (2003) Loosely tree-based alignment for machine translation ACL-2003.In: 41st annual meeting of the association for computational linguistics, Sapporo, 7–12 July 2003
Gough N, Way A (2003) Controlled generation in example-based machine translation MT Summit IX, New Orleans, 23–27 Sept 2003, pp 133–140
Green T (1979) The necessity of syntax markers: two experiments with artificial languages. J Verbal Learn Behav 18:481–496
Habash N, Dorr B, Monz C (2009) Symbolic-to-statistical hybridization: extending generation-heavy machine translation. Mach Transl 23:23–63
Hanneman G, Lavie A (2009) Decoding with syntactic and non-syntactic phrases in a syntax-based machine translation system, 2009. In: Proceedings of SSST-3, third workshop on syntax and structure in statistical translation, ACL, pp 1–9
Haque R, Naskar SK, Ma Y, Way A (2009) Using supertags as source language context in SMT. In: Lluís Màrquez, Harold Somers (eds) EAMT-2009: proceedings of the 13th annual conference of the European association for machine translation, Universitat Politècnica de Catalunya, Barcelona, 14–15 May 2009, pp 234–241
He Xiaodong, Mei Yang, Jianfeng Gao, Patrick Nguyen, Robert Moore (2008) Indirect HMM based hypothesis alignment for combining outputs from machine translation systems. In:Proceedings of the 2008 conference on empirical methods in natural language processing, pp 98–107
Hermjakob Ulf (2009) Improved Word alignment with statistics and linguistic heuristics. In: Proceedings of EMNLP 2009, Singapore, pp 229–237
Heyn M (1996) Integrating machine translation into translation memory systems. In: Proceedings of the EAMT machine translation workshop, TKE ’96, Vienna, pp 113–126
Hoang H, Koehn P (2010) Improved translation with source syntax labels. In: Proceedings of the joint 5th workshop on statistical machine translation and metrics MATR, Uppsala, 11–16 July 2010, pp 409–417
Hoang H, Koehn P, Lopez A (2009) A unified framework for phrase-based, hierarchical, and syntax-based statistical machine translation. In: IWSLT 2009: proceedings of the international workshop on spoken language translation, National Museum of Emerging Science and Innovation, Tokyo, 1–2 Dec 2009, pp 152–159
Hovy E (1998) Combining and standardizing large-scale, practical ontologies for machine translation and other uses. In: Proceedings of the first international conference on language resources and evaluation, Granada
Hovy E, Nirenburg S (1992) Approximating and interlingua in a principled way. In: Proceedings of the DARPA speech and natural language workshop, Arden House
Hovy E, Marcus M, Palmer M, Pradhan S, Ramshaw I, Weischedel R (2006) OntoNotes: the 90 % solution. In: Proceedings of the human language technology conference at the annual meeting of NAACL, New York
Huang Liang, David Chiang (2007) Forest rescoring: faster decoding with integrated language models. In: ACL 2007: proceedings of the 45th annual meeting of the association for computational linguistics, Prague, June 2007, pp 144–151
Huang Liang, Knight K, Joshi A (2006) Statistical syntax-directed translation with extended domain of locality. In: AMTA 2006: proceedings of the 7th conference of the association for machine translation in the americas, visions for the future of machine translation, Cambridge, MA, 8–12 Aug 2006, pp 66–73
Huang Liang, Hao Zhang, Daniel Gildea, Kevin Knight (2009) Binarization of synchronous context-free grammars. Comput Linguist 35(4):559–595
Huang Chu-Ren, Ru-Yng Chang, Hsiang-bin Lee (2010) Sinica BOW (Bilingual Ontological WordNet): integration of bilingual WordNet and SUMO. In: Huang et al (eds) Ontology and the lexicon, CUP, Cambridge, pp 201–211
Huang Zhongqiang, Martin Cmejrek, Bowen Zhou (2010) Soft syntactic constraint for hierarchical phrase-based translation using latent syntactic distributions. In: Proceedings of EMNLP10, Cambridge, MA
Hutchins J (2005a) State of the art reports natural language translation computer-based translation systems and tools. www.hutchingsweb.me.uk/BCS-NLT-2005.pdf
Hutchins J (2005b) Current commercial machine translation systems and computer-based translation tools: system types and their uses. Int J Transl 17(1–2):5–38
Hutchins J (2010) Outline of machine translation developments in Europe and America. JAPIO, Tokyo, pp 1–8
Hutchins WJ, Somers HL (1992) An introduction to machine translation. Academic, London, Xxi, 362pp
Hwidong Na, Jong-Hyeok Lee (2011) Multi-word unit dependency forest-based translation rule extraction. In: Proceedings of SSST-5, fifth workshop on syntax, semantics and structure in statistical translation, ACL HLT 2011, pp 41–51
Hyoung-Gyu Lee, Min-Jeong Kim, Gumwon Hong, Sang-Bum Kim, Young-Sook Hwang, Hae-Chang Rim (2010) Identifying idiomatic expressions using phrase alignments in bilingual parallel corpus. In: Proceedings of PRICAI 2010, Daegu, Korea
Jelinek F (2004) Stochastic analysis of structured language modeling. In: Johnson M, Khudanpur S, Ostendorf M, Rosenfeld R (eds) Mathematical foundations of speech and language processing. Springer, Berlin, pp 37–72
Jianjun Ma, Degen Huang, Haixia Liu, Wenfeng Sheng (2011) POS tagging of English particles for machine translation. In: Proceedings of the 13th machine translation summit, Asia-Pacific Association for Machine Translation, Xiamen, pp 57–63
Jijkoun V, de Rijke M (2004) Enriching the output of a parser using memory-based learning. In: Proceedings of the 42nd annual meeting of the Association for Computational Linguistics
Johannessen JB, Nordgård T, Nygaard L (2008) Evaluation of linguistics-based translation. In: LREC 2008: 6th language resources and evaluation conference, Marrakech, pp 26–30
Johansson R, Nugues P (2007) Extended constituent-to-dependency conversion for english. In: Proceedings of NODALIDA 2007, Tartu
Josef OF, Ney H (2003) A systematic comparison of various statistical alignment models. Comput Linguist 29(1):19–51
Josef OF, Ney H (2004) The alignment template approach to statistical machine translation. Comput Linguist 30(4):417–449
Karakos D, Khudanpur S (2008) Sequential system combination for machine translation of speech. In: Proceedings of the 2008 IEEE workshop on spoken language technology (SLT-08), Goa
Kay M (2011) Zipf’s Law and L’Arbitraire du Signe. Linguist Issues Lang Technol (LiLT) 6(8):1–25, CLSI Publications
Khadivi S, Zens R, Ney H (2006) Integration of speech to computer-assisted translation using finite-state automata. In: Proceedings of COLING/ACL 2006, Sydney
Khalilov M, Pretkalniņa L, Kuvaldina N, Pereseina V (2010) SMT of Latvian, Lithuanian and Estonian languages: a comparative study. In: Human language technologies – the Baltic perpective, international conference, Riga, 8 Oct 2010, 8pp
Kirchhoff K, Rambow O, Habash N, Diab M (2007) Semi-automatic error analysis for large-scale statistical machine translation systems. In: Proceedings of the machine translation summit (MT- Summit), Copenhagen
Klein D, Manning CD (2003) Accurate unlexicalized parsing. In: ACL 41, pp 423–430
Koehn P (2004) Statistical significance tests for machine translation evaluation. In: Proceedings of EMNLP, Barcelona, pp 388–395
Koehn P (2010) Statistical machine translation. Cambridge University Press, Cambridge, XII, 433p
Koehn P, Knight K (2003) Feature-rich statistical translation of noun phrases. In: Proceedings of ACL 2003, Hongkong
Koehn P, Monz C (2006) Manual and automatic evaluation of machine translation between European languages. In: NAACL 2006 workshop on statistical machine translation, ACL, New York, pp 102–121
Koehn P, Och FJ, Marcu D (2003) Statistical phrase-based translation. In: Proceedings of NAACL, Morristown, pp 48–54
Koehn P, Hoang H, Birch A, Callison-Burch C, Federico M, Bertoldi N, Cowan B, Shen W, Moran C, Zens R, Dyer C, Bojar O, Constantin A, Herbst E (2007) Moses: open source toolkit for statistical machine translation. In: Proceedings of the 45th annual meeting of the association for computational linguistics companion volume proceedings of the demo and poster sessions, pp 177–180
Koehn P, Hoang H, Birch A, Callison-Burch C, Federico M, Bertoldi N, Cowan B, Shen W, Labaka G (2010) EUSMT: incorporating linguistic information into SMT for a morphologically rich language. Its use in SMT-RBMT-EBMT hybridation. Ph.D. thesis, University of the Basque Country
Labaka G (2010) EUSMT: incorporating linguistic information into SMT for a morphologically rich language. Its use in SMT-RBMT-EBMT hybridation. Ph.D. thesis, University of the Basque Country
Lafferty J, McCallum A, Pereira F (2001) Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: Proceedings of ICML, pp 282–289
Lagarda A-L, Alabau V, Casacuberta F, Silva R, Díaz-de-Liaño E (2009) Statistical post-editing of a rule-based machine translation system. In: Proceedings of NAACL HLT 2009. Human language technologies: the 2009 annual conference of the North American chapter of the ACL, short papers, Boulder, 31 May–5 June 2009, pp 217–220
Lambert P, Banchs R (2005) Data inferred multi-word expressions for statistical machine translation. In: Proceedings of MT summit X, Phuket
Lamel L et al (2011) Speech recognition for machine translation in quaero. In: Proceedings of IWSLT, San Francisco
Lavie A (2008) Stat-XFER: a general search-based syntax-driven framework for machine translation. In: Computational linguistics and intelligent text processing, Springer, LNCS, New York, pp 362–375
Lavie A, Agarwal A (2007) METEOR: an automatic metric for MT evaluation with high levels of correlation with human judgments. In: Proceedings of the second workshop on statistical machine translation, Prague, pp 228–231
Lavie A, Waibel A et al (1996) Translation of conversational speech with JANUS-II. In: Proceedings of ICSLP 96, Philadelphia
Lavie A et al (2002) A multi-perspective evaluation of the NESPOLE! speech-to-speech translation system. In: Proceedings of ACL 2002 workshop on speech-tospeech translation: algorithms and systems, Philadelphia
Lavie A, Yarowsky D, Knight K, Callison-Burch C, Habash N, Mitamura T (2006) MINDS workshops machine translation working group final report. http://www-nlpir.nist.gov/MINDS/FINAL/MT.web.pdf
Lavie A, Parlikar A, Ambati V (2008) Syntax-driven learning of sub-sentential translation equivalents and translation rules from parsed parallel corpora. In: Proceedings of the second ACL workshop on syntax and structure in statistical translation, Columbus, pp 87–95
Lei Cui, Zhang D, Li M, Zhou M, Zhao T (2010) A joint rule selection model for hierarchical phrase-based translation. In: Proceedings of ACL10, Uppsala, pp 6–11
Lei Cui, Dongdong Zhang, Mu Li, Ming Zhou (2011) Function word generation in Statistical Machine Translation Systems, MTS 2011, pp 139–146
Lemao Liu, Tiejun Zhao, Chao Wang, Hailong Cao (2011) A unified and discriminative soft syntactic constraint model for hierarchical phrase-based translation
Levin L, Lavie A, Woszczyna M, Gates D, Galvadá M, Koll D, Waibel A (2000) The janus-III translation system: speech-to-speech translation in multiple domains. Mach Trans 15:3–25
Levy R, Manning C (2004) Deep dependencies from context–free statistical parsers: correcting the surface dependency approximation. In: Proceedings of the ACL
Li Zhifei, Callison-Burch C, Dyer C, Ganitkevitch J, Khudanpur S, Schwartz L, Thornton WNG, Weese J, Zaidan OF (2009) Demonstration of Joshua: an open source toolkit for parsing-based machine translation. In: Proceedings of the 47th annual meeting of the ACL and the 4th IJCNLP, Software demonstrations, Suntec, 3 Aug 2009, pp 25–28
Liu Ding, Gildea D (2008) Improved tree-to-string transducer for machine translation. In: Proceedings of ACL-08: HLT. Third workshop on statistical machine translation (ACL WMT-08), The Ohio State University, Columbus, 19 June 2008, pp 62–69
Liu Ding, Gildea D (2010) Semantic role features for machine translation. In: Proccedings of the coling 2010: 23rd international conference on computational linguistics, Beijing International Convention Center, Beijing, 23–27 Aug 2010, pp 716–724
Liu Yang, Qun Liu, Shouxun Lin (2006) Tree-to-string alignment template for statistical machine translation. In: Coling-ACL 2006: proceedings of the 21st international conference on computational linguistics and 44th annual meeting of the association for computational linguistics, Sydney, 17–21 July 2006, pp 609–616
Liu Yang, Yajuan Lü, Qun Liu (2009) Improving tree-to-tree translation with packed forests. In: Proceedings of the 47th annual meeting of the ACL and the 4th IJCNLP, Suntec, 2–7 Aug 2009, pp 558–566
Liu Zhanyi, Haifeng Wang, Hua Wu, Sheng Li (2010) Improving statistical machine translation with monolingual collocation. In: Proceedings of ACL 2010, Uppsala
Liu Lemao, Tiejun Zhao, Chao Wang, Hailong Cao (2011a) A unified and discriminative soft syntactic constraint model for hierarchical phrase-based translation
Liu Shujie, Chi-Ho Li, Ming Zhou (2011b) A unified SMT framework combining MIRA and MERT in MTS, pp 181–188
Liu Yang, Qun Liu, Yajuan Lü (2011) Adjoining tree-to-string translation. In: ACL-HLT 2011: proceedings of the 49th annual meeting of the association for computational linguistics, Portland, 19–24 June 2011, pp 1278–1287
Llitjós AF, Vogel S (2007) A walk on the other side. Adding statistical components to a transfer-based translation system. In: Proceedings of the HLT-NAACL workshop on syntax and structure in statistical translation, Rochester, pp 72–79
Lopez A, Resnik P (2006) Word-based alignment, phrase-based translation: what’s the link. In: Proceedings of AMTA, pp 90–99
Lopez A, Resnik P (2006) Word-based alignment, phrase-based translation: what’s the link. In: Proceedings of the AMTA, Cambridge MA, pp 90–99
Maletti A (2010) Why synchronous tree substitution grammars? In: Proceedings of the 2010 meeting of the North American chapter of the association for computational linguistics (NAACL-10), pp 876–884
Maletti A, Graehl J, Hopkins M, Knight K (2009) The power of extended top-down tree transducers. SIAM J Comput 39:410–430
Marcu D, Wei Wang, Echihabi A, Knight K (2006) SPMT: statistical machine translation with syntactified target language phrases. In: Proceedings of EMNLP 2006, pp 44–52
Mareček D (2009a) Improving word alignment using alignment of deep structures. In: Proceedings of the 12th international conference on text, speech and dialogue, pp 56–63
Mareček D (2009b) Using tectogrammatical alignment in phrase-based machine translation. In: Proceedings of WDS 2009 contributed papers, pp 22–27
Marton Y, Resnik P (2008) Soft syntactic constraints for hierarchical phrased-based translation. Proceedings of the ACL-08: HLT 46th annual meeting of the association for computational linguistics: human language technologies, The Ohio State University, Columbus, 15–20 June 2008, pp 1003–1011
Maruyama H, Watanabe H (1992) Tree cover search algorithm for example-based translation fourth international conference on theoretical and methodological issues in machine translation (TMI-92), empiricist vs. rationalist methods in MT, Montreal, CCRIT-CWARC, 25–27 June 1992, pp 173–184
Matthew GS, Madnani N, Dorr B, Schwartz R (2009) TER-Plus: paraphrase, semantic, and alignment enhancements to translation error rate. Mach Trans 23(2/3):117–127
McCrae J, Campana J, Cimiano P (2010) CLOVA: an architecture for cross-language semantic data querying. In: Proceedings of the first mutlilingual semantic web workshop
McCrae et al (2011) Combining statistical and semantic approaches to the translation of ontologies and taxonomies. In: Proceedings of SSST, pp 116–125
McCrae J, Espinoza M, Montiel-Ponsoda E, Aguado-de-Cea G, Cimiano P (2011) Combining statistical and semantic approaches to the translation of ontologies and taxonomies. In: Proceedings of SSST-5, fifth workshop on syntax, semantics and structure in statistical translation, ACL HLT 2011, Portland, pp 116–125
Menezes A, Quirk C (2008) Syntactic models for structural word insertion and deletion. In: Proceedings of EMNLP
Menezes A, Richardson SD (2001) A best-first alignment algorithm for automatic extraction of transfer mappings from bilingual corpora. In: Proceedings of MT summit VIII workshop on example-based machine translation, Santiago de Compostela, 18–22 Sept 2001
Menezes A, Toutanova K, Quirk C (2006) Microsoft research treelet translation system: NAACL 2006 Europarl evaluation. In: HLT-NAACL 2006: proceedings of the workshop on statistical machine translation, New York, June 2006, pp 158–161
Mi Haitao, Liang Huang, Qun Liu (2008) Forest-based translation. In: Proceedings of the ACL-08: HLT: 46th annual meeting of the association for computational linguistics: human language technologies, 15–20 June 2008, The Ohio State University, Columbus, pp 192–199
Michael C, Way A (eds) (2003) Recent advances in example-based machine translation. Kluwer Academic, Dordrecht. Introduction to the workshop on EBMT, xxxi
Ming Tan, Wenli Zhou, Lei Zheng, Shaojun Wang (2012) A scalable distributed syntactic, semantic and lexical language model, to appear in computational linguistics just accepted MS, pp 1–66
Mischo W (1982) Library of congress subject headings. Catalog Classif Quart 1(2):105–124
Moran C, Zens R, Dyer C, Bojar O, Constantin A, Herbst E (2007) Moses: open source toolkit for statistical machine translation. In: Proceedings of the 45th annual meeting of the association for computational linguistics companion volume proceedings of the demo and poster sessions, pp 177–180
Morimoto T et al ATR’s speech translation system: ASURA. In: Proceedings of EuroSpeech 93, pp 1291–1294
Na Hwidong, Lee Jong-Hyeok (2011) Multi-word unit dependency forest-based translation rule extraction. In: Proceedings of SSST-5, fifth workshop on syntax, semantics and structure in statistical translation, ACL HLT 2011, pp 41–51
Nakazawa T, Kurohashi S (2008) Linguistically-motivated tree-based probabilistic phrase alignment. In: AMTA-2008 MT at work: proceedings of the eighth conference of the association for machine translation in the Americas, Waikiki, 21–25 Oct 2008, pp 163–171
Nakazawa T, Kurohashi S (2011) Statistical phrase alignment model using dependency relation probability. In: Proceedings of SSST-3, third workshop on syntax and structure in statistical translation, pp 10–18
Navigli R, Ponzetto SP (2010) Babelnet: building a very large multilingual semantic network. In: Proceedings of the 48th annual meeting of the association for computational linguistics, pp 216–225
Nießen S, Och FJ, Leusch G, Ney H (2000) An evaluation tool for machine translation: fast evaluation for MT research. In: Proceedings of LREC-2000: second international conference on language resources and evaluation, Athens, 31 May–2 June 2000, pp 39–45
Och FJ (2003) Minimum error rate training in statistical machine translation. In: Proceedings of ACL, pp 160–167
Och FJ, Ney H (2002) Discriminative training and maximum entropy models for statistical machine translation. In: ACL 2002: proceedings of the 40th annual meeting of the association for computational linguistics (best paper award), Philadelphia, July 2002, pp 295–302
Och FJ, Ney H (2003a) A systematic comparison of various statistical alignment models. Comput Linguist 29(1):19–51
Och FJ, Ney H (2003b) The alignment template approach to statistical machine translation. Comput Linguist 30(4):417–449
Oepen S, Lønning JT (2006) Discriminant-based MRS banking. In: Proceedings of the 4th international conference on language resources and evaluation
Oepen S, Dyvik H, Lønning JT, Velldal E, Beermann D, Carroll J, Flickinger D, Hellan L, Johannessen JB, Meurer P, Nordgård T, Rosén V (2004) Som a ̊kapp-ete med trollet? Towards MRS-based Norwegian–english machine translation. In: Proceedings of the 10th international conference on theoretical and methodological issues in machine translation, Baltimore, pp 11–20
Oepen S, Dyvik H, Flickinger D, Lønning JT, Meurer P, Rosén V (2005) Holistic regression testing for high-quality MT. Some methodological and technological reflections. In: Proceedings of the 10th annual conference of the European association for machine translation, Budapest
Oepen S, Velldal E, Lønning JT, Meurer P, Rosén V, Flickinger D (2007) Towards hybrid quality-oriented machine translation—on linguistics and probabilities in MT. In: TMI-2007: proceedings of the 11th international conference on theoretical and methodological issues in machine translation, Skövde, 7–9 Sept 2007, pp 144–153
Oflazer K, El-Kahlout ID (2007) Exploring different representational units in English-to-Turkish statistical machine translation. In: Proceedings of the second workshop on statistical machine translation, ACL, pp 25–32
Owczarzak K, van Genabith J (2007) Evaluating machine translation with LFG dependencies [abstract]. Mach Trans 21(2):95–119
Papineni K, Roukos S, Ward T, Wei-jing Zhu (2002) BLEU: a method for automatic evaluation of machine translation. In: Proceedings of ACL
Pekar V, Mitkov R, Blagoev D, Mulloni A (2006) Finding translations for low-frequency words in comparable corpora. Mach Transl 20(4):247–266
Petrov S, Barrett L, Thibaux R, Klein D (2006) Learning accurate, compact, and interpretable tree annotation. In: Proceedings of COLING-ACL
Phillips AB (2011) Cunei: open-source machine translation with relevance-based models of each translation instance, in special issue: free/open-source machine translation machine translation. Mach Trans 25(2):161–177
Philpot A, Hovy E, Pantel P (2010) The OMEGA ontology. In: Huang CR et al (eds) Ontology and the lexicon. Cambridge University Press, Cambridge, UK, pp 258–270
Pi-Chuan Chang, Huihsin Tseng, Dan Jurafsky, Manning CD (2011) Discriminative reordering with Chinese grammatical relations features. In: Proceedings of SSST-3, third workshop on syntax and structure in statistical translation, Boulder, pp 51–59
Pighin Daniele, Lluìs Màrquez (2011) Automatic projection of semantic structures: an application to pairwise translation Popović, Maja & Hermann Ney: 2006. POS-based reorderings for statistical machine translation. In: LREC-2006: fifth international conference on language resources and evaluation, Genoa, 22–28 May 2006, pp1278–1283
Pighin D, Màrquez L (2011) Automatic projection of semantic structures: an application to pairwise translation ranking. In: Proceedings of SSST-5, fifth workshop on syntax, semantics and structure in statistical rranslation, Portland, pp 1–9
Przybocki M, Peterson K, Bronsart S (2008) Translation adequacy and preference evaluation tool (TAP-ET). In: LREC 2008: 6th language resources and evaluation, Marrakech, 26–30 May 2008, 8pp
Qin Gao, Vogel S (2011) Proceedings of SSST-5, fifth workshop on syntax, semantics and structure in statistical translation, ACL HLT 2011, Portland, pp 107–115
Quirk C, Menezes A, Cherry C (2005) Dependency treelet translation: syntactically informed phrasal SMT. In: ACL-2005: 43rd annual meeting of the association for computational linguistics, University of Michigan, Ann Arbor, 25–30 June 2005, pp 271–279
Ranking, in proceedings of SSST-5, fifth workshop on syntax, semantics and structure in statistical translation, pp 1–9
Ravi S, Knight K (2010) Does GIZA++ make search errors? Comput Linguist Squibs Discuss 36(3):295–302
Ren, Zhixiang, Yajuan Lü, Jie Cao, Qun Liu, and Yun Huang (2009) Improving statistical machine translation using domain bilingual multiword expressions. In: Proceedings of MWE 2009 (ACL-IJCNLP)
Riezler S, Maxwell JT III (2006) Grammatical machine translation. In: Proceedings of the human language technology conference and annual meeting of the North American association for computational linguistics, pp 248–255
Rosti AVI, Bing Xiang, Matsoukas S, Schwartz R, Ayan NF, Dorr BJ (2007a) Combining outputs from multiple machine translation systems. In: Proceedings of HLT-NAACL, Rochester, pp 228–235
Rosti AVI, Matsoukas S, Schwartz R (2007b) Improved word-level system combination for machine translation. In: Proceedings of ACL-07, pp 312–319
Rosti AVI, Bing Zhang, Matsoukas S, Schwartz R (2008) Incremental hypothesis alignment for building confusion networks with application to machine translation system combination. In: Proceedings of ACL/WMT 2008, pp 183–186
Roturier J (2009) Deploying novel MT technology to raise the bar for quality: a review of key advantages and challenges. Keynote slides, the twelfth machine translation summit, International association for machine translation, Ottawa
Saers M (2011) Translation as linear transduction: models and algorithms for efficient learning in statistical machine translation. Ph.D. thesis, Uppsala University, Department of Linguistics and Philology, Uppsala
Saers M, Nivre J, Wu D (2010) Word alignment with stochastic bracketing linear inversion transduction grammar. In: HLT/NAACL2010, ACL, pp 341–344
Sag IA, Baldwin T, Bond F, Copestake A, Flickinger D (2001) Multiword expressions: a pain in the neck for NLP. In: Proceedings of WP
Sanchez-Cartagena VM, Pérez-Ortiz JA (2010) Tradubi: open-source social translation for the Apertium machine translation platform. In: Open source tools for machine translation, MT Marathon 2010, pp 47–56
Sanchez-Cartagena VM, Sanchez-Martìnez F, Pérez-Ortiz JA (2011) Integrating shallow-transfer rules into phrase-based statistical machine translation, MT Summit XIII: the thirteenth machine translation summit [organized by the] Asia-Pacific association for machine translation (AAMT), Xiamen, 19–23 Sept 2011, pp 562–569
Sanchez-Martinez F, Forcada ML (2009) Inferring shallow-transfer machine translation rules from small parallel corpora. J Artif Intell Res 34:605–635
Schafer C, David Y (2003) Statistical machine translation using coercive two-level syntactic transduction EMNLP-2003. In: proceedings of the 2003 conference on empirical methods in natural language processing, a meeting of SIGDAT, a special interest group of the ACL, held in conjunction with ACL-03, Sapporo, 11–12 July 2003, 8pp
Schafer C, Yarowsky D (2002) Inducing translation lexicons via diverse similarity measures and bridge languages. In: CoNLL, Taipei
Schmid H (2006) Trace prediction and recovery with unlexicalized PCFGs and slash features. In Proceedings of the COLING-ACL, Sydney
Schwenk H, Abdul-Rauf S, Barrault L, Senellart J (2009) SMT and SPE machine translation system for WMT’09. In: Proceedings of the fourth workshop on statistical machine translation, Athens, 30–31 March 2009, pp 130–134
Shen Libin, Jinxi Xu, Weischedel R (2008) A new string-to-dependency machine translation algorithm with a target dependency language model. ACL-08: HLT. In: 46th annual meeting of the association for computational linguistics: human language technologies. Proceedings of the conference, The Ohio State University, Columbus, 15–20 June 2008, pp 577–585
Shen Libin, Jinxi Xu, Bing Zhang, Matsoukas S, Weischedel R (2009) Effective use of linguistic and contextual information for statistical machine translation. EMNLP-2009. In: Proceedings of the 2009 conference on empirical methods in natural language processing, Singapore, 6–7 Aug 2009, pp 72–80
Shu Cai, Chiang D, Goldberg Y (2011) Language-independent parsing with empty elements. In: Proceedings of the 49th annual meeting of the ACL, Portland, pp 212–216
Shujie Liu, Chi-Ho Li, Ming Zhou (2011) A unified SMT framework combining MIRA and MERT in MTS, pp 181–188
Shumin Wu, Palmer M (2011) Semantic mapping using automatic word alignment and semantic role labeling. In: Proceedings of SSST-5, fifth workshop on syntax, semantics and structure in statistical translation, Portland, pp 21–30
Shumin Wu, Choi JD, Palmer M (2010) Detecting cross-lingual semantic similarity using parallel propbanks. In: Proceedings of the 9th conference of the association for machine translation in the Americas
Sim, Khe Chai, William JB, Mark JFG, Hichem S, Phil CW (2007) Consensus network decoding for statistical machine translation system combination. In: Proceedings of the 32nd IEEE international conference on acoustics, speech, and signal processing (ICASSP), pp 105–108
Simard M, Ueffing N, Isabelle P, Kuhn R (2007) Rule-based translation with statistical phrase-based post-editing. In: Proceedings of the second workshop on statistical machine translation, ACL, pp 203–206
Simard M, Cyril G, Pierre I (2007) Statistical phrase-based post-editing. NAACL-HLT-2007 Human language technology: the conference of the North American chapter of the association for computational linguistics, Rochester, 22–27 April 2007, pp 508–515
Smith DA, Eisner J (2006) Minimum risk annealing for training log-linear models. In: Proceedings of the COLING/ACL on main conference poster sessions, ACL, pp 787–794
Snover M, Bonnie D, Richard S, Linnea M, John M (2006) A study of translation edit rate with targeted human annotation. In: Proceedings of the 7th conference of the association for machine translation in the Americas (AMTA-2006), Cambridge, MA, Aug, pp 223–231
Soltau H, Metze F, Fügen C, Waibel A (2001) A one pass-decoder based on polymorphic linguistic context assignment. In: IEEE ASRU, Madonna di Campiglio
Somers H (2003) An overview of EBMT. In: Michael C, Andy W (eds) Recent advances in example-based machine translation. Kluwer Academic, Dordrecht, pp 3–57
Specia L, Cancedda N, Turchi M, Cristianini N (2009) Estimating the sentence-level quality of machine translation systems. In: Proceedings of the 13th annual conference of the EAMT, pp 28–35
Specia L, Saunders C, Turchi M, Wang Z, Shawe-Taylor J (2009) Improving the confidence of machine translation quality estimates. MT summit XII
Stein D, Stephan P, David V, Hermann N (2010) A cocktail of deep syntactic features for hierarchical machine translation. AMTA 2010: the ninth conference of the association for machine translation in the Americas, Denver 31 Oct–4 Nov 2010, 9pp
Stolcke A (2002) SRILM – an extensible language modeling toolkit. In: Proceedings of the international conference of spoken language processing, vol 2, Denver, pp 901–904
Suzuki H (2011) Automatic post-editing based on SMT and its selective application by sentence-level automatic quality evaluation, MTS, pp 156–163
Tan Ming, Wenli Zhou, Lei Zheng, Shaojun Wang (2012) A scalable distributed syntactic, semantic and lexical language model, to appear in computational linguistics Just accepted MS, pp 1–66
Teramusa E (2007) Rule based machine translation combined with statistical post-editor for Japanese to English patent translation. Tokyo University of Science, Suwas
Thurmair G (2009) Comparing different architectures of hybrid machine translation systems. In: Proceedings of MT summit XII
Tiedemann J (2011) Bitext alignment. Morgan & Claypool Publishers, viii, 153 p
Tiedemann J, Kotzé G (2009) Building a large machine-aligned parallel treebank. In: Proceedings of the 8th international workshop on treebanks and linguistic theories (TLT), Milan, pp 197–208
Tillmann C (2004) A unigram orientation model for statistical machine translation. HLT-NAACL 2004: Human language technology conference and North American chapter of the association for computational linguistics annual meeting, The Park Plaza Hotel, Boston, – Short Papers, 2–7 May 2004, pp 101–104
Tinsley J, Hearne M, Way A (2007) Exploiting parallel treebanks to improve phrase-based statistical machine translation. In: Proceedings of the sixth international workshop on treebanks and linguistic theories, pp 175–187
Tonelli S, Delmonte R, Bristot A (2008) Enriching the Venice Italian Treebank with dependency and grammatical relations. In: Proceedings of LREC 2008, Marrakech
Tonelli S, Rodolfo D, Antonella B (2008) Enriching the Venice Italian treebank with dependency and grammatical relations, LREC 2008
Tong Xiao, Jingbo Zhu, Shujie Yao, Hao Zhang (2011) Document-level consistency verification in machine translation, MST 2011, pp 131–138
Turian JP, Luke S, Melamed ID (2003) Evaluation of machine translation and its evaluation MT Summit IX, New Orleans, 23–27 Sept 2003, pp 386–393
Tyers FM (2009) Rule-based augmentation of training data in Breton-French statistical machine translation. In: Proceedings of the 13th annual conference of the European association for machine translation, pp 213–217
Ueffing N, Haffari G, Sarker A (2007) Semi-supervised model adaptation for statistical machine translation. Mach Trans 21(2):77–94
Vandeghinste V, Scott M (2010) Bottom-up transfer in example-based machine translation. In: François I, Veale, T, Andy W (eds) 1997 Gaijin: a bootstrapping, template-driven approach to example-based MT. Proceedings of the 2nd international conference on recent advances in natural language processing, Tzigov Chark1997, pp 239–244
Vandeghinste V, Van den Bogaert J, Martens S, Kotzé G (2011) PaCo-MT: parse and corpus-based machine translation. In: Forcada ML, Depraetere H, Vandeghinste V (eds) Proceedings of the 15th annual conference of the European association for machine translation, p 347
Velldal E, Oepen S (2006) Statistical ranking in tactical generation. In: Proceedings of the conference on empirical methods in natural language processing. Sydney
Velldal E, Oepen S, Flickinger D (2004) Paraphrasing treebanks for stochastic realization ranking. In: Proceedings of the 3rd workshop on treebanks and linguistic theories, pp 149–160
Venkatapathy S, Sangal R, Joshi A, Gali K (2010) A discriminative approach for dependency based statistical machine translation. In: Proceedings of SSST, pp 66–74
Venugopal A, Andreas Z, Noah AS, Stephan V (2009) Preference grammars: softening syntactic constraints to improve statistical machine translation. NAACL HLT 2009. Human language technologies: the 2009 annual conference of the North American chapter of the ACL, Boulder, 31 May–5 June 2009, pp37–45
Viggo H (eds) Proceedings of the 14th international conference of the European association for machine translation (EAMT-2010), 8pp
Vossen P (1998) EuroWordNet: a multilingual database with lexical semantic networks. Kluwer, Dordrecht
Waibel A, Ahmed B, Alan WB, Robert F, Donna Gates AL, Lori L, Kevin L, Laura MT, Juergen R, Tanja S, Dorcas W, Monika W, Jing Z (2003) Speechalator: two-way speech-to-speech translation in your hand. In: Proceedings of HLT-NAACL 2003, Demonstrations, May–June 2003, pp 29–30
Wallach H (2006) Topic modeling: beyond bag-of-words. In: Proceedings of the 23rd international conference on machine learning (ICML), New York, pp 977–984
Wang S, Wang S, Greiner R, Schuurmans D, Cheng L (2005) Exploiting syntactic, semantic and lexical regularities in language modeling via directed Markov random fields. In: Proceedings of the 22nd international conference on machine learning (ICML), Bonn, pp 953–960
Wang S, Wang S, Cheng L, Greiner R, Schuurmans D (2006) Stochastic analysis of lexical and semantic enhanced structural language model. In: Proceedings of the 8th international colloquium on grammatical inference (ICGI), Tokyo, pp 97–111
Wang Chao, Michael Collins, Philipp K (2007) Chinese syntactic reordering for statistical machine translation. In: Proceedings of the 2007 joint conference on empirical methods in natural language processing and computational natural language learning (EMNLP-CoNLL), Prague, pp 737–745
Wang Wei, Kevin Knight, Daniel Marcu (2007) Binarizing syntax trees to improve syntax-based machine translation accuracy. In: Proceedings of the 2007 joint conference on empirical methods in natural language processing and computational natural language learning, Prague, pp 746–754
Wei Wang, May J, Knight K, Marcu D (2010) Re-structuring, re-labeling, and re-aligning for syntax-based machine translation. Comput Linguist 36(2):247–277
Watanabe Hideo, Sadao Kurohashi, Eiji Aramaki (2000) Finding structural correspondences from bilingual parsed corpus for corpus-based translation Coling 2000 in Europe: the 18th international conference on computational linguistics. In: Proceedings of the conference, Universität des Saarlandes, Saarbrücken, 31 July−4 Aug 2000, pp 906–912
Wehrli E (2007) Fips, a “deep” linguistic multilingual parser. In: Proceedings of the ACL 2007 workshop on deep linguistic processing, Prague, pp 120–127
Wehrli E, Nerima L, Seretan V, Scherrer Y (2009) On-line and off-line translation aids for non-native readers. In: Proceedings of the international multiconference on computer science and information technology, vol 4, pp 299–303
Wehrli E, Seretan V, Nerima L, Russo L (2009) Collocations in a rule-based MT system: a case study evaluation of their translation adequacy. In: EAMT-2009: proceedings of the 13th annual conference of the European association for machine translation, Lluís Màrquez, Harold Somers (eds), 14–15 May 2009, Universitat Politècnica de Catalunya, Barcelona, pp 128–135
Wei Wang, Knight K, Marcu D (2007) Binarizing syntax trees to improve syntax-based machine translation accuracy. In: Proceedings of the 2007 joint conference on empirical methods in natural language processing and computational natural language learning, pp 746–754
Wilks Y (1994) Stone soup and the French room. In: Zampolli A, Calzolari N, Palmer M (eds) Current issues in computational linguistics: in honour of Don Walker, vol 9–10, Linguistica Computazionale. Giradini Editori/Kluwer Academic, Pisa/Dordrecht, pp 585–594
Wilks Y (2009) Machine translation: its scope and limits. Springer, New York, 252 p
Wong Fai, Dong-Cheng Hu, Yu-Hang Mao, Ming-Chui Dong, Yi-Ping Li (2005) Machine translation based on constraint-based synchronous grammar. In: Proceedings of the 2nd international joint conference on natural language, Jeju Island, pp 612–623
Wu D (1997) Stochastic inversion transduction grammars and bilingual parsing of parallel corpora. Comput Ling 23(3):378–403
Wu Dekai, Hongsing Wong (1998) Machine translation with a stochastic grammatical channel. In: Coling-ACL ’98: 36th annual meeting of the association for computational linguistics and 17th international conference on computational linguistics, Université de Montréal, Montreal, 10–14 Aug 1998, pp 1408–1415
Wu Shumin, Martha Palmer (2011) Semantic mapping using automatic word alignment and semantic role labeling. In: Proceedings of SSST-5, fifth workshop on syntax, semantics and structure in statistical translation, Portland, pp 21–30
Wu Dekai, Pascale Fung (2009) Semantic roles for SMT: a hybrid two-pass model. In: NAACL HLT 2009: human language technologies: the 2009 annual conference of the North American chapter of the ACL, Short papers, Boulder, 31 May–5 June 2009, pp 13–16
Wu Shumin, Jinho D Choi, Martha Palmer (2010) Detecting cross-lingual semantic similarity using parallel propbanks. In: Proceedings of the 9th conference of the association for machine translation in the Americas
Wuebker J, Mauser A, Ney H (2010) Training phrase translation models with leaving-one-out. In: Proceeding of ACL, pp 475–484
Tong Xiao, Jingbo Zhu, Shujie Yao, Hao Zhang (2011) Document-level consistency verification in machine translation. MST 2011:131–138
Xiong Deyi, Qun Liu, Shouxun Lin (2006) Maximum entropy based phrase reordering model for statistical machine translation. In: Coling-ACL 2006: proceedings of the 21st international conference on computational linguistics and 44th annual meeting of the association for computational linguistics, Sydney, 17–21 July 2006, pp 521–528
Xiong D, Zhang M, Li H (2010) Error detection for statistical machine translation using linguistic features. In: ACL 2010: the 48th annual meeting of the association for computational linguistics, Uppsala, pp 604–611
Xiong Deyi, Min Zhang, Haizhou Li (2010a) Learning translation boundaries for phrase-based decoding. In: Proceedings of HLT-NAACL 2010
Yamabana Kiyoshi et al (2003) A speech translation system with mobile wireless clients. In: Proceedings of ACL 2003
Yamada Kenji, Kevin Knight (2001) A syntax-based statistical translation model ACL-EACL-2001: 39th annual meeting [of the association for computational linguistics] and 10th conference of the European chapter (of ACL), Toulouse, 9–11 July 2001, pp 523–530
Yamada Kenji, Kevin Knight (2002) A decoder for syntax-based statistical MT. In: ACL-2002: 40th annual meeting of the association for computational linguistics, Philadelphia, July 2002, pp 303–310. (PDF, 788 KB)
Yaqin Yang, Nianwen Xue (2010) Chasing the ghost: recovering empty categories in the Chinese Treebank. In: Proceedings of COLING, Beijing
Yonggang Deng, Jia Xu, Yuqing Gao (2008) Phrase table training for precision and recall: what makes a good phrase and a good phrase pair? In: Proceedings of ACL, Columbus, pp 81–88
Zens Richard, Hermann Ney (2006) Discriminative reordering models for statistical machine translation. In: HLT-NAACL 2006: proceedings of the workshop on statistical machine translation, New York, June 2006, pp 55–63
Zhai Feifei, Jiajun Zhang Yu Zhou, Chengqing Zong (2011) Simple but effective approaches to improving tree-to-tree model, MTS
Zhang Y (2008) Structured language models for statistical machine translation. Ph.D. dissertation, Carnegie Mellon University
Zhang Licheng Fang, Peng Xu, Xiaoyun Wu (2011) Binarized forest to string translation. In: ACL-HLT 2011: proceedings of the 49th annual meeting of the association for computational linguistics, Portland, 19–24 June 2011, pp 835–845
Zhang Ying (Joy) http://projectile.sv.cmu.edu/research/public/talks/speechtranslation/sst-survey-joy.pdf
Zhang Ying, Stephan Vogel (2007) PanDoRA: a large- scale two-way statistical machine translation system for hand-held devices. In: Proceedings of MT Summit XI, Copenhagen, pp 10–14
Zhang Min, Hongfei Jiang, Ai Ti Aw, Jun Sun, Sheng Li, Chew Lim Tan (2007) A tree-to-tree alignment-based model for statistical machine translation. In: Proceedings of MT Summit XI, Copenhagen, 10–14 Sept 2007, pp 535–542
Zhang Min, Hongfei Jiang, Aiti Aw, Haizhou Li, Chew Lim Tan, Sheng Li (2008) A tree sequence alignment-based tree-to-tree translation model. In: Proceedings of the conference ACL-08: HLT. 46th annual meeting of the association for computational linguistics: human language technologies, The Ohio State University, Columbus, 15–20 June 2008, pp 559–567
Zhanyi Liu, Haifeng Wang, Hua Wu, Sheng Li (2010) Improving statistical machine translation with monolingual collocation. In: Proceedings of ACL 2010
Zhixiang Ren, Yajuan Lü, Jie Cao, Qun Liu, Yun Huang (2009) Improving statistical machine translation using domain bilingual multiword expressions. In: Proceedings of MWE 2009 (ACL-IJCNLP)
Zhongqiang Huang, Cmejrek M, Zhou B (2010) Soft syntactic constraint for hierarchical phrase-based translation using latent syntactic distributions. In: Proceedings of EMNLP10, Stroudsburg
Zollmann A, Venugopal A (2006) Syntax augmented machine translation via chart parsing. In: Proceedings of the workshop on statistical machine translation, New York, pp 138–141
Zong Chenqing, Bo Xu, Taiyi Huang (2002) Interactive chinese-to-english speech translation based on dialogue management. In: Proceedings of the workshop on speech-to-speech translation: algorithms and systems, pp 61–68
Zong Chengqing, Heyan Huang, Shuming Shi (2008) Application of machine translation during Olympic Games 2008. In: AMTA-2008. MT at work: proceedings of the eighth conference of the association for machine translation in the Americas, Waikiki, 21–25 Oct 2008, pp 470–479
Online MT Systems and Tools
https://apertium.svn.sourceforge.net/svnroot/apertium/branches/apertium-dixtools-paradigmlearning
http://itunes.apple.com/us/app/voicetra-speech-to-speech/id383542155?mt=8
http://www.bbn.com/technology/speech/speech_to_speech_translation
http://www.research.att.com/projects/Speech_Translation/index.html?fbid=0GMC-dWS68d
http://www-01.ibm.com/software/websphere/products/mobilespeech/
http://tldp.org/HOWTO/Speech-Recognition-HOWTO/software.html
http://www.sdl.com/en/language-technology/products/automated-translation/
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer Science+Business Media New York
About this chapter
Cite this chapter
Delmonte, R. (2013). Getting Past the Language Gap: Innovations in Machine Translation. In: Neustein, A., Markowitz, J. (eds) Mobile Speech and Advanced Natural Language Solutions. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-6018-3_6
Download citation
DOI: https://doi.org/10.1007/978-1-4614-6018-3_6
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-6017-6
Online ISBN: 978-1-4614-6018-3
eBook Packages: EngineeringEngineering (R0)