Jump to content
 







Main menu
   


Navigation  



Main page
Contents
Current events
Random article
About Wikipedia
Contact us
Donate
 




Contribute  



Help
Learn to edit
Community portal
Recent changes
Upload file
 








Search  

































Create account

Log in
 









Create account
 Log in
 




Pages for logged out editors learn more  



Contributions
Talk
 



















Contents

   



(Top)
 


1 Moses statistical machine translation decoder  





2 Europarl corpus  





3 Other interests and activities in chronological order  





4 Awards and recognition  





5 References  














Philipp Koehn







Add links
 









Article
Talk
 

















Read
Edit
View history
 








Tools
   


Actions  



Read
Edit
View history
 




General  



What links here
Related changes
Upload file
Special pages
Permanent link
Page information
Cite this page
Get shortened URL
Download QR code
Wikidata item
 




Print/export  



Download as PDF
Printable version
 
















Appearance
   

 






From Wikipedia, the free encyclopedia
 


Philipp Koehn
Born (1971-08-01) 1 August 1971 (age 52)
CitizenshipGermany
Alma materAlbert Schweitzer High School (Erlangen), University of Erlangen-Nuremberg, University of Tennessee, University of Southern California
Known forEuroparl corpus, Moses
AwardsFinalist – 2013 EPO European Inventor Award
Scientific career
Fieldscomputer science, natural language processing, machine translation, cross-language information retrieval
InstitutionsUniversity of Edinburgh, Johns Hopkins University
Doctoral advisorKevin Knight

Philipp Koehn (born 1 August 1971 in Erlangen, West Germany) is a computer scientist and researcher in the field of machine translation.[1][2] His primary research interest is statistical machine translation and he is one of the inventors of a method called phrase based machine translation. This is a sub-field of statistical translation methods that employs sequences of words (or so-called "phrases") as the basis of translation, expanding the previous word based approaches.  A 2003 paper which he authored with Franz Josef Och and Daniel Marcu called Statistical phrase-based translation has attracted wide attention in Machine translation community and has been cited over a thousand times.[3] Phrase based methods are widely used in machine translation applications in industry.

Philipp Koehn received his PhD in computer science in 2003 from the University of Southern California, where he worked at the Information Sciences Institute advised by Kevin Knight. After a year as a postdoctoral fellow under Michael Collins at the Massachusetts Institute of Technology, he joined the University of Edinburgh as a lecturer in the School of Informatics in 2005. He was appointed reader in 2010 and professor in 2012. In 2014, he was appointed professor at the computer science department of The Johns Hopkins University, where he is affiliated with the Center for Language and Speech Processing.

Moses statistical machine translation decoder[edit]

The Moses machine translation decoder is an open source project that was created by and is maintained under the guidance of Philipp Koehn.[4] The Moses decoder is a platform for developing Statistical machine translation systems given a parallel corpus for any language pair.[5] The decoder was mainly developed by Hieu Hoang and Philipp Koehn at the University of Edinburgh and extended during a Johns Hopkins University Summer Workshop and further developed under Euromatrix and GALE project funding.  The decoder (which is part of a complete statistical machine translation toolkit) is the de facto benchmark for research in the field.

Although Koehn continues to play a major role in the development of Moses, the Moses decoder was supported by the European Framework 6 projects Euromatrix, TC-Star, the European Framework 7 projects EuroMatrixPlus, Let's MT, META-NET and MosesCore and the DARPA GALE project, as well as several universities such as the University of Edinburgh, the University of Maryland, ITC-irst, Massachusetts Institute of Technology, and others.  Substantial additional contributors to the Moses decoder include Hieu Hoang, Chris Dyer, Josh Schroeder, Marcello Federico, Richard Zens, and Wade Shen.

Europarl corpus[edit]

The Europarl corpus is a set of documents that consists of the proceedings of the European Parliament from 1996 to the present.  The corpus has been compiled and expanded by a group of researchers led by Philipp Koehn at University of Edinburgh.  The data that makes up the corpus was extracted from the website of the European Parliament and then prepared for linguistic research.  The latest release (2012) comprised up to 60 million words per language,[6] with 21 European languages represented: Romanic (French, Italian, Spanish, Portuguese, Romanian), Germanic (English, Dutch, German, Danish, Swedish), Slavic (Bulgarian, Czech, Polish, Slovak, Slovene), Finno-Ugric (Finnish, Hungarian, Estonian), Baltic (Latvian, Lithuanian), and Greek.

Other interests and activities in chronological order[edit]

Awards and recognition[edit]

References[edit]

  • ^ "philipp koehn" – Google Akademik
  • ^ Moses Manual
  • ^ mloss | Projects authored by philipp koehn
  • ^ Europarl Home Page
  • ^ SMT Group Edinburgh – Main/HomePage
  • ^ Philipp Koehn's online resume
  • ^ Press Release – CLSI acquires a controlling stake in SYSTRAN
  • ^ "CLSI website". Archived from the original on 4 February 2015. Retrieved 6 May 2020.
  • ^ Philipp Koehn's online resume
  • ^ Omniscien Technologies – About Us/Company
  • ^ Philipp Koehn: Statistical Machine Translation
  • ^ Sánchez-Martínez, Felipe; Pérez-Ortiz, Juan Antonio (2010). "Philipp Koehn, Statistical machine translation". Machine Translation. 24 (3–4): 273–278. doi:10.1007/s10590-010-9083-4.
  • ^ Statistical machine translation – contents
  • ^ Philipp Koehn: Neural Machine Translation
  • ^ EPO: Found in translation: a present-day Rosetta Stone
  • ^ "International Association for Machine Translation". Archived from the original on 24 June 2010. Retrieved 8 September 2016.

  • Retrieved from "https://en.wikipedia.org/w/index.php?title=Philipp_Koehn&oldid=1229398947"

    Categories: 
    Academics of the University of Edinburgh
    German computer scientists
    Living people
    1971 births
    People from Erlangen
    German expatriates in the United States
    Johns Hopkins University faculty
    Natural language processing researchers
    University of Southern California alumni
    Hidden categories: 
    Articles with short description
    Short description matches Wikidata
    EngvarB from February 2018
    Use dmy dates from February 2018
    Articles with hCards
    Articles with ISNI identifiers
    Articles with VIAF identifiers
    Articles with WorldCat Entities identifiers
    Articles with GND identifiers
    Articles with J9U identifiers
    Articles with LCCN identifiers
    Articles with NKC identifiers
    Articles with NTA identifiers
    Articles with DBLP identifiers
    Articles with MATHSN identifiers
    Articles with MGP identifiers
    Articles with SUDOC identifiers
     



    This page was last edited on 16 June 2024, at 16:00 (UTC).

    Text is available under the Creative Commons Attribution-ShareAlike License 4.0; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization.



    Privacy policy

    About Wikipedia

    Disclaimers

    Contact Wikipedia

    Code of Conduct

    Developers

    Statistics

    Cookie statement

    Mobile view



    Wikimedia Foundation
    Powered by MediaWiki