ImTranslator blog

translate your mind….

  • Home
  • About
  • About Us
Blue Theme Red Theme
RSS Feeds:
Posts
Comments
  • Categories
  • Google (9)
    • Google Translator (5)
  • Guestbook (93)
  • ImTranslator (40)
    • Plugin for IE and Firefox (10)
    • Portal (39)
      • Development News (12)
      • Features (27)
        • Dictionary (3)
        • eMailer (3)
        • Russian Decoder (3)
        • Spell Checker (3)
        • Translator (6)
        • TTS Voice (2)
        • Virtual Keyboard (3)
      • Tips & Tricks (8)
      • Tutorials (10)
  • Language Information (18)
    • French (3)
    • German (2)
    • Portuguese (2)
    • Russian (4)
    • Spanish (2)
  • Machine Translation (28)
    • Rule-based MT (14)
    • Statistical MT (4)
  • PARS (1)
  • Press Releases (14)
  • PROMT (10)
  • Reviews (1)
  • Systran (63)
    • F.A.Q. (19)
  • Systran products (57)
  • Recent Posts
  • This translator is helping me
  • Your web site is the best ever
  • You are the best!
  • I love your website!
  • This is wonderful to be able to use
  • Such a marvelous software
  • Calendar
June 2007
Sun Mon Tue Wed Thu Fri Sat
« May   Jul »
 12
3456789
10111213141516
17181920212223
24252627282930
  • Links
  • Partners

    • Dependable Translation
  • Our Services

    • ImTranslator Portal
    • ImTranslator in your Language
    • Solutions Guide
    • Virtual Keyboard
    • Spell Checker
    • Russian Decoder
    • ImTranslator - Promt Online
    • Free Translation
    • Web Page Translation
    • Multilingual Search
    • Smart Link Catalog
    • Context Online
    • Promt Online
  • Archives
  • October 2009
  • September 2009
  • July 2009
  • June 2009
  • August 2008
  • January 2008
  • August 2007
  • July 2007
  • June 2007
  • May 2007
  • April 2007
  • March 2007
  • January 2007
  • March 2006
  • Login
  • Log in

How does the statistical machine translation work?

Jun 18th, 2007 by Julia

Statistical machine translation (SMT) is a machine translation paradigm where translations are generated on the basis of statistical models whose parameters are derived from the analysis of bilingual text corpora. The statistical approach contrasts with the rule-based approaches to machine translation. The first ideas of statistical machine translation were introduced by Warren Weaver in 1949. Statistical machine translation was re-introduced in 1991 by researchers at IBM’s Thomas J. Watson Research Center and has contributed to the significant resurgence in interest in machine translation in recent years. As of 2006, it is by far the most widely-studied machine translation paradigm.

The benefits of statistical machine translation over traditional paradigms that are most often cited are the following:

Better use of resources

  • There is a great deal of natural language in machine-readable format.
  • Generally, SMT systems are not tailored to any specific pair of languages.
  • Rule-based translation systems require the manual development of linguistic rules, which can be costly, and which often do not generalize to other languages.

More natural translations

The ideas behind statistical machine translation come out of information theory. Essentially, the document is translated on the probability p(e | f) that a string e in native language (for example, English) is the translation of a string f in foreign language (for example, French). Generally, these probabilities are estimated using techniques of parameter estimation.

Statistical machine translation tries to generate translations using statistical methods based on bilingual text corpora. The statistical translation has been implemented by Google in its Google Translate in the BETA version for certain translations (Arabic, Russian and Asian languages). The system has been trained using the United Nations Documents as a corpus. The corpus is 200 billion words worth of content. It uses existing source and target language translations (done by human translators at the U.N.) to find patterns it then uses to build rules for translating between those languages. Accuracy of the translation has improved.

Source Wikipedia. (Read the entire article. This article is licensed under the GNU Free Documentation License. It uses material from the Wikipedia article “Metasyntactic variable”)

Posted in Statistical MT | 1 Comment

One Comment to “How does the statistical machine translation work?”

  1. on 27 Jun 2008 at 8:59 am1arachesostufo

    ho trovato siti e blog con una barra di bandiere da cliccare per avere la traduzione automatica. come si fa?

    vi prego di darmi qualche consiglio qui o su arachesostufo@libero.it

ImTranslator blog © 2009 All Rights Reserved.
Back to Top