{"id":752,"date":"2016-08-21T23:22:07","date_gmt":"2016-08-21T23:22:07","guid":{"rendered":"http:\/\/lingenio.de\/?page_id=752"},"modified":"2016-08-24T16:36:32","modified_gmt":"2016-08-24T16:36:32","slug":"translates-linguistic-capabilities","status":"publish","type":"page","link":"https:\/\/lingenio.de\/en\/language-technology\/translates-linguistic-capabilities\/","title":{"rendered":"<i>translate<\/i>&#8216;s Linguistic Capabilities"},"content":{"rendered":"<p style=\"text-align: justify;\">The objective of a machine translation system is to translate as well and as quickly as possible. Translation quality is dependent on a number of parameters, the most important being:<\/p>\n<ul>\n<li>suitability of the source text for machine translation: sentence length and complexity, ambiguity<\/li>\n<li>availability of the necessary vocabulary<\/li>\n<li>parsing and translation capability of the system<\/li>\n<li>stylistic standard expected of the translation.<\/li>\n<\/ul>\n<p style=\"text-align: justify;\">The <em>translate<\/em> system has been extensively tested, both systematically for handling linguistic constructions and translation problems, and by use on sample texts, mostly taken from the field of computing.<\/p>\n<p style=\"text-align: justify;\">The speed at which <em>translate<\/em> translates obviously depends on the available hardware and software configuration, but is also dependent on the complexity of the sentences to be translated.<\/p>\n<p style=\"text-align: justify;\">Some examples will demonstrate what <em>translate<\/em> is capable of. They have been translated from English into German as printed, and the results can be readily reproduced using the corresponding settings.<\/p>\n<h2>Homonyms<\/h2>\n<p style=\"text-align: justify;\">Homonyms are words with several different meanings. These different meanings can have completely different morphosyntactic properties. A word can be more than one part of speech, for example, such as <em>walk<\/em>, which is a verb and a noun. Homonyms are a common phenomenon in English:<\/p>\n<blockquote><p>He watches her watches.<br \/>\nShe books some books.<br \/>\nIn order to eat they order a meal.<br \/>\nThey can can the fish in a can.<\/p><\/blockquote>\n<p style=\"text-align: justify;\">are sentences which demonstrate two (in the last sentence, three) meanings of words. These are stored in the dictionary together with criteria for when to use which meaning. <em>translate<\/em> provides the following German translations:<\/p>\n<blockquote><p>Er beobachtet ihre Uhren.<br \/>\nSie bucht einige B\u00fccher.<br \/>\nUm zu essen, bestellen sie eine Mahlzeit.<br \/>\nSie k\u00f6nnen den Fisch in einer Dose eindosen.<\/p><\/blockquote>\n<h2>Semantic Types<\/h2>\n<p style=\"text-align: justify;\">Ambiguity of words can often be described by\u00a0assigning them different semantic types (generic terms), and\u00a0describing the semantic types of their slots (complements).\u00a0A German word like <em>Bank<\/em> can be described as an institution and as furniture. Only in the context of a sentence is it possible to decide the specific meaning of Bank and hence the required translation.<\/p>\n<p>With the German verb <em>erheben<\/em> the different semantic types of the slots result in different translations, as the following examples show:<\/p>\n<blockquote><p>Der Staatsanwalt erhob Anklage gegen den M\u00f6rder.<\/p>\n<p>The public prosecutor brought charges against the murderer.<\/p><\/blockquote>\n<blockquote><p>Der Gemeinderat hat eine Geb\u00fchr auf Abfall erhoben.<\/p>\n<p>The district council has levied charges on waste.<\/p><\/blockquote>\n<blockquote><p>Die Ten\u00f6re erhoben die Stimme.<\/p>\n<p>The tenors raised the voice.<\/p><\/blockquote>\n<blockquote><p>Er erhob sich.<\/p>\n<p>He rose.<\/p><\/blockquote>\n<h2>Compound Words<\/h2>\n<p style=\"text-align: justify;\">Compound words are groups of words the meaning of which cannot be derived from their component parts, and which therefore must be translated differently than the individual words they contain. They must be included in the dictionary if they are to be translated correctly. Compound words can contain all parts of speech; their component parts are often inflected (i.e. they change their form) and can often occur separated from each other. It is not easy to enter the properties of compound words correctly, and it was decided that users of Translate should not define all the different types of compound words.<\/p>\n<p style=\"text-align: justify;\">The dictionary entry for the German verb <em>stellen<\/em> includes the information that when used together with <em>Verf\u00fcgung<\/em> it should be translated as <em>provide<\/em> (rather than using the normal translation).<\/p>\n<blockquote><p>Ich werde die Vase zur Lampe stellen.<\/p>\n<p>I will place the vase next to the lamp.<\/p><\/blockquote>\n<blockquote><p>Ich werde die Vase zur Verf\u00fcgung stellen.<\/p>\n<p>I will provide the vase.<\/p><\/blockquote>\n<p>Some compound words are mirrored in German and English:<\/p>\n<blockquote><p>Er verlor sein Gesicht.<\/p>\n<p>He lost his face.<\/p><\/blockquote>\n<p style=\"text-align: justify;\">There is then no need for special definitions in the dictionary.<\/p>\n<p style=\"text-align: justify;\">The most common compound words in English are noun-noun compounds, nouns with prepositional attributes and adjective-noun groups such as:<\/p>\n<blockquote><p>railway station<\/p>\n<p>Bahnhof<\/p><\/blockquote>\n<blockquote><p>table of contents<\/p>\n<p>Inhaltsverzeichnis<\/p><\/blockquote>\n<blockquote><p>environmental pollution<\/p>\n<p>Umweltverschmutzung<\/p><\/blockquote>\n<h2>Passive Constructions<\/h2>\n<p style=\"text-align: justify;\">English passive constructions differ somewhat from German ones. <em>translate<\/em> takes account of this difference when it translates.<\/p>\n<p style=\"text-align: justify;\">English passive constructions are created using <em>to be<\/em>, but German passive constructions use <em>to become<\/em> (<em>werden<\/em>).\u00a0In German, creating a passive sentence makes only the accusative object the subject, but in English the indirect object can be made passive in addition to the direct object.\u00a0When the indirect object is the subject of an English passive sentence, the roles must be switched in the translated version.\u00a0The following active sentences result in the same translation:<\/p>\n<blockquote><p>Alice has given John the book.<br \/>\nAlice has given the book to John.<\/p>\n<p>Alice hat John das Buch gegeben.<\/p><\/blockquote>\n<p>When the direct object is made passive, this changes to:<\/p>\n<blockquote><p>The book was given to John by Alice.<\/p>\n<p>Das Buch wurde John von Alice gegeben.<\/p><\/blockquote>\n<p>When the indirect object is made passive, the result is:<\/p>\n<blockquote><p>John was given the book by Alice.<\/p>\n<p>John wurde das Buch von Alice gegeben.<\/p><\/blockquote>\n<p>There are two different forms of the passive in German:<\/p>\n<p>Expressing a process:<\/p>\n<blockquote><p>Das Programm wird geladen.<\/p><\/blockquote>\n<p>Expressing a completed state:<\/p>\n<blockquote><p>Das Programm ist geladen.<\/p><\/blockquote>\n<p>When translated into English, both sentences are rendered as:<\/p>\n<blockquote><p>The program is loaded.<\/p><\/blockquote>\n<p style=\"text-align: justify;\">The problem here is that when translating from English into German it is often difficult to tell which option is correct.<\/p>\n<h2>Coordinating Conjunctions<\/h2>\n<p>Translate uses a complex algorithm to analyze coordinating conjunctions, which makes it possible to correctly translate both simple conjunctions (like <em>and<\/em> or <em>or<\/em>) and conjunctional phrases (like <em>both &#8230; and<\/em>):<\/p>\n<blockquote><p>John hears and Mary sees the car.<\/p>\n<p>John h\u00f6rt, und Mary sieht das Auto.<\/p><\/blockquote>\n<blockquote><p>Both John and Mary see the car.<\/p>\n<p>Sowohl John als auch Mary sehen das Auto.<\/p><\/blockquote>\n<blockquote><p>John Cleverman wants to buy a new car but tries to avoid having to pay too much for it.<\/p>\n<p>John Cleverman will ein neues Auto kaufen, aber versucht, es zu vermeiden, zu viel daf\u00fcr bezahlen zu m\u00fcssen.<\/p><\/blockquote>\n<h2>Implicit Subjects<\/h2>\n<p style=\"text-align: justify;\">Translate is capable of recognizing the implicit subjects of incomplete verb forms. The English verb <em>want<\/em> is an example. The system recognizes that in the following examples:<\/p>\n<blockquote><p>John wants to leave.<\/p>\n<p>John will gehen.<\/p><\/blockquote>\n<p>the subject of <em>leave<\/em> is linked to the subject of <em>wants<\/em>, while in<\/p>\n<blockquote><p>John wants Frank to leave.<\/p>\n<p>John will, dass Frank geht.<\/p><\/blockquote>\n<p>the subject of <em>leave<\/em> is the object of <em>wants<\/em> (Frank).<\/p>\n<p style=\"text-align: justify;\">Of course, there are exceptions to this general rule which do not present any problems.<\/p>\n<p>Both in<\/p>\n<blockquote><p>John promised to leave.<\/p>\n<p>John versprach zu gehen.<\/p><\/blockquote>\n<p>and in<\/p>\n<blockquote><p>John promised Frank to leave.<\/p>\n<p>John versprach Frank zu gehen.<\/p><\/blockquote>\n<p style=\"text-align: justify;\">the subject of the infinitive clause is linked to the subject of the main clause.<\/p>\n<h2>Interrogative Sentences<\/h2>\n<p style=\"text-align: justify;\">As you would expect, <em>translate<\/em> also handles interrogative sentences correctly. Here are some examples:<\/p>\n<blockquote><p>When did you arrive?<\/p>\n<p>Wann kamen Sie an?<\/p><\/blockquote>\n<blockquote><p>When will she leave us?<\/p>\n<p>Wann verl\u00e4sst sie uns?<\/p><\/blockquote>\n<blockquote><p>Who did she try to find?<\/p>\n<p>Wen versuchte sie zu finden?<\/p><\/blockquote>\n<blockquote><p>Can you explain to me the way to the station?<\/p>\n<p>K\u00f6nnen Sie mir den Weg zum Bahnhof erkl\u00e4ren?<\/p><\/blockquote>\n<blockquote><p>Who did they say John wanted to find?<\/p>\n<p>Von wem sagten sie, dass John ihn finden wollte?<\/p><\/blockquote>\n<h2>Incomplete Sentences<\/h2>\n<p style=\"text-align: justify;\"><em>translate<\/em> can also be used to translate individual words, and groups of words that make grammatical sense, if they are<\/p>\n<ul>\n<li style=\"text-align: justify;\">terminated by an end-of-sentence mark<\/li>\n<li style=\"text-align: justify;\">terminated by a new-line character<\/li>\n<li style=\"text-align: justify;\">selected, and translated using <em>translate &#8211; Translate Sentence<\/em>.<\/li>\n<\/ul>\n<blockquote><p>tree<\/p>\n<p>Baum<\/p><\/blockquote>\n<blockquote><p>yellow flowers<\/p>\n<p>gelbe Blumen<\/p><\/blockquote>\n<blockquote><p>The building beside the station.<\/p>\n<p>Das Geb\u00e4ude neben dem Bahnhof.<\/p><\/blockquote>\n<blockquote><p>The man watching the car.<\/p>\n<p>Der Mann, der das Auto beobachtet.<\/p><\/blockquote>\n<p style=\"text-align: justify;\">Please note that it can be more difficult to resolve ambiguity in sentence fragments than in complete sentences, so it is advisable to take particular care over checking such translations.<\/p>\n<h2>Punctuation<\/h2>\n<p>Punctuation is very important for <em>translate<\/em>.<\/p>\n<p style=\"text-align: justify;\"><em>End-of-sentence characters<\/em> like periods (.), exclamation marks (!), and question marks (?) are used to break down a text into individual sentences. Note that the period can have a number of different functions:<\/p>\n<ol>\n<li>end of sentence<\/li>\n<li>to denote abbreviations<\/li>\n<li>decimal period (English)<\/li>\n<li>as a separator when writing large numbers in digits (German)<\/li>\n<li>to denote ordinal numbers (German)<\/li>\n<\/ol>\n<p style=\"text-align: justify;\"><em>Separator characters<\/em> like commas (,), semicolons (;), dashes (-), dashes in lists, and colons (:) are used to separate sentence parts from each other. When analyzing the source-language text, <em>translate<\/em> uses a certain license with regard to such separators. However, it should be noted that the presence or absence of separators can involve a decisive change in meaning.<\/p>\n<blockquote><p>Er befiehlt ihm zu helfen.<\/p>\n<p>He orders him to help.<\/p><\/blockquote>\n<p style=\"text-align: justify;\">Unfortunately, it is not possible to ensure that <em>translate<\/em> always puts commas in the right place, so you should always check translations carefully for correct commas.<\/p>\n<p style=\"text-align: justify;\"><em>Parentheses<\/em> are symbols that occur in pairs, such as round brackets, square brackets, braces, dashes and quotation marks, that can enclose words or groups of words.<\/p>\n<blockquote><p>John wears the (blue) shirt (which he bought yesterday).<\/p>\n<p>John tr\u00e4gt das (blaue) Hemd (das er gestern kaufte).<\/p><\/blockquote>\n<blockquote><p>John, after he had left the office, went to the bank.<\/p>\n<p>John ging, nachdem er das\u00a0B\u00fcro verlassen hatte, zur Bank.<\/p><\/blockquote>\n<blockquote><p>I like the book I bought yesterday.<\/p>\n<p>Das Buch, das ich gestern kaufte, gef\u00e4llt mir.<\/p><\/blockquote>\n<h2>Ambiguity<\/h2>\n<p style=\"text-align: justify;\">The ambiguity of natural language is one of the major problems faced by machine translation. <em>translate<\/em> is equipped with a number of strategies for dealing with language ambiguity, including:<\/p>\n<ul>\n<li style=\"text-align: justify;\">coding the different meanings of words and expressions with regard to part of speech, and syntactic and semantic properties<\/li>\n<li style=\"text-align: justify;\">assessing how probable different analyses of a group of words are. The variant with the highest rating is then output as the translation. The rating is based on general grammatical rules. This means that translations that are not complete sentences are excluded when a more complete analysis exists.<\/li>\n<\/ul>\n<p style=\"text-align: justify;\">The English word <em>like<\/em> will be used as an example to explain the principle. There are two translations for <em>like<\/em>, <em>gefallen<\/em> (verb) and <em>wie<\/em> (conjunction). In the following simple English sentence this gives rise to two possibilities:<\/p>\n<blockquote><p>I like it.<\/p>\n<p>Es gef\u00e4llt mir.<br \/>\nIch wie es.\u00a0<em>(wrong !)<\/em><\/p><\/blockquote>\n<p style=\"text-align: justify;\"><em>translate<\/em> recognizes that the second variant is unlikely to be correct, and does not present it as a proposed translation.<\/p>\n<p style=\"text-align: justify;\">If a sentence can have several meanings, and there are therefore several correct translations, one variant is always the simplest one in grammatical terms. The program opts for this variant. The translation of the following ambiguous question corresponds to the less probable meaning:<\/p>\n<blockquote><p>Which horse do you want to win?<\/p>\n<p>Welches Pferd wollen Sie gewinnen?<\/p><\/blockquote>\n<h2>Multiple Translations<\/h2>\n<p style=\"text-align: justify;\">There are often several different ways of translating a sentence. Translate assesses the different translations and generally outputs the one with the highest score. It&#8217;s possible that this is not the translation you want, so Translate now allows you to create a number of different translations and have them displayed. Below are some examples, based on English-German translation:<\/p>\n<blockquote><p>They complained to the guide that they could not see.<\/p>\n<p>Sie beklagten sich beim F\u00fchrer dar\u00fcber, dass sie nicht sehen konnten.<br \/>\nSie beschwerten sich beim F\u00fchrer, den sie nicht sehen konnten.<br \/>\nSie klagten zum F\u00fchrer dar\u00fcber, dass sie nicht sehen konnten.<\/p><\/blockquote>\n<blockquote><p>She saw John leaving.<\/p>\n<p>Sie sah John gehen.<br \/>\nSie sah John, als sie ging.<\/p><\/blockquote>\n<blockquote><p>Die F\u00fchrung w\u00e4hlt das Team.<\/p>\n<p>The leadership chooses the team.<br \/>\nThe team chooses the leadership.<\/p><\/blockquote>\n<h2>Influencing Translation Results<\/h2>\n<p style=\"text-align: justify;\"><em>translate<\/em> offers several translation options, and the way they are set can have a major impact on the translation. A key benefit of <em>translate<\/em> is the fact that it also allows words to be translated differently from normal usage in the context of specific subject areas. Consider the English word <em>enter<\/em>, for example. It is generally used as the equivalent of the German word <em>betreten<\/em>, but in connection with computers it must be translated as <em>eingeben<\/em>.<\/p>\n<ul>\n<li>Subject areas<\/li>\n<li>Impersonal imperative<\/li>\n<li>Translate impersonal request with imperative<\/li>\n<li>Translate<em> Sie<\/em>\u00a0as <em>you<\/em>\u00a0instead of <em>they<\/em><\/li>\n<li>Translate <em>you<\/em>\u00a0as <em>Sie<\/em>\u00a0instead of\u00a0<em>du<\/em><\/li>\n<li>Recognize what pronouns\u00a0refer to Line break as end of sentence<\/li>\n<li>American English \/ British English<\/li>\n<li>Provide multiple translations<\/li>\n<li>Time limit per sentence<\/li>\n<\/ul>\n<h2>Spelling<\/h2>\n<p style=\"text-align: justify;\"><em>translate<\/em> makes every effort to follow the rules for German and English spelling and punctuation when producing its translations, including the correct use of upper-case and lower-case letters in German. However, some errors still occur, partly due to specific contexts, partly due to coding errors which were not detected at an early enough stage to allow corrective measures to be taken. Generally, in German translations the current German spelling rules are used, rather than the reformed rules.<\/p>\n<p style=\"text-align: justify;\">With regard to the source-language text, the aim has been to be liberal in the use of spelling rules, assuming they do not lead to ambiguity. In particular, this concerns the rules governing use of upper case \/ lower case in German, as well as writing words as one word or two. With respect to the rules for putting commas, an option has been introduced to select between the old style and the new (&#8220;liberal&#8221;) style.<\/p>\n<blockquote><p>Im allgemeinen schreibt der Chef richtig.<br \/>\nIm Allgemeinen schreibt der Chef richtig.<\/p>\n<p>The boss generally writes correctly.<\/p><\/blockquote>\n<blockquote><p>Der Brief ist verlorengegangen.<br \/>\nDer Brief ist verloren gegangen.<\/p>\n<p>The letter has been lost.<\/p><\/blockquote>\n<blockquote><p>Er wei\u00df, da\u00df die Ma\u00dfe stimmen.<br \/>\nEr wei\u00df, dass die Ma\u00dfe stimmen.<\/p>\n<p>He knows that the measures are correct.<\/p><\/blockquote>\n","protected":false},"excerpt":{"rendered":"<p>The objective of a machine translation system is to translate as well and as quickly as possible. Translation quality is dependent on a number of parameters, the most important being: suitability of the source text for machine translation: sentence length and complexity, ambiguity availability of the necessary vocabulary parsing and translation capability of the system &hellip; <a href=\"https:\/\/lingenio.de\/en\/language-technology\/translates-linguistic-capabilities\/\" class=\"more-link\">Continue reading<span class=\"screen-reader-text\"> &#8220;<i>translate<\/i>&#8216;s Linguistic Capabilities&#8221;<\/span><\/a><\/p>\n","protected":false},"author":3,"featured_media":0,"parent":737,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"_locale":"en_US","_original_post":"752","footnotes":""},"class_list":["post-752","page","type-page","status-publish","hentry","en-US"],"_links":{"self":[{"href":"https:\/\/lingenio.de\/wp-json\/wp\/v2\/pages\/752"}],"collection":[{"href":"https:\/\/lingenio.de\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/lingenio.de\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/lingenio.de\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/lingenio.de\/wp-json\/wp\/v2\/comments?post=752"}],"version-history":[{"count":8,"href":"https:\/\/lingenio.de\/wp-json\/wp\/v2\/pages\/752\/revisions"}],"predecessor-version":[{"id":1348,"href":"https:\/\/lingenio.de\/wp-json\/wp\/v2\/pages\/752\/revisions\/1348"}],"up":[{"embeddable":true,"href":"https:\/\/lingenio.de\/wp-json\/wp\/v2\/pages\/737"}],"wp:attachment":[{"href":"https:\/\/lingenio.de\/wp-json\/wp\/v2\/media?parent=752"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}