000 08480cam a2201021 i 4500
001 ocn886940632
003 OCoLC
005 20240523125537.0
006 m o d
007 cr cnu---unuuu
008 140814s2014 enk ob 001 0 eng d
040 _aN$T
_beng
_erda
_epn
_cN$T
_dOCLCO
_dIDEBK
_dEBLCP
_dDG1
_dCDX
_dE7B
_dRECBK
_dOCLCQ
_dYDXCP
_dOCLCF
_dOCLCO
_dDEBSZ
_dCOO
_dDEBBG
_dCOCUF
_dDG1
_dMOR
_dLIP
_dPIFAG
_dZCU
_dMERUC
_dOCLCQ
_dYDX
_dU3W
_dOCLCQ
_dSTF
_dICG
_dINT
_dVT2
_dAU@
_dOCLCQ
_dTKN
_dOCLCQ
_dDKC
_dOCLCQ
_dUKAHL
_dOCLCQ
_dOCLCO
_dOCLCQ
_dOCLCO
019 _a885019380
_a961585956
_a962644922
_a1007396263
020 _a9781119002529
_q(electronic bk.)
020 _a1119002524
_q(electronic bk.)
020 _a1322007748
_q(electronic bk.)
020 _a9781322007748
_q(electronic bk.)
020 _a9781119002659
_q(electronic bk.)
020 _a1119002656
_q(electronic bk.)
020 _z9781848216891
020 _z1848216890
029 1 _aAU@
_b000056006140
029 1 _aCHBIS
_b010259774
029 1 _aCHBIS
_b010286926
029 1 _aCHNEW
_b000943443
029 1 _aCHVBK
_b332764486
029 1 _aCHVBK
_b480236666
029 1 _aDEBBG
_bBV043397050
029 1 _aDEBBG
_bBV044069516
029 1 _aDEBSZ
_b431731659
029 1 _aDEBSZ
_b46887447X
029 1 _aGBVCP
_b814717624
029 1 _aNZ1
_b15908972
035 _a(OCoLC)886940632
_z(OCoLC)885019380
_z(OCoLC)961585956
_z(OCoLC)962644922
_z(OCoLC)1007396263
037 _a632025
_bMIL
050 4 _aP98
072 7 _aCOM
_x000000
_2bisacsh
072 7 _aCOM
_x042000
_2bisacsh
082 0 4 _a006.3/5
_223
049 _aMAIN
100 1 _aDelpech, Estelle Maryline,
_eauthor.
245 1 0 _aComparable corpora and computer-assisted translation /
_cEstelle Maryline Delpech.
264 1 _aLondon :
_bISTE,
_c2014.
300 _a1 online resource
336 _atext
_btxt
_2rdacontent
337 _acomputer
_bc
_2rdamedia
338 _aonline resource
_bcr
_2rdacarrier
490 1 _aCognitive science and knowledge management series
588 0 _aPrint version record.
504 _aIncludes bibliographical references and index.
520 _aComputer-assisted translation (CAT) has always used translation memories, which require the translator to have a corpus of previous translations that the CAT software can use to generate bilingual lexicons. This can be problematic when the translator does not have such a corpus, for instance, when the text belongs to an emerging field. To solve this issue, CAT research has looked into the leveraging of comparable corpora, i.e. a set of texts, in two or more languages, which deal with the same topic but are not translations of one another. This work had two primary objectives. The first is to assess the input of lexicons extracted from comparable corpora in the context of a specialized human translation task. The second objective is to identify bilingual-lexicon-extraction methods which best match the translators' needs, determining the current limits of these techniques and suggesting improvements. The author focuses, in particular, on the identification of fertile translations, the management of multiple morphological structures, and the ranking of candidate translations. The experiments are carried out on two language pairs (English-French and English-German) and on specialized texts dealing with breast cancer. This research puts significant emphasis on applicability - methodological choices are guided by the needs of the final users. This book is organized in two parts: the first part presents the applicative and scientific context of the research, and the second part is given over to efforts to improve compositional translation. The research work presented in this book received the PhD Thesis award 2014 from the French association for natural language processing (ATALA).
505 0 _aCover; Title Page; Copyright; Contents; Acknowledgments; Introduction; PART 1: Applicative and Scientific Context; Chapter 1: Leveraging Comparable Corpora for Computer-assisted Translation ; 1.1. Introduction; 1.2. From the beginnings of machine translation to comparable corpora processing; 1.2.1. The dawn of machine translation; 1.2.2. The development of computer-assisted translation; 1.2.3. Drawbacks of parallel corpora and advantages of comparable corpora; 1.2.4. Difficulties of technical translation; 1.2.5. Industrial context.
505 8 _a1.3. Term alignment from comparable corpora: a state-of-the-art1.3.1. Distributional approach principle; 1.3.2. Term alignment evaluation; 1.3.2.1. Precision at rank N or TopN; 1.3.2.2. MRR; 1.3.2.3. MAP; 1.3.3. Improvement and variants of the distributional approach; 1.3.3.1. Favoring distributional symmetry; 1.3.3.2. Using syntactic contexts; 1.3.3.3. Relying on trusted elements; 1.3.3.4. Improving semantic information representation; 1.3.3.5. Using second-order semantic affinities; 1.3.3.6. Improving the bilingual resource with semantic classes; 1.3.3.7. Translating polylexical units.
505 8 _a1.3.4. Influence of data and parameters on alignment quality1.3.4.1. Data; 1.3.4.2. Parameters; 1.3.5. Limits of the distributional approach; 1.4. CAT software prototype for comparable corpora processing; 1.4.1. Implementation of a term alignment method; 1.4.1.1. Implementation and data; 1.4.1.2. Extraction of the terms to be aligned; 1.4.1.3. Collecting context vectors; 1.4.1.3.1. Monolexical term context vectors; 1.4.1.4. Polylexical term context vectors; 1.4.1.5. Translation of the source context vectors; 1.4.1.6. Term alignment; 1.4.2. Terminological records extraction.
505 8 _a1.4.3. Lexicon consultation interface1.5. Summary; Chapter 2: User-Centered Evaluation of Lexicons Extracted from Comparable Corpora; 2.1. Introduction; 2.2. Translation quality evaluation methodologies; 2.2.1. Machine translation evaluation; 2.2.1.1. Automatic evaluation measures; 2.2.1.2. Human MT evaluation; 2.2.2. Human translation evaluation; 2.2.2.1. Quantitative models; 2.2.2.2. Non-quantitative models; 2.2.3. Discussion; 2.3. Design and experimentation of a user-centered evaluation; 2.3.1. Methodological aspects; 2.3.1.1. Evaluation criteria and purpose.
505 8 _a2.3.1.2. Subject matter expertise2.3.1.3. Basis for comparison; 2.3.2. Experimentation protocol; 2.3.2.1. Data; 2.3.2.1.1. Comparable corpora and extracted lexica; 2.3.2.1.2. Texts to be translated; 2.3.2.1.3. Resources used in the translation situation; 2.3.2.1.4. Translators and judges; 2.3.2.2. Evaluation progress; 2.3.2.2.1. Translation phase; 2.3.2.2.2. Translation quality evaluation phase; 2.3.3. Results; 2.3.3.1. Lexicons usability; 2.3.3.1.1. Translation speed; 2.3.3.1.2. Use of resources; 2.3.3.1.3. Translators' impressions on the lexicons extracted from comparable corpora.
590 _aJohn Wiley and Sons
_bWiley Online Library: Complete oBooks
650 0 _aComputational linguistics.
650 0 _aCorpora (Linguistics)
650 0 _aTranslators (Computer programs)
650 0 _aKnowledge management.
650 6 _aLinguistique informatique.
650 6 _aCorpus (Linguistique)
650 6 _aTraducteurs (Logiciels)
650 6 _aGestion des connaissances.
650 7 _acomputational linguistics.
_2aat
650 7 _aCOMPUTERS
_xGeneral.
_2bisacsh
650 7 _aCOMPUTERS
_xNatural Language Processing.
_2bisacsh
650 7 _aComputational linguistics
_2fast
650 7 _aCorpora (Linguistics)
_2fast
650 7 _aKnowledge management
_2fast
650 7 _aTranslators (Computer programs)
_2fast
776 0 8 _iPrint version:
_aDelpech, Estelle Maryline.
_tComparable corpora and computer-assisted translation
_z9781848216891
_w(OCoLC)884550434
830 0 _aCognitive science and knowledge management series.
856 4 0 _uhttps://onlinelibrary.wiley.com/doi/book/10.1002/9781119002659
938 _aAskews and Holts Library Services
_bASKH
_nAH26939508
938 _aAskews and Holts Library Services
_bASKH
_nAH26918951
938 _aCoutts Information Services
_bCOUT
_n28706002
938 _aEBL - Ebook Library
_bEBLB
_nEBL1752712
938 _aebrary
_bEBRY
_nebr10899820
938 _aEBSCOhost
_bEBSC
_n817378
938 _aProQuest MyiLibrary Digital eBook Collection
_bIDEB
_ncis28706002
938 _aRecorded Books, LLC
_bRECE
_nrbeEB00582667
938 _aYBP Library Services
_bYANK
_n12002291
938 _aYBP Library Services
_bYANK
_n12004539
938 _aYBP Library Services
_bYANK
_n12888521
994 _a92
_bINLUM
999 _c12123
_d12123