MARC View

000			08480cam a2201021 i 4500
001			ocn886940632
003			OCoLC
005			20240523125537.0
006			m o d
007			cr cnu---unuuu
008			140814s2014 enk ob 001 0 eng d
040			_aN$T _beng _erda _epn _cN$T _dOCLCO _dIDEBK _dEBLCP _dDG1 _dCDX _dE7B _dRECBK _dOCLCQ _dYDXCP _dOCLCF _dOCLCO _dDEBSZ _dCOO _dDEBBG _dCOCUF _dDG1 _dMOR _dLIP _dPIFAG _dZCU _dMERUC _dOCLCQ _dYDX _dU3W _dOCLCQ _dSTF _dICG _dINT _dVT2 _dAU@ _dOCLCQ _dTKN _dOCLCQ _dDKC _dOCLCQ _dUKAHL _dOCLCQ _dOCLCO _dOCLCQ _dOCLCO
019			_a885019380 _a961585956 _a962644922 _a1007396263
020			_a9781119002529 _q(electronic bk.)
020			_a1119002524 _q(electronic bk.)
020			_a1322007748 _q(electronic bk.)
020			_a9781322007748 _q(electronic bk.)
020			_a9781119002659 _q(electronic bk.)
020			_a1119002656 _q(electronic bk.)
020			_z9781848216891
020			_z1848216890
029	1		_aAU@ _b000056006140
029	1		_aCHBIS _b010259774
029	1		_aCHBIS _b010286926
029	1		_aCHNEW _b000943443
029	1		_aCHVBK _b332764486
029	1		_aCHVBK _b480236666
029	1		_aDEBBG _bBV043397050
029	1		_aDEBBG _bBV044069516
029	1		_aDEBSZ _b431731659
029	1		_aDEBSZ _b46887447X
029	1		_aGBVCP _b814717624
029	1		_aNZ1 _b15908972
035			_a(OCoLC)886940632 _z(OCoLC)885019380 _z(OCoLC)961585956 _z(OCoLC)962644922 _z(OCoLC)1007396263
037			_a632025 _bMIL
050		4	_aP98
072		7	_aCOM _x000000 _2bisacsh
072		7	_aCOM _x042000 _2bisacsh
082	0	4	_a006.3/5 _223
049			_aMAIN
100	1		_aDelpech, Estelle Maryline, _eauthor.
245	1	0	_aComparable corpora and computer-assisted translation / _cEstelle Maryline Delpech.
264		1	_aLondon : _bISTE, _c2014.
300			_a1 online resource
336			_atext _btxt _2rdacontent
337			_acomputer _bc _2rdamedia
338			_aonline resource _bcr _2rdacarrier
490	1		_aCognitive science and knowledge management series
588	0		_aPrint version record.
504			_aIncludes bibliographical references and index.
520			_aComputer-assisted translation (CAT) has always used translation memories, which require the translator to have a corpus of previous translations that the CAT software can use to generate bilingual lexicons. This can be problematic when the translator does not have such a corpus, for instance, when the text belongs to an emerging field. To solve this issue, CAT research has looked into the leveraging of comparable corpora, i.e. a set of texts, in two or more languages, which deal with the same topic but are not translations of one another. This work had two primary objectives. The first is to assess the input of lexicons extracted from comparable corpora in the context of a specialized human translation task. The second objective is to identify bilingual-lexicon-extraction methods which best match the translators' needs, determining the current limits of these techniques and suggesting improvements. The author focuses, in particular, on the identification of fertile translations, the management of multiple morphological structures, and the ranking of candidate translations. The experiments are carried out on two language pairs (English-French and English-German) and on specialized texts dealing with breast cancer. This research puts significant emphasis on applicability - methodological choices are guided by the needs of the final users. This book is organized in two parts: the first part presents the applicative and scientific context of the research, and the second part is given over to efforts to improve compositional translation. The research work presented in this book received the PhD Thesis award 2014 from the French association for natural language processing (ATALA).
505	0		_aCover; Title Page; Copyright; Contents; Acknowledgments; Introduction; PART 1: Applicative and Scientific Context; Chapter 1: Leveraging Comparable Corpora for Computer-assisted Translation ; 1.1. Introduction; 1.2. From the beginnings of machine translation to comparable corpora processing; 1.2.1. The dawn of machine translation; 1.2.2. The development of computer-assisted translation; 1.2.3. Drawbacks of parallel corpora and advantages of comparable corpora; 1.2.4. Difficulties of technical translation; 1.2.5. Industrial context.
505	8		_a1.3. Term alignment from comparable corpora: a state-of-the-art1.3.1. Distributional approach principle; 1.3.2. Term alignment evaluation; 1.3.2.1. Precision at rank N or TopN; 1.3.2.2. MRR; 1.3.2.3. MAP; 1.3.3. Improvement and variants of the distributional approach; 1.3.3.1. Favoring distributional symmetry; 1.3.3.2. Using syntactic contexts; 1.3.3.3. Relying on trusted elements; 1.3.3.4. Improving semantic information representation; 1.3.3.5. Using second-order semantic affinities; 1.3.3.6. Improving the bilingual resource with semantic classes; 1.3.3.7. Translating polylexical units.
505	8		_a1.3.4. Influence of data and parameters on alignment quality1.3.4.1. Data; 1.3.4.2. Parameters; 1.3.5. Limits of the distributional approach; 1.4. CAT software prototype for comparable corpora processing; 1.4.1. Implementation of a term alignment method; 1.4.1.1. Implementation and data; 1.4.1.2. Extraction of the terms to be aligned; 1.4.1.3. Collecting context vectors; 1.4.1.3.1. Monolexical term context vectors; 1.4.1.4. Polylexical term context vectors; 1.4.1.5. Translation of the source context vectors; 1.4.1.6. Term alignment; 1.4.2. Terminological records extraction.
505	8		_a1.4.3. Lexicon consultation interface1.5. Summary; Chapter 2: User-Centered Evaluation of Lexicons Extracted from Comparable Corpora; 2.1. Introduction; 2.2. Translation quality evaluation methodologies; 2.2.1. Machine translation evaluation; 2.2.1.1. Automatic evaluation measures; 2.2.1.2. Human MT evaluation; 2.2.2. Human translation evaluation; 2.2.2.1. Quantitative models; 2.2.2.2. Non-quantitative models; 2.2.3. Discussion; 2.3. Design and experimentation of a user-centered evaluation; 2.3.1. Methodological aspects; 2.3.1.1. Evaluation criteria and purpose.
505	8		_a2.3.1.2. Subject matter expertise2.3.1.3. Basis for comparison; 2.3.2. Experimentation protocol; 2.3.2.1. Data; 2.3.2.1.1. Comparable corpora and extracted lexica; 2.3.2.1.2. Texts to be translated; 2.3.2.1.3. Resources used in the translation situation; 2.3.2.1.4. Translators and judges; 2.3.2.2. Evaluation progress; 2.3.2.2.1. Translation phase; 2.3.2.2.2. Translation quality evaluation phase; 2.3.3. Results; 2.3.3.1. Lexicons usability; 2.3.3.1.1. Translation speed; 2.3.3.1.2. Use of resources; 2.3.3.1.3. Translators' impressions on the lexicons extracted from comparable corpora.
590			_aJohn Wiley and Sons _bWiley Online Library: Complete oBooks
650		0	_aComputational linguistics.
650		0	_aCorpora (Linguistics)
650		0	_aTranslators (Computer programs)
650		0	_aKnowledge management.
650		6	_aLinguistique informatique.
650		6	_aCorpus (Linguistique)
650		6	_aTraducteurs (Logiciels)
650		6	_aGestion des connaissances.
650		7	_acomputational linguistics. _2aat
650		7	_aCOMPUTERS _xGeneral. _2bisacsh
650		7	_aCOMPUTERS _xNatural Language Processing. _2bisacsh
650		7	_aComputational linguistics _2fast
650		7	_aCorpora (Linguistics) _2fast
650		7	_aKnowledge management _2fast
650		7	_aTranslators (Computer programs) _2fast
776	0	8	_iPrint version: _aDelpech, Estelle Maryline. _tComparable corpora and computer-assisted translation _z9781848216891 _w(OCoLC)884550434
830		0	_aCognitive science and knowledge management series.
856	4	0	_uhttps://onlinelibrary.wiley.com/doi/book/10.1002/9781119002659
938			_aAskews and Holts Library Services _bASKH _nAH26939508
938			_aAskews and Holts Library Services _bASKH _nAH26918951
938			_aCoutts Information Services _bCOUT _n28706002
938			_aEBL - Ebook Library _bEBLB _nEBL1752712
938			_aebrary _bEBRY _nebr10899820
938			_aEBSCOhost _bEBSC _n817378
938			_aProQuest MyiLibrary Digital eBook Collection _bIDEB _ncis28706002
938			_aRecorded Books, LLC _bRECE _nrbeEB00582667
938			_aYBP Library Services _bYANK _n12002291
938			_aYBP Library Services _bYANK _n12004539
938			_aYBP Library Services _bYANK _n12888521
994			_a92 _bINLUM
999			_c12123 _d12123