Comparable corpora and computer-assisted translation / (Record no. 12123)
[ view plain ]
000 -LEADER | |
---|---|
fixed length control field | 08480cam a2201021 i 4500 |
001 - CONTROL NUMBER | |
control field | ocn886940632 |
003 - CONTROL NUMBER IDENTIFIER | |
control field | OCoLC |
005 - DATE AND TIME OF LATEST TRANSACTION | |
control field | 20240523125537.0 |
006 - FIXED-LENGTH DATA ELEMENTS--ADDITIONAL MATERIAL CHARACTERISTICS | |
fixed length control field | m o d |
007 - PHYSICAL DESCRIPTION FIXED FIELD--GENERAL INFORMATION | |
fixed length control field | cr cnu---unuuu |
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION | |
fixed length control field | 140814s2014 enk ob 001 0 eng d |
040 ## - CATALOGING SOURCE | |
Original cataloging agency | N$T |
Language of cataloging | eng |
Description conventions | rda |
-- | pn |
Transcribing agency | N$T |
Modifying agency | OCLCO |
-- | IDEBK |
-- | EBLCP |
-- | DG1 |
-- | CDX |
-- | E7B |
-- | RECBK |
-- | OCLCQ |
-- | YDXCP |
-- | OCLCF |
-- | OCLCO |
-- | DEBSZ |
-- | COO |
-- | DEBBG |
-- | COCUF |
-- | DG1 |
-- | MOR |
-- | LIP |
-- | PIFAG |
-- | ZCU |
-- | MERUC |
-- | OCLCQ |
-- | YDX |
-- | U3W |
-- | OCLCQ |
-- | STF |
-- | ICG |
-- | INT |
-- | VT2 |
-- | AU@ |
-- | OCLCQ |
-- | TKN |
-- | OCLCQ |
-- | DKC |
-- | OCLCQ |
-- | UKAHL |
-- | OCLCQ |
-- | OCLCO |
-- | OCLCQ |
-- | OCLCO |
019 ## - | |
-- | 885019380 |
-- | 961585956 |
-- | 962644922 |
-- | 1007396263 |
020 ## - INTERNATIONAL STANDARD BOOK NUMBER | |
International Standard Book Number | 9781119002529 |
Qualifying information | (electronic bk.) |
020 ## - INTERNATIONAL STANDARD BOOK NUMBER | |
International Standard Book Number | 1119002524 |
Qualifying information | (electronic bk.) |
020 ## - INTERNATIONAL STANDARD BOOK NUMBER | |
International Standard Book Number | 1322007748 |
Qualifying information | (electronic bk.) |
020 ## - INTERNATIONAL STANDARD BOOK NUMBER | |
International Standard Book Number | 9781322007748 |
Qualifying information | (electronic bk.) |
020 ## - INTERNATIONAL STANDARD BOOK NUMBER | |
International Standard Book Number | 9781119002659 |
Qualifying information | (electronic bk.) |
020 ## - INTERNATIONAL STANDARD BOOK NUMBER | |
International Standard Book Number | 1119002656 |
Qualifying information | (electronic bk.) |
020 ## - INTERNATIONAL STANDARD BOOK NUMBER | |
Canceled/invalid ISBN | 9781848216891 |
020 ## - INTERNATIONAL STANDARD BOOK NUMBER | |
Canceled/invalid ISBN | 1848216890 |
029 1# - OTHER SYSTEM CONTROL NUMBER (OCLC) | |
OCLC library identifier | AU@ |
System control number | 000056006140 |
029 1# - OTHER SYSTEM CONTROL NUMBER (OCLC) | |
OCLC library identifier | CHBIS |
System control number | 010259774 |
029 1# - OTHER SYSTEM CONTROL NUMBER (OCLC) | |
OCLC library identifier | CHBIS |
System control number | 010286926 |
029 1# - OTHER SYSTEM CONTROL NUMBER (OCLC) | |
OCLC library identifier | CHNEW |
System control number | 000943443 |
029 1# - OTHER SYSTEM CONTROL NUMBER (OCLC) | |
OCLC library identifier | CHVBK |
System control number | 332764486 |
029 1# - OTHER SYSTEM CONTROL NUMBER (OCLC) | |
OCLC library identifier | CHVBK |
System control number | 480236666 |
029 1# - OTHER SYSTEM CONTROL NUMBER (OCLC) | |
OCLC library identifier | DEBBG |
System control number | BV043397050 |
029 1# - OTHER SYSTEM CONTROL NUMBER (OCLC) | |
OCLC library identifier | DEBBG |
System control number | BV044069516 |
029 1# - OTHER SYSTEM CONTROL NUMBER (OCLC) | |
OCLC library identifier | DEBSZ |
System control number | 431731659 |
029 1# - OTHER SYSTEM CONTROL NUMBER (OCLC) | |
OCLC library identifier | DEBSZ |
System control number | 46887447X |
029 1# - OTHER SYSTEM CONTROL NUMBER (OCLC) | |
OCLC library identifier | GBVCP |
System control number | 814717624 |
029 1# - OTHER SYSTEM CONTROL NUMBER (OCLC) | |
OCLC library identifier | NZ1 |
System control number | 15908972 |
035 ## - SYSTEM CONTROL NUMBER | |
System control number | (OCoLC)886940632 |
Canceled/invalid control number | (OCoLC)885019380 |
-- | (OCoLC)961585956 |
-- | (OCoLC)962644922 |
-- | (OCoLC)1007396263 |
037 ## - SOURCE OF ACQUISITION | |
Stock number | 632025 |
Source of stock number/acquisition | MIL |
050 #4 - LIBRARY OF CONGRESS CALL NUMBER | |
Classification number | P98 |
072 #7 - SUBJECT CATEGORY CODE | |
Subject category code | COM |
Subject category code subdivision | 000000 |
Source | bisacsh |
072 #7 - SUBJECT CATEGORY CODE | |
Subject category code | COM |
Subject category code subdivision | 042000 |
Source | bisacsh |
082 04 - DEWEY DECIMAL CLASSIFICATION NUMBER | |
Classification number | 006.3/5 |
Edition number | 23 |
049 ## - LOCAL HOLDINGS (OCLC) | |
Holding library | MAIN |
100 1# - MAIN ENTRY--PERSONAL NAME | |
Personal name | Delpech, Estelle Maryline, |
Relator term | author. |
245 10 - TITLE STATEMENT | |
Title | Comparable corpora and computer-assisted translation / |
Statement of responsibility, etc. | Estelle Maryline Delpech. |
264 #1 - PRODUCTION, PUBLICATION, DISTRIBUTION, MANUFACTURE, AND COPYRIGHT NOTICE | |
Place of production, publication, distribution, manufacture | London : |
Name of producer, publisher, distributor, manufacturer | ISTE, |
Date of production, publication, distribution, manufacture, or copyright notice | 2014. |
300 ## - PHYSICAL DESCRIPTION | |
Extent | 1 online resource |
336 ## - CONTENT TYPE | |
Content type term | text |
Content type code | txt |
Source | rdacontent |
337 ## - MEDIA TYPE | |
Media type term | computer |
Media type code | c |
Source | rdamedia |
338 ## - CARRIER TYPE | |
Carrier type term | online resource |
Carrier type code | cr |
Source | rdacarrier |
490 1# - SERIES STATEMENT | |
Series statement | Cognitive science and knowledge management series |
588 0# - SOURCE OF DESCRIPTION NOTE | |
Source of description note | Print version record. |
504 ## - BIBLIOGRAPHY, ETC. NOTE | |
Bibliography, etc. note | Includes bibliographical references and index. |
520 ## - SUMMARY, ETC. | |
Summary, etc. | Computer-assisted translation (CAT) has always used translation memories, which require the translator to have a corpus of previous translations that the CAT software can use to generate bilingual lexicons. This can be problematic when the translator does not have such a corpus, for instance, when the text belongs to an emerging field. To solve this issue, CAT research has looked into the leveraging of comparable corpora, i.e. a set of texts, in two or more languages, which deal with the same topic but are not translations of one another. This work had two primary objectives. The first is to assess the input of lexicons extracted from comparable corpora in the context of a specialized human translation task. The second objective is to identify bilingual-lexicon-extraction methods which best match the translators' needs, determining the current limits of these techniques and suggesting improvements. The author focuses, in particular, on the identification of fertile translations, the management of multiple morphological structures, and the ranking of candidate translations. The experiments are carried out on two language pairs (English-French and English-German) and on specialized texts dealing with breast cancer. This research puts significant emphasis on applicability - methodological choices are guided by the needs of the final users. This book is organized in two parts: the first part presents the applicative and scientific context of the research, and the second part is given over to efforts to improve compositional translation. The research work presented in this book received the PhD Thesis award 2014 from the French association for natural language processing (ATALA). |
505 0# - FORMATTED CONTENTS NOTE | |
Formatted contents note | Cover; Title Page; Copyright; Contents; Acknowledgments; Introduction; PART 1: Applicative and Scientific Context; Chapter 1: Leveraging Comparable Corpora for Computer-assisted Translation ; 1.1. Introduction; 1.2. From the beginnings of machine translation to comparable corpora processing; 1.2.1. The dawn of machine translation; 1.2.2. The development of computer-assisted translation; 1.2.3. Drawbacks of parallel corpora and advantages of comparable corpora; 1.2.4. Difficulties of technical translation; 1.2.5. Industrial context. |
505 8# - FORMATTED CONTENTS NOTE | |
Formatted contents note | 1.3. Term alignment from comparable corpora: a state-of-the-art1.3.1. Distributional approach principle; 1.3.2. Term alignment evaluation; 1.3.2.1. Precision at rank N or TopN; 1.3.2.2. MRR; 1.3.2.3. MAP; 1.3.3. Improvement and variants of the distributional approach; 1.3.3.1. Favoring distributional symmetry; 1.3.3.2. Using syntactic contexts; 1.3.3.3. Relying on trusted elements; 1.3.3.4. Improving semantic information representation; 1.3.3.5. Using second-order semantic affinities; 1.3.3.6. Improving the bilingual resource with semantic classes; 1.3.3.7. Translating polylexical units. |
505 8# - FORMATTED CONTENTS NOTE | |
Formatted contents note | 1.3.4. Influence of data and parameters on alignment quality1.3.4.1. Data; 1.3.4.2. Parameters; 1.3.5. Limits of the distributional approach; 1.4. CAT software prototype for comparable corpora processing; 1.4.1. Implementation of a term alignment method; 1.4.1.1. Implementation and data; 1.4.1.2. Extraction of the terms to be aligned; 1.4.1.3. Collecting context vectors; 1.4.1.3.1. Monolexical term context vectors; 1.4.1.4. Polylexical term context vectors; 1.4.1.5. Translation of the source context vectors; 1.4.1.6. Term alignment; 1.4.2. Terminological records extraction. |
505 8# - FORMATTED CONTENTS NOTE | |
Formatted contents note | 1.4.3. Lexicon consultation interface1.5. Summary; Chapter 2: User-Centered Evaluation of Lexicons Extracted from Comparable Corpora; 2.1. Introduction; 2.2. Translation quality evaluation methodologies; 2.2.1. Machine translation evaluation; 2.2.1.1. Automatic evaluation measures; 2.2.1.2. Human MT evaluation; 2.2.2. Human translation evaluation; 2.2.2.1. Quantitative models; 2.2.2.2. Non-quantitative models; 2.2.3. Discussion; 2.3. Design and experimentation of a user-centered evaluation; 2.3.1. Methodological aspects; 2.3.1.1. Evaluation criteria and purpose. |
505 8# - FORMATTED CONTENTS NOTE | |
Formatted contents note | 2.3.1.2. Subject matter expertise2.3.1.3. Basis for comparison; 2.3.2. Experimentation protocol; 2.3.2.1. Data; 2.3.2.1.1. Comparable corpora and extracted lexica; 2.3.2.1.2. Texts to be translated; 2.3.2.1.3. Resources used in the translation situation; 2.3.2.1.4. Translators and judges; 2.3.2.2. Evaluation progress; 2.3.2.2.1. Translation phase; 2.3.2.2.2. Translation quality evaluation phase; 2.3.3. Results; 2.3.3.1. Lexicons usability; 2.3.3.1.1. Translation speed; 2.3.3.1.2. Use of resources; 2.3.3.1.3. Translators' impressions on the lexicons extracted from comparable corpora. |
590 ## - LOCAL NOTE (RLIN) | |
Local note | John Wiley and Sons |
Provenance (VM) [OBSOLETE] | Wiley Online Library: Complete oBooks |
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM | |
Topical term or geographic name entry element | Computational linguistics. |
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM | |
Topical term or geographic name entry element | Corpora (Linguistics) |
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM | |
Topical term or geographic name entry element | Translators (Computer programs) |
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM | |
Topical term or geographic name entry element | Knowledge management. |
650 #6 - SUBJECT ADDED ENTRY--TOPICAL TERM | |
Topical term or geographic name entry element | Linguistique informatique. |
650 #6 - SUBJECT ADDED ENTRY--TOPICAL TERM | |
Topical term or geographic name entry element | Corpus (Linguistique) |
650 #6 - SUBJECT ADDED ENTRY--TOPICAL TERM | |
Topical term or geographic name entry element | Traducteurs (Logiciels) |
650 #6 - SUBJECT ADDED ENTRY--TOPICAL TERM | |
Topical term or geographic name entry element | Gestion des connaissances. |
650 #7 - SUBJECT ADDED ENTRY--TOPICAL TERM | |
Topical term or geographic name entry element | computational linguistics. |
Source of heading or term | aat |
650 #7 - SUBJECT ADDED ENTRY--TOPICAL TERM | |
Topical term or geographic name entry element | COMPUTERS |
General subdivision | General. |
Source of heading or term | bisacsh |
650 #7 - SUBJECT ADDED ENTRY--TOPICAL TERM | |
Topical term or geographic name entry element | COMPUTERS |
General subdivision | Natural Language Processing. |
Source of heading or term | bisacsh |
650 #7 - SUBJECT ADDED ENTRY--TOPICAL TERM | |
Topical term or geographic name entry element | Computational linguistics |
Source of heading or term | fast |
650 #7 - SUBJECT ADDED ENTRY--TOPICAL TERM | |
Topical term or geographic name entry element | Corpora (Linguistics) |
Source of heading or term | fast |
650 #7 - SUBJECT ADDED ENTRY--TOPICAL TERM | |
Topical term or geographic name entry element | Knowledge management |
Source of heading or term | fast |
650 #7 - SUBJECT ADDED ENTRY--TOPICAL TERM | |
Topical term or geographic name entry element | Translators (Computer programs) |
Source of heading or term | fast |
776 08 - ADDITIONAL PHYSICAL FORM ENTRY | |
Relationship information | Print version: |
Main entry heading | Delpech, Estelle Maryline. |
Title | Comparable corpora and computer-assisted translation |
International Standard Book Number | 9781848216891 |
Record control number | (OCoLC)884550434 |
830 #0 - SERIES ADDED ENTRY--UNIFORM TITLE | |
Uniform title | Cognitive science and knowledge management series. |
856 40 - ELECTRONIC LOCATION AND ACCESS | |
Uniform Resource Identifier | <a href="https://onlinelibrary.wiley.com/doi/book/10.1002/9781119002659">https://onlinelibrary.wiley.com/doi/book/10.1002/9781119002659</a> |
938 ## - | |
-- | Askews and Holts Library Services |
-- | ASKH |
-- | AH26939508 |
938 ## - | |
-- | Askews and Holts Library Services |
-- | ASKH |
-- | AH26918951 |
938 ## - | |
-- | Coutts Information Services |
-- | COUT |
-- | 28706002 |
938 ## - | |
-- | EBL - Ebook Library |
-- | EBLB |
-- | EBL1752712 |
938 ## - | |
-- | ebrary |
-- | EBRY |
-- | ebr10899820 |
938 ## - | |
-- | EBSCOhost |
-- | EBSC |
-- | 817378 |
938 ## - | |
-- | ProQuest MyiLibrary Digital eBook Collection |
-- | IDEB |
-- | cis28706002 |
938 ## - | |
-- | Recorded Books, LLC |
-- | RECE |
-- | rbeEB00582667 |
938 ## - | |
-- | YBP Library Services |
-- | YANK |
-- | 12002291 |
938 ## - | |
-- | YBP Library Services |
-- | YANK |
-- | 12004539 |
938 ## - | |
-- | YBP Library Services |
-- | YANK |
-- | 12888521 |
994 ## - | |
-- | 92 |
-- | INLUM |
No items available.