The Resource Building and using comparable corpora, Serge Sharoff ... [et al.], editors, (electronic resource)

Building and using comparable corpora, Serge Sharoff ... [et al.], editors, (electronic resource)

Label
Building and using comparable corpora
Title
Building and using comparable corpora
Statement of responsibility
Serge Sharoff ... [et al.], editors
Contributor
Provider
Subject
Genre
Language
eng
Summary
The 1990s saw a paradigm change in the use of corpus-driven methods in NLP. In the field of multilingual NLP (such as machine translation and terminology mining) this implied the use of parallel corpora. However, parallel resources are relatively scarce: many more texts are produced daily by native speakers of any given language than translated. This situation resulted in a natural drive towards the use of comparable corpora, i.e. non-parallel texts in the same domain or genre. Nevertheless, this research direction has not produced a single authoritative source suitable for researchers and students coming to the field. The proposed volume provides a reference source, identifying the state of the art in the field as well as future trends. The book is intended for specialists and students in natural language processing, machine translation and computer-assisted translation
Cataloging source
DKDLA
Image bit depth
0
LC call number
  • P98
  • P98-98.5
Literary form
non fiction
Nature of contents
dictionaries
http://library.link/vocab/relatedWorkOrContributorName
  • SpringerLink
  • Sharov, S. A.
http://library.link/vocab/subjectName
  • Translations
  • Neural Networks (Computer)
  • Linguistics
  • Software
  • Computer science
  • Translators (Computer programs)
  • Computational linguistics
  • Language Translation and Linguistics
  • Information Systems Applications (incl. Internet)
  • COMPUTERS / General
  • Computational linguistics
  • Computer science
  • Translators (Computer programs)
Label
Building and using comparable corpora, Serge Sharoff ... [et al.], editors, (electronic resource)
Instantiates
Publication
Antecedent source
mixed
Bibliography note
Includes bibliographical references
Color
not applicable
Contents
Preface - Building and Using Comparable Corpora. S.Sharoff, R.Rapp, P.Zweigenbaum -- Overviewing Important Aspects of the Last 20 Years of Research in Comparable Corpora -- S.Sharoff, R.Rapp, P.Zweigenbaum -- Part I: Compiling and Measuring Comparable Corpora -- Multilingual Corpus Collection. S.Shi, P.Fung -- Automatic Comparable Web Corpora Collection and Bilingual Terminology Extraction for Specialized Dictionary Making. A.Gurrutxaga, I.Leturia, I.San Vicente, X.Saralegi -- Statistical Comparability: Methodological Caveats. R.Köhler -- Methods for Collection and Evaluation of Comparable Documents. M.Lestari Paramita, D.Guthrie, E.Kanoulas, R.Gaizauskas, P.Clough and M.Sanderson -- Measuring the Distance between Comparable Corpora between Languages. S.Sharoff -- Exploiting Comparable Corpora for Lexicon Extraction: Measuring and Improving Corpus Quality. B.Li, E.Gaussier -- Statistical Corpus and Language Comparison on Comparable Corpora. T.Eckart, U.Quasthoff -- Comparable Multilingual Patents as Large-scale Parallel Corpora. B.Lu and B.Tsou -- Part II: Using Comparable Corpora -- Extracting Parallel Phrases from Comparable Data. S.Hewavitharana, S.Vogel -- Exploiting Comparable Corpora. D.S.Munteanu, D.Marcu -- Paraphrase Detection in Comparable Monolingual Corpora. L.Deleger, B.Cartoni, P.Zweigenbaum -- Information Network Construction and Alignment from Automatically Acquired Comparable Corpora. H.Ji, W.-P.Lin -- Bilingual Terminology Mining from Comparable Corpora. B.Daille, E.Morin -- The Place of Comparable Corpora in Providing Terminological Reference Information to Online Translators: A Strategic Framework. K.Kageura, T.Abekawa -- Old Needs, New Solutions: Comparable Corpora for Language Professionals. S.Bernardini, A.Ferraresi -- Exploiting the Incomparability of Comparable Corpora for Contrastive Linguistics and Translation Studies. S.Neumann, S.Hansen-Schirra
Dimensions
unknown
Extent
1 online resource (xiii, 335 p.)
File format
multiple file formats
Form of item
  • online
  • electronic
Isbn
9783642201271
Level of compression
uncompressed
Other control number
10.1007/978-3-642-20128-8
Other physical details
ill. (some col.)
Quality assurance targets
absent
Reformatting quality
access
Specific material designation
remote
System control number
  • (OCoLC)868638906
  • (OCoLC)ocn868638906
Label
Building and using comparable corpora, Serge Sharoff ... [et al.], editors, (electronic resource)
Publication
Antecedent source
mixed
Bibliography note
Includes bibliographical references
Color
not applicable
Contents
Preface - Building and Using Comparable Corpora. S.Sharoff, R.Rapp, P.Zweigenbaum -- Overviewing Important Aspects of the Last 20 Years of Research in Comparable Corpora -- S.Sharoff, R.Rapp, P.Zweigenbaum -- Part I: Compiling and Measuring Comparable Corpora -- Multilingual Corpus Collection. S.Shi, P.Fung -- Automatic Comparable Web Corpora Collection and Bilingual Terminology Extraction for Specialized Dictionary Making. A.Gurrutxaga, I.Leturia, I.San Vicente, X.Saralegi -- Statistical Comparability: Methodological Caveats. R.Köhler -- Methods for Collection and Evaluation of Comparable Documents. M.Lestari Paramita, D.Guthrie, E.Kanoulas, R.Gaizauskas, P.Clough and M.Sanderson -- Measuring the Distance between Comparable Corpora between Languages. S.Sharoff -- Exploiting Comparable Corpora for Lexicon Extraction: Measuring and Improving Corpus Quality. B.Li, E.Gaussier -- Statistical Corpus and Language Comparison on Comparable Corpora. T.Eckart, U.Quasthoff -- Comparable Multilingual Patents as Large-scale Parallel Corpora. B.Lu and B.Tsou -- Part II: Using Comparable Corpora -- Extracting Parallel Phrases from Comparable Data. S.Hewavitharana, S.Vogel -- Exploiting Comparable Corpora. D.S.Munteanu, D.Marcu -- Paraphrase Detection in Comparable Monolingual Corpora. L.Deleger, B.Cartoni, P.Zweigenbaum -- Information Network Construction and Alignment from Automatically Acquired Comparable Corpora. H.Ji, W.-P.Lin -- Bilingual Terminology Mining from Comparable Corpora. B.Daille, E.Morin -- The Place of Comparable Corpora in Providing Terminological Reference Information to Online Translators: A Strategic Framework. K.Kageura, T.Abekawa -- Old Needs, New Solutions: Comparable Corpora for Language Professionals. S.Bernardini, A.Ferraresi -- Exploiting the Incomparability of Comparable Corpora for Contrastive Linguistics and Translation Studies. S.Neumann, S.Hansen-Schirra
Dimensions
unknown
Extent
1 online resource (xiii, 335 p.)
File format
multiple file formats
Form of item
  • online
  • electronic
Isbn
9783642201271
Level of compression
uncompressed
Other control number
10.1007/978-3-642-20128-8
Other physical details
ill. (some col.)
Quality assurance targets
absent
Reformatting quality
access
Specific material designation
remote
System control number
  • (OCoLC)868638906
  • (OCoLC)ocn868638906

Library Locations

  • African Studies LibraryBorrow it
    771 Commonwealth Avenue, 6th Floor, Boston, MA, 02215, US
    42.350723 -71.108227
  • Alumni Medical LibraryBorrow it
    72 East Concord Street, Boston, MA, 02118, US
    42.336388 -71.072393
  • Astronomy LibraryBorrow it
    725 Commonwealth Avenue, 6th Floor, Boston, MA, 02445, US
    42.350259 -71.105717
  • Fineman and Pappas Law LibrariesBorrow it
    765 Commonwealth Avenue, Boston, MA, 02215, US
    42.350979 -71.107023
  • Frederick S. Pardee Management LibraryBorrow it
    595 Commonwealth Avenue, Boston, MA, 02215, US
    42.349626 -71.099547
  • Howard Gotlieb Archival Research CenterBorrow it
    771 Commonwealth Avenue, 5th Floor, Boston, MA, 02215, US
    42.350723 -71.108227
  • Mugar Memorial LibraryBorrow it
    771 Commonwealth Avenue, Boston, MA, 02215, US
    42.350723 -71.108227
  • Music LibraryBorrow it
    771 Commonwealth Avenue, 2nd Floor, Boston, MA, 02215, US
    42.350723 -71.108227
  • Pikering Educational Resources LibraryBorrow it
    2 Silber Way, Boston, MA, 02215, US
    42.349804 -71.101425
  • School of Theology LibraryBorrow it
    745 Commonwealth Avenue, 2nd Floor, Boston, MA, 02215, US
    42.350494 -71.107235
  • Science & Engineering LibraryBorrow it
    38 Cummington Mall, Boston, MA, 02215, US
    42.348472 -71.102257
  • Stone Science LibraryBorrow it
    675 Commonwealth Avenue, Boston, MA, 02445, US
    42.350103 -71.103784
Processing Feedback ...