6th Workshop on Building and Using Comparable Corpora
Conference Program
| Session: Invited talk | |
| 9:00-10:00 | Three dimensions of comparable corpora: same or different language, given or inferred comparability, means to an end or end in itself |
| Hinrich Schütze | |
| Session: (10:00-12:30) Terminology | |
| 10:00-10:30 | Cross-lingual WSD for Translation Extraction from Comparable Corpora |
| Marianna Apidianaki, Nikola Ljubešic and Darja Fišer | |
| Coffee break: (10:30-11:00) | |
| 11:00-11:30 | Bilingual Lexicon Extraction via Pivot Language and Word Alignment Tool |
| Hong-seok Kwon, Hyeong-won Seo and Jae-hoon Kim | |
| 11:30-12:00 | Using WordNet and Semantic Similarity for Bilingual Terminology Mining from Comparable Corpora |
| Dhouha Bouamor, Nasredine Semmar and Pierre Zweigenbaum | |
| 12:00-12:30 | A Comparison of Smoothing Techniques for Bilingual Lexicon Extraction from Comparable Corpora |
| Amir Hazem and Emmanuel Morin | |
| Session: (14:00-15:00) Comparable corpora | |
| 14:00-14:30 | Finding More Bilingual Webpages with High Credibility via Link Analysis |
| Chengzhi Zhang, Xuchen Yao and Chunyu Kit | |
| 14:30-15:00 | A modular open-source focused crawler for mining monolingual and bilingual corpora from the web |
| Vassilis Papavassiliou, Prokopis Prokopidis and Gregor Thurmair | |
| Session: (15:00-15:30) Posters with Booster Session | |
| 15:00-15:03 | Building basic vocabulary across 40 languages |
| Judit Acs, Katalin Pajkossy and Andras Kornai | |
| 15:04-15:07 | Scientific registers and disciplinary diversification: a comparable corpus approach |
| Elke Teich, Stefania Degaetano-Ortlieb, Hannah Kermes and Ekaterina Lapshinova-Koltunski | |
| 15:08-15:11 | Improving MT System Using Extracted Parallel Fragments of Text from Comparable Corpora |
| Rajdeep Gupta, Santanu Pal and Sivaji Bandyopadhyay | |
| 15:12-15:15 | VARTRA: A Comparable Corpus for Analysis of Translation Variation |
| Ekaterina Lapshinova-Koltunski | |
| 15:16-15:19 | Building Ontologies from Collaborative Knowledge Bases to Search and Interpret Multilingual Corpora |
| Yegin Genc, Elizabeth Lennon, Winter Mason and Jeffrey Nickerson | |
| 15:20-15:23 | Using a Random Forest Classifier to recognise translations of biomedical terms across languages |
| Georgios Kontonatsios, Ioannis Korkontzelos, Sophia Ananiadou and Jun’ichi Tsujii | |
| 15:24-15:27 | Comparing Multilingual Comparable Articles Based On Opinions |
| Motaz Saad, David Langlois and Kamel Smaili | |
| Coffee break: (15:30-16:00) | |
| Session: (16:00-18:00) Comparable corpora | |
| 16:00-16:30 | Mining for Domain-specific Parallel Text from Wikipedia |
| Magdalena Plamada and Martin Volk | |
| 16:30-17:00 | Gathering and Generating Paraphrases from Twitter with Application to Normalization |
| Wei Xu, Alan Ritter and Ralph Grishman | |
| 17:00-17:30 | Learning Comparable Corpora from Latent Semantic Analysis Simplified Document Space |
| Ekaterina Stambolieva | |
| 17:30-18:00 | Chinese-Japanese Parallel Sentence Extraction from Quasi-Comparable Corpora |
| Chenhui Chu, Toshiaki Nakazawa and Sadao Kurohashi | |
Serge Sharoff 2013-07-10
