default search action
5th LREC 2006: Genoa, Italy
- Nicoletta Calzolari, Khalid Choukri, Aldo Gangemi, Bente Maegaard, Joseph Mariani, Jan Odijk, Daniel Tapias:
Proceedings of the Fifth International Conference on Language Resources and Evaluation, LREC 2006, Genoa, Italy, May 22-28, 2006. European Language Resources Association (ELRA) 2006 - Yoshihiko Hayashi, Toru Ishida:
A Dictionary Model for Unifying Machine Readable Dictionaries and Computational Concept Lexicons. 1-6 - Claudia Soria, Maurizio Tesconi, Francesca Bertagna, Nicoletta Calzolari, Andrea Marchetti, Monica Monachini:
Moving to dynamic computational lexicons with LeXFlow. 7-12 - Violetta Cavalli-Sforza, Abdelhadi Soudi:
IMORPHĒ: An Inheritance and Equivalence Based Morphology Description Compiler. 13-18 - Alon Itai, Shuly Wintner, Shlomo Yona:
A Computational Lexicon of Contemporary Hebrew. 19-22 - Eneko Agirre, Izaskun Aldezabal, Jone Etxeberria, Eli Izagirre, Karmele Mendizabal, Eli Pociello, Mikel Quintian:
A methodology for the joint development of the Basque WordNet and Semcor. 23-28 - Sabry ElKateb, William Black, Horacio Rodríguez, Musa Alkhalifa, Piek Vossen, Adam Pease, Christiane Fellbaum:
Building a WordNet for Arabic. 29-34 - Valentin Tablan, Tamara Polajnar, Hamish Cunningham, Kalina Bontcheva:
User-friendly ontology authoring using a controlled language. 35-40 - Luke Nezda, Andrew Hickl, John Lehmann, Sarmad Fayyaz:
What in the world is a Shahab?: Wide Coverage Named Entity Recognition for Arabic. 41-46 - Ionas Michailidis, Konstantinos I. Diamantaras, Spiros Vasileiadis, Yannick Frère:
Greek Named Entity Recognition using Support Vector Machines, Maximum Entropy and Onetime. 47-52 - Bruno Pouliquen, Marco Kimler, Ralf Steinberger, Camelia Ignat, Tamara Oellinger, Ken Blackler, Flavio Fuart, Wajdi Zaghouani, Anna Widiger, Ann-Charlotte Forslund, Clive Best:
Geocoding Multilingual Texts: Recognition, Disambiguation and Visualisation. 53-58 - Voula Giouli, Alexis Konstandinidis, Elina Desypri, Harris Papageorgiou:
Multi-domain Multi-lingual Named Entity Recognition: Revisiting & Grounding the resources issue. 59-64 - Corina Forascu, Ionut Pistol, Dan Cristea:
Temporality in relation with discourse structure. 65-70 - Branimir Boguraev, Rie Kubota Ando:
Analysis of TimeBank as a Resource for TimeML Parsing. 71-76 - Feng Pan, Rutu Mulkar, Jerry R. Hobbs:
An Annotated Corpus of Typical Durations of Events. 77-82 - Flora Ramírez Bustamante, Alfredo Arnaiz, Mar Ginés:
A Spell Checker for a World Language: The New Microsoft's Spanish Spell Checker. 83-86 - Martin Reynaert:
Corpus-Induced Corpus Clean-up. 87-92 - Flora Ramírez Bustamante, Enrique López Díaz:
Spelling Error Patterns in Spanish for Word Processing Applications. 93-98 - Ernesto William De Luca, Andreas Nürnberger:
Rebuilding Lexical Resources for Information Retrieval using Sense Folder Detection and Merging Methods. 99-102 - José Iria, Fabio Ciravegna:
A Methodology and Tool for Representing Language Resources for Information Extraction. 103-106 - Nizar Habash, Clinton Mah, Sabiha Imran, Randy Calistri-Yeh, Páraic Sheridan:
Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval. 107-112 - Alexander Klassmann, Freddy Offenga, Daan Broeder, Romuald Skiba, Peter Wittenburg:
Comparison of Resource Discovery Methods. 113-116 - Christopher Cieri, Walter D. Andrews, Joseph P. Campbell, George R. Doddington, John J. Godfrey, Shudong Huang, Mark Liberman, Alvin F. Martin, Hirotaka Nakasone, Mark A. Przybocki, Kevin Walker:
The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research. 117-120 - Elena Grishina:
Spoken Russian in the Russian National Corpus (RNC). 121-124 - Emma Barker, Ryuichiro Higashinaka, François Mairesse, Robert J. Gaizauskas, Marilyn A. Walker, Jonathan Foster:
Simulating Cub Reporter Dialogues: The collection of naturalistic human-human dialogues for information access to text archives. 125-130 - Abdillahi Nimaan, Pascal Nocera, Jean-François Bonastre:
Towards automatic transcription of Somali language. 131-134 - Catia Cucchiarini, Hugo Van hamme, Olga van Herwijnen, Felix Smits:
JASMIN-CGN: Extension of the Spoken Dutch Corpus with Speech of Elderly People, Children and Non-natives in the Human-Machine Interaction Modality. 135-138 - Sylvain Galliano, Edouard Geoffrois, Guillaume Gravier, Jean-François Bonastre, Djamel Mostefa, Khalid Choukri:
Corpus description of the ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News. 139-142 - Joseph Polifroni, Marilyn A. Walker:
Learning Database Content for Spoken Dialogue System Design. 143-148 - Djamel Mostefa, Olivier Hamon, Khalid Choukri:
Evaluation of Automatic Speech Recognition and Speech Language Translation within TC-STAR: Results from the first evaluation campaign. 149-154 - Olivier Hamon, Martin Rajman:
X-Score: Automatic Evaluation of Machine Translation Grammaticality. 155-160 - Keith J. Miller, Michelle Vanni:
Formal v. Informal: Register-Differentiated Arabic MT Evaluation in the PLATO Paradigm. 161-166 - Elliott Macklovitch:
TransType2 : The Last Word. 167-172 - Ulrich Schäfer, Daniel Beck:
Automatic Testing and Evaluation of Multilingual Language Technology Resources and Components. 173-178 - Olivier Hamon, Andrei Popescu-Belis, Khalid Choukri, Marianne Dabbadie, Anthony Hartley, Widad Mustafa El Hadi, Martin Rajman, Ismaïl Timimi:
CESTA: First Conclusions of the Technolangue MT Evaluation Campaign. 179-184 - Stephanie M. Strassel, Christopher Cieri, Andrew Cole, Denise DiPersio, Mark Liberman, Xiaoyi Ma, Mohamed Maamouri, Kazuaki Maeda:
Integrated Linguistic Resources for Language Exploitation Technologies. 185-190 - Philipp Cimiano, Matthias Hartung, Esther Ratsch:
Finding the Appropriate Generalization Level for Binary Ontological Relations Extracted from the Genia Corpus. 191-196 - José Iria, Christopher Brewster, Fabio Ciravegna, Yorick Wilks:
An Incremental Tri-Partite Approach To Ontology Learning. 197-202 - Gregory Grefenstette, Fathi Debili, Christian Fluhr, Svitlana Zinger:
Exploiting text for extracting image processing resources. 203-206 - Rita Marinelli, Adriana Roventini, Giovanni Spadoni:
Using Core Ontology for Domain Lexicon Structuring. 207-212 - Sujian Li, Qin Lu, Wenjie Li, Ruifeng Xu:
Interaction between Lexical Base and Ontology with Formal Concept Analysis. 213-218 - Roberto Bartolini, Caterina Caracciolo, Emiliano Giovannetti, Alessandro Lenci, Simone Marchi, Vito Pirrelli, Chiara Renso, Laura Spinsanti:
Creation and Use of Lexicons and Ontologies for NL Interfaces to Databases. 219-224 - Nancy Ide, Laurent Romary:
Representing Linguistic Corpora and Their Annotations. 225-228 - Thierry Declerck:
SynAF: Towards a Standard for Syntactic Annotation. 229-232 - Gil Francopoulo, Monte George, Nicoletta Calzolari, Monica Monachini, Núria Bel, Mandy Pet, Claudia Soria:
Lexical Markup Framework (LMF). 233-236 - Mark van Assem, Aldo Gangemi, Guus Schreiber:
Conversion of WordNet to a standard RDF/OWL representation. 237-242 - Lina Henriksen, Claus Povlsen, Andrejs Vasiljevs:
EuroTermBank - a Terminology Resource based on Best Practice. 243-246 - Giovanni Tummarello, Christian Morbidoni, Fábio N. Kepler, Francesco Piazza, Paolo Puliti:
A novel Textual Encoding paradigm based on Semantic Web tools and semantics. 247-252 - Paula Chesley, Susanne Salmon-Alt:
Automatic extraction of subcategorization frames for French. 253-258 - Anders Berglund, Richard Johansson, Pierre Nugues:
Extraction of Temporal Information from Texts in Swedish. 259-264 - Michael Schiehlen, Kristina Spranger:
The Mass-Count Distinction: Acquisition and Disambiguation. 265-270 - Pavel Smrz:
Automatic Acquisition of Semantics-Extraction Patterns. 271-274 - Yi Zhang, Valia Kordoni:
Automated Deep Lexical Acquisition for Robust Open Texts Processing. 275-280 - Saba Amsalu:
Data-driven Amharic-English Bilingual Lexicon Acquisition. 281-286 - Qian Yang, Jean-Pierre Martens, Nanneke Konings, Henk van den Heuvel:
Development of a phoneme-to-phoneme (p2p) converter to improve the grapheme-to-phoneme (g2p) conversion of names. 287-292 - Stefan Breuer, Sven Bergmann, Ralf Dragon, Sebastian Möller:
Set-up of a Unit-Selection Synthesis with a Prominent Voice. 293-296 - Dominika Oliver, Krzysztof Szklanny:
Creation and analysis of a Polish speech database for use in unit selection synthesis. 297-302 - Javier Pérez, Antonio Bonafonte, Horst-Udo Hain, Eric Keller, Stefan Breuer, Jilei Tian:
ECESS Inter-Module Interface Specification for Speech Synthesis. 303-306 - Marie-Neige Garcia, Christophe d'Alessandro, Gérard Bailly, Philippe Boula de Mareüil, Michel Morel:
A joint prosody evaluation of French text-to-speech synthesis systems. 307-310 - Antonio Bonafonte, Harald Höge, Imre Kiss, Asunción Moreno, Ute Ziegenhain, Henk van den Heuvel, Horst-Udo Hain, Xia S. Wang, Marie-Neige Garcia:
TC-STAR: Specifications of Language Resources and Evaluation for Speech Synthesis. 311-314 - Patrick Paroubek, Isabelle Robba, Anne Vilnat, Christelle Ayache:
Data, Annotations and Measures in EASY the Evaluation Campaign for Parsers of French. 315-320 - Günter Neumann, Berthold Crysmann:
Exploring HPSG-based Treebanks for Probabilistic Parsing HPSG grammar extraction. 321-326 - Rajen Subba, Barbara Di Eugenio, Elena Terenzi:
Building lexical resources for PrincPar, a large coverage parser that generates principled semantic representations. 327-332 - Brian Roark, Mary P. Harper, Eugene Charniak, Bonnie J. Dorr, Mark Johnson, Jeremy G. Kahn, Yang Liu, Mari Ostendorf, John Hale, Anna Krasnyanskaya, Matthew Lease, Izhak Shafran, Matthew G. Snover, Robin Stewart, Lisa Yung:
SParseval: Evaluation Metrics for Parsing Speech. 333-338 - Alexandre Denis, Matthieu Quignard, Guillaume Pitel:
A Deep-Parsing Approach to Natural Language Understanding in Dialogue System: Results of a Corpus-Based Evaluation. 339-344 - Tristan van Rullen, Philippe Blache, Jean-Marie Balfourier:
Constraint-Based Parsing as an Efficient Solution: Results from the Parsing Evaluation Campaign EASy. 345-350 - Cédrick Fairon, Sébastien Paumier:
A translated corpus of 30, 000 French SMS. 351-354 - Irene Castellón, Ana Fernández Montraveta, Glòria Vázquez, Laura Alonso Alemany, Joan Antoni Capilla:
The Sensem Corpus: a Corpus Annotated at the Syntactic and Semantic Level. 355-358 - Stefan Klatt:
A Corpus-based Approach to the Interpretation of Unknown Words with an Application to German. 359-364 - Kentaro Inui, Toru Hirano, Ryu Iida, Atsushi Fujita, Yuji Matsumoto:
Augmenting a Semantic Verb Lexicon with a Large Scale Collection of Example Sentences. 365-368 - Martin Forst, Ronald M. Kaplan:
The importance of precise tokenizing for deep grammars. 369-372 - Jaroslava Hlavácová:
New Approach to Frequency Dictionaries - Czech Example. 373-378 - Fu Lee Wang, Xiaotie Deng, Feng Zou:
Towards Unified Chinese Segmentation Algorithm. 379-384 - Felix Pîrvan, Dan Tufis:
Tagset Mapping and Statistical Training Data Cleaning-up. 385-390 - Nick Campbell, Toshiyuki Sadanobu, Masataka Imura, Naoto Iwahashi, Noriko Suzuki, Damien Douxchamps:
Multimedia Database of Meetings and Informal Interactions for Tracking Participant Involvement and Discourse Flow. 391-394 - Donna K. Byron, Eric Fosler-Lussier:
The OSU Quake 2004 corpus of two-party situated problem-solving dialogs. 395-400 - Anders Green, Helge Hüttenrauch, Elin Anna Topp, Kerstin Severinson Eklundh:
Developing a ContextualizedMultimodal Corpus for Human-Robot Interaction. 401-406 - Saturnino Luz, Matt-Mouley Bouamrane, Masood Masoodian:
Gathering a corpus of multimodal computer-mediated meetings. 407-412 - Alina Andreevskaia, Sabine Bergler:
Semantic Tag Extraction from WordNet Glosses. 413-416 - Andrea Esuli, Fabrizio Sebastiani:
SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining. 417-422 - Carlo Strapparava, Alessandro Valitutti, Oliviero Stock:
The Affective Weight of Lexicon. 423-426 - Maite Taboada, Caroline Anthony, Kimberly D. Voll:
Methods for Creating Semantic Orientation Dictionaries. 427-432 - Felice Dell'Orletta, Alessandro Lenci, Simonetta Montemagni, Vito Pirrelli:
Searching treebanks for functional constraints: cross-lingual experiments in grammatical relation assignment. 433-438 - Václav Novák, Jan Hajic:
Perspectives of Turning Prague Dependency Treebank into a Knowledge Base. 439-442 - Mohamed Maamouri, Ann Bies, Tim Buckwalter, Mona T. Diab, Nizar Habash, Owen Rambow, Dalila Tabessi:
Developing and Using a Pilot Dialectal Arabic Treebank. 443-448 - Marie-Catherine de Marneffe, Bill MacCartney, Christopher D. Manning:
Generating Typed Dependency Parses from Phrase Structure Parses. 449-454 - Antonietta Alonge:
The Italian Metaphor Database. 455-460 - Rainer Osswald, Hermann Helbig, Sven Hartrumpf:
The Representation of German Prepositional Verbs in a Semantically Based Computer Lexicon. 461-464 - Serge Sharoff, Bogdan Babych, Anthony Hartley:
Using collocations from comparable corpora to find translation equivalents. 465-470 - Manfred Sailer, Beata Trawinski:
The Collection of Distributionally Idiosyncratic Items: A Multilingual Resource for Linguistic Research. 471-474 - Dan Stefanescu, Dan Tufis:
Aligning Multilingual Thesauri. 475-478 - Gianmaria Ajani, Guido Boella, Leonardo Lesmo, Marco Martin, A. Mazze:
A Development Tool For Multilingual Ontology-based Conceptual. 479-484 - Baden Hughes, Timothy Baldwin, Steven Bird, Jeremy Nicholson, Andrew MacKinlay:
Reconsidering Language Identification for Written Language Resources. 485-488 - Xiaoyi Ma:
Champollion: A Robust Parallel Text Sentence Aligner. 489-492 - John Niekrasz, Alexander Gruenstein:
NOMOS: A Semantic Web Software Framework for Annotation of Multimodal Corpora. 493-498 - Katerina Pastra:
Beyond Multimedia Integration: corpora and annotations for cross-media decision mechanisms. 499-504 - Niels Ole Bernsen, Laila Dybkjær, Svend Kiilerich:
H. C. Andersen Conversation Corpus. 505-510 - Daniel Sonntag, Massimo Romanelli:
A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications. 511-516 - Aljoscha Burchardt, Katrin Erk, Anette Frank, Andrea Kowalski, Sebastian Padó:
SALTO - A Versatile Multi-Level Annotation Tool. 517-520 - Magesh Balasubramanya, Michael Higgins, Peter Lucas, Jeffrey Senn, Dominic Widdows:
Collaborative Annotation that Lasts Forever: Using Peer-to-Peer Technology for Disseminating Corpora and Language Resources. 521-526 - Katrin Erk, Sebastian Padó:
Shalmaneser - A Toolchain For Shallow Semantic Parsing. 527-532 - Mustafa Yaseen, Mohammed Attia, Bente Maegaard, Khalid Choukri, Niklas Paulsson, S. Haamid, Steven Krauwer, Chomicha Bendahman, Hanne Fersøe, Mohsen A. Rashwan, Bassam Haddad, Chafic Mokbel, Abdelhak Mouradi, A. Al-Kufaishi, Mostafa Shahin, Noureddine Chenfour, Ahmed Ragheb:
Building Annotated Written and Spoken Arabic LRs in NEMLAR Project. 533-538 - Serge Sharoff:
A Uniform Interface to Large-Scale Linguistic Resources. 539-542 - Dragoç Ciobanu, Tony Hartley, Serge Sharoff:
Using Richly Annotated Trilingual Language Resources for Acquiring Reading Skills in a Foreign Language. 543-548 - Anna Feldman, Jirka Hana, Chris Brew:
A Cross-language Approach to Rapid Creation of New Morpho-syntactically Annotated Resources. 549-554 - Jonas Kuhn, Michael Jellinghaus:
Multilingual parallel treebanking: a lean and flexible approach. 555-558 - Owen Rambow, Bonnie J. Dorr, David Farwell, Rebecca Green, Nizar Habash, Stephen Helmreich, Eduard H. Hovy, Lori S. Levin, Keith J. Miller, Teruko Mitamura, Florence Reeder, Advaith Siddharthan:
Parallel Syntactic Annotation of Multiple Languages. 559-564 - Jonas Granfeldt, Pierre Nugues, Malin Ågren, Jonas Thulin, Emil Persson, Suzanne Schlyter:
CEFLE and Direkt Profil: a New Computer Learner Corpus in French L2 and a System for Grammatical Profiling. 565-570 - Christophe Jouis:
Hierarchical Relationships "is-a": Distinguishing Belonging, Inclusion and Part/of Relationships. 571-574 - Olivier Ferret:
Building a network of topical relations from a corpus. 575-580 - André Le Meur, Marie-Jeanne Derouin:
Lemma-oriented dictionaries, concept-oriented terminology and translation memories. 581-586 - Ya-Min Chou, Chu-Ren Huang:
Hantology-A Linguistic Resource for Chinese Language Processing and Studying. 587-590 - Jean-François Couturier, Sylvain Neuvel, Patrick Drouin:
Applying Lexical Constraints on Morpho-Syntactic Patterns for the Identification of Conceptual-Relational Content in Specialized Texts. 591-594 - Beatrice Alex, Malvina Nissim, Claire Grover:
The Impact of Annotation on the Performance of Protein Tagging in Biomedical Text. 595-600 - Baden Hughes:
Searching for Language Resources on the Web: User Behaviour in the Open Language Archives Community. 601-604 - Daan Broeder, Andreas Claus, Freddy Offenga, Romuald Skiba, Paul Trilsbeek, Peter Wittenburg:
LAMUS: the Language Archive Management and Upload System. 605-608 - Maria Gavrilidou, Penny Labropoulou, Stelios Piperidis, Voula Giouli, Nicoletta Calzolari, Monica Monachini, Claudia Soria, Khalid Choukri:
Language Resources Production Models: the Case of the INTERA Multilingual Corpus and Terminology. 609-614 - Felix Sasaki:
Work within the W3C Internationalization Activity and its Benefit for the Creation and Manipulation of Language Resources. 615-620 - Nancy Ide, Keith Suderman:
Integrating Linguistic Resources: The American National Corpus Model. 621-624 - Peter Wittenburg, Daan Broeder, Wolfgang Klein, Stephen C. Levinson, Laurent Romary:
Foundations of Modern Language Resource Archives. 625-628 - Ann Bies, Stephanie M. Strassel, Haejoong Lee, Kazuaki Maeda, Seth Kulick, Yang Liu, Mary P. Harper, Matthew Lease:
Linguistic Resources for Speech Parsing. 629-634 - Kiyotaka Uchimoto, Ryoji Hamabe, Takehiko Maruyama, Katsuya Takanashi, Tatsuya Kawahara, Hitoshi Isahara:
Dependency-structure Annotation to Corpus of Spontaneous Japanese. 635-638 - Olivier Baude, Michel Jacobson, Atanas Tchobanov, Richard Walter:
Interoperability of audio corpora : the case of the French corpora. 639-642 - Miriam Voghera, Francesco Cutugno:
An observatory on Spoken Italian linguistic resources and descriptive standards. 643-646 - Annalisa Sandrelli, Claudio Bendazzoli:
Tagging a Corpus of Interpreted Speeches: the European Parliament Interpreting Corpus (EPIC). 647-652 - Kazuaki Maeda, Christopher Cieri, Kevin Walker:
Low-cost Customized Speech Corpus Creation for Speech Technology Applications. 653-656 - Abdelrahim Abdelsapor, Noha Adly, Kareem Darwish, Ossama Emam, Walid Magdy, Magdi Nagi:
Building a Heterogeneous Information Retrieval Collection of Printed Arabic Documents. 657-662 - Hercules Dalianis, Bart Jongejan:
Hand-crafted versus Machine-learned Inflectional Rules: The Euroling-SiteSeeker Stemmer and CST's Lemmatiser. 663-666 - Lun-Wei Ku, Yu-Ting Liang, Hsin-Hsi Chen:
Tagging Heterogeneous Evaluation Corpora for Opinionated Tasks. 667-670 - Atsushi Fujii, Makoto Iwayama, Noriko Kando:
Test Collections for Patent Retrieval and Patent Classification in the Fifth NTCIR Workshop. 671-674 - Carol Peters:
The Impact of Evaluation on Multilingual Information Retrieval System Development. 675-678 - Shisanu Tongchim, Prapass Srichaivattana, Virach Sornlertlamvanich, Hitoshi Isahara:
Blind Evaluation for Thai Search Engines. 679-684 - Jesús Giménez, Enrique Amigó:
IQmt: A Framework for Automatic Machine Translation Evaluation. 685-690 - Andrei Popescu-Belis, Paula Estrella, Margaret King, Nancy L. Underwood:
A Model for Context-Based Evaluation of Language Processing Systems and its Application to Machine Translation Evaluation. 691-696 - David Vilar, Jia Xu, Luis Fernando D'Haro, Hermann Ney:
Error Analysis of Statistical Machine Translation Output. 697-702 - Kiyotaka Uchimoto, Naoko Hayashida, Toru Ishida, Hitoshi Isahara:
Automatic Detection and Semi-Automatic Revision of Non-Machine-Translatable Parts of a Sentence. 703-708 - Alexandre Patry, Fabrizio Gotti, Philippe Langlais:
MOOD: A Modular Object-Oriented Decoder for Statistical Machine Translation. 709-714 - Arne Mauser, Evgeny Matusov, Hermann Ney:
Training a Statistical Machine Translation System without GIZA++. 715-720 - Gerhard Fliedner:
Towards Natural Interactive Question Answering. 721-726 - Davide Buscaldi, Paolo Rosso:
Mining Knowledge fromWikipedia for the Question Answering task. 727-730 - Véronique Moriceau:
Language Challenges for Data Fusion in Question-Answering. 731-736 - Liang Zhou, Chin-Yew Lin, Eduard H. Hovy:
Summarizing Answers for Complicated Questions. 737-740 - Sanda M. Harabagiu, Cosmin Adrian Bejan:
An Answer Bank for Temporal Inference. 741-746 - Gregory Marton, Boris Katz:
Using Semantic Overlap Scoring in Answering TREC Relationship Questions. 747-752 - Shuichi Itahashi, Chiu-yu Tseng, Satoshi Nakamura:
Oriental COCOSDA: Past, Present and Future. 753-756 - Bente Maegaard, Jens Erik Fenstad, Lars Ahrenberg, Knut Kvale, Katarina Heimann Mühlenbock, Bernt-Erik Heid:
KUNSTI - Knowledge Generation for Norwegian Language Technology. 757-760 - Elisabeth D'Halleweyn, Jan Odijk, Lisanne Teunissen, Catia Cucchiarini:
The Dutch-Flemish HLT Programme STEVIN: Essential Speech and Language Technology Resources. 761-766 - Stéphane Chaudiron, Joseph Mariani:
Techno-langue: The French National Initiative for Human Language Technologies (HLT). 767-772 - Bente Maegaard, Steven Krauwer, Khalid Choukri, Lise Damsgaard Jørgensen:
The BLARK concept and BLARK for Arabic. 773-778 - Christopher Cieri, Mark Liberman:
More Data and Tools for More Languages and Research Areas: A Progress Report on LDC Activities. 779-782 - Manny Rayner, Pierrette Bouillon, Beth Ann Hockey, Nikos Chatzichrisafis:
REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications. 783-788 - Tomoyuki Kato, Tomiki Toda, Hiroshi Saruwatari, Kiyohiro Shikano:
Transcription Cost Reduction for Constructing Acoustic Models Using Acoustic Likelihood Selection Criteria. 789-792 - Julie Mauclair, Yannick Estève, Simon Petit-Renaud, Paul Deléglise:
Automatic Detection of Well Recognized Words in Automatic Speech Transcriptions. 793-798 - Dafydd Gibbon, Shu-Chuan Tseng:
Discourse functions of duration in Mandarin: resource design and implementation. 799-802 - Jonathan G. Fiscus, Jerome Ajot, Nicolas Radde, Christophe Laprun:
Multiple Dimension Levenshtein Edit Distance Calculations for Evaluating Automatic Speech Recognition Systems During Simultaneous Speech. 803-808 - Susanne Burger, Zachary Sloane, Jie Yang:
Competitive Evaluation of Commercially Available Speech Recognizers in Multiple Languages. 809-814 - Richard Johansson, Pierre Nugues:
Construction of a FrameNet Labeler for Swedish Text. 815-820 - Magnus Sahlgren:
Towards pertinent evaluation methodologies for word-space models. 821-824 - Sabine Schulte im Walde:
Human Verb Associations as the Basis for Gold Standard Verb Classes: Validation against GermaNet and FrameNet. 825-830 - Rebecca J. Passonneau:
Measuring Agreement on Set-valued Items (MASI) for Semantic and Pragmatic Annotation. 831-836 - Anna Rumshisky, James Pustejovsky:
Inducing Sense-Discriminating Context Patterns from Sense-Tagged Corpora. 837-840 - Andreas Eisele:
Parallel Corpora and Phrase-Based Statistical Machine Translation for New Language Pairs via Multiple Intermediaries. 845-848 - Yoshihiko Nitta, Masashi Saraki, Satoru Ikehara:
Building Carefully Tagged Bilingual Corpora to Cope with Linguistic Idiosyncrasy. 849-854 - Sasa Hasan, Anas El Isbihani, Hermann Ney:
Creating a Large-Scale Arabic to French Statistical MachineTranslation System. 855-858 - Xiaoyi Ma, Christopher Cieri:
Corpus Support for Machine Translation at LDC. 859-864 - Olena Medelyan, Stefan Schulz, Jan Paetzold, Michael Poprat, Kornél G. Markó:
Language Specific and Topic Focused Web Crawling. 865-868 - Dan Tufis, Elena Irimia:
RoCo-News: A Hand Validated Journalistic Corpus of Romanian. 869-872 - Claire Grover, Richard Tobin:
Rule-Based Chunking and Reusability. 873-878 - Eva Hajicová, Petr Sgall:
Corpus Annotation as a Test of a Linguistic Theory. 879-884 - Roberta Catizone, Angelo Dalli, Yorick Wilks:
Evaluating Automatically Generated Timelines from the Web. 885-888 - Oana Postolache, Dan Cristea, Constantin Orasan:
Transferring Coreference Chains through Word Alignment. 889-892 - Olga Uryupina:
Coreference Resolution with and without Linguistic Knowledge. 893-898 - Eduard H. Hovy, Chin-Yew Lin, Liang Zhou, Junichi Fukumoto:
Automated Summarization Evaluation with Basic Elements. 899-902 - Juan José Rodríguez Soler, Pedro Concejero Cerezo, Carlos Lázaro Ávila, Daniel Tapias Merino:
Usability evaluation of 3G multimodal services in Telefónica Móviles España. 903-908 - Hans Dybkjær, Laila Dybkjær:
Act-Topic Patterns for Automatically Checking Dialogue Models. 909-914 - Djamel Mostefa, Marie-Neige Garcia, Khalid Choukri:
Evaluation of multimodal components within CHIL: The evaluation packages and results. 915-918 - Harry Bunt:
Dimensions in Dialogue Act Annotation. 919-924 - Attila Novák:
Morphological Tools for Six Small Uralic Languages. 925-930 - Joris Vaneyghen, Guy De Pauw, Dirk Van Compernolle, Walter Daelemans:
A mixed word / morphological approach for extending CELEX for high coverage on contemporary large corpora. 931-934 - Margot Mieskes, Michael Strube:
Part-of-Speech Tagging of Transcribed Speech. 935-938 - Baden Hughes, Dafydd Gibbon, Thorsten Trippel:
Feature-based Encoding and Querying Language Resources with Character Semantics. 939-944 - Widad Mustafa El Hadi, Ismaïl Timimi, Marianne Dabbadie, Khalid Choukri, Olivier Hamon, Yun-Chuang Chiao:
Terminological Resources Acquisition Tools: Toward a User-oriented Evaluation Model. 945-948 - Thatsanee Charoenporn, Canasai Kruengkrai, Thanaruk Theeramunkong, Virach Sornlertlamvanich, Hitoshi Isahara:
Word Knowledge Acquisition for Computational Lexicon Construction. 949-954 - Mathieu Lafourcade:
Conceptual Vector Learning - Comparing Bootstrapping from a Thesaurus or Induction by Emergence. 955-958 - Naoaki Okazaki, Sophia Ananiadou:
Clustering acronyms in biomedical text for disambiguation. 959-962 - Bernardo Magnini, Emanuele Pianta, Christian Girardi, Matteo Negri, Lorenza Romano, Manuela Speranza, Valentina Bartalesi Lenzi, Rachele Sprugnoli:
I-CAB: the Italian Content Annotation Bank. 963-968 - Aljoscha Burchardt, Katrin Erk, Anette Frank, Andrea Kowalski, Sebastian Padó:
The SALSA Corpus: a German Corpus Resource for Lexical Semantics. 969-974 - Harris Papageorgiou, Elina Desipri, Maria Koutsombogera, Kanella Pouli, Prokopis Prokopidis:
Adding multi-layer semantics to the Greek Dependency Treebank. 975-980 - Eneko Agirre, Izaskun Aldezabal, Jone Etxeberria, Eli Pociello:
A Preliminary Study for Building the Basque PropBank. 981-986 - Jerneja Zganec-Gros, Varja Cvetko-Oresnik, Primoz Jakopin, Ales Mihelic:
SI-PRON: A Pronunciation Lexicon for Slovenian. 987-992 - Stefan Schaden, Ute Jekosch:
“Casselberveetovallarga” and other Unpronounceable Places: The CrossTowns Corpus. 993-998 - David Graff, Tim Buckwalter, Mohamed Maamouri, Hubert Jin:
Lexicon Development for Varieties of Spoken Colloquial Arabic. 999-1004 - Thomas Pellegrini, Lori Lamel:
Experimental detection of vowel pronunciation variants in Amharic. 1005-1008 - Milena Slavcheva:
Semantic Descriptors: The Case of Reflexive Verbs. 1009-1014 - Anna Korhonen, Yuval Krymolowski, Ted Briscoe:
A Large Subcategorization Lexicon for Natural Language Processing Applications. 1015-1020 - Patrick Saint-Dizier:
PrepNet: a Multilingual Lexical Description of Prepositions. 1021-1026 - Karin Kipper, Anna Korhonen, Neville Ryant, Martha Palmer:
Extending VerbNet with Novel Verb Classes. 1027-1032 - Aminul Islam, Diana Inkpen:
Second Order Co-occurrence PMI for Determining the Semantic Similarity of Words. 1033-1038 - Dominic Widdows, Adil Toumouh, Beate Dorow, Ahmed Lehireche:
Ongoing Developments in Automatically Adapting Lexical Resources to the Biomedical Domain. 1039-1044 - Manolis Maragoudakis, Katia Kermanidis, Aristogiannis Garbis, Nikos Fakotakis:
Dealing with Imbalanced Data using Bayesian Techniques. 1045-1050 - Ben Medlock:
An Introduction to NLP-based Textual Anonymisation. 1051-1056 - Rita Marinelli, Remo Bindi:
Proper Names and Linguistic Dynamics. 1057-1060 - Vasile Rus, Art Graesser:
The Look and Feel of a Confident Entailer. 1061-1064 - Marie-Claude L'Homme, Hee-Sook Bae:
A Methodology for Developing Multilingual Resources for Terminology. 1065-1070 - Goran Nenadic, Naoaki Okazaki, Sophia Ananiadou:
Towards a terminological resource for biomedical text mining. 1071-1076 - Boris V. Dobrov, Natalia V. Loukachevitch:
Development of Linguistic Ontology on Natural Sciences and Technology. 1077-1082 - Ronny Melz, Pum-Mo Ryu, Key-Sun Choi:
Compiling large language resources using lexical similarity metrics for domain taxonomy learning. 1083-1088 - Wim Peters, Maria-Teresa Sagri, Daniela Tiscornia, Sara Castagnoli:
The LOIS Project. 1089-1094 - Stefanie Anstein, Gerhard Kremer, Uwe Reyle:
Identifying and Classifying Terms in the Life Sciences: The Case of Chemical Terminology. 1095-1098 - Chloé Clavel, Ioana Vasilescu, Laurence Devillers, Thibaut Ehrette, Gaël Richard:
Fear-type emotions of the SAFE Corpus: annotation issues. 1099-1104 - Laurence Devillers, Roddy Cowie, Jean-Claude Martin, Ellen Douglas-Cowie, Sarkis Abrilian, Margaret McRorie:
Real life emotions in French and English TV video clips: an integrated annotation protocol combining continuous and discrete approaches. 1105-1110 - Kornel Laskowski, Susanne Burger:
Annotation and Analysis of Emotionally Relevant Behavior in the ISL Meeting Corpus. 1111-1116 - Dennis Reidsma, Dirk Heylen, Roeland Ordelman:
Annotating Emotions in Meetings. 1117-1122 - Thurid Vogt, Elisabeth André:
Improving Automatic Emotion Recognition from Speech via Gender Differentiaion. 1123-1126 - Jean-Claude Martin, George Caridakis, Laurence Devillers, Kostas Karpouzis, Sarkis Abrilian:
Manual Annotation and Automatic Image Processing of Multimodal Emotional Behaviours: Validating the Annotation of TV Interviews. 1127-1132 - Laurent Gillard, Patrice Bellot, Marc El-Bèze:
Question Answering Evaluation Survey. 1133-1138 - Jeongwoo Ko, Laurie Hiyakumoto, Eric Nyberg:
Exploiting Multiple Semantic Resources for Answer Selection. 1139-1142 - Hideki Shima, Mengqiu Wang, Frank Lin, Teruko Mitamura:
Modular Approach to Error Analysis and Evaluation for Multilingual Question Answering. 1143-1146 - V. Finley Lacatusu, Andrew Hickl, Sanda M. Harabagiu:
Impact of Question Decomposition on the Quality of Answer Summaries. 1147-1152 - Bernardo Magnini, Danilo Giampicollo, Lili Aunimo, Christelle Ayache, Petya Osenova, Anselmo Peñas, Maarten de Rijke, Bogdan Sacaleanu, Diana Santos, Richard F. E. Sutcliffe:
The Multilingual Question Answering Track at CLEF. 1153-1156 - Christelle Ayache, Brigitte Grau, Anne Vilnat:
EQueR: the French Evaluation campaign of Question-Answering Systems. 1157-1160 - Anders Nøklestad, Øystein Reigem, Christer Johansson:
Developing a re-usable web-demonstrator for automatic anaphora resolution with support for manual editing of coreference chains. 1161-1166 - Laura Hasler, Constantin Orasan, Karin Naumann:
NPs for Events: Experiments in Coreference Annotation. 1167-1172 - Tommaso Caselli, Irina Prodanof:
Annotating Bridging Anaphors in Italian: in Search of Reliability. 1173-1176 - Daniela Goecke, Andreas Witt:
Exploiting logical document structure for anaphora resolution. 1177-1180 - Hisami Suzuki, Gary Kacmarcik:
RefRef: A Tool for Viewing and Exploring Coreference Space. 1181-1184 - Whitney Gegg-Harrison, Donna K. Byron:
PYCOT: An Optimality Theory-based Pronoun Resolution Toolkit. 1185-1190 - Massimo Poesio, Mijail A. Kabadjov, P. Goux, Udo Kruschwitz, Elizabeth Bishop, Louise Corti:
An Anaphora Resolution-Based Anonymization Module. 1191-1193 - Georgiana Puscasu, Ruslan Mitkov:
If “it” were “then”, then when was “it”? Establishing the anaphoric role of “then”. 1194-1199 - Dimitrios Kokkinakis:
Collection, Encoding and Linguistic Processing of a Swedish Medical Corpus - The MEDLEX Experience. 1200-1205 - Nelleke Oostdijk, Lou Boves:
User requirements analysis for the design of a reference corpus of written Dutch. 1206-1211 - Corinna Onelli, Domenico Proietti, Corrado Seidenari, Fabio Tamburini:
The DiaCORIS project: a diachronic corpus of written Italian. 1212-1215 - Diana Santos, Susana Inácio:
Annotating COMPARA, a Grammar-aware Parallel Corpus. 1216-1221 - David Guthrie, Ben Allison, Wei Liu, Louise Guthrie, Yorick Wilks:
A Closer Look at Skip-gram Modelling. 1222-1225 - Aimilios Chalamandaris, Athanassios Protopapas, Pirros Tsiakoulis, Spyros Raptis:
All Greek to me! An automatic Greeklish to Greek transliteration system. 1226-1229 - Agam Patel, Dragomir R. Radev:
Lexical similarity can distinguish between automatic and manual translations. 1230-1235 - Ondrej Bojar, Magdalena Prokopová:
Czech-English Word Alignment. 1236-1239 - Lea Cyrus:
Building a resource for studying translation shifts. 1240-1245 - Andi Wu, Kirk Lowery:
A Hebrew Tree Bank Based on Cantillation Marks. 1246-1249 - Stephan Oepen, Jan Tore Lønning:
Discriminant-Based MRS Banking. 1250-1255 - Ivana Kruijff-Korbayová, Klára Chvátalová, Oana Postolache:
Annotation Guidelines for Czech-English Word Alignment. 1256-1261 - Hiroyuki Kaji, Mariko Watanabe:
Automatic Construction of Japanese WordNet. 1262-1267 - Reinhard Rapp, Carlos Martín-Vide:
Example-Based Machine Translation Using a Dictionary of Word Pairs. 1268-1273 - Bettina Schrader:
Non-probabilistic alignment of rare German and English nominal expressions. 1274-1277 - Maja Popovic, Hermann Ney:
POS-based Word Reorderings for Statistical Machine Translation. 1278-1283 - Vincent Vandeghinste, Ineke Schuurman, Michael Carl, Stella Markantonatou, Toni Badia:
METIS-II: Machine Translation for Low Resource Languages. 1284-1289 - Radu Ion, Alexandru Ceausu, Dan Tufis:
Dependency-Based Phrase Alignment. 1290-1293 - Márton Miháltz, Gábor Pohl:
Exploiting Parallel Corpora for Supervised Word-Sense Disambiguation in English-Hungarian Machine Translation. 1294-1297 - Jorge Civera, Alfons Juan:
Bilingual Machine-Aided Indexing. 1302-1305 - Suzan Verberne, Lou Boves, Nelleke Oostdijk, Peter-Arno Coppen:
Data for question answering: The case of why. 1306-1311 - Horacio Saggion:
Multilingual Multidocument Summarization Tools and Evaluation. 1312-1317 - Horacio Saggion, Robert J. Gaizauskas:
Language Resources for Background Gathering. 1318-1321 - Wei Li, Wenjie Li, Qin Lu:
Mining Implicit Entities in Queries. 1322-1325 - Francesca Bertagna:
Representation and Inference for Open-Domain QA: Strength and Limits of two Italian Semantic Lexicons. 1326-1331 - Roser Saurí, Marc Verhagen, James Pustejovsky:
SlinkET: A Partial Modal Parser for Events. 1332-1337 - Vít Novácek, Pavel Smrz, Jan Pomikálek:
Text Mining for Semantic Relations as a Support Base of a Scientific Portal Generator. 1338-1343 - Daisuke Kawahara, Sadao Kurohashi:
Case Frame Compilation from the Web using High-Performance Computing. 1344-1347 - Benoît Sagot, Lionel Clément, Éric Villemonte de la Clergerie, Pierre Boullier:
The Lefff 2 syntactic lexicon for French: architecture, acquisition, use. 1348-1351 - Kyoko Kanzaki, Qing Ma, Eiko Yamamoto, Hitoshi Isahara:
Semantic Analysis of Abstract Nouns to Compile a Thesaurus of Adjectives. 1352-1357 - Ben Wellner, Marc B. Vilain:
Leveraging Machine Readable Dictionaries in Discriminative Sequence Models. 1358-1361 - Núria Bel, Sergio Espeja, Montserrat Marimon:
New tools for the encoding of lexical data extracted from corpus. 1362-1367 - Yasunori Ohishi, Katunobu Itou, Kazuya Takeda, Atsushi Fujii:
Statistical Analysis for Thesaurus Construction using an Encyclopedic Corpus. 1368-1371 - Maria Teresa Pazienza, Marco Pennacchiotti, Fabio Massimo Zanzotto:
Mixing WordNet, VerbNet and PropBank for studying verb relations. 1372-1377 - Juri D. Apresjan, Igor Boguslavsky, Boris Iomdin, Leonid L. Iomdin, Andrei Sannikov, Victor G. Sizov:
A Syntactically and Semantically Tagged Corpus of Russian: State of the Art and Prospects. 1378-1381 - Nianwen Xue:
Annotating the Predicate-Argument Structure of Chinese Nominalizations. 1382-1387 - Saso Dzeroski, Tomaz Erjavec, Nina Ledinek, Petr Pajas, Zdenek Zabokrtský, Anreja Zele:
Towards a Slovene Dependency Treebank. 1388-1391 - Joakim Nivre, Jens Nilsson, Johan Hall:
Talbanken05: A Swedish Treebank with Phrase Structure and Dependency Annotation. 1392-1395 - Raffaella Bernardi, Andrea Bolognesi, Corrado Seidenari, Fabio Tamburini:
POS tagset design for Italian. 1396-1401 - Tomoko Ohta, Yuka Tateisi, Jin-Dong Kim, Akane Yakushiji, Jun'ichi Tsujii:
Linguistic and Biological Annotations of Biological Interaction Events. 1402-1405 - Nerea Areta, Antton Gurrutxaga, Igor Leturia, Ziortza Polin, Rafa Saiz, Iñaki Alegria, Xabier Artola, Arantza Díaz de Ilarraza, Nerea Ezeiza, Aitor Sologaistoa, Aitor Soroa, Andoni Valverde:
Structure, Annotation and Tools in the Basque ZT Corpus. 1406-1411 - Masaki Noguchi, Hiroshi Ichikawa, Taiichi Hashimoto, Takenobu Tokunaga:
A new approach to syntactic annotation. 1412-1417 - Yuji Matsumoto, Masayuki Asahara, Kiyota Hashimoto, Yukio Tono, Akira Ohtani, Toshio Morita:
An Annotated Corpus Management Tool: ChaKi. 1418-1421 - Yunqing Xia, Kam-Fai Wong, Wenjie Li:
Constructing A Chinese Chat Language Corpus with A Two-Stage Incremental Annotation Approach. 1422-1427 - Jaeyoung Jung, Maki Miyake, Hiroyuki Akama:
Recurrent Markov Cluster (RMCL) Algorithm for the Refinement of the Semantic Network. 1428-1431 - Véronique Hoste, Guy De Pauw:
KNACK-2002: a Richly Annotated Corpus of Dutch Written Text. 1432-1437 - Florbela Barreto, António Branco, Eduardo Ferreira, Amália Mendes, Maria Fernanda Bacelar do Nascimento, Filipe Nunes, João Ricardo Silva:
Open Resources and Tools for the Shallow Processing of Portuguese: The TagShare Project. 1438-1443 - Harry Bunt, Amanda Schiffrin:
Methodological Aspects of Semantic Annotation. 1444-1449 - Ming-Shun Lin, Hsin-Hsi Chen:
Constructing a Named Entity Ontology from Web Corpora. 1450-1453 - Khurshid Ahmad, Maria Teresa Musacchio, Giuseppe Palumbo:
Ontological and Terminological Commitments and the Discourse of Specialist Communities. 1454-1459 - Leonardo Lesmo, Livio Robaldo:
From Natural Language to Databases via Ontologies. 1460-1465 - Alberto Simões, José João Almeida:
T2O - Recycling Thesauri into a Multilingual Ontology. 1466-1471 - Luciana Bordoni, Tiziana Mazzoli:
Towards an Ontology for Art and Colours. 1472-1475 - Mats Lundälv, Katarina Heimann Mühlenbock, Bengt Farre, Annika Brännström:
SYMBERED - a Symbol-Concept Editing Tool. 1476-1481 - Yoji Kiyota, Hiroshi Nakagawa:
A Domain Ontology Production Tool Kit Based on Automatically Constructed Case Frames. 1482-1487 - Hennie Brugman, Véronique Malaisé, Luit Gazendam:
A Web Based General Thesaurus Browser to Support Indexing of Television and Radio Programs. 1488-1491 - Thierry Declerck, Asunción Gómez-Pérez, Ovidiu Vela, Zeno Gantner, David Manzano-Macho:
Multilingual Lexical Semantic Resources for Ontology Translation. 1492-1495 - Anna Estellés, Amparo Alcina, Victoria Soler:
Retrieving Terminological Data from the TxtCeram Tagged Domain Corpus: A First Step towards a Terminological Ontology. 1496-1501 - Luís Sarmento, Belinda Maia, Diana Santos, Ana Sofia Pinto, Luís Cabral:
Corpógrafo V3 - From Terminological Aid to Semi-automatic Knowledge Engineering. 1502-1505 - Yasuko Senda, Yasusi Sinohara, Manabu Okumura:
Automatic Terminology Intelligibility Estimation for Readership-oriented Technical Writing. 1506-1509 - Alessandro Moschitti, Roberto Basili:
A Tree Kernel approach to Question and Answer Classification in Question Answering Systems. 1510-1513 - Irene M. Cramer, Jochen L. Leidner, Dietrich Klakow:
Building an Evaluation Corpus for German Question Answering by Harvesting Wikipedia. 1514-1519 - Luís Fernando Costa, Luís Sarmento:
Component Evaluation in a Question Answering System. 1520-1523 - Brigitte Grau, Anne-Laure Ligozat, Isabelle Robba, Anne Vilnat, Laura Monceaux:
FRASQUES: A Question Answering system in the EQueR evaluation campaign. 1524-1529 - Tomoyosi Akiba:
Exploiting Dynamic Passage Retrieval for Spoken Question Recognition and Context Processing towards Speech-driven Information Access Dialogue. 1530-1535 - Matthew W. Bilotti, Eric Nyberg:
Evaluation for Scenario Question Answering Systems. 1536-1541 - Martin Hassel, Jonas Sjöbergh:
Towards Holistic Summarization - Selecting Summaries, Not Sentences. 1542-1547 - Constantin Orasan, Laura Hasler:
Computer-aided summarisation - what the user really wants. 1548-1551 - Peter Berck, Albert Russel:
ANNEX - a web-based Framework for Exploiting Annotated Media Resources. 1552-1555 - Peter Wittenburg, Hennie Brugman, Albert Russel, Alexander Klassmann, Han Sloetjes:
ELAN: a Professional Framework for Multimodality Research. 1556-1559 - Andrei Popescu-Belis, Maria Georgescul:
TQB: Accessing Multimodal Data Using a Transcript-based Query and Browsing Interface. 1560-1565 - Fabian Behrens, Jan-Torsten Milde:
The Eclipse Annotator: an extensible system for multimodal corpus creation. 1566-1569 - Kazuaki Maeda, Haejoong Lee, Julie Medero, Stephanie M. Strassel:
A New Phase in Annotation Tool Development at the Linguistic Data Consortium: The Evolution of the Annotation Graph Toolkit. 1570-1573 - Khurshid Ahmad, Craig Bennett, Tim Oliver:
Visual Surveillance and Video Annotation and Description. 1574-1577 - Nina Grønnum:
DanPASS - A Danish Phonetically Annotated Spontaneous Speech Corpus. 1578-1583 - Yuki Irie, Shigeki Matsubara, Nobuo Kawaguchi, Yukiko Yamaguchi, Yasuyoshi Inagaki:
Layered Speech-Act Annotation for Spoken Dialogue Corpus. 1584-1589 - Tomohiro Ohno, Shigeki Matsubara, Hideki Kashioka, Naoto Kato, Yasuyoshi Inagaki:
A Syntactically Annotated Corpus of Japanese Spoken Monologue. 1590-1595 - Isabella Chiari:
Slips and errors in spoken data transcription. 1596-1599 - Dafydd Gibbon, Flaviane Romani Fernandes, Thorsten Trippel:
A BLARK extension for temporal annotation mining. 1600-1605 - Patrizia Paggio:
Annotating Information Structure in a Corpus of Spoken Danish. 1606-1609 - Tien-Ping Tan, Laurent Besacier:
A French Non-Native Corpus for Automatic Speech Recognition. 1610-1613 - Anna Bogacka, Katarzyna Dziubalska-Kolaczyk, Grzegorz Krynicki, Dawid Pietrala, Mikolaj Wypych:
General and Task-Specific Corpus Resources for Polish Adult Learners of English. 1614-1619 - Andrej Zgank, Darinka Verdonik, Alexandra Zögling Markus, Zdravko Kacic:
SINOD - Slovenian non-native speech database. 1620-1623 - Vicente Alabau, Carlos D. Martínez-Hinarejos:
Bilingual speech corpus in two phonetically similar languages. 1624-1627 - Moritz Kaiser, Hannes Mögele, Florian Schiel:
Bikers Accessing the Web: The SmartWeb Motorbike Corpus. 1628-1631 - Asunción Moreno, Albert Febrer, Lluís Màrquez:
Generation of Language Resources for the Development of Speech Technologies in Catalan. 1632-1635 - José-Miguel Benedí, Eduardo Lleida, Amparo Varona, María José Castro, Isabel Galiano, Raquel Justo, Iñigo López de Letona, Antonio Miguel:
Design and acquisition of a telephone spontaneous speech dialogue corpus in Spanish: DIHANA. 1636-1639 - Sophie Rosset, Sandra Petel:
The Ritel Corpus - An annotated Human-Machine open-domain question answering spoken dialog corpus. 1640-1643 - Hynek Boril, Tomás Boril, Petr Pollák:
Methodology of Lombard Speech Database Acquisition: Experiences with CLSD. 1644-1647 - Eline Westerhout, Paola Monachesi:
A pilot study for a Corpus of Dutch Aphasic Speech (CoDAS). 1648-1653 - Renata Savy, Francesco Cutugno, Claudia Crocco:
Multilevel corpus analysis: generating and querying an AGset of spoken Italian (SpIt-MDb). 1654-1659 - Folkert de Vriend, Lou Boves, Henk van den Heuvel, Roeland van Hout, Joep Kruijsen, Jos Swanenberg:
A Unified Structure for Dutch Dialect Dictionary Data. 1660-1665 - Mathieu Mangeot, Antoine Chalvin:
Dictionary Building with the Jibiki Platform: the GDEF case. 1666-1669 - Viktor Trón, Péter Halácsy, Péter Rebrus, András Rung, Péter Vajda, Eszter Simon:
Morphdb.hu: Hungarian lexical database and morphological grammar. 1670-1673 - Bruno Cartoni:
Dealing with unknown words by simple decomposition: feasibility studies with Italian prefixes. 1674-1677 - Tomaz Erjavec, Darja Fiser:
Building Slovene WordNet. 1678-1683 - Stefan Schulz, Kornél G. Markó, Philipp Daumke, Udo Hahn, Susanne Hanser, Percy Nohama, Roosewelt L. Andrade, Edson José Pacheco, Martin Romacker:
Semantic Atomicity and Multilinguality in the Medical Domain: Design Considerations for the MorphoSaurus Subword Lexicon. 1684-1687 - Sanni Nimb:
LEXADV - a multilingual semantic Lexicon for Adverbs. 1688-1691 - Cvetana Krstev, Ranka Stankovic, Dusko Vitas, Ivan Obradovic:
WS4LR: A Workstation for Lexical Resources. 1692-1697 - Alexander Geyken, Norbert Schrader:
LexikoNet - a lexical database based on type and role hierarchies. 1698-1701 - James Pustejovsky, Catherine Havasi, Jessica Littman, Anna Rumshisky, Marc Verhagen:
Towards a Generative Lexical Resource: The Brandeis Semantic Ontology. 1702-1705 - Toshiyuki Kanamaru, Masaki Murata, Hitoshi Isahara:
Creation of a Japanese Adverb Dictionary that Includes Information on the Speaker's Communicative Intention Using Machine Learning. 1706-1709 - Adriana Roventini:
Linking Verbal Entries of Different Lexical Resources. 1710-1715 - Nilda Ruimy:
Merging two Ontology-based Lexical Resources. 1716-1721 - Sonja Bosch, Laurette Pretorius, Jackie Jones:
Towards machine-readable lexicons for South African Bantu languages. 1722-1727 - Markéta Lopatková, Zdenek Zabokrtský, Karolina Skwarska:
Valency Lexicon of Czech Verbs: Alternation-Based Model. 1728-1733 - Grazyna Vetulani, Zygmunt Vetulani, Tomasz Obrêbski:
Syntactic Lexicon of Polish Predicative Nouns. 1734-1737 - Ruli Manurung, Dave O'Mara, Helen Pain, Graeme Ritchie, Annalu Waller:
Building a Lexical Database for an Interactive Joke-Generator. 1738-1741 - Caroline Sporleder, Marieke van Erp, Tijn Porcelijn, Antal van den Bosch, Pim Arntzen:
Identifying Named Entities in Text Databases from the Natural History Domain. 1742-1745 - Peter Lucas, Magesh Balasubramanya, Dominic Widdows, Michael Higgins:
The Information Commons Gazetteer. 1746-1751 - Pawel P. Mazur, Robert Dale:
Named Entity Extraction with Conjunction Disambiguation. 1752-1755 - Ulrich Schäfer:
OntoNERdIE - Mapping and Linking Ontologies to Named Entity Recognition and Information Extraction Resources. 1756-1761 - Valentin Tablan, Wim Peters, Diana Maynard, Hamish Cunningham:
Creating Tools for Morphological Analysis of Sumerian. 1762-1765 - Christoph Benzmüller, Helmut Horacek, Henri Lesourd, Ivana Kruijff-Korbayová, Marvin R. G. Schiller, Magdalena Wolska:
A corpus of tutorial dialogs on theorem proving; the influence of the presentation of the study-material. 1766-1769 - Cristina Bosco, Vincenzo Lombardo:
Comparing linguistic information in treebank annotations. 1770-1775 - Irene Langkilde-Geary, Justin Betteridge:
A Factored Functional Dependency Transformation of the English Penn Treebank for Probabilistic Surface Generation. 1776-1781 - Adeline Nazarenko, Érick Alphonse, Julien Derivière, Thierry Hamon, Guillaume Vauvert, Davy Weissenbacher:
The ALVIS Format for Linguistically Annotated Documents. 1782-1786 - Luís Sarmento:
BACO - A large database of text and co-occurrences. 1787-1790 - Maria Fernanda Bacelar do Nascimento, José Bettencourt Gonçalves, Luísa Pereira, Antónia Estrela, Alfonso Pereira, Rui Santos, Sancho Oliveira:
The African Varieties of Portuguese: Compiling Comparable Corpora and Analyzing Data-Derived Lexicon. 1791-1794 - Liviu P. Dinu, Anca Dinu:
On the data base of Romanian syllables and some of its quantitative and cryptographic aspects. 1795-1798 - Uwe Quasthoff, Matthias Richter, Christian Biemann:
Corpus Portal for Search in Monolingual Corpora. 1799-1802 - Ivan Berlocher, Hyun-Gue Huh, Éric Laporte, Jee-Sun Nam:
Morphological annotation of Korean with Directly Maintainable Resources. 1803-1806 - Gertjan van Noord, Ineke Schuurman, Vincent Vandeghinste:
Syntactic Annotation of Large Corpora in STEVIN. 1811-1814 - May Lai-Yin Wong:
Skeleton Parsing in Chinese: Annotation Scheme and Guidelines. 1815-1820 - Kari Tenfjord, Paul Meurer, Knut Hofland:
The ASK Corpus - a Language Learner Corpus of Norwegian as a Second Language. 1821-1824 - Andrew W. Cole:
Corpus Development and Publication. 1825-1830 - Maria Clara Paixão de Sousa, Thorsten Trippel:
Building a historical corpus for Classical Portuguese: some technological aspects. 1831-1836 - Ruska Ivanovska-Naskova:
Development of the First LRs for Macedonian: Current Projects. 1837-1841 - Tonio Wandmacher, Jean-Yves Antoine:
Training Language Models without Appropriate Language Resources: Experiments with an AAC System for Disabled People. 1842-1845 - Nella Cucurullo, Simonetta Montemagni, Matilde Paoli, Eugenio Picchi, Eva Sassolini:
Dialectal resources on-line: the ALT-Web experience. 1846-1851 - Monica Monachini, Nicoletta Calzolari, Khalid Choukri, Jochen Friedrich, Giulio Maltese, Michele Mammini, Jan Odijk, Marisa Ulivieri:
Unified Lexicon and Unified Morphosyntactic Specifications for Written and Spoken Italian. 1852-1857 - Federico Calzolari, Eva Sassolini, Manuela Sassi, Sebastiana Cucurullo, Eugenio Picchi, Francesca Bertagna, Alessandro Enea, Monica Monachini, Claudia Soria, Nicoletta Calzolari:
Next Generation Language Resources using Grid. 1858-1861 - Marc Kemps-Snijders, Mark-Jan Nederhof, Peter Wittenburg:
LEXUS, a web-based tool for manipulating lexical resources lexicon. 1862-1865 - Freddy Offenga, Daan Broeder, Peter Wittenburg, Julien Ducret, Laurent Romary:
Metadata Profile in the ISO Data Category Registry. 1866-1869 - Hans Uszkoreit, Feiyu Xu, Jörg Steffen, Ilhan Aslan:
The pragmatic combination of different crosslingual resources. 1870-1873 - Isa Maks, Bob Boelhouwer:
Exploring opportunities for Comparability and Enrichment by Linking lexical databases. 1874-1879 - Ruifeng Xu, Qin Lu, Sujian Li:
The Design and Construction of A Chinese Collocation Bank. 1880-1885 - Leo Wanner, Margarita Alonso Ramos:
Local Document Relevance Clustering in IR Using Collocation Information. 1886-1889 - Silvie Cinková, Pavel Pecina, Petr Podveský, Pavel Schlesinger:
Semi-automatic Building of Swedish Collocation Lexicon. 1890-1893 - Nicole Grégoire:
Elaborating the parameterized Equivalence Class Method for Dutch. 1894-1899 - Amália Mendes, Sandra Antunes, Maria Fernanda Bacelar do Nascimento, João Miguel Casteleiro, Luísa Pereira, Tiago Sá:
COMBINA-PT: A Large Corpus-extracted and Hand-checked Lexical Database of Portuguese Multiword Expressions. 1900-1905 - Winston N. Anderson, Petronella M. Kotzé:
Finite state tokenisation of an orthographical disjunctive agglutinative language: The verbal segment of Northern Sotho. 1906-1911 - Magnar Brekke, Kai Innselset, Marita Kristiansen, Kari Øvsthus:
Automatic Term Extraction from Knowledge Bank of Economics. 1912-1916 - Alessandro Panunzi, Marco Fabbri, Massimo Moneglia:
Integrating Methods and LRs for Automatic Keyword Extraction from Open Domain Texts. 1917-1920 - Hans Hjelm:
Extraction of Cross Language Term Correspondences. 1921-1924 - Julia Ritz, Ulrich Heid:
Extraction tools for collocations and their morphosyntactic specificities. 1925-1930 - Rachada Kongkachandra, Kosin Chamnongthai:
Semantic-Based Keyword Recovery Function for Keyword Extraction System. 1931-1936 - Peter Waiganjo Wagacha, Guy De Pauw, Pauline Githinji:
A Grapheme-Based Approach for Accent Restoration in Gikuyu. 1937-1940 - Jirí Semecký:
On Automatic Assignment of Verb Valency Frames in Czech. 1941-1944 - Kyo Kageura, Genichiro Kikui:
A Self-Referring Quantitative Evaluation of the ATR Basic Travel Expression Corpus (BTEC). 1945-1950 - Rebecca J. Passonneau, Nizar Habash, Owen Rambow:
Inter-annotator Agreement on a Multilingual Semantic Annotation Task. 1951-1956 - György Szarvas, Richárd Farkas, László Felföldi, András Kocsor, János Csirik:
A highly accurate Named Entity corpus for Hungarian. 1957-1960 - Eckhard Bick:
Turning a Dependency Treebank into a PSG-style Constituent Treebank. 1961-1964 - Motoko Ueyama:
Evaluation of Web-based Corpora: Effects of Seed Selection and Time Interval. 1965-1970 - Dimitrios Kokkinakis, Dana Dannélls:
Recognizing Acronyms and their Definitions in Swedish Medical Texts. 1971-1974 - Yun-Chuang Chiao, Olivier Kraif, Dominique Laurent, Thi Minh Huyen Nguyen, Nasredine Semmar, François Stuck, Jean Véronis, Wajdi Zaghouani:
Evaluation of multilingual text alignment systems: the ARCADE II project. 1975-1979 - Marko Salmenkivi:
Finding representative sets of dialect words for geographical regions. 1980-1985 - Diana Santos, Nuno Seco, Nuno Cardoso, Rui Vilela:
HAREM: An Advanced NER Evaluation Contest for Portuguese. 1986-1991 - Jan Frederik Maas, Britta Wrede:
BITT: A Corpus for Topic Tracking Evaluation on Multimodal Human-Robot-Interaction. 1992-1995 - Jérémie Segouat, Annelies Braffort, Emilie Martin:
Sign Language corpus analysis: Synchronisation of linguistic annotation and numerical data. 1996-1999 - Jan Bungeroth, Daniel Stein, Philippe Dreuw, Morteza Zahedi, Hermann Ney:
A German Sign Language Corpus of the Domain Weather Report. 2000-2003 - Sarkis Abrilian, Laurence Devillers, Jean-Claude Martin:
Annotation of Emotions in Real-Life Video Interviews: Variability between Coders. 2004-2009 - Gilles Le Chenadec, Valérie Maffiolo, Noël Chateau, Jean-Marc Colletta:
Creation of a corpus of multimodal spontaneous expressions of emotions in Human-Machine Interaction. 2010-2013 - Petra-Maria Strauß, Holger Hoffmann, Wolfgang Minker, Heiko Neumann, Günther Palm, Stefan Scherer, Friedhelm Schwenker, Harald C. Traue, Welf Walter, Ulrich Weidenbacher:
Wizard-of-Oz Data Collection for Perception and Interaction in Multi-User Environments. 2014-2017 - Ivana Kruijff-Korbayová, Tilman Becker, Nate Blaylock, Ciprian Gerstenberger, Michael Kaisser, Peter Poller, Verena Rieser, Jan Schehl:
The SAMMIE Corpus of Multimodal Dialogues with an MP3 Player. 2018-2023 - Jeongwoo Ko, Fumihiko Murase, Teruko Mitamura, Eric Nyberg, Masahiko Tateishi, Ichiro Akahori:
Analyzing the Effects of Spoken Dialog Systems on Driving Behavior. 2024-2027 - Alexander Raake, Brian F. G. Katz:
US-based Method for Speech Reception Threshold Measurement in French. 2028-2033 - Philippe Boula de Mareüil, Christophe d'Alessandro, Alexander Raake, Gérard Bailly, Marie-Neige Garcia, Michel Morel:
A joint intelligibility evaluation of French text-to-speech synthesis systems: the EvaSy SUS/ACR campaign. 2034-2037 - Mark A. Przybocki, Gregory A. Sanders, Audrey N. Le:
Edit Distance: A Metric for Machine Translation Evaluation. 2038-2043 - Gábor Hodász:
Evaluation Methods of a Linguistically Enriched Translation Memory System. 2044-2047 - Jamal Laoudi, Calanda R. Tate, Clare R. Voss:
Task-based MT Evaluation: From Who/When/Where Extraction to Event Understanding. 2048-2053 - Hélène Bonneau-Maynard, Christelle Ayache, Frédéric Béchet, Alexandre Denis, Anne Kuhn, Fabrice Lefèvre, Djamel Mostefa, Matthieu Quignard, Sophie Rosset, Christophe Servan, Jeanne Villaneau:
Results of the French Evalda-Media evaluation campaign for literal understanding. 2054-2059 - Bernt Andrassy, Harald Höge:
Human and machine recognition as a function of SNR. 2060-2063 - Jan Hoidekr, Josef V. Psutka, Ales Prazák, Josef Psutka:
Benefit of a Class-based Language Model for Real-time Closed-captioning of TV Ice-hockey Commentaries. 2064-2067 - Niels Ole Bernsen, Thomas Hansen, Svend Kiilerich, Torben Kruchov Madsen:
Field Evaluation of a Single-Word Pronunciation Training System. 2068-2073 - Rafael E. Banchs, Antonio Bonafonte, Javier Pérez:
Acceptance Testing of a Spoken Language Translation System. 2074-2079 - Evie Coussé, Steven Gillis:
Regional Bias in the Broad Phonetic Transcriptions of the Spoken Dutch Corpus. 2080-2083 - Dirk Bühler, Wolfgang Minker:
Stochastic Spoken Natural Language Parsing in the Framework of the French MEDIA Evaluation Campaign. 2084-2087 - Juan José Rodríguez Soler, Pedro Concejero Cerezo, Daniel Tapias Merino, Alberto J. Sánchez García:
MEDUSA: User-Centred Design and usability evaluation of Automatic Speech Recognition telephone services in Telefónica Móviles España. 2088-2091 - Daniela Braga, L. M. S. Coelho, João Paulo Ramos Teixeira, Diamantino Freitas:
Progmatica: A Prosodic Database for European Portuguese. 2092-2095 - Ingunn Amdal, Torbjørn Svendsen:
FonDat1: A Speech Synthesis Corpus for Norwegian. 2096-2101 - Marti Umbert, Asunción Moreno, Pablo Daniel Agüero, Antonio Bonafonte:
Spanish Synthesis Corpora. 2102-2105 - Hannes Mögele, Moritz Kaiser, Florian Schiel:
SmartWeb UMTS Speech Data Collection: The SmartWeb Handheld Corpus. 2106-2111 - Christoph Draxler, Klaus Jänsch:
Speech Recordings in Public Schools in Germany - the Perfect Show Case for Web-based Recordings and Annotation. 2112-2115 - Zhongqiang Huang, Lei Chen, Mary P. Harper:
An Open Source Prosodic Feature Extraction Tool. 2116-2121 - Benjamin K. Tsou, Tom B. Y. Lai, King Kui Sin, Lawrence Y. L. Cheung:
Court Stenography-To-Text ("STT") in Hong Kong: A Jurilinguistic Engineering Effort. 2122-2125 - Ibon Saratxaga, Eva Navas, Inmaculada Hernáez, Iker Luengo:
Designing and Recording an Emotional Speech Database for Corpus Based Synthesis in Basque. 2126-2129 - Beáta Megyesi, Anna Sågvall Hein, Éva Csató Johanson:
Building a Swedish-Turkish Parallel Corpus. 2130-2133 - Alexandru Ceausu, Dan Stefanescu, Dan Tufis:
Acquis Communautaire Sentence Alignment using Support Vector Machines. 2134-2137 - Tomaz Erjavec:
The English-Slovene ACQUIS corpus. 2138-2141 - Ralf Steinberger, Bruno Pouliquen, Anna Widiger, Camelia Ignat, Tomaz Erjavec, Dan Tufis, Dániel Varga:
The JRC-Acquis: A Multilingual Aligned Parallel Corpus with 20+ Languages. 2142-2147 - Bente Maegaard, Lene Offersgaard, Lina Henriksen, Hanne Jansen, Xavier Lepetit, Costanza Navarretta, Claus Povlsen:
The MULINCO corpus and corpus platform. 2148-2153 - Jörg Tiedemann:
ISA & ICA - Two Web Interfaces for Interactive Alignment of Bitexts alignment of parallel texts. 2154-2159 - Emiliano Guevara, Sergio Scalise, Antonietta Bisetto, Chiara Melloni:
MORBO/COMP: a multilingual database of compound words. 2160-2163 - Andrea Sansò:
Documenting variation across Europe and the Mediterranean: the Pavia Typological Database. 2164-2169 - Silvie Cinková:
From PropBank to EngValLex: Adapting the PropBank-Lexicon to the Valency Theory of the Functional Generative Description. 2170-2175 - Doaa Samy, Antonio Moreno-Sandoval, José María Guirao, Enrique Alfonseca:
Building a Parallel Multilingual Corpus (Arabic-Spanish-English). 2176-2181 - Wei-Yun Ma, Chu-Ren Huang:
Uniform and Effective Tagging of a Heterogeneous Giga-word Corpus. 2182-2185 - Markus Geilfuss, Jan-Torsten Milde:
SAM - an annotation editor for parallel texts. 2186-2189 - Jorge Kinoshita, Laís do Nascimento Salvador, Carlos Eduardo Dantas de Menezes:
CoGrOO: a Brazilian-Portuguese Grammar Checker based on the CETENFOLHA Corpus. 2190-2193 - Petr Nemec:
Tree Searching/Rewriting Formalism. 2194-2199 - Hong Phuong Le, Nguyên Thi Minh Huyên, Laurent Romary, Azim Roussanaly:
A Lexicalized Tree-Adjoining Grammar for Vietnamese. 2200-2205 - Christian Rohrer, Martin Forst:
Improving coverage and parsing quality of a large-scale LFG for German. 2206-2211 - Timothy Baldwin, Su'ad Awab:
Open Source Corpus Analysis Tools for Malay. 2212-2215 - Joakim Nivre, Johan Hall, Jens Nilsson:
MaltParser: A Data-Driven Parser-Generator for Dependency Parsing. 2216-2219 - Benoît Sagot, Pierre Boullier:
Deep non-probabilistic parsing of large corpora. 2220-2221 - Sónia Frota, Marina Vigário, Fernando Martins:
FreP: An electronic tool for extracting frequency information of phonological units from Portuguese written text. 2224-2230 - Roger Levy, Galen Andrew:
Tregex and Tsurgeon: tools for querying and manipulating tree data structures. 2231-2234 - Ulrich Heid, Elsabé Taljard, Danie J. Prinsloo:
Grammar-based tools for the creation of tagging resources for an unresourced language: the case of Northern Sotho. 2235-2240 - Elaine Uí Dhonnchadha, Josef van Genabith:
A Part-of-speech tagger for Irish using Finite-State Morphology and Constraint Grammar Disambiguation. 2241-2244 - Péter Halácsy, András Kornai, Csaba Oravecz, Viktor Trón, Dániel Varga:
Using a morphological analyzer in high precision POS tagging of Hungarian. 2245-2248 - Marc Verhagen, Robert Knippen, Inderjeet Mani, James Pustejovsky:
Annotation of Temporal Relations with Tango. 2249-2252 - Catherine Havasi, James Pustejovsky, Marc Verhagen:
BULB: A Unified Lexical Browser. 2253-2258 - Tomasz Obrêbski, Michal Stolarski:
UAM Text Tools - a flexible NLP architecture. 2259-2262 - Benjamin Waldron, Ann A. Copestake, Ulrich Schäfer, Bernd Kiefer:
Preprocessing and Tokenisation Standards in DELPH-IN Tools. 2263-2268 - Yoshihide Kato, Shigeki Matsubara, Yasuyoshi Inagaki:
A Corpus Search System Utilizing Lexical Dependency Structure. 2269-2272 - Cédrick Fairon, Sébastien Paumier:
A framework for real-time dictionary updating. 2273-2276 - Enrique Alfonseca, Antonio Moreno-Sandoval, José María Guirao, Maria Ruiz-Casado:
The wraetlic NLP suite. 2277-2280 - Jordi Atserias, Bernardino Casas, Elisabet Comelles, Meritxell González, Lluís Padró, Muntsa Padró:
FreeLing 1.3: Syntactic and semantic services in an open-source NLP library. 2281-2286 - Raffaella Bernardi, Diego Calvanese, Luca Dini, Vittorio Di Tomaso, Elisabeth Frasnelli, Ulrike Kugler, Barbara Plank:
Multilingual Search in Libraries. The case-study of the Free University of Bozen-Bolzano. 2287-2290 - Daan Broeder, Freddy Offenga, Peter Wittenburg, Peter van der Kamp, David Nathan, Sven Strömqvist:
Technologies for a Federation of Language Resource Archives. 2291-2294 - Peter Berck, Hans-Jörg Bibiko, Marc Kemps-Snijders, Albert Russel, Peter Wittenburg:
Ontology-based Language Archive Utilization. 2295-2298 - Marc Kemps-Snijders, Julien Ducret, Laurent Romary, Peter Wittenburg:
An API for accessing the Data Category Registry. 2299-2302 - Michel Boekestein, Griet Depoorter, Remco van Veenendaal:
Functioning of the Centre for Dutch Language and Speech Technology. 2303-2306 - Emiko Suzuki, Tomomi Suzuki, Kyoko Kakihana:
On the Web Trilingual Sign Language Dictionary to Learn the foreign Sign Language without Learning a Target Spoken Language. 2307-2310 - Alessandro Oltramari:
LexiPass methodology: a conceptual path from frames to senses and back. 2311-2314 - Paul Morarescu:
Principles for annotating and reasoning with spatial information. 2315-2320 - Paul Buitelaar, Philipp Cimiano, Stefania Racioppa, Melanie Siegel:
Ontology-based Information Extraction with SOBA. 2321-2324 - Eiko Yamamoto, Kyoko Kanzaki, Hitoshi Isahara:
Detection of inconsistencies in concept classifications in a large dictionary - Toward an improvement of the EDR electronic dictionary. 2325-2330 - Dominique Dutoit:
Alexandria: A Powerful Multilingual Resource for Web. 2331-2334 - Asanee Kawtrakul, Chaveevan Pechsiri, Trakul Permpool, Dussadee Thamvijit, Phukao Sornprasert, Chaiyakorn Yingsaeree, Mukda Suktarachan:
Ontology Driven K-Portal Construction and K-Service Provision. 2335-2340 - Paolo Bouquet, Luciano Serafini, Stefano Zanobini:
The role of lexical resources in matching classification schemas. 2341-2346 - Kiril Ivanov Simov, Petya Osenova:
Shallow Semantic Annotation of Bulgarian. 2347-2352 - Thierry Declerck, Mihaela Vela:
Generic NLP Tools for Supporting Shallow Ontology Building. 2353-2356 - Dorothee Beermann, Lars Hellan:
Word Sense Disambiguation and Semantic Disambiguation for Construction Types in Deep Processing Grammars. 2357-2360 - Porfírio P. Filipe, Nuno J. Mamede:
A Framework to Integrate Ubiquitous Knowledge Modeling. 2361-2366 - Hiromi Itoh Ozaku, Akinori Abe, Kaoru Sagara, Noriaki Kuwahara, Kiyoshi Kogure:
Features of Terms in Actual Nursing Activities. 2367-2372 - Angelika Storrer, Sandra Wellinghoff:
Automated detection and annotation of term definitions in German text corpora. 2373-2376 - Judit Feliu, Jorge Vivaldi, M. Teresa Cabré:
SKELETON: Specialised knowledge retrieval on the basis of terms and conceptual relations. 2377-2382 - Yi-Rong Chen, Qin Lu, Wenjie Li, Zhifang Sui, Luning Ji:
A Study on Terminology Extraction Based on Classified Corpora. 2383-2386 - Andrea Mulloni, Viktor Pekar:
Automatic Detection of Orthographics Cues for Cognate Recognition. 2387-2390 - Benjamin K. Tsou, Oi Yee Kwong:
Toward a Pan-Chinese Thesaurus. 2391-2394 - Gabriella Pardelli, Manuela Sassi, Sara Goggi, Paola Orsolini:
Natural Language Processing: A Terminological and Statistical Approach. 2395-2398 - Masaki Itagaki, Anthony Aue, Takako Aikawa:
Detecting Inter-domain Semantic Shift using Syntactic Similarity. 2399-2402 - Kamel Smaïli, Caroline Lavecchia, Jean-Paul Haton:
Linguistic features modeling based on Partial New Cache. 2403-2406 - Nilda Ruimy:
Structuring a Domain Vocabulary in a General Knowledge Environment. 2407-2412 - Rebecca J. Passonneau, Roberta Blitz, David K. Elson, Angela Giral, Judith Klavans:
CLiMB ToolKit: A Case Study of Iterative Evaluation in a Multidisciplinary Project. 2413-2418 - Canasai Kruengkrai, Virach Sornlertlamvanich, Hitoshi Isahara:
A Conditional Random Field Framework for Thai Morphological Analysis. 2419-2424 - Kow Kuroda, Masao Utiyama, Hitoshi Isahara:
Getting Deeper Semantics than Berkeley FrameNet with MSFA. 2425-2430 - Milen Kouylekov, Bernardo Magnini:
Building a Large-Scale Repository of Textual Entailment Rules. 2437-2440 - Stefan Schaden:
Evaluation of Automatically Generated Transcriptions of Non-Native Pronunciations using a Phonetic Distance Measure. 2441-2446 - Ulrik Petersen:
Querying Both Parallel And Treebank Corpora: Evaluation Of A Corpus Query System. 2454-2459 - Anna Sinopalnikova, Pavel Smrz:
Intelligent Dictionary Interfaces: Usability Evaluation of Access-Supporting Enhancements. 2464-2469 - Harry Halpin:
Automatic Evaluation and Composition of NLP Pipelines with Web Services. 2470-2473 - Ana-Maria Barbu, Emil Ionescu, Verginica Barbu Mititelu:
Romanian Valence Dictionary in XML Format. 2474-2477 - Agnes Lisowska, Nancy L. Underwood:
ROTE: A Tool to Support Users in Defining the Relative Importance of Quality Characteristics. 2478-2481 - Margaret King, Nancy L. Underwood:
Evaluating Symbiotic Systems: the challenge. 2482-2485 - Nancy L. Underwood, Agnes Lisowska:
The Evolution of an Evaluation Framework for a Text Mining System. 2486-2491 - Fidelia Ibekwe-Sanjuan:
A task-oriented framework for evaluating theme detection systems: A discussion paper. 2492-2497 - Feng Zou, Fu Lee Wang, Xiaotie Deng, Song Han:
Evaluation of Stop Word Lists in Chinese Language. 2497-2500 - Laurianne Sitbon, Patrice Bellot:
Tools and methods for objective or contextual evaluation of topic segmentation. 2498-2503 - Nasredine Semmar, Meriama Laïb, Christian Fluhr:
A Deep Linguistic Analysis for Cross-language Information Retrieval. 2507-2510 - Alessandro Bahgat Shehata, Fabio Massimo Zanzotto:
A Dependency-based Algorithm for Grammar Conversion. 2508-2513 - Witold Abramowicz, Agata Filipowska, Jakub Piskorski, Krzysztof Wecel, Karol Wieloch:
Linguistic Suite for Polish Cadastral System. 2518-2523 - Khurshid Ahmad, Lee Gillam, David Cheng:
Sentiments on a Grid: Analysis of Streaming News and Views. 2524-2527 - Bolette Sandford Pedersen:
Query Expansion on Compounds. 2528-2533 - David M. Rojas, Takako Aikawa:
Predicting MT Quality as a Function of the Source Language. 2534-2537 - Martin Rajman, Marita Ailomaa, Agnes Lisowska, Miroslav Melichar, Susan Armstrong:
Extending the Wizard of Oz Methodologie for Multimodal Language-enabled Systems. 2538-2543 - Ronnie Taib, Natalie Ruiz:
Tangible Objects for the Acquisition of Multimodal Interaction Patterns. 2540-2545 - Stephan Raidt, Gérard Bailly, Frédéric Elisei:
Does a Virtual Talking Face Generate Proper Multimodal Cues to Draw User's Attention to Points of Interest? 2544-2549 - Darinka Verdonik, Matej Rojc:
Are you ready for a call? - Spontaneous conversations in tourism for speech-to-speech translation systems. 2556-2559 - Javier Pérez, Antonio Bonafonte:
GAIA: Common Framework for the Development of Speech Translation Technologies. 2560-2563 - Hitomi Tohyama, Shigeki Matsubara:
Collection of Simultaneous Interpreting Patterns by Using Bilingual Spoken Monologue Corpus. 2564-2569 - Henk van den Heuvel, Khalid Choukri, Christian Gollan, Asunción Moreno, Djamel Mostefa:
TC-STAR: New language resources for ASR and SLT purposes. 2570-2573 - Briony Williams, Rhys James Jones, Ivan Uemlianin:
Tools and resources for speech synthesis arising from a Welsh TTS project. 2574-2577 - Harold L. Somers, D. Gareth Evans, Zeinab Mohamed:
Developing Speech Synthesis for Under-Resourced Languages by "Faking it": An Experiment with Somali. 2578-2581 - Dimou Athanassia-Lida, Aimilios Chalamandaris:
Language identification from suprasegmental cues: Speech synthesis of Greek utterances from different dialectal variations. 2582-2585 - Hiromichi Kawanami, Takahiro Kitamura, Kiyohiro Shikano:
Long-term Analysis of Prosodic Features of Spoken Guidance System User Speech. 2586-2589 - Mohammad Bahrani, Hossein Sameti, Nazila Hafezi, H. Movassagh:
Building and Incorporating Language Models for Persian Continuous Speech Recognition Systems. 2590-2593 - Jim Talley:
Bootstrapping New Language ASR Capabilities: Achieving Best Letter-to-Sound Performance under Resource Constraints. 2594-2599 - Pavel Ircing, Jan Hoidekr, Josef Psutka:
Exploiting Linguistic Knowledge in Language Modeling of Czech Spontaneous Speech. 2600-2603 - Antal van den Bosch, Ineke Schuurman, Vincent Vandeghinste:
Transferring PoS-tagging and lemmatization tools from spoken to written Dutch corpus development. - Zeljko Agic, Marko Tadic:
Evaluating Morphosyntactic Tagging of Croatian Texts. - Julie Medero, Kazuaki Maeda, Stephanie M. Strassel, Christopher Walker:
An Efficient Approach to Gold-Standard Annotation: Decision Points for Complex Tasks.
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.