Several papers from CLARA were presented at the LREC 2014 conference in Reykjavik.
An overview paper with the title CLARA: A New Generation of Researchers in Common Language Resources and Their Applications presents the main aims, activities and results of CLARA.
The following papers were presented by former CLARA fellows (including some research done after their engagement in the project):
- Automatic Refinement of Syntactic Categories in Chinese Word Structures
- Chasing the Perfect Splitter: A Comparison of Different Compound Splitting Tools
- Combining Dependency Information and Generalization in a Pattern-based Approach to the Classification of Lexical-Semantic Relation Instances
- Pruning the Search Space of the Wolof LFG Grammar Using a Probabilistic and a Constraint Grammar Parser
- Crowdsourcing as a Preprocessing for Complex Semantic Annotation Tasks
- How to Tell a Schneemann from a Milchmann: An Annotation Scheme for Compound-Internal Relations
- Transfer Learning of Feedback Head Expressions in Danish and Polish Comparable Multimodal Corpora
- Votter Corpus: A Corpus of Social Polling Language
- The DWAN Framework: Application of a Web Annotation Framework for the General Humanities to the Domain of Language Resources
- VarClass: An Open-source Language Identification Tool for Language Varieties
- A Quality-based Active Sample Selection Strategy for Statistical Machine Translation
The project period of CLARA has been extended to 52 months. The new end date is March 31, 2014.
Few PhD students at Humanities faculties go spend some time at an industrial company. But that is precisely what Héctor Martínez did. He is a CLARA ESR at Copenhagen Universitity and had a research stay at Infomedia, a Danish IT company which has large text databases which are useful for his research. Read more in the Humanist on p. 34 (in Danish).