Press Release - Immediate
Paris, France, December 4th, 2015
Philadelphia, PA USA
ELRA and LDC partner on a joint distribution of Language Resources from the 2006 CoNLL shared task.
The Conference on Computational Natural Language Learning (CoNLL) is accompanied every year by a shared task intended to promote natural language processing applications and evaluate them in a standard setting. In 2006, the shared task was devoted to the parsing of syntactic dependencies using corpora from up to thirteen languages. The task aimed to define and extend the then-current state of the art in dependency parsing, a technology that complemented previous tasks by producing a different kind of syntactic description of input text.
Within this framework, ELRA and LDC are pleased to announce the release of 2006 CoNLL Shared Task - Ten Languages and 2006 CoNLL Shared Task – Arabic & Czech consisting of dependency treebanks used as part of the CoNLL 2006 shared task on multi-lingual dependency parsing. The languages covered in 2006 CoNLL Shared Task – Ten Languages are: Bulgarian, Danish, Dutch, German, Japanese, Portuguese, Slovene, Spanish, Swedish and Turkish. The source data in the treebanks consists principally of various texts (e.g., textbooks, news, literature) annotated in dependency format.
These packages can be found in the ELRA and LDC catalogues under the following references:
2006 CoNLL Shared Task - Ten Languages
ELRA ID: ELRA-W0086
LDC ID: LDC2015T11
2006 CoNLL Shared Task – Arabic & Czech
ELRA ID: ELRA-W0087
LDC ID: LDC2015T12
***About CoNLL and 2006 shared task***
More information about CoNLL and the 2006 shared task are available respectively at: http://ifarm.nl/signll/conll and http://ilk.uvt.nl/conll
*** About ELRA ***
The European Language Resources Association (ELRA) is a non-profit making organisation founded by the European Commission in 1995, with the mission of providing a clearing house for language resources and promoting Human Language Technologies (HLT).
To find out more about ELRA and respective catalogue, please visit: http://www.elra.info and http://catalogue.elra.info
*** About LDC ***
The Linguistic Data Consortium (LDC) is an open consortium of universities, libraries, corporations and research laboratories that creates and distributes linguistic resources for language-related education, research and technology development.
To find out more about LDC and its respective catalogue, please visit: https://www.ldc.upenn.edu and https://catalog.ldc.upenn.edu
Denise DiPersio firstname.lastname@example.org
Valérie Mapelli email@example.com