User Name:


Password:
     Register

¡¡

             

News

Call for The 4th China Workshop on Machine Translation(CWMT2008) will be held in Nov 27-28,2008. More Detail

Call for Participation:The Fourth International Chinese Language Processing Bakeoff & the First Chinese Processing Evaluation More Detail


Newly Added Resources

863 program in 2005 machine translation evaluation data

863 program in 2005 information index evaluation data

863 program in 2005 speech recognition evaluation data The contemporary chinese general balanced corpus of National Language Committee(Raw)
The contemporary chinese general balanced corpus of National Language Committee(Segmentation and part-of-speech annotated) The contemporary chinese general balanced corpus of National Language Committee(Syntactic Treebank)
The contemporary chinese general balanced corpus of National Language Committee(Segmentation lexicon) 6 regional accent speech corpus-spoken language
6 regional accent speech corpus-Recitation

About Chinese Linguistic Data Consortium (CLDC)

Chinese Linguistic Data Consortium (CLDC) is a nationwide voluntary entity, legally registered by researchers engaged in establishment of Chinese linguistic data (including phonetic data). It is an academic, public-catering and non-profitable association, aiming to unite numerous researchers in this area and then to establish a universally accepted Chinese linguistic database so that to enhance Chinese speech disposal to an international level, by offering assistance in relevant fundamental research and the development of application, meanwhile to push forward the Chinese linguistic data processing technology. CLDC starts with ¡°Image, Speech, Natural Language Understanding & Knowledge Exploration (Subject No. G19980305) supported by the Layout Program of National Key Foundation Research and Development (973), together with Chinese Hi-tech Research and Development Layout Project ¡°Generally Technical Research and Basic Database Establishment of Chinese Platform¡± (Subject No. 2001AA11401).Subordinate to Chinese Information Association, CLDC takes in the professional guidance, and supervising management from it, and meanwhile set up its office at Institute of Automation, Chinese Academy of Sciences.

The goal of the establishment of CLDC is to set up a general linguistic database of Chinese Linguistic Data, which embodies the Chinese linguistic database currently in the lead internationally. To achieve this, CLDC is managing to create and collect open Chinese linguistic data which are the most integral, authorized, systematic, encompassing the Speech data required in various areas, such as lexicon, language corpus, data and instrumental references, and thereby to set up a uniform series of standards and criteria for the clients. While creating and collecting, CLDC distributes existing data to departments for education, scientific research, governmental purposes, and development of industrial technology, to offer support to the fundamental research and application development of Chinese linguistic data processing. 

In this way it can enhance the development of Chinese speech information processing technology, to make it possible to keep its pace with relevant technology in English, and to make Chinese amongst the international general languages.

 COPYRIGHT (C) 2004, CHINESE LINGUISTIC DATA CONSORTIUM (CLDC) , ALL RIGHTS RESERVED