chinese treebank 可以士兵做什么任务去

【图文】LDC中文树库Chinese Treebank_百度文库
两大类热门资源免费畅读
续费一年阅读会员,立省24元!
LDC中文树库Chinese Treebank
&&LDC中文树库 Chinese Treebank
大小:684.00KB
登录百度文库,专享文档复制特权,财富值每天免费拿!
你可能喜欢Penn Chinese Treebank Project
The Penn - CU Chinese Treebank Project
Growing interest in Chinese Language Processing is leading to the development
of resources such as annotated corpora and automatic segmenters, part-of-speech
taggers and parsers.
Currently these are all being developed independently,
often with quite different standards for segmentation, part-of-speech tagging
and syntactic bracketing.
The time is ripe for an open discussion of the
methodological issues involved in achieving agreement on annotation
standards.
Unlike Western and Middle Eastern
Writing systems, Chinese writing does not have a
natural delimiter between words with the result that appropriate word
segmentation becomes a prerequisite for any other NLP tasks. In the literature
this problem has been discussed extensively.
The problem of part-of-speech
tagging is closely related.
These are both prerequisites to the establishment
of a Chinese Treebank that could be of general use.
We have completed building a 780K-thousand-word Chinese Treebank.
Our aim is to work towards a community
consensus on guidelines that will include the input of influential researchers
from Taiwan, Singapore, Hong Kong, China and the US.
To this end,
we held two workshops and a number of meetings between 7/1998 to 10/2000
in USA and abroad.
We are very interested in the community's
reaction to our guidelines and Treebank, and encourage anyone interested in
getting involved to please look into the guidelines we have attached below, use
the Treebank, which is available via , and
to get in touch with us with your comments.
Descriptions of the project:
Task: Building a segmented, POS tagged and bracketed Chinese corpus. The
data consists of Xinhua newswire, Hong Kong news and articles from Sinorama
news magazine. There is on-going effort to annotate broadcast news and broadcast
conversation data under the
DARPA GALE funding.
Latest release: The Chinese TreeBank (CTB) version 6.0, which has
has been officially
released via .
CTB6.0 data composition:
Xinhua newswire:
[001-325, 400-454, 600-885, 900-931]
Hong Kong news: [500-554]
Sinorama: [590-596, ], Broadcast news: []
for more information.
Coming soon! CTB6.0 is in the LDC publication pipeline.
Penn guidelines for Chinese Treebank
Segmentation guidelines (final version):
Guideline for POS tagging (final version):
Guideline for Bracketing (final version):
All three guidelines are now
technical reports. The ID numbers are
00-06, 00-07 and 00-08, respectively.
Publications
2005: The Penn Chinese TreeBank: Phrase Structure Annotation of a Large Corpus.
Nianwen Xue, Fei Xia, Fu-Dong Chiou, and Martha Palmer
Natural Language Engineering, 11(2)207-238.
Nianwen Xue, Fu-Dong Chiou, and Martha Palmer
Proceedings of the 19th. International Conference on Computational
Linguistics (COLING 2002), Taipei, Taiwan, 2002.
Fu-Dong Chiou, David Chiang, and Martha Palmer
Proceedings of the Human Language Technology Conference (HLT 2001), San
Diego, California, 2001.
Fei Xia, Martha Palmer, Nianwen Xue,
Mary Ellen Okurowski, John Kovarik, Fu-Dong Chiou,
Shizhe Huang, Tony Kroch, and Mitch Marcus
Proceedings of the second International Conference on Language Resources
and Evaluation (LREC 2000), Athens, Greece, 2000.
Principal Investigators:
Consultants:
Shizhe Huang,
Mary Ellen Okurowski,
John Kovarik,
Boyan A. Onyshkevyc
Project Managers:
Shudong Huang (September - December, 1998),
(September
1998 - December 2000),
(May 1999 - May 2000),
(January 2001 - present)
Guideline Designers:
Programming Support:
Zhibiao Wu (September 1998 - September 2000)
Scott Cotton (October - December, 2000)
Annotators:
Meiyu Chang (June 2003 - present)
(September 1998 - present)
Shudong Huang (September - December, 1998)
Tsan-Kuang Lee (June 2002 - present)
Nianwen Xue (September 1998 - May 2000; September 2001 - November
Sample Files
file 1: [],
file 2: [],
Treebank Releases on
Preliminary Release: June 2000,
Second Release: Dec 2000,
Workshops and meetings
meeting during ACL-98, Montreal, Canada (8/98)
meeting during ICCIP-98, Beijing, China (11/98)
meeting during ACL-99, Maryland, USA (6/99)
Links to other sites
Last modified on February 10, 2004.
This page has been viewed
times since March 5, 2003.您所在位置: &
&nbsp&&nbsp&nbsp&&nbsp
Treebank-based acquisition of a Chinese lexical-functional grammar.pdf全文-职业教育-在线文档 12页
本文档一共被下载:
次 ,您可全文免费在线阅读后下载本文档。
下载提示
1.本站不保证该用户上传的文档完整性,不预览、不比对内容而直接下载产生的反悔问题本站不予受理。
2.该文档所得收入(下载+内容+预览三)归上传者、原创者。
3.登录后可充值,立即自动返金币,充值渠道很便利
需要金币:145 &&
Treebank-based acquisition of a Chinese lexical-functional grammar.pdf
你可能关注的文档:
··········
··········
Treebank-Based Acquisition of a Chinese Lexical-Functional
Michael BURKE
Olivia LAM
+National Centre for Language Technology,
§Department of Linguistics,
School of Computing, Dublin City University
The University of Hong Kong,
and ?Centre for Advanced Studies, IBM,
Pokfulam, Hong Kong.
Dublin, Ireland.
olivia@hku.hk
mburke@computing.dcu.ie
Aoife CAHILL+
Rowena CHAN§
Ruth O’DONOVAN+
acahill@computing.dcu.ie
rowenac@graduate.hku.hk
rodonovan@computing.dcu.ie
Adams BODOMO§
Josef van GENABITH+?
Andy WAY+?
abbodomo@hku.hk
josef@computing.dcu.ie
away@computing.dcu.ie
wide-coverage,
constraint-based
Lexical-Functional
(LFG) (Kaplan and Bresnan, 1982; Bresnan, 2001) or Head-Driven Phrase Structure Grammars
unrestricted
knowledge-intensive,
time-consuming
prohibitively)
expensive.
researchers
automatically
wide-coverage,
probabilistic constraint-based grammatical resources from treebanks (Cahill et al., 2002, Cahill
Hockenmaier
Hockenmaier,
addressing
acquisition
bottleneck
正在加载中,请稍后...The Chinese Penn Treebank Tag Set中文宾州树库标记及其含义_百度文库
两大类热门资源免费畅读
续费一年阅读会员,立省24元!
The Chinese Penn Treebank Tag Set中文宾州树库标记及其含义
&&The Chinese Penn Treebank Tag Set中文宾州树库标记及其含义
阅读已结束,下载文档到电脑
想免费下载更多文档?
定制HR最喜欢的简历
下载文档到电脑,方便使用
还剩2页未读,继续阅读
定制HR最喜欢的简历
你可能喜欢您要找的是不是:
chinese treebank
汉语树库 ; 概念层次网络理论 ; 句类依存树库
[gap=560]Key words
Chinese treebank; hierarchical sentence-category dependency treebank
基于4个网页-
中文宾州树库
宾州大学中文树库
宾州中文树库
清华汉语树库
更多收起网络短语
宾州中文树库
- 引用次数:3
The automatic identification of coordination with overt conjunctions (COC) will prepare the work for building the Chinese Treebank, enhance the efficiency of the parser and be used for Machine Translation and Information Extraction.
有标记联合结构的自动识别将为汉语树库的构建做好预处理工作,提高句法分析器的工作效率,同时该识别成果可以直接应用于机器翻译、信息抽取等领域。
参考来源 - 有标记联合结构的自动识别
&2,447,543篇论文数据,部分数据来源于
Chinese automatic sentence analysis Grammar knowledge database Grammar function distribution 973 Treebank;
汉语自动句法分析; 语法知识库; 语法功能分布; 973汉语树库;
This paper reports the new improvement of the work on parsing the Penn Chinese treebank (CTB), one of the most important technologies of Chinese information processing.
报告了依托宾州中文树库进行句法分析研究的最新进展。
Experiments on Chinese TreeBank from different training set size are made. It shows that our approach improves the accuracy of POS tagging over the four training sets with different sizes.
本文以宾州中文树库为实验语料,考查了不同规模的标注数据对模型性能的影响,实验结果表明,本文提出的无监督词性标注方法提高了中文词性标注的性能。
$firstVoiceSent
- 来自原声例句
请问您想要如何调整此模块?
感谢您的反馈,我们会尽快进行适当修改!
请问您想要如何调整此模块?
感谢您的反馈,我们会尽快进行适当修改!

我要回帖

更多关于 任务管理器可以做什么 的文章

 

随机推荐