英语语法网 英语词汇网 高考英语网 中考英语网
精心组稿 精巧编排 精彩纷呈 全心打造英语第一品牌!
加入收藏
网站地图
购点说明
首    页 | 语法新闻 | 名词用法 | 代词用法 | 冠词用法 | 数词用法 | 介词用法 | 连词用法 | 形容词用法 | 副词用法 | 比较等级 | 动词用法 | 连系动词 | 情态动词 | 动词时态 | 被动语态 | 虚拟语气 | 非谓语动词 | 疑问句 | 祈使句 | 感叹句 | 否定句 | 倒装句 | 强调句 | there be存在句 | 省略句 | 独立主格 | 主谓一致 | 状语从句 | 定语从句 | 名词性从句 | it用法 | 语法练习 | 语法考试 | 语法综合 | 句子成分 | 语法连载 | 语法著作 | 英语语料库 | 语法与翻译 | 双语阅读 | 语法与惯用法 | 语法与写作 | 期刊精选 | 语法观点 | 语法挑刺 | 下载中心 | 趣味英语 | 会员之家 | 专家顾问 | 百家讲坛 | 答疑中心
您现在的位置: 首页 > 英语语法 > 英语语料库 >
英国国家语料库(BNC)介绍
作者:admin    文章来源:本站原创    点击数:    更新时间:2011-11-16    
        ★★★ 【字体:
说明:引用此文请注明出处,并务请保留后面的有效链接地址,谢谢!

 

How the BNC was created

The BNC project was carried out and is managed by the BNC Consortium, an industrial/academic consortium led by Oxford University Press, of which the other members are major dictionary publishers Addison-Wesley Longman and Larousse Kingfisher Chambers; academic research centres at Oxford University Computing Services (OUCS), the University Centre for Computer Corpus Research on Language (UCREL) at Lancaster University, and the British Library's Research and Innovation Centre. The project was funded by the commercial partners, the Science and Engineering Council (now EPSRC) and the UK government's Department of Trade and Industry under the Joint Framework for Information Technology (JFIT) programme. Additional support was provided by the British Library and the British Academy.

Creation process in brief

The creation of the corpus started with a careful planning stage where the design principles were drawn up. These principles included the selection criteria that were used as the basis for the collection of the texts (a separate section describes the selection criteria for the written and the spoken parts of the corpus).

Once a suitable texts was identified and permission to use it had been obtained, the text was converted to machine readable form. The conversion was performed by one of the commercial partners (OUP, Longman or Chambers). The resulting text was then converted to the standard project encoding format at OUCS, where its accuracy and internal consistency was also validated. The text was then passed to UCREL, where word class tagging was automatically added, and returned to OUCS for documentation and accession into the corpus. Each stage of corpus processing was recorded in a database maintained at OUCS.

Work on building the corpus commenced in 1991 and was completed in 1994. The first general release of the corpus for European researchers was announced in February 1995. After the completion of the first edition of the BNC, a phase of tagging improvement was undertaken at Lancaster University with funding from the Engineering and Physical Sciences Research Council (Research Grant No. GR/F 99847). This tagging enhancement project was led by Geoffrey Leech, Roger Garside and Tony McEnery. Correction and validation of the bibliographic and contextual information in all the BNC Headers was also carried out for this second version of the corpus, known as the BNC World Edition. BNC World was made available for world-wide distribution in 2001. In response to user feedback, the original SGML version of the corpus was later converted into XML. Additional mark-up for lemma and simplified word-class annotation was added and the treatment of multi-word units was improved. Minor errors and inconsistencies were also corrected. BNC XML Edition was released in 2007. Two sub-sets from the corpus have been published separately: the BNC Sampler and the BNCBaby.

web address:

http://www.scottishcorpus.ac.uk/cmsw/ 

more corpus addresses:

/Article/201111/2702.html

 

    ■点击→英语疑难问题·综合解答

 

上一页  [1] [2] 

引用地址:
文章录入:admin    责任编辑:admin 
  • 上一篇文章:

  • 下一篇文章:
  • 发表评论】【加入收藏】【告诉好友】【打印此文】【关闭窗口
    网友评论:(只显示最新10条。评论内容只代表网友观点,与本站立场无关!)