![]() ![]() |
英國國家語料庫(BNC)介紹 |
作者:admin 文章來源:本站原創(chuàng) 點擊數(shù) 更新時間:2011-11-16 文章錄入:admin 責(zé)任編輯:admin |
|
■How the BNC was created The BNC project was carried out and is managed by the BNC Consortium, an industrial/academic consortium led by Oxford University Press, of which the other members are major dictionary publishers Addison-Wesley Longman and Larousse Kingfisher Chambers; academic research centres at Oxford University Computing Services (OUCS), the University Centre for Computer Corpus Research on Language (UCREL) at Lancaster University, and the British Library's Research and Innovation Centre. The project was funded by the commercial partners, the Science and Engineering Council (now EPSRC) and the ■Creation process in brief The creation of the corpus started with a careful planning stage where the design principles were drawn up. These principles included the selection criteria that were used as the basis for the collection of the texts (a separate section describes the selection criteria for the written and the spoken parts of the corpus). Once a suitable texts was identified and permission to use it had been obtained, the text was converted to machine readable form. The conversion was performed by one of the commercial partners (OUP, Longman or Chambers). The resulting text was then converted to the standard project encoding format at OUCS, where its accuracy and internal consistency was also validated. The text was then passed to UCREL, where word class tagging was automatically added, and returned to OUCS for documentation and accession into the corpus. Each stage of corpus processing was recorded in a database maintained at OUCS. Work on building the corpus commenced in 1991 and was completed in 1994. The first general release of the corpus for European researchers was announced in February 1995. After the completion of the first edition of the BNC, a phase of tagging improvement was undertaken at ■web address: http://www.scottishcorpus.ac.uk/cmsw/ more corpus addresses: ■點擊→英語疑難問題·綜合解答■
|
![]() ![]() |