英汉语料库汇总
英汉语料库汇总

1.英语学习者语料库(书面语及口语)中国学习者语料库 CLEC(100万)广外、上海交大2.大学英语学习者口语语料库 COLSEC (5万) 上海交大3.香港科技大学学习者语料库 HKUST Learner Corpus 香港科技大学4.中国英语专业语料库 CEME (148万) 南京大学5.中国英语学习者口语语料库 SECCL (100万) 南京大学6.国际外语学习者英语口语语料库中国部分 LINSEI-China (10万) 华南师大7.硕士写作语料库 MWC (12万) 华中科技大学9.平行语料库汉英平行语料库 PCCE 北外10.南大-国关平行语料库南京大学11.英汉文学作品语料库;外研社12.冯友兰《中国哲学史》汉英对照语料库13.李约瑟(Joself Needham)《中国科学技术史》英汉对照语料库14.计算机专业的双语语料库;国家语言文字工作委员会语言文字应用研究所15.柏拉图(Plato)哲学名著《理想国》的双语语料库16.英汉双语语料库(15万对) 中科院软件所17.英汉双语语料库:LDC香港新闻英汉双语对齐语料36294段以及香港法律英汉双语对齐语料31万句子对中国科学院自动化研究所18.英汉双语语料库(100万),网上英汉语段电子词典及网上电子英汉搭配词典(1000万) 东北大学19.英汉双语语料库(40-50万句子对) 哈尔滨工业大学20.双语语料库(5万多对) 北京大学计算语言学研究所21.对比语料库 LIVAC(Linguistic variety in Chinese communities) 香港城市理工大学22.平衡语料库(Sinica Corpus);树图语料库(Sinica Treebank) 台湾23.特殊英语语料库中国英语(China English)语料库河南师范大学24.军事英语语料库(Corpus of Military Texts) 解放军外语学院25.新视野大学英语教材语料库上海交通大学26.汉语语料库汉语现代文学作品语料库(1979年,527万字) 武汉大学27.现代汉语语料库(1983年,2000万字) 北京航空航天大学28.中学语文教材语料库(1983年,106万8000字) 北京师范大学29.现代汉语词频统计语料库(1983年,182万字) 北京语言学院30.国家级大型汉语均衡语料库(2000万字) 国家语言文字工作委员会31.《人民日报》语料库(2700万字) 北京大学计算机语言学研究所32.大型中文语料库(5亿字,10分库) 北京语言文化大学33.现代汉语语料库(1亿字) 清华大学34.汉语新闻语料库;(1988年,250万字) 山西大学35.标准语料库(2000年,70万字)36.生语料库(3000万字);《作家文摘》的标注语料库(100万字) 上海师范大学37.现代自然口语语料库中国社会科学院语言所38.旅游咨询口语对话语料库和旅馆预定口语对话语料库中国科学院自动化所39.北京大学汉语语言学研究中心的三个语料库现代汉语语料库/yuliao.asp?item=1古代汉语语料库/yuliao.asp?item=2汉英双语语料库/yuliao.asp?item=3/printthread.php?t=2742汉语语料库使用权限国家语委语料库(http://219.238.40.213:8080/CpsQrySv.srf)”虽说是通用型平衡语料库,但不能完全免费使用;北京语言大学的汉语语料库(http://202.112.195.8)语料产出时间较早,且不能完全免费使用;北京大学汉语语言学研究中心语料库(现代汉语子库)”(/YuLiao_Contents.Asp)规模最大,逾亿字,但取样极不均衡,多半为文学作品;台湾“中央研究院”Sinica Corpus也是可免费使用的平衡汉语语料库。
常用的英语语料库

常用的英语语料库English corpora, or language corpora, are collections of text samples that are used for linguistic research and analysis. These corpora serve as valuable resources for studying language patterns, trends, and usage in various contexts. In this article, we will explore some of the commonly used English language corpora and their applications.1. British National Corpus (BNC)The British National Corpus is one of the most widely used language corpora for studying contemporary British English. It contains a diverse range of texts, including spoken conversations, written documents, and academic papers. Researchers can access the BNC to examine language usage in different genres and domains, such as science, politics, and fiction. The BNC provides valuable insights into the changes in the English language over time.2. Corpus of Contemporary American English (COCA)The Corpus of Contemporary American English is a comprehensive corpus that provides a vast collection of English texts from different genres, including spoken, written, and academic. It offers researchers the opportunity to investigate various aspects of American English, including vocabulary, syntax, and discourse patterns. The COCA is frequently used in linguistic research, language teaching, and corpus-based language analysis.3. Google Books Ngram ViewerThe Google Books Ngram Viewer is a powerful tool that allows researchers to analyze the frequency of words or phrases in the vast collection of books digitized by Google. It provides a visual representation of the usage of specific terms over time, offering insights into the historical development and popularity of certain expressions. This tool is useful for investigating language change and cultural shifts through the lens of published literature.4. Corpus Linguistics Toolkit (CLAWS)The Corpus Linguistics Toolkit, also known as CLAWS, is a suite of programs specifically designed for corpus analysis. It provides researchers with tools for processing, annotating, and analyzing text corpora. CLAWS allows for the extraction of linguistic features, such as part-of-speech tags and named entities, which can be utilized for various linguistic studies. The toolkit's versatility makes it a valuable resource for researchers in the field.5. International Corpus of English (ICE)The International Corpus of English is a collection of English language corpora from different countries and regions. It aims to capture the linguistic variations within the English language across different cultures and contexts. The ICE provides researchers with valuable data for studying dialectal differences, language contact phenomena, and sociolinguistic aspects of English.6. Oxford English Corpus (OEC)The Oxford English Corpus is a corpus of contemporary English texts that serves as a reference for the analysis of language usage and trends. Itincludes a wide range of written and spoken materials from various sources, such as books, newspapers, and online platforms. The OEC is frequently used for linguistic research, lexicography, and language teaching purposes.7. Corpus Query Language (CQL)Corpus Query Language is a specialized language used to search and retrieve specific linguistic patterns within corpora. It enables researchers to formulate complex queries and retrieve relevant linguistic data for analysis. CQL is widely used in corpus linguistics and facilitates the exploration of language patterns and structures within corpora.In conclusion, English language corpora play a vital role in linguistic research and analysis. The aforementioned corpora, including the British National Corpus, Corpus of Contemporary American English, Google Books Ngram Viewer, Corpus Linguistics Toolkit, International Corpus of English, Oxford English Corpus, and Corpus Query Language, provide valuable resources for investigating language usage, trends, and patterns in various contexts. These corpora aid in the understanding of language change, societal influences, and cultural shifts, making them invaluable tools for language researchers, educators, and language enthusiasts.。
语料库第十一章

语料库第十一章1. Caravan ['kærəvæn; kærə'væn] n.房车记忆方法:car 车+ van 箱子caravan holiday 乘旅行房车度过的假日例:We're heading for the caravan with the sun painted on it.2.navigation [ˌnævɪ'ɡeɪʃn] n. 导航;航行=direction finding 方位测定=释义:the guidance of ships or airplanes from place to place例:Mechanics discovered problems with the plane's navigation system.3.vegetarian [ˌvedʒə'teəriən] n. 素食者记住:vegetable 是蔬菜的意思Vegeta+rian an是表示人的后缀例:Not a few of my friends are vegetarian.4.campsite ['kæmpsaɪt] n. 露营地记:camp 露营+ site地点——露营地同义词:campground 野营地例:The campsite is set in the middle of a pine forest.5.mid-range adj.中等距离的记忆方法:range意思:动词排列:名词范围6.postage ['pəʊstɪdʒ] n. 邮费;邮资已付邮戳Stamp 邮票Postage stamp 邮票Postage due 欠邮费due 应付的Postage free 免邮费7.liberty ['lɪbəti] n. 自由;自由权用法:allow/grant sb liberty 给某人自由Invade the liberty of 侵犯......的自由例:We have the liberty to say what we want.8.transit ['trænzɪt] n. 运输;经过记:trans 前缀表示转换的意思transfer 翻译——转换意思transit 转换地点——运输例:Our ship used the canal to transit to the east.9.snow boarding 滑雪板Board n. 木板,布告板V. 登机,登船Board the ship /plane10.padding pool ['pædɪŋ] n. 嬉水池记:Pad 平板Padding 垫充例:Add in some padding and margin。
网上在线字典辞典大全

网上在线字典辞典大全翻译类字典辞典金山词霸在线版——国人自主开发的最权威的电子词典,词霸搜索-免费在线词典查词翻译_英汉_日汉_英语_成语中国专家翻译网——英文翻译公司--日文翻译公司--多语种翻译公司- 英语翻译--日语翻译——德语翻译——法语翻译中国译典@中国在线翻译网——线上最庞大的英汉-汉英翻译语料库百度词典搜索——百度词典搜索支持强大的英汉互译功能,中文成语的智能翻译Yahoo学生英汉字典——英语单词查询、举例Dict_CN 在线词典——在线搜索不重复汉英词条100万,英汉词条103万。
牛津英汉双解词典在线汉英双解新华字典林语堂当代汉英词典(繁)——较权威的在线汉英词典,繁体,备有汉字部首索引和汉语拼音检索功能华翼翻译-多语种在线电脑字典太阳雨英汉\汉英词典汉语输入方式:拼音、简体繁体洪恩双语词典中英文查询,提供词义、例句、词组、同义词、反义词英文类字典辞典——几十个语种和数百本词典的在线检索,最权威的在线词典门户之一,英文界面LEO English-German Dictionary——英文、德文互译在线词典,英文界面Dnelook Dictionary——英语、法语、德语、意大利语五种语言629本词典的在线检索,权威,英文Latin-English Dictionary——在线拉丁语和英语词典,历史较久,英文界面Webster's Collegiate Dictionary——著名的韦氏大词典在线版,使用方便,英文American Sign Language Dictionary——美国形体语言词典,独特的在线词典剑桥在线辞典——包括剑桥国际英语辞典、美国英语辞典、国际短语辞典及国际习语辞典,英文——查询、互译、流行词汇、站点导航,英文——英语同义词字典,英文几十个语种和数百本词典的在线检索,最权威的在线词典门户之一,英文界面LEO English-German Dictionary英文、德文互译在线词典,英文界面Onelook Dictionary 英语、法语、德语、意大利语五种语言629本词典的在线检索,权威,英文Latin-English Dictionary在线拉丁语和英语词典,历史较久,英文界面Travlang's Translating Dictionaries欧洲主要语言互译,很多链接资源,英文American Sign Language Dictionary美国形体语言词典,独特的在线词典Oxford English Dictionary 牛津英语大词典在线版,须注册才能使用,英文剑桥在线辞典包括剑桥国际英语辞典、美国英语辞典、国际短语辞典及国际习语辞典,英文英语同义词字典,英文其它语言类字典辞典德汉字典网华翼电脑字典-荷兰语、中文、英语、法语、...承隆科技Amasoft 在线、离线英汉翻译软件,英汉/汉英字典颜元叔教授主编-网路英英/英汉辞典提供英文对英文、中文和日文的翻译。
英汉社论平行语料库

英汉社论平行语料库1.引言1.1 概述概述部分:随着全球化的发展,英汉社论的重要性日益凸显。
社论作为一种新闻类文体,承载着媒体的立场和观点,并在舆论场中发挥着重要的作用。
因此,对于英汉社论的研究和理解具有重要的意义。
为了更好地研究英汉社论,建立一个英汉社论平行语料库是至关重要的。
英汉社论平行语料库是指收集和整理一定数量的英语社论与对应的中文翻译,以便进行对照和分析。
这样的平行语料库可以帮助研究人员深入了解英汉社论的语言特点、文体特征以及表达方式等。
建立英汉社论平行语料库的目的有两个方面。
首先,它可以作为翻译研究的重要资源,帮助翻译人员更好地进行英汉社论的互译。
其次,它可以为社会科学研究提供依据,例如新闻传播学、语言学和文化研究等领域的学者可以通过对英汉社论平行语料库的分析来揭示社论对于公众舆论形成的影响。
本文将从概述、文章结构和目的三个方面对英汉社论平行语料库进行全面介绍。
首先,我们将简要概述英汉社论的背景和重要性。
然后,我们将详细介绍英汉社论平行语料库的定义和意义。
接着,我们将讨论建立英汉社论平行语料库的方法和步骤,包括语料的采集、整理以及语言特征的标注。
最后,我们将展望英汉社论平行语料库的应用前景,并对整篇文章进行总结和展望。
通过对英汉社论平行语料库的研究和应用,我们可以更好地理解英汉社论的特点和规律,并且为相关领域的学术研究和实际应用提供支持和参考。
希望本文能够为英汉社论平行语料库的建设和应用提供启示,并促进跨文化交流和研究的发展。
1.2 文章结构本文将按照以下结构进行阐述和探讨英汉社论平行语料库的相关内容:1. 引言:首先,我们将概述本文的研究背景和意义,明确本文的研究目的。
通过引言部分,读者可以初步了解到本文所要探讨的问题及其重要性。
2. 正文:正文是本文的核心部分,旨在详细介绍英汉社论平行语料库的定义、意义、以及建立方法和步骤。
2.1 英汉社论平行语料库的定义和意义:首先,我们将解释什么是英汉社论平行语料库,即在英汉两种语言中,相互对应的社论文本的语料库。
免费的英语语料库汇总

免费的英语语料库汇总Here is a list of free English language corpora:1. British National Corpus (BNC): One of the most widely used corpora, it includes spoken and written texts from a range of genres and registers.2. Corpus of Contemporary American English (COCA): Contains over 520 million words of American English from a variety of sources, including fiction, non-fiction, newspapers, academic journals, and spoken language.3. Corpus of Historical American English (COHA): Covers American English from 1810 to 2024 and includes over 400 million words from a variety of genres.4. Corpus of Global Web-Based English (GloWbE): A web-based corpus that contains over 1.9 billion words from websites around the world. It includes texts from different countries and regions, allowing for the study of global variation in English.5. International Corpus of English (ICE): A collection of corpora representing different varieties of English, including British, American, Indian, Australian, and Hong Kong English.6. TIME Magazine Corpus: Contains articles from TIME Magazine published between 1923 and 2024. It is a useful resource for studying the use of language in news and current affairs.7. Open American National Corpus (OANC): A wide-ranging corpus that includes a variety of written and spoken texts from different sources, including newspapers, fiction, academic journals, and interviews.8. Santa Barbara Corpus of Spoken American English: A corpus of spoken American English that includes conversations between native speakers from different regions of the United States.9. EnTenTen Corpus: A web-based corpus that has over 20 billion words of English from a wide range of online sources. It is useful for studying contemporary English usage.10. BYU-BNC: A version of the British National Corpus that has been cleaned and lemmatized, making it easier to analyze.11. The Corpus of Contemporary American English under COCA: Similar to COCA, this corpus includes 560 million words of American English, allowing for detailed analysis of language use in various contexts.12. The Corpus of Contemporary American English under COCA-Spoken: Specifically focuses on spoken American English, with over 200 million words from conversations, interviews, and other spoken sources.13. The Hansard Corpus: Contains transcripts of parliamentary debates in the United Kingdom from 1803 to thepresent day. It is a valuable resource for studying political discourse and language change.14. TIMIT Corpus: A widely used speech database that contains recordings of speech from speakers of eight major American English dialects.15. The New York Times Annotated Corpus: An extensive collection of articles from The New York Times, allowing for analysis of language use in journalistic writing.。
ig英语口语语料库大全

ig英语口语语料库大全1、You should take the medicine after you read the _______. [单选题] *A. linesB. wordsC. instructions(正确答案)D. suggestions2、1.________my father ________ my mother is able to drive a car. So they are going to buy one. [单选题] *A.Neither; norB.Both; andC.Either; orD.Not only; but also(正确答案)3、5.Shanghais is known ________ “the Oriental Pearl”, so many foreigners come to visit Shanghai very year. [单选题] *A.forB.as (正确答案)C.withD.about4、I'm sorry I cannot see you immediately. But if you wait, I'll see you_____. [单选题] *A. for a momentB. in a moment(正确答案)C. for the momentD. at the moment5、The sun disappeared behind the clouds. [单选题] *A. 出现B. 悬挂C. 盛开D. 消失(正确答案)6、How _______ it rained yesterday! We had to cancel(取消) our football match. [单选题] *A. heavily(正确答案)B. lightC. lightlyD. heavy7、Growing vegetables()constantly watering. [单选题] *A. neededB. are neededC. were neededD. needs(正确答案)8、33.Will Mary's mother ______ this afternoon? [单选题] * A.goes to see a filmB.go to the filmC.see a film(正确答案)D.goes to the film9、He has bought an unusual car. [单选题] *A. 平常的B. 异常的(正确答案)C. 漂亮的D. 废弃的10、We need a _______ when we travel around a new place. [单选题] *A. guide(正确答案)B. touristC. painterD. teacher11、She passed me in the street, but took no()of me. [单选题] * Attention (正确答案)B. watchC. careD. notice12、I think _______ is nothing wrong with my car. [单选题] *A. thatB. hereC. there(正确答案)D. where13、If you don’t feel well, you’d better ask a ______ for help. [单选题] *A. policemanB. driverC. pilotD. doctor(正确答案)14、I have worked all day. I'm so tired that I need _____ . [单选题] *A. a night restB. rest of nightC. a night's rest(正确答案)D. a rest of night15、22.______ is convenient to travel between Pudong and Puxi now. [单选题] *A.It(正确答案)B.ThisC.ThatD.What16、I _______ play the game well. [单选题] *A. mustB. can(正确答案)C. wouldD. will17、—What can I do to help at the old people’s home?—You ______ read stories to the old people. ()[单选题] *A. could(正确答案)B. mustC. shouldD. would18、I _______ seeing you soon. [单选题] *A. look afterB. look forC. look atD. look forward to(正确答案)19、45.—Let's make a cake ________ our mother ________ Mother's Day.—Good idea. [单选题] *A.with; forB.for; on(正确答案)C.to; onD.for; in20、We _______ swim every day in summer when we were young. [单选题] *A. use toB. are used toC. were used toD. used to(正确答案)21、______ the morning of September 8th, many visitors arrived at the train station for a tour.()[单选题] *A. FromB. ToC. InD. On(正确答案)22、23.Hurry up! The train ________ in two minutes. [单选题] *A.will go(正确答案)B.goC.goesD.went23、You have failed two tests. You’d better start working harder, ____ you won’t pass the course. [单选题] *A. andB. soC. butD. or(正确答案)24、Will you see to()that the flowers are well protected during the rainy season? [单选题] *A. it(正确答案)B. meC. oneD. yourself25、—______ is the concert ticket?—It’s only 160 yuan.()[单选题] *A. How manyB How much(正确答案)C. How oftenD. How long26、When Max rushed to the classroom, his classmates _____ exercises attentively. [单选题] *A. didB. have doneC. were doing(正确答案)D. do27、Mrs. Green has given us some _______ on how to study English well. [单选题] *A. practiceB. newsC. messagesD. suggestions(正确答案)28、Many people prefer the bowls made of steel to the _____ made of plastic. [单选题] *A. itB. ones(正确答案)C. oneD. them29、—I can’t always get good grades. What should I do?—The more ______ you are under, the worse grades you may get. So take it easy!()[单选题] *A. wasteB. interestC. stress(正确答案)D. fairness30、Sorry, I can't accept your invitation. [单选题] *A. 礼物B. 观点C. 邀请(正确答案)D. 好意。
汉英双语语料库

汉英双语语料库1. 这个城市的夜景很美,特别是那些高楼大厦的灯光,让整个城市感觉都变得更加璀璨。
The night view of this city is beautiful, especially the lights of the tall buildings, which make the whole city feel even more dazzling.2. 这个博物馆收藏了许多有价值的文物和艺术品,是了解本地历史和文化的最佳场所。
This museum has collected many valuable relics and artworks, and is the best place to learn about local history and culture.3. 我们一家人喜欢去海边度假,享受阳光、沙滩和大海的美丽。
Our family likes to go to the beach for vacation, and enjoy the beauty of the sun, sand, and sea.4. 我们在学校的食堂里可以选择各种各样的美食,包括中餐、西餐和快餐。
We can choose a variety of cuisine in the school cafeteria, including Chinese, Western, and fast food.5. 这个地区的气候十分宜人,四季分明,春暖花开、夏日清凉、秋高气爽、冬日雪景,都让人感受到大自然的美好。
The climate of this region is very pleasant, with distinct seasons, spring blooms, cool summers, refreshing autumns, and snowy winters, all making people feel the beauty of nature.6. 现在越来越多的人喜欢做运动来保持身体健康,例如跑步、游泳、瑜伽等等。
- 1、下载文档前请自行甄别文档内容的完整性,平台不提供额外的编辑、内容补充、找答案等附加服务。
- 2、"仅部分预览"的文档,不可在线预览部分如存在完整性等问题,可反馈申请退款(可完整预览的文档不适用该条件!)。
- 3、如文档侵犯您的权益,请联系客服反馈,我们会尽快为您处理(人工客服工作时间:9:00-18:30)。
1.英语学习者语料库(书面语及口语)中国学习者语料库 CLEC(100万)广外、上海交大
2.大学英语学习者口语语料库 COLSEC (5万) 上海交大
3.香港科技大学学习者语料库 HKUST Learner Corpus 香港科技大学
4.中国英语专业语料库 CEME (148万) 南京大学
5.中国英语学习者口语语料库 SECCL (100万) 南京大学
6.国际外语学习者英语口语语料库中国部分 LINSEI-China (10万) 华南师大
7.硕士写作语料库 MWC (12万) 华中科技大学
9.平行语料库汉英平行语料库 PCCE 北外
10.南大-国关平行语料库南京大学
11.英汉文学作品语料库;外研社
12.冯友兰《中国哲学史》汉英对照语料库
13.李约瑟(Joself Needham)《中国科学技术史》英汉对照语料库
14.计算机专业的双语语料库;国家语言文字工作委员会语言文字应用研究所
15.柏拉图(Plato)哲学名著《理想国》的双语语料库
16.英汉双语语料库(15万对) 中科院软件所
17.英汉双语语料库:LDC香港新闻英汉双语对齐语料36294段以及香港法律英汉双语对齐语料31万句子对中国科学院自动化研究所
18.英汉双语语料库(100万),网上英汉语段电子词典及网上电子英汉搭配词典(1000万) 东北大学
19.英汉双语语料库(40-50万句子对) 哈尔滨工业大学
20.双语语料库(5万多对) 北京大学计算语言学研究所
21.对比语料库 LIVAC(Linguistic variety in Chinese communities) 香港城市理工大学
22.平衡语料库(Sinica Corpus);树图语料库(Sinica Treebank) 台湾
23.特殊英语语料库中国英语(China English)语料库河南师范大学
24.军事英语语料库(Corpus of Military Texts) 解放军外语学院
25.新视野大学英语教材语料库上海交通大学
26.汉语语料库汉语现代文学作品语料库(1979年,527万字) 武汉大学
27.现代汉语语料库(1983年,2000万字) 北京航空航天大学
28.中学语文教材语料库(1983年,106万8000字) 北京师范大学
29.现代汉语词频统计语料库(1983年,182万字) 北京语言学院
30.国家级大型汉语均衡语料库(2000万字) 国家语言文字工作委员会
31.《人民日报》语料库(2700万字) 北京大学计算机语言学研究所
32.大型中文语料库(5亿字,10分库) 北京语言文化大学
33.现代汉语语料库(1亿字) 清华大学
34.汉语新闻语料库;(1988年,250万字) 山西大学
35.标准语料库(2000年,70万字)
36.生语料库(3000万字);《作家文摘》的标注语料库(100万字) 上海师范大学
37.现代自然口语语料库中国社会科学院语言所
38.旅游咨询口语对话语料库和旅馆预定口语对话语料库中国科学院自动化所
39.北京大学汉语语言学研究中心的三个语料库
现代汉语语料库
/yuliao.asp?item=1
古代汉语语料库
/yuliao.asp?item=2
汉英双语语料库
/yuliao.asp?item=3
/printthread.php?t=2742
汉语语料库使用权限
国家语委语料库(http://219.238.40.213:8080/CpsQrySv.srf)”虽说是通用型平衡语料库,但不能完全免费使用;
北京语言大学的汉语语料库(http://202.112.195.8)语料产出时间较早,且不能完全免费使用;
北京大学汉语语言学研究中心语料库(现代汉语子库)”(/YuLiao_Contents.Asp)规模最大,逾亿字,但取样极不均衡,多半为文学作品;
台湾“中央研究院”Sinica Corpus也是可免费使用的平衡汉语语料库。
但是它只能代表台湾地区的汉语,无法反映中国大陆的汉语状况。
详情可访问Sinica Corpus官方网站.tw/ftms-bin/kiwi.sh。
PH语料库包含的是1990年1月至1991年3月新华社出版的新闻。
该语料库规模为3,260,416字。
通过ftp:///pub/chinese/可获得该语料库。