可以免费使用的英语语料库资源
语料库

到目前为止, 国际上学习者语料库多数都是用于研 究二语习得,并且以研究英语的习得为主,除了 ICLE和NICE两个英语学习者语料库之外,建成的 其他英语学习者书面语法料库还包括: 匈牙利英语 学习者语料库(JPU )、波兰英语学习者语料库 (PELCRA )、瑞典英语学习者语料库(USE)、日本 英语学习者语料库(JEFLL )、美国英语学习者语料 库(MELD) 等。 国外学习者口语语料库目前已建成的有2 个, 它们是 国际英语中介语口语语料库(LINSEI) 和日本标准化 英语口试语料库(SSIC)。
学习者语料库的发展
最早的学习者语料库是20 世纪80 年代末由Longman 出版集 团建立的朗曼学习者语料库(Longman L earners’ Corpus) , 约1000 万词的规模。由剑桥大学出版社建立的剑桥学习者 语料库(Cambridge Learners Corpus) , 词汇规模达1500万。 到目前为止,在国际上得到公认的最重要的学习者语料库是 ICLE( International Corpus of L earner English) , 即国际英 语学习者语料库, 1990 年启动建立, 项目负责人为比利时 Louvain - L a - euve 大学的Sylviane Granger 教授。它包括 母语背景不同的英语学习者书面语200 多万词, 并按照第一 语言(母语)背景分为14 个子库, 目前子库还在不断增加。为 了便于对比研究, 它还建有一个由英语为母语的高中生和大 学生所写的议论文文体语料库, 词数为30 万(Granger, 1998; 2002)。
NICE-NNS
1)English study history 2)Language other than English 3) Length of studying other language 4) Qualifications: TOEIC, TOEFL, STEP 5) Experience going abroad 6) Daily amount of English reading, writing, listening, speaking 7) Essay writing (in Japanese or English) proficiency self-estimation Japanese essay
语料库

3 语料库的设计
语料库三方面 A. 语料本身
属性 规模 领域
体裁 时代 语体 语种
语言层次
值
百万词级 | 千万词级 | 亿万词级 | … 政治 | 经济 | 体育 | 心理学 | …
文学 | 应用文 | 新闻 | …
共时 | 历时 书面语 | 口语 单语 | 双语 | 多语 双语平行语料库 | 双语比较语料库 语音(音节,韵律) | 语法(词,句,…)
11
第二代语料库
建于1980年代,由英国Birmingham大学 与Collins出版社合作完成,规模达2000 万词次,基于该语料库出版的Collins Cobuild词典(1987)受到了广泛的好评
COBUILD语料库 Longman语料库
千万词级 词典编纂 - 应用导向
建于1980年代,包括三个语料库: LLELC语料库(Longman/Lancaster英语语料库) LSC语料库(Longman口语语料库) LCLE(Longman英语学习语料库) 目标是编撰英语学习词典,为外国人学习英语服 务,词典规模达5000万词次
7
London-Lund英语口语语料库部分标记
标记
含义
#
语调群的结束 (end of tone group)
^
语音开始 (onset)
/
上升型核心语调 (rising nuclear tone)
\
下降型核心语调 (falling nuclear tone)
^
先升后降型核心语调 (rise-fall nuclear tone)
检索工具 | 人机界面 | 数据接口 | … 16
语料的选取
精品原则 有影响力原则 随机挑选原则 高流通度原则 典型性原则 易于获得原则 具有统计样本意义原则 符合语言规范原则
英汉语料库汇总

1.英语学习者语料库(书面语及口语)中国学习者语料库 CLEC(100万)广外、上海交大2.大学英语学习者口语语料库 COLSEC (5万) 上海交大3.香港科技大学学习者语料库 HKUST Learner Corpus 香港科技大学4.中国英语专业语料库 CEME (148万) 南京大学5.中国英语学习者口语语料库 SECCL (100万) 南京大学6.国际外语学习者英语口语语料库中国部分 LINSEI-China (10万) 华南师大7.硕士写作语料库 MWC (12万) 华中科技大学9.平行语料库汉英平行语料库 PCCE 北外10.南大-国关平行语料库南京大学11.英汉文学作品语料库;外研社12.冯友兰《中国哲学史》汉英对照语料库13.李约瑟(Joself Needham)《中国科学技术史》英汉对照语料库14.计算机专业的双语语料库;国家语言文字工作委员会语言文字应用研究所15.柏拉图(Plato)哲学名著《理想国》的双语语料库16.英汉双语语料库(15万对) 中科院软件所17.英汉双语语料库:LDC香港新闻英汉双语对齐语料36294段以及香港法律英汉双语对齐语料31万句子对中国科学院自动化研究所18.英汉双语语料库(100万),网上英汉语段电子词典及网上电子英汉搭配词典(1000万) 东北大学19.英汉双语语料库(40-50万句子对) 哈尔滨工业大学20.双语语料库(5万多对) 北京大学计算语言学研究所21.对比语料库 LIVAC(Linguistic variety in Chinese communities) 香港城市理工大学22.平衡语料库(Sinica Corpus);树图语料库(Sinica Treebank) 台湾23.特殊英语语料库中国英语(China English)语料库河南师范大学24.军事英语语料库(Corpus of Military Texts) 解放军外语学院25.新视野大学英语教材语料库上海交通大学26.汉语语料库汉语现代文学作品语料库(1979年,527万字) 武汉大学27.现代汉语语料库(1983年,2000万字) 北京航空航天大学28.中学语文教材语料库(1983年,106万8000字) 北京师范大学29.现代汉语词频统计语料库(1983年,182万字) 北京语言学院30.国家级大型汉语均衡语料库(2000万字) 国家语言文字工作委员会31.《人民日报》语料库(2700万字) 北京大学计算机语言学研究所32.大型中文语料库(5亿字,10分库) 北京语言文化大学33.现代汉语语料库(1亿字) 清华大学34.汉语新闻语料库;(1988年,250万字) 山西大学35.标准语料库(2000年,70万字)36.生语料库(3000万字);《作家文摘》的标注语料库(100万字) 上海师范大学37.现代自然口语语料库中国社会科学院语言所38.旅游咨询口语对话语料库和旅馆预定口语对话语料库中国科学院自动化所39.北京大学汉语语言学研究中心的三个语料库现代汉语语料库/yuliao.asp?item=1古代汉语语料库/yuliao.asp?item=2汉英双语语料库/yuliao.asp?item=3/printthread.php?t=2742汉语语料库使用权限国家语委语料库(http://219.238.40.213:8080/CpsQrySv.srf)”虽说是通用型平衡语料库,但不能完全免费使用;北京语言大学的汉语语料库(http://202.112.195.8)语料产出时间较早,且不能完全免费使用;北京大学汉语语言学研究中心语料库(现代汉语子库)”(/YuLiao_Contents.Asp)规模最大,逾亿字,但取样极不均衡,多半为文学作品;台湾“中央研究院”Sinica Corpus也是可免费使用的平衡汉语语料库。
作为语音研究与教学得力工具的英语语音语料库——介绍IViE语料库

作为语音研究与教学得力工具的英语语音语料库
一 一
介绍I i语料库 VE
。朱嘉莉
郝文婷
摘 要 :语音语料库作 为语音研究和语音工程 中不可或缺 的重要技 术手段和工具 已在 国内外得到蓬勃发展 。 V E Ii
语料库 ( n o a in V r a in n E g ih)是由牛津大学和剑桥 大学于1 9- 02 I tn to a i to i n 1 s 9 7 2 0年合作开发建成的大型语音
语本族语语音语料 的英语学 习者提标注系统 应用
一
、
引 言
中 国英 语 学 习者 作 为 世 界 最 大 规 模 的 英 语 学 习 团 体 ,
在 以往 资 源 缺 乏 、 技 术 条 件 相 对 不 足 的 时 代 , 语 音 内部 存 在 众 多 差 异 ,诸 如 不 同方 言 区 的 方 言 口音 对 英 语 口
学技 术大学科 大讯飞公 司发布 的汉 语语音语料 库和 中国社 伦 敦、剑桥 、卡地 夫 、利物 浦、布拉福德 、利兹 、纽卡斯
会科学 院语言研究所主 持的 “ 6 语音语料库 ”。为 反映和 尔 、北爱尔兰 贝尔法斯特和 爱尔兰的都柏林 。所选择 的地 83 记录语 言的实 际使用情况 、透 视语言 系统的发 展规律 、比 区较分 散 ,不 仅选 取 了所 谓 的南部 标 准 的英语 方言 ( 剑 较语 言体系 的系统差异 ,特 别是为考 察外语学 习者的语 言 桥 、伦 敦 ),而且 也选 取 了广 泛使 用 的 “ 代 ”或 “ 现 主 习得规律 及相关 教学策略 ,研 究人员开 始关注英 语学习者 流 ”的英语方言 ( 贝尔法斯特 、北爱尔兰 、都 柏林 )。已
剑桥和诺丁汉商务英语语料库

剑桥和诺丁汉商务英语语料库
剑桥商务英语语料库(Cambridge Business English Corpus)是
由剑桥大学出版社和剑桥大学语言中心合作建立的一个商务英语语料库。
该语料库收集了各个领域的商务英语文本,包括商务报告、商务会议记录、商务合同、商务信函等。
语料库中的文本涵盖了各个专业领域的商务英语,如市场营销、国际贸易、金融、人力资源等。
剑桥商务英语语料库不仅包含了原始文本,还包括了词性标注、句法分析和语义角色标注等语言信息。
这些语言信息有助于研究商务英语的词汇、句法和语用等方面的特点。
诺丁汉商务英语语料库(Nottingham Business English Corpus)是由英国诺丁汉大学的商务英语研究中心建立的一个商务英语语料库。
该语料库也收集了各种商务英语文本,主要用于研究商务英语的语言使用和语篇结构。
诺丁汉商务英语语料库的特点在于它采用了专门开发的语料库软件,可以按照一定的标准对语料库中的文本进行搜索和分析。
这使得研究人员可以方便地根据自己的研究目的来使用该语料库。
这两个商务英语语料库都为商务英语研究提供了宝贵的资源,研究人员可以通过对语料库中的文本进行分析,了解商务英语的实际应用情况,从而提高商务英语的教学和学习效果。
美国当代英语语料库

美国当代英语语料库
美国当代英语语料库是一个包含大量英语文本的数据库,用于研究和
分析美国当代英语的语言使用。
该语料库包括各种类型的文本,如新闻报道、小说、学术论文、广告、社交媒体帖子等。
这些文本可以用于研究语
言变化、语言习惯、语言使用情况等方面。
美国当代英语语料库的建立是
为了帮助语言学家、翻译、教师和其他语言专业人士更好地了解和使用英语。
该语料库可以用于开发语言学习软件、自然语言处理系统、机器翻译
系统等。
此外,该语料库还可以用于研究社会和文化问题,如性别、种族、阶级等方面的语言使用情况。
美国当代英语语料库的建立需要大量的时间
和资源。
语料库的建立通常涉及到文本收集、文本清理、文本标注等多个
步骤。
此外,语料库的维护和更新也需要不断的努力和投入。
因此,美国
当代英语语料库是一个非常宝贵的资源,对于研究和教学都有着重要的意义。
top20000个当代英语语料库单词
top20000个当代英语语料库单词全文共10篇示例,供读者参考篇1Hello friends! Today I want to talk about the top 20,000 words in the contemporary English language. Whoa, that's a lot of words, right? But don't worry, I'm here to break it down for you in a fun and easy way.So, what exactly are these top 20,000 words? Well, they are the most commonly used words in English. That means you're likely to hear or see these words a lot in books, TV shows, movies, and everyday conversations. Pretty cool, right?Now, let's talk about some of these top words. We have simple words like "the", "and", "is", "you", "I", "he", "she", and "it". These are called pronouns and they help us talk about people or things without using their names all the time. So, instead of saying "The dog is cute", we can say "It is cute".Next, we have words like "to", "of", "in", "for", "on", and "with". These are called prepositions and they show how things are related to each other. For example, "I'm going to the park", "The cat is sleeping on the bed", or "She's playing with her toys".Prepositions are like little connectors that help make our sentences more interesting.Now, let's talk about some fun words like "happy", "sad", "funny", "cool", "smart", and "kind". These are called adjectives and they describe how things look, feel, or sound. For example, "She's a kind person", "He's a smart student", or "The movie was funny". Adjectives add color and emotion to our sentences.And of course, we can't forget about action words like "run", "jump", "dance", "sing", "play", and "work". These are called verbs and they show what someone or something is doing. For example, "The dog is running fast", "She loves to sing", or "He's working hard". Verbs keep the action going in our sentences.There are also words like "house", "car", "book", "school", "friend", and "family". These are called nouns and they name people, places, or things. For example, "I live in a house", "She drives a car", or "He's reading a book". Nouns help us talk about the world around us.So, there you have it! The top 20,000 words in the contemporary English language. Pretty neat, huh? By knowing and using these words, you'll be able to express yourself better and communicate with others more effectively. Keep learningand exploring the wonderful world of words! Stay curious and have fun with the English language. Bye for now!篇2Once upon a time, there was a super cool list called the Top 20000 Contemporary English Corpus Words. This list had all the words that people use every day in English. It was like a treasure trove of words waiting to be explored and used in all sorts of ways.One day, a group of friends decided to check out this list and see what all the fuss was about. They gathered around their computer and started scrolling through the words. There were words like "fun", "happy", "exciting", "play", "jump", and many more. They were so amazed at all the words they found on the list.The friends decided to challenge each other to use the words from the list in different sentences. They had to come up with creative ways to incorporate the words into their everyday conversations. It was like a fun game that tested their vocabulary skills and creativity.One of the friends said, "I feel so happy when I play with my friends at the park. We jump and run around, having so much funtogether." Another friend added, "I love to read exciting books that take me on adventures to faraway places. It's like escaping reality for a little while."The friends continued to explore the list of words and found even more interesting ones like "surprise", "magic", "dream", "laugh", and "shine". They were fascinated by how each word had its own unique meaning and could be used in different contexts.As they used the words in their conversations, they noticed how their language skills improved. They were able to express themselves more clearly and creatively, making their conversations more engaging and fun.The friends realized that the Top 20000 Contemporary English Corpus Words were like a magic wand that helped them unlock a whole new world of possibilities in their communication. They were grateful for the list and promised to continue exploring and learning new words every day.And so, the friends continued their journey of discovery and growth, armed with the power of words from the Top 20000 list. They knew that with each word they learned and used, they were becoming better communicators and storytellers.And they lived happily ever after, speaking confidently and eloquently with the help of their trusty list of words. The end.篇3Hey guys! Today I'm gonna tell you about the top 20000 most common English words! I know it sounds like a lot, but don't worry, I'll break it down for you.So, these words are the ones that people use the most in everyday conversations. They are super important because they help us communicate with each other. Whether we're talking to our friends, our teachers, or even just reading a book, these words come in handy.Some of the top 20000 words are things we use all the time, like "hello," "goodbye," "please," and "thank you." These words make our conversations polite and friendly. Other words are things we see every day, like "house," "car," "dog," and "cat." They help us describe the world around us.There are also words that help us talk about our feelings, like "happy," "sad," "angry," and "excited." And let's not forget about the words that help us ask questions like "who," "what," "when," and "where."Learning these words can help us have better conversations and understand what other people are saying. So next time you hear a word you don't know, don't be afraid to ask what it means. You might just add a new word to your list of top 20000 English words!Keep practicing and soon enough you'll be a pro at using these common words in your everyday conversations. Have fun learning and happy chatting, everyone!篇4Hey guys! Today I want to talk about the top 20000 contemporary English language corpus words. Sounds fancy, right? But don't worry, I'll explain everything in a fun way that even a little kid can understand!So, what exactly are these top 20000 words? Well, they are basically just a list of common words that are frequently used in the English language. These words are important because they form the foundation of our communication. Without them, we wouldn't be able to express ourselves properly or understand what others are saying.Some of these words are super basic, like "apple" or "dog". You probably use these words every day without even thinkingabout it. But there are also some more complicated words on the list, like "synonym" or "metaphor". These words help us to express more complex ideas and emotions.One cool thing about the top 20000 words is that they are constantly changing and evolving. New words are being added all the time, especially with the rise of technology and social media. Words like "selfie" and "emoji" are now part of our everyday vocabulary, even though they didn't exist a few years ago.Learning these top 20000 words is important because it can help us to become better communicators. The more words we know, the more accurately we can express ourselves and understand others. Plus, it's just fun to learn new words and expand your vocabulary!So, next time you come across a word that you don't know, don't be afraid to look it up and add it to your own personal list of top 20000 words. Who knows, maybe you'll even impress your friends and teachers with your newfound vocabulary!That's all for now, guys. I hope you enjoyed learning about the top 20000 words with me. Keep on reading, talking, and learning new words every day. See you next time!篇5Hey guys! Today I want to talk to you about the top 20000 words in contemporary English language! It may sound like a lot, but don't worry, we can learn some of these together!First, let's talk about some common words we use every day. Words like "hello", "goodbye", "please" and "thank you" are important for being polite and friendly.Next, we have words that describe things, like "dog", "cat", "ball" and "book". These words help us talk about the world around us.Then, there are words that describe actions, like "run", "jump", "eat" and "sing". These words are fun to use when we're playing games or telling stories.We also use words to talk about feelings, like "happy", "sad", "angry" and "excited". It's important to share our feelings with others so they know how we're doing.In addition, we have words that help us connect ideas, like "and", "but", "because" and "so". These words help us tell stories and explain things to others.There are also words that give more details, like "big", "small", "fast" and "slow". These words help us paint a picture in our minds.Finally, there are words that show ownership, like "my", "your", "his" and "her". These words help us talk about who something belongs to.Learning these words will help us communicate better with others, both in person and in writing. So let's keep practicing and using these words every day!I hope you learned something new today about the top 20000 words in contemporary English language. Keep practicing and soon you'll be a pro at using them all! Thanks for listening!篇6Title: My Favorite Top 20000 Words from the Contemporary English CorpusHey there! Today I want to share with you some really cool words that are part of the top 20000 words from the Contemporary English Corpus. These words are super important and used a lot in everyday English, so I think it's really awesome to know them!Let's start with the word "love". Love is such a beautiful word that means caring about someone deeply and wanting to make them happy. It's all about spreading kindness and positivity in the world. I love to use the word "love" because it makes me feel warm and fuzzy inside.Next up is the word "happy". Being happy is one of the best feelings in the world. It's when you feel joyful, content, and at peace with yourself and others. I love to see people smiling and laughing because it shows that they are happy.Another word I really like is "friend". Friends are the best! They are the people who support you, make you laugh, and always have your back. Having good friends is so important because they make our lives so much better.One word that is really cool is "awesome". When something is awesome, it means it's really amazing and impressive. Like when you see a beautiful sunset or win a game, you can say, "That was awesome!"A word that is often used in school is "learn". Learning is so important because it helps us grow and understand the world better. Whether we're learning math, science, or even a new hobby, it's always cool to learn new things.One word that is used a lot in stories and movies is "adventure". Adventures are exciting journeys or experiences that take us to new places and give us unforgettable memories. I love going on adventures with my friends and family.The word "kind" is another word that I really like. Being kind means being friendly, generous, and considerate towards others. It's so important to be kind to everyone we meet because it makes the world a better place.One word that is really fun to say is "banana". Bananas are delicious fruits that are full of vitamins and energy. They are my favorite snack to eat during the day because they are tasty and good for me.The word "dream" is a word that is full of possibilities. Dreams are our hopes, goals, and desires for the future. It's important to have dreams because they inspire us to work hard and achieve great things.Another word that is really important is "family". Family is a group of people who love and care for each other. They are always there for us through good times and bad. I love spending time with my family because they make me feel safe and loved.Lastly, the word "create" is a word that I think is really cool. To create means to make something new or original. Whether it's drawing, writing, or building, creating things is so much fun and brings out our creative side.I hope you enjoyed learning about some of my favorite words from the top 20000 words in the Contemporary English Corpus. Remember, words are powerful and can help us express our thoughts and feelings in so many different ways. Keep exploring new words and using them in your everyday life! Keep learning and growing every day. Bye for now!篇7Hey guys, have you ever heard of the top 20000 words in contemporary English language? It's pretty cool and there are some really interesting words in there! Let me tell you all about it.So, these top 20000 words are like the most commonly used words in English. That means we use them a lot when we talk or write. It's important to know these words because they help us to communicate better and understand things more easily.Some of the words in the top 20000 list are really simple, like "hello" and "goodbye". We use these words all the time when wegreet someone or say farewell. Then there are words like "happy" and "sad" which describe our feelings. It's fun to learn how to express our emotions with words!There are also words that describe things around us, like "tree" and "car". These words help us to talk about the world we see every day. And don't forget about words that help us to ask questions, like "who" and "what". Asking questions is a great way to learn more about something or someone.Now, let's talk about some cool words that are in the top 20000 list. Have you ever heard of the word "penguin"? It's a bird that can't fly but it swims really well. Penguins are so cute and funny to watch! Another interesting word is "butterfly". It's a beautiful insect that flies around with colorful wings. Have you ever seen a butterfly up close? They are so pretty!There are also some words that are a little harder to understand, like "prestigious" and "metamorphosis". These words sound fancy but they are actually pretty cool once you know what they mean. "Prestigious" means something is very respected or admired, while "metamorphosis" means a big change or transformation. It's fun to learn new words and their meanings!Learning new words from the top 20000 list can be really exciting. It helps us to expand our vocabulary and become better at expressing ourselves. So, let's keep exploring the world of words and have fun with language! Have a great day, everyone!篇8Hey everyone! Today I'm going to talk about the top 20000 words in the contemporary English language! It's super duper cool and interesting to learn about all these words that people use every day.So the first word on the list is "the". It's a really common word that we use all the time. It's like when we say "the dog" or "the cat". It helps us know which thing we're talking about.Next up is "I". We use this word to talk about ourselves. Like when we say "I want to play" or "I like ice cream".Oh, and don't forget about "you"! It's a word we use to talk to someone else. Like when we say "Can you pass me the crayons?" or "You are my friend".There are so many words on this list that we use every day. Like "love", "happy", "play", "go", "eat" and so many more. It's so cool to see all the words that make up the English language.I hope you enjoyed learning about the top 20000 words in the contemporary English language. It's pretty neat to see all the words that we use every day without even thinking about it. Keep on learning and exploring new words! Bye!篇9Today I'm gonna tell you all about the top 20000 words in the contemporary English language! Cool, right? So buckle up and get ready for a fun ride through the wonderful world of words!First off, what exactly is a "word"? Well, a word is a unit of language that carries meaning. It's like a building block in the vast structure of a language. English has thousands and thousands of words, but some are used more frequently than others. These are the top 20000 words in the language.So, why are these words important? Well, they are the ones that are most commonly used in everyday conversation, writing, and reading. By knowing these words, you can better understand and communicate with others in English.Let's start with some basic words that you probably use all the time without even realizing it. Words like "the," "and," "I," "you," and "he" are among the most frequently used words inEnglish. These are the building blocks of sentences and are essential for communication.Next, we have words that help us describe things. Adjectives like "big," "small," "hot," and "cold" allow us to paint a picture with our words. We can also use verbs like "run," "jump," "eat," and "sleep" to talk about actions. These words help us bring our stories and experiences to life.Then, there are words that connect our ideas and thoughts. Words like "because," "although," "and," and "but" help us link different parts of a sentence or paragraph. These words are crucial for creating a coherent and logical flow in our writing and speaking.Of course, we can't forget about words that express emotions and feelings. Words like "happy," "sad," "angry," and "surprised" allow us to convey how we're feeling to others. These words help us connect with others on a deeper level and build meaningful relationships.As we continue to explore the top 20000 words in English, we come across a wide range of vocabulary, from common words like "house," "car," and "dog" to more advanced words like "phenomenon," "equilibrium," and "perspective." Each word has its own unique meaning and role in the language.By expanding your vocabulary and learning new words, you can become a more effective communicator and express yourself more clearly and accurately. So, don't be afraid to dive into the world of words and discover the endless possibilities that language has to offer.In conclusion, the top 20000 words in the contemporary English language are like a treasure trove waiting to be explored. By mastering these words, you can unlock the full potential of the language and open up new opportunities for learning and growth. So, keep on expanding your vocabulary and never stop exploring the wonderful world of words!篇10Hey guys, today I'm gonna tell you all about the top 20,000 words in the contemporary English language! That's a lot of words, so let's dive right in and see what cool words we can learn.First up, we have words like "the," "and," "is," and "it." These are called the most common words in English because we use them all the time. Then there are words like "you," "that," "he," and "she." These are pronouns that we use to talk about people and things.Next, we have words like "love," "happy," "sad," and "angry." These are all emotions that we feel and express through our words. Words like "friend," "family," "home," and "school" are all about the people and places that are important to us.Now, let's talk about some fun words like "banana," "candy," "pizza," and "ice cream." These are all yummy foods that we love to eat! Words like "dog," "cat," "bird," and "fish" are all about animals that we see and love.Moving on, we have words like "run," "jump," "play," and "dance." These are all actions that we do to have fun and stay active. Words like "big," "small," "tall," and "short" describe the size and shape of things.Let's not forget about words like "beautiful," "ugly," "smart," and "silly." These are all adjectives that describe how things look and behave. Words like "fast," "slow," "loud," and "quiet" describe the speed and volume of things.And finally, we have words like "hello," "goodbye," "please," and "thank you." These are all polite words that we use to be kind and respectful to others. So remember, using words is a super fun way to express ourselves and communicate with others.So there you have it, guys! The top 20,000 words in the contemporary English language. I hope you learned some new words and had fun exploring the wonderful world of vocabulary. Keep on learning and growing your language skills, and who knows, maybe one day you'll be a vocabulary expert too! Thanks for listening, bye-bye!。
基于COCA语料库英语同义词辨析——以Compulsory和Mandatory为例
- 217 -校园英语 / 语言文化研究基于COCA语料库英语同义词辨析——以Compulsory和Mandatory为例成都理工大学外国语学院/郭齐园 金铠【摘要】本文基于语料库的研究方法,以compulsory和mandatory为例,利用美国当代英语语料库COCA从不同语域的词频分布、搭配特征、句法结构等方面,结合定性和定量的方法,辨析英语同义词。
本研究对英语教学和实践提供了一种有效的手段和视角,以语料库为基础的英语同义词教学有一定的借鉴意义。
【关键词】COCA语料库 同义词 搭配特征 类连接统计数据表明,同义词占到所有英语单词的60%以上。
传统方法是词典学习,老师自身经验,学生内化。
此类学习方法宏观,不具体。
而COCA 语料库在辨析同义词,微观,具体。
一、语料库语言学背景20世纪中后期,语料库语言学对于辨析同义词的研究成为一种新的方法,具有语言真实,数据量大,检索快速。
在语言学中,语料库即大量文本的集合,库中的文本(称为语 料),通常经过整理,具有既定的格式与标记,特指计算机存储的数字化语料库。
语料库是语料库语言学研究的基础资源,也是经验主义语言研究方法的主要资源。
应用于词典编纂、语言教学、传统语言研究、自然语言处理中基于统计或实例的研究等方面。
语料库可分成四种类型:(1)异质的:没有特定的语料收集原则,广泛收集并原样存储各种语料;(2)同质的:只收集同一类内容的语料;(3)系统的:根据预先确定的原则和比例收集语料,使语料具有平衡性和系统性,能够代表某一范围内的语言事实;(4)专用的:只收集用于某一特定用途的语料。
COCA 是当前重要的语料库工具之一,全称为美国当代英语语料库,可免费在线使用且分布均匀,其涵盖SPOK ,FIC ,MAG ,NEWS ,ACAD 五个部分。
二、研究对象及工具1. compulsory 和mandatory 在字典中的定义。
《牛津高阶英汉双解词典,(7th Edition)》中,compulsory 解释为:that must be done because of a law or a rule (因法律或规则 而)必须做的,强制的,强迫的;短语和例句有:Compulsory education / schooling 义务教育、Compulsory redundancies 强制裁员等。
iWriteBaby中国学习者英语语料库的创建
iWriteBaby 语料库由北京外国语大学许家金总体设计,并完成相关的语料整 理校对工作。语料库建设的全过程得到北京外研在线数字科技有限公司、汇智明 德(北京)教育科技有限公司的资金和技术支持。语料库的整体设计得到梁茂成 教授的指导。
3. iWriteBaby 语料库在线检索平台
目前的单机版语料库软件已很难处理 800 万词规模的 iWriteBaby 语料库。因 此, 我 们 将 该 语 料 库 部 署 在“ 语 料 云 ” 在 线 平 台()。 该 云 平 台 可 以 实 现 WordSmith、AntConc、BFSU PowerConc 等 单 机 版 语 料 库 工 具的相应功能,例如词表、索引分析、搭配等。语料云是在大数据时代 BFSU PowerConc 的网络实现(许家金、贾云龙 2013;许家WriteBaby 语料库为 iWriteBaby 1.0 版。其中包含学习者英语作文 52,855 篇,计 8,299,066 词次(单词定义为 [a-zA-Z0-9-]+)。库中作文来自全国 69 所高校(其中重点大学与普通高校比例约为 1: 10)。它们来自全国 23 个省市自治 区,48 个不同的城市。这些学生分布在 154 个不同的学科专业。入库的作文题目 超过 1,000 个。
通过语料云的“工具”菜单找到“词表生成”,就可以创建 iWriteBaby 语料库 的词频表。图 1 中显示的是 iWriteBaby 中最常用的词汇。在词表结果中显示的库容 量为 8,293,751 词,与前文我们提供的总词数略有差别。这与该系统与我们的单词 定义不同有关。若使用该云平台,则库容信息及其他相应频数都应统一以系统提 供的数据为准。
英语听力语料库
英语听力语料库
英语听力语料库是一种按照一定的采样标准采集而成的电子文本集,它可以代表一种语言或某语言的一种变体或文类。
语料库对于英语听力学习有着重要的作用,因为它可以为学习者提供真实的语言材料,帮助学习者更好地理解语言的结构和用法。
英语听力语料库中包含了大量的听力材料,这些材料可以是电影片段、新闻广播、演讲、课堂讨论等等。
学习者可以通过听这些材料来提高自己的听力技能,学习地道的英语表达方式,增强对英语语言的理解和感知能力。
此外,英语听力语料库还可以帮助学习者更好地了解英语语言的特点和规律。
通过分析语料库中的语言数据,学习者可以发现英语语言的常用词汇、短语和句型,以及语言在不同语境中的使用情况。
这些发现可以帮助学习者更好地掌握英语语言的运用,提高他们的口语和写作能力。
总之,英语听力语料库是一种非常有用的学习资源,它可以为学习者提供真实的语言材料,帮助学习者更好地理解语言的结构和用法,提高他们的听力技能和英语语言能力。
- 1、下载文档前请自行甄别文档内容的完整性,平台不提供额外的编辑、内容补充、找答案等附加服务。
- 2、"仅部分预览"的文档,不可在线预览部分如存在完整性等问题,可反馈申请退款(可完整预览的文档不适用该条件!)。
- 3、如文档侵犯您的权益,请联系客服反馈,我们会尽快为您处理(人工客服工作时间:9:00-18:30)。
可以免费使用的大型英语语料库资源
常用语料库资源链接汇集(语料天涯)
1. BNC-World Simple Search ☆☆☆
But no more than 50 hits will be displayed, with a fixed amount of context.
2. Brown, LOB, BNC sampler ☆☆☆
Here are a few links for searching corpora online, including monolingual corpora like Brown, LOB, and BNC sampler and also some parallel English-Chinese corpora. English:
English:
Parallel:
3. Collins Cobuild Corpus Concordance Sampler☆☆☆☆☆
The Collins WordbanksOnline English corpus is composed of 56 million words of contemporary written and spoken text.
4. New BNC interface - VIEW: ☆☆☆☆☆
5. Samples (about 2 million words) from the British National Corpus: both written and spoken ☆☆☆
The Brown Corpus and many others - native, learner...
Go to
6. MICASE ☆☆☆☆
There are currently 152 transcripts (totaling 1,848,364 words) available at the site.
7. CLEC online concordancing ☆☆☆☆
CLEC收集了包括中学生、大学英语4级和6级、专业英语低年级和高年级在内的5种学生的语料一百多万词,并对言语失误进行标注。
For an introduction of the corpus, its error tagset and some statistics, see
8. Business Letter Corpus Online KWIC Concordancer ☆☆☆
1 MILLION WORDS BUSINESS LETTER CORPUS (US & UK) AND OTHER CORPORA
9. Virtual Language Centre ☆☆☆
The Starr Report, Brown, LOB, The Times (Jan, Feb, Mar) 3 files, SCMP, Business & Economy, Computing etc
10. Time Magazine archive ☆☆☆, 1923-2007 (100+ million words)
and more at
11. Just the word
书店可以买到的语料库相关书籍
Austermühl, F. 2001. Electronic Tools for Translators《译者的电子工具》. Manchester: St.
Jerome Publishing. (外研社引进)
Biber, Douglas, Stig Johansson, Geoffrey Leech, Susan Conrad & Edward Finegan. 1999.
Longman Grammar of Spoken and Written English. Longman Publications Group.
(外研社引进)
Biber, Douglas, Susan Conrad & Randi Reppen. 1998. Corpus Linguistics. Cambridge: Cambridge University Press. (外研社引进)
Granger, S. et al. (eds.). 2003. Corpus-based Approaches to Contrastive Linguistics and Translation Studies《基于语料库的语言对比和翻译研究》. Amsterdam: Rodopi. (外研社引进)
Gries, Stefan Thomas. 2004. Multifactorial Analysis in Corpus Linguistics: A Study of Particle Placement. Beijing: Peking University Press. (北大出版社引进)Hunston, Susan. 2002. Corpora in Applied Linguistics. Cambridge: Cambridge. University Press. (世界图书出版社引进)
Kennedy, Graeme. 1998. An Introduction to Corpus Linguistics. London: Longman. (外研社引进)
Nattinger, James R. & Jeanette S. DeCarrico. 1992. Lexical Phrases and Language Teaching. Oxford: Oxford University Press. (外教社引进)
Sinclair, John. 1991. Corpus, Concordance, Collocation. Oxford: Oxford University Press.
(外教社引进)
Thomas, Jenny & Mick Short. 1996. Using Corpora for Language Education. London: Pearson Education. (外研社引进)
Zanettin, F., et al. (eds.). 2003. Corpora in Translator Education《语料库与译者培养》.
Manchester: St. Jerome Publishing. (外研社引进)
蔡金亭,2003,《语言因素对英语过渡中使用——一般过去时的影响》。
北京:外语教学与研究出版社。
何安平(主编),2004,《语料库在外语教育中的应用:理论与实践》。
广州:广东高等教育出版社出版。
何安平,2004,《语料库语言学与英语教学》。
北京:外语教学与研究出版社。
华南师范大学外国语学院编,2005,《语料库语言学的研究与应用》。
长春:东北师范大学出版社。
黄昌宁,李涓子著,2002,《语料库语言学》。
北京:商务印书馆。
濮建忠,2003,《学习者动词行为:类联接、搭配及词块》。
开封:河南大学出版社。
王建新,2005,《计算机语料库的建设与应用》。
北京:清华大学出版社。
王克非等,2004,《双语对应语料库研制与应用》。
北京:外语教学与研究出版社。
王立非、梁茂成等,2007,《计算机辅助第二语言研究方法与实用》。
北京:外语教学与研究出版社。
卫乃兴,2002,《词语搭配的界定与研究体系》。
上海:上海交通大学出版社。
卫乃兴,李文中,濮建忠等,2005,《语料库应用研究》。
上海:上海外语教育出版社。
文秋芳、王立非、梁茂成,2005,《中国学生英语口笔语语料库》。
北京:外语教学与研究出版社。
【含SWECCL语料库光盘】
杨达复,2000,《英语错误型式分析》。
西安:陕西人民出版社。
杨惠中、桂诗春,2003,《中国学习者英语语料库》。
上海:上海外语教育出版社。
【含CLEC 语料库光盘】
杨惠中、卫乃兴,2005,《中国学习者英语口语语料库建设与研究》。
上海:上海外语教育出版社。
【含COLSEC语料库光盘】
杨惠中等(主编),2005,《基于CLEC语料库的中国学习者英语分析》。
上海:上海外语教育出版社。
杨惠中主编,2002,《语料库语言学导论》。
上海:上海外语教育出版社。
2020年5月31日星期日。