Recent Developments in the Emu Speech Database System

合集下载

专业英语四级模拟试卷177(题后含答案及解析)

专业英语四级模拟试卷177(题后含答案及解析)

专业英语四级模拟试卷177(题后含答案及解析)题型有:1. DICTATION 2. LISTENING COMPREHENSION 3. CLOZE 4. GRAMMAR & VOCABULARY 5. READING COMPREHENSION 6. WRITINGPART I DICTATION (15 MIN)Directions: Listen to the following passage. Altogether the passage will be read to you four times. During the first reading, which will be read at normal speed, listen and try to understand the meaning. For the second and third readings, the passage will be read sentence by sentence, or phrase by phrase, with intervals of 15 seconds. The last reading will be read at normal speed again and during this time you should check your work. You will then be given 2 minute 1.正确答案:Opportunity Opportunities don’t come often. They come every once in a while. Very often, they come quietly and go by without being noticed. Therefore, it’s advisable that you should value and treat them with care. When an opportunity comes, it brings a promise but never realizes it on its own. If you intend to fulfill one of your ambitions, you must work hard. Otherwise, you will take no advantage of opportunities when they visit you. The difference between a successful man and one who does not lies only in the way treating opportunities. The successful person always makes adequate preparations to meet opportunities. The loser, on the other hand, works little and just waits to see them pass by. There are plenty of opportunities for everyone in our society, but only those who are prepared adequately and qualified highly can make use of them to achieve their purpose. Chance favors the minds that are prepared.PART II LISTENING COMPREHENSION (20 MIN)Directions: In Sections A, B and C you will hear everything ONCE ONLY. Listen carefully and then answer the questions that follow. Mark the correct answer to each question on your answer sheet.SECTION C NEWS BROADCASTDirections: In this section, you will hear several news items. Listen to them carefully and then answer the questions that follow.听力原文:The cook who feeds British workers on a North Sea oil rig has beaten off a challenge by 5,000 other cooks to reach the finals of the International Indian Chef of the Year competition. Rajiv Pathak was one of the eight finalists announced on Saturday. The contestants will have three hours to prepare and present their ideal four-dish Indian meal in Edinburgh on February 22.When did the head chef Rajiv Pathak was announced as one of the finalists?A.It was three hours ago.B.It was in the last week.C.It was on Saturday.D.It was on February 25.正确答案:C听力原文:European leaders, trying to end their bitter dispute over Iraq, warned Saddam Hussein on Monday he faces a “last chance” to disarm, but gave no deadline and said U.N. weapons inspectors must have more time to finish their work. The statement came at the end of a European Union emergency summit on the crisis with Baghdad. Diplomats insisted they had healed the rift (裂口;空隙;分歧) over U.S. calls for military action. But significant divisions remained, with some states saying the United Nations could still disarm Iraq peacefully. “War is not inevitable. Force should be used only as a last resort. It is for the Iraqi regime to end this crisis by complying fully with the demands of the Security Council,” the 15 nations said in the joint declaration. That was seen as a setback for Germany, which has opposed war under any circumstances.3.European leaders trying to give Saddam HusseinA.a last resort.B.a warning.C.a deadline.D.a setback.正确答案:B解析:新闻的第一句话就是European leaders, trying to end their bitter dispute over Iraq, warned Saddam Hussein on Monday。

2024年高考英语最后一卷(全国卷)(解析版)

2024年高考英语最后一卷(全国卷)(解析版)

2024年高考英语最后一卷【全国卷】英语(考试时间:120分钟试卷满分:150分)注意事项:1.答卷前,考生务必将自己的姓名、考生号等填写在答题卡和试卷指定位置上。

2.回答选择题时,选出每小题答案后,用铅笔把答题卡对应题目的答案标号涂黑。

如需改动,用橡皮擦干净后,再选涂其他答案标号。

回答非选择题时,将答案写在答题卡上。

写在本试卷上无效。

3.考试结束后,将本试卷和答题卡一并交回。

第一部分听力(共两节,满分30 分)第一节(共5小题;每小题1.5分,满分7.5分)听下面5段对话。

每段对话后有一个小题,从题中所给的A、B、C三个选项中选出最佳选项。

听完每段对话后,你都有10秒钟的时间来回答有关小题和阅读下一小题。

每段对话仅读一遍。

1.Where are the old cases?A.In the boxes.B.In the bookcase.C.In the drawers.【答案】C【原文】M: Do you like the way I organized the files in the bookcase, Ms. Stanford?W: Yes, you’ve done a good job of organizing them, but what did you do with the old cases in the yellow boxes? M: I moved those into the drawers, since we don’t use them very often.2.What animal does the woman own?A.A mouse.B.A dog C.A cat.【答案】B【原文】W: Watch what happens when I place some cheese on the edge of the wall just here…M: Oh, my goodness! Is that a mouse that just grabbed it?W: Yes! And the dog kept watching from his bed!M: Maybe it’s time to get a cat!3.Where does this conversation take place?A.In a house.B.In a park.C.In a forest【答案】C【原文】M: How much further is this walk?W: Not long. We just have to walk past that big house and then through a park.M: We’ve been walking through the forest for ages now.4.What can we learn about the woman?A.She found a great job.B.She is popular in college.C.She won the student election.【答案】B【原文】W: I’m putting my name forward for the upcoming student election. I’m hoping to be the first student union president from Asia at the university.M: That’s fantastic news , and you’d do a great job. I think you have a great chance of winning as everybody likes you!5.What are the speakers talking about?A.Making a birthday cake.B.Going to a birthday party.C.Repairing the broken clock.【答案】B【原文】M: I thought I’d set my alarm clock, but I didn’t hear it ring!W: Oh, no. And Ashley’s birthday party is going to start in a few minutes.M: I better get there in a hurry before everyone eats all the birthday cake.W: You and cake?! Let’s not forget whose birthday it is.第二节(共15小题;每小题1.5分,满分22.5分)听下面5段对话或独白。

2022年考研考博-考博英语-华南师范大学考试全真模拟易错、难点剖析B卷(带答案)第29期

2022年考研考博-考博英语-华南师范大学考试全真模拟易错、难点剖析B卷(带答案)第29期

2022年考研考博-考博英语-华南师范大学考试全真模拟易错、难点剖析B卷(带答案)一.综合题(共15题)1.单选题I have never laid()to being an expert in this field.问题1选项A.reclaimB.exclamationC.affirmationD.claim【答案】D【解析】考查固定搭配。

lay claim to “声称……的权力,自称有知识等”。

句意:我从不认为自己是这个领域的专家。

结合句意,故D为正确答案。

2.单选题The accuracy of scientific observations and calculations is always at the mercy of the scientist's timekeeping methods. For this reason scientists are interested in devices that give promise of more precise timekeeping.In their search for precision, scientists have turned to atomic clocks that depend on various vibrating atoms or molecules to supply their “ticking”. This is possible because each kind of atom or molecule has its own characteristic rate of vibration. The nitrogen atom in ammonia, for example: vibrates or “ticks” 24 billion times a second.One such atomic clock is so accurate that it will probably lose no more than a second in 3000 years. It will be of great importance in fields such as astronomical observation and long-range navigation. The heart of this Atomichron is a cesium atom that -vibrates 9.2 billion times a second when heated to the temperature of boiling water.An atomic clock that operates with an ammonia molecule may be used to check the accuracy of predictions based on Einstein’s relativity theories, according to which a clock in motion and a clock at rest should keep time differently. Placed in an orbiting satellite moving at a speed of 18, 000 miles an hour, the clock could broadcast its time reading to a ground station, where they would be compared with the readings on a similar model. Whatever differences developed would be checked against the differences predicted.1.Scientists expect that the atomic clock will be().2.According to the passage, which of the following statements is true?3.It could be inferred from the passage that().4.The word “vibration” in the second paragraph is closest in meaning to().5.Which of the following can best summarize the passage?问题1选项A.absolutely accurateB.more precise.C.heat-resistantD.different in vibration问题2选项A.Temperature changes affect the vibration rate of ammonia molecules.B.The rate of vibration of an atom does not vary.C.Einstein’s relativity theories are based on the vibration of atoms.D.Temperature changes may affect the speed at which atoms travel.问题3选项A.precise timekeeping is not possible at allB.scientists expect to disprove Einstein's relativity theoriesC.atomic clocks will be important in space flightD.clock in motion and a dock at rest keep time differently.问题4选项A.pulseB.precisionC.movementD.accuracy问题5选项A.Peacetime use of the atomsB.Atoms and moleculesC.The satellite timekeepersD.The role of the clock in the astronomy【答案】第1题:B第2题:A第3题:C第4题:A第5题:A【解析】1.细节事实题。

面向汉维机器翻译的双语关联度优化模型

面向汉维机器翻译的双语关联度优化模型

收稿日期:2018 08 22;修回日期:2018 10 24 基金项目:国家自然科学基金资助项目(U1703133);中科院西部之光项目(2017 XBQNXZ A 005);中国科学院青年创新促进会的资助项目(2017472);新疆维吾尔自治区重大科技专项项目(2016A03007 3);新疆维吾尔自治区高层次人才引进工程项目(Y839031201) 作者简介:潘一荣(1992 ),女,天津人,博士研究生,主要研究方向为自然语言处理、机器翻译;李晓(1957 ),男(通信作者),博导,研究员,硕士,主要研究方向为多语种信息处理、信息系统研究与开发(xiaoli@ms.xjb.ac.cn);杨雅婷(1985 ),女,副研究员,硕导,博士,主要研究方向为多语种信息处理技术;董瑞(1985 ),男,助理研究员,博士研究生,主要研究方向为多语种信息处理.面向汉维机器翻译的双语关联度优化模型潘一荣1,2,3,李 晓1,3 ,杨雅婷1,3,董 瑞1,3(1.中国科学院新疆理化技术研究所,乌鲁木齐830011;2.中国科学院大学,北京100049;3.新疆民族语音语言信息处理实验室,乌鲁木齐830011)摘 要:针对汉语—维吾尔语的统计机器翻译系统中存在的语义无关性问题,提出基于神经网络机器翻译方法的双语关联度优化模型。

该模型利用注意力机制捕获词对齐信息,引入双语短语间的语义相关性和内部词汇匹配度,预测双语短语的生成概率并将其作为双语关联度,以优化统计翻译模型中的短语翻译得分。

在第十一届全国机器翻译研讨会(CWMT2015)汉维公开机器翻译数据集上的实验结果表明,与基线系统相比,在使用较小规模的训练数据和词汇表的条件下,所提方法可以同时有效地提高短语级别和句子级别的机器翻译任务性能,分别获得最高2.49和0.59的BLEU值提升。

关键词:维吾尔语;神经网络机器翻译;注意力机制;词对齐;生成概率中图分类号:TP391 文献标志码:A 文章编号:1001 3695(2020)03 019 0726 05doi:10.19734/j.issn.1001 3695.2018.08.0625BilingualrelatednessoptimizationmodelforChinese UyghurmachinetranslationPanYirong1,2,3,LiXiao1,3 ,YangYating1,3,DongRui1,3(1.XinjiangTechnicalInstituteofPhysics&Chemistry,ChineseAcademyofSciences,Urumqi830011,China;2.UniversityofChineseAcademyofSciences,Beijing100049,China;3.XinjiangLaboratoryofMinoritySpeech&LanguageInformationProcessing,Urumqi830011,China)Abstract:FocusedontheissueofsemanticindependenceinChinese Uyghurstatisticalmachinetranslationsystem,thispaperproposedabilingualrelatednessoptimizationmodelbasedonneuralmachinetranslationmethod.Themodelutilizedtheatten tionmechanismtocapturewordalignmentinformationaswellasintroducedbilingualphrasesemanticrelevanceandinnerwordcorrelationtopredicttheconditionalprobabilityofbilingualphrasepair.Andittooktheprobabilityasbilingualrelatednesstooptimizethephrasetranslationscoresinstatisticaltranslationmodel.Experimentalresultsonthe11thChinaWorkshoponMachineTranslation(CWMT2015)Chinese Uyghurpublicmachinetranslationdatasetsshowthattheproposedapproachcanachieveobviousimprovementsbothinthephrase levelandthesentence levelmachinetranslationtasks,whichoutperformsthebaselinesystemwitharelativesmall scaletrainingdataandvocabulary.ThehighestBLEUpointoftheproposedalgorithmgains2.49and0.59respectively.Keywords:Uyghur;neuralnetworkmachinetranslation;attentionmechanism;wordalignment;conditionalprobability0 引言在基于短语的统计机器翻译(statisticalmachinetranslation,SMT)[1]系统中,翻译模型对从平行语料库中抽取的双语短语进行建模,主要包括短语翻译概率、词汇化权重等参数,这些参数作为特征函数并结合对数线性方法,以此训练机器翻译系统,从而获取最优权重分布,在解码时以搜索最有可能的翻译选项,实现双语转换过程。

2011年12月六级听力原文及答案 2

2011年12月六级听力原文及答案 2

2011 年12 月大学英语六级考试Section ADirections: In this section, you will hear 8 short conversations and2 long conversations. At the end of each conversation, one or more questions will be asked about what was said. Both the conversation and the questions will be spoken only once.After each question there will be a pause. During the pause,you must read the four choices marked A), B),C)and D), and decide which is the best answer.Then mark the corresponding letter on Answer Sheet 2with a single line through the centre.11.M: I don’t know what to do. I have to drive to Chicago next Friday for my cousin’s wedding, but I have got a Psychology test to prepare for.W: Why don’t you record your notes so you can study on the way?Q: What does the woman suggest the man do?12.M:Professor Wright,you may have to find another student to play this role,the lines are so long and I simply can’t remember them all.W: Look, Tony. It is still a long time before the first show. I don’t expect you to know all the lines yet. Just keep practicing.Q: What do we learn from the conversation?13.M: Hello, this is Dr. Martin from the Emergency Department. I have a male patient with a fractured ankle.W: Oh, we have one bed available in ward 3, send him here and I will take care of him.Q: What are the speakers talking about?14.W: Since Simon will graduate this May, the school paper needs a new editor. So if you are interested, I will be happy to nominate you.M: Thanks for considering me. But the baseball team is starting up a new season. And I’m afraid I have a lot on my hands.Q: What does the man mean?15. W: Have you heard the news that Jame Smeil has resigned his post as prime minister?M: Well, I got it from the headlines this morning. It’s reported that he made public at this decision at the last cabinet meeting.Q: what do we learn about Jame Smeil?16. W: The morning paper says the space shuttle is taking off at 10 a.m. tomorrow.M:Yeah,it’s just another one of this year’s routine missions.The first mission was undertaken a decade ago and broadcast live then worldwide.Q: what can we infer from this conversation? 17. M: We do a lot of camping in the mountains. What would you recommend for two people? W:You’d probably be better off with the four real drive vehicle.We have several off-road trucks in stock, both new and used.Q: Where does the conversation most probably take place?18. W: I hear you did some serious shopping this past weekend.M: Yeah, the speakers of my old stereo finally gave out and there was no way to repair them.Q: What did the man do over the weekend?Conversation OneW: Now, could you tell me where the idea for the business first came from?M: Well, the original shop was opened by a retired printer by the name of Gruby.Mr Gruby being left-handed himself, thought of the idea to try to promote a few products for left-handers.W: And how did he then go about actually setting up the business?M:Well,he looked for any left-handed products that might already be on the market which were very few.And then contacted the manufactures with the idea of having products produced for him, mainly in the scissors range to start with.W: Right. So you do commission some part of your stock.M: Yes, very much so. About 75 percent of our stock is specially made for us.W: And the rest of it?M:Hmm,the rest of it now,some 25,30years after Mr.Gruby’s initial efforts,there are more left-handed product actually on the market.Manufactures are now beginning to see that there is a market for left-handed products.W: And what’s the range of your stock?M: The range consists of a variety of scissors from children scissors to scissors for tailors, hairdressers etc. We also have a large range of kitchen ware.W: What’s the competition like? Do you have quite a lot of competition?M:There are other people in the business now in specialists,but only as mail-order outlets.But we have a shop here in central London plus a mail-order outlet.And we are without any doubt the largest supplier of the left-handed items.Q19: What kind of business does the man engaged in?Q20: What does the man say about his stock of products?Q21: What does the man say about other people in his line of business?Conversation TwoM: Can we make you an offer? We would like to run the campaign for four extra weeks.W:well,can we summarize the problem from my point of view?First of all,the campaign was late.It missed two important trade affairs.The ads also did not appear into key magazines.As a result, the campaign failed. Do you accept that summary of what happened?M: well, the delay wasn’t entirely our fault. You did in fact make late changes to the specifications of the advertisements.W:Uh,actually,you were late with the initial proposals so you have very little time and in fact, we only asked for small changes.M: Well whatever, can we repeat our offer to run the campaign for 4 extra weeks?W: That’s not really the point. The campaign missed two key trade affairs. Because of this, we are asking you either to repeat the campaign next year for free, or we only pay 50% of the fee for this year.M:Could we suggest a 20%reduction to the fee together with the four week sustention to the campaign.W: We are not happy. We lost business.M: I think we both made mistakes. The responsibility is on both sides.W: Ok, let’s suggest a new solution. How about a 40% cut in fee, or a free repeat campaign?M: Well, let’s take a break, we’re not getting very far. Perhaps we should think about this.22: What do we learn about the man’s company?23: Why was the campaign delayed according to the man?24: What does the woman propose as a solution to the problem?25: What does the man suggest they do at the end of the conversation?Section BPassage OneThe University of Tennessee’s Walters Life Sciences building, is a model animal facility, spotlessly clean,careful in obtaining prior approval for experiments from an animal care committee.Of the 15,000mice house there in a typical year,most give their lives for humanity. These are good mice and as such won the protection of the animal care committee.At any given time however some mice escape and run free.These mice are pests.They can disrupt experiments with the bacteria organisms they carry.They are bad mice and must be captured and destroyed. Usually,this is accomplished by means of sticky traps, a kind of fly paper on which they become increasingly stuck.But the real point of the cautionary tale,says animal behaviorist Herzau,is that the labels we put on things can affect our moral responses to ing stick traps or the more deadly snap traps would be deemed unacceptable for good mice.Yet the killing of bad mice requires no prior approval.Once the research animal hits the floor and becomes an escapee,says Herza,its moral standard is instantly diminished.In Herzau’s own home,there was more ironic example when his young son’s pet mouse Willy died recently,it was accorded a tearful ceremonial burial in garden.Yet even as they mourned Willy,says Herzau,he and his wife were setting snap traps to kill the pest mice in their kitchen with the bare change in labels from pet to pest,the kitchen mice obtained totally different moral standardsQuestions:26, What does the passage say about most of the mice used for experiments?27, Why did the so-called bad mice have to be captured and destroyed?28, When are mice killed without prior approval?29, Why does the speaker say what the Herzau’s did at home is ironical?Passage TwoThere are roughly three New Yorks.There is,first,the New York of the man or woman who was born here,who takes the city for granted and accepts its size and its turbulence as natural and inevitable.Second,there is the New York of the commuter —the city that is swallowed up by locusts each day and spat out each night. Third, there is the New York of the person who was born somewhere else and came to New York in quest of something.Of these three trembling cities the greatest is the last,the city of final destination,the city that has a goal.It is this third city that accounts for New York's high-strung disposition,its poetical deportment,its dedication to the arts, and its incomparable muters give the city its tidal restlessness;natives give it solidity and continuity;but the settlers give it passion.And whether it is a farmer arriving from Italy to set up a small grocery store in a slum,or a young girl arriving from a small town in Mississippi to escape the indignity of being observed by her neighbors,or a boy arriving from theCorn Belt with a manuscript in his suitcase and a pain in his heart,it makes no difference:each embraces New York with the intense excitement of first love,each absorbs New York with the fresh eyes of an adventurer, each generates heat and light to dwarf the Consolidated Edison Company.Questions30 to 32 are based on the passage you have just heard.30. What does the speaker say about the natives of New York?31. What does the speaker say commuters give to New York?32. What do we learn about the settlers of New York?Passage Three“If you asked me television is unhealthy”,I said to my roommate Walter,as I walked into the living room. “While you are sitting passively in front of the TV set, your muscles are turning to fat, your complexion is fading,and your eyesight is being ruined.”“Shh~”Walter put his finger to his lips, “This is an intriguing murder mystery.”“Really?” I replied.“But you know,the brain is destroyed by TV viewing.Creativity is killed by that box.And people are kept from communicating with one another.From my point of view,TV is the cause of the declining interest in school and the failure of our entire educational system. ”“Ah ha,I can’t see your point.”Walter said softly.“But see?The woman on the witness stand in this story is being questioned about the murder that was committed one hundred years ago. ”Ignoring his enthusiastic description of the plot, I went on with my argument.“As I see it,” I explained, “not only are most TV programs badly written and produced, but viewers are also manipulated by the mass media.As far as I am concerned,TV watchers are cut off from reality from nature,from the other people,from life itself!I was confident in my ability to persuade.After a short silence, my roommate said, “Anyway, I’ve been planning to watch the football game.I am going to change the channel.”“Don’t touch that dial!” I shouted, “I wanted to find out how the mystery turns out!”I am not sure I got my point to cross.Questions 33- 35 are based on the passage you have just heard.33. As the speaker walked into the living room, what was being shown on TV?34. What does the speaker say about watching television?35. What can we say about the speaker?Section C Compound DictationIn the past,one of the biggest disadvantages of machines has been their inability to work on a micro scale.For example,doctors did not have devices allowing them to go inside the human body to detect health problems or to perform delicate surgery.Repair crews did not have a way of identifying broken pipes located deep within a high-rise apartment building.However, that’s about to change.Advances in computers and biophysics have started a micro miniature revolution that allows scientists to envision and in some cases actually build microscopic machines.These devices promise to dramatically change the way we live and work.Micromachines already are making an impact.At Case Western Reserve University in Cleveland, Ohio, research scientists have designed a 4-inch silicon chip that holds 700 tiny primitive motors. At Lucas Nova Sensor in Fremont, California, scientists have perfected the world’s first microscopic blood-pressure sensor.Threaded through a person’s blood vessels,the sensor can provide blood pressure readings at the valve of the heart itself.Although simple versions of miniature devices have had an impact, advanced versions are still several years away.Auto manufacturers, for example, are trying to use tiny devices that can sense when to release an airbag and how to keep engines and breaks operating efficiently .Some futurists envision nanotechnology also being used to explore the deep sea in small submarine, or even to launch finger-sized rockets packed with micro miniature instruments.“There is an explosion of new ideas and applications,”So,when scientists now think about future machines doing large and complex tasks,they’re thinking smaller than ever before.Listening Comprehension 短对话听力答案11.Q: What does the woman suggest the man do?【答案】A) Listen to the recorded notes while driving.12.Q: What do we learn from the conversation?【答案】C) The man lacks confidence in playing the part.13.Q: What are the speakers talking about?【答案】A) Arranging a bed for a patient.14.Q: What does the man mean?【答案】A) He is too busy to accept more responsibility.Section A11.Q: What does the woman suggest the man do?【答案】A) Listen to the recorded notes while driving.12.Q: What do we learn from the conversation?【答案】C) The man lacks confidence in playing the part.13.Q: What are the speakers talking about?【答案】A) Arranging a bed for a patient14.Q: What does the man mean?【答案】A) He is too busy to accept more responsibility.15.Q: what do we learn about Jame Smeil?【答案】C) He has left his position in the government.16.Q: what can we infer from this conversation?【答案】D) The man is well informed about the space shuttle missions.17.Q: Where does the conversation most probably take place?【答案】A) At a car renting companySection BPassage One【材料评析】本篇文章主要是讲述人们对待老鼠不同的道德态度。

语料库

语料库
15
3 语料库的设计
语料库三方面 A. 语料本身
属性 规模 领域
体裁 时代 语体 语种
语言层次

百万词级 | 千万词级 | 亿万词级 | … 政治 | 经济 | 体育 | 心理学 | …
文学 | 应用文 | 新闻 | …
共时 | 历时 书面语 | 口语 单语 | 双语 | 多语 双语平行语料库 | 双语比较语料库 语音(音节,韵律) | 语法(词,句,…)
11
第二代语料库
建于1980年代,由英国Birmingham大学 与Collins出版社合作完成,规模达2000 万词次,基于该语料库出版的Collins Cobuild词典(1987)受到了广泛的好评
COBUILD语料库 Longman语料库
千万词级 词典编纂 - 应用导向
建于1980年代,包括三个语料库: LLELC语料库(Longman/Lancaster英语语料库) LSC语料库(Longman口语语料库) LCLE(Longman英语学习语料库) 目标是编撰英语学习词典,为外国人学习英语服 务,词典规模达5000万词次
7
London-Lund英语口语语料库部分标记
标记
含义
#
语调群的结束 (end of tone group)
^
语音开始 (onset)
/
上升型核心语调 (rising nuclear tone)
\
下降型核心语调 (falling nuclear tone)
^
先升后降型核心语调 (rise-fall nuclear tone)
检索工具 | 人机界面 | 数据接口 | … 16
语料的选取
精品原则 有影响力原则 随机挑选原则 高流通度原则 典型性原则 易于获得原则 具有统计样本意义原则 符合语言规范原则

Robust-forensic-...

Current BiologyMagazineCurrent Biology 28, R1–R16, January 8, 2018 © 2017 Elsevier Ltd. R13Robust forensic matching of confi scated horns to individualpoached African rhinocerosCindy Harper 1,2,*, Anette Ludwig 1, Amy Clarke 1, Kagiso Makgopela 1, Andrey Yurchenko 2, Alan Guthrie 1, Pavel Dobrynin 2, Gaik Tamazian 2,Richard Emslie 3, Marile van Heerden 4, Markus Hofmeyr 1,5, Roderick Potter 6, Johannes Roets 7, Piet Beytell 8, Moses Otiende 9, Linus Kariuki 9,Raoul du Toit 10, Natasha Anderson 10, Joseph Okori 11, Alexey Antonik 2, Klaus-Peter Koepfl i 2,12, Peter Thompson 1,and Stephen J. O’Brien 2,13Black and white rhinoceros (Diceros bicornis and Ceratotherium simum ) are iconic African species that are classifi ed by the International Union for the Conservation of Nature (IUCN) as Critically Endangered and Near Threatened (/), respectively [1]. At the end of the 19th century, Southern white rhinoceros (Ceratotherium simum simum ) numbers had declined to fewer than 50 animals in the Hluhluwe-iMfolozi region of the KwaZulu-Natal (KZN) province of South Africa, mainly due to uncontrolled hunting [2,3]. Efforts by the Natal Parks Board facilitated an increase in population to over 20,000 in 2015 through aggressive conservation management [2].Black rhinoceros (Diceros bicornis ) populations declined from several hundred thousand in the early 19th century to ~65,000 in 1970 and to ~2,400 by 1995 [1] withsubsequent genetic reduction, also due to hunting, land clearances and later poaching [4]. In South Africa, rhinoceros poaching incidents have increased from 13 in 2007 to 1,215 in 2014 [1]. This has occurred despite strict trade bans on rhinoceros products and strict enforcement in recent years.The signifi cant increase in illegal killing of African rhinoceros and the involvement of transnationalorganised criminal syndicates in horn traffi cking has met with increased law enforcement efforts to apprehend, successfully prosecute and sentence traffi ckers and poachers with the aim of reducing poaching. In Africa, wildlife rangers, law enforcement offi cials and genome scientists have instituted a DNA-based individual identifi cation protocol usingcomposite short tandem repeat (STR) genotyping of rhinoceros horns,rhinoceros tissue products and crime scene carcasses to link confi scated evidence to specifi c poaching incidents for support of criminalinvestigations. This method has been used extensively and documented in the RhODIS® (Rhinoceros DNA Index System) database of confi scated horn and living rhinoceros genotypes (http://rhodis.co.za ), eRhODIS™applications to collect field and forensic sample data and RhODIS® biospecimen collection kits. These are made available to trained RhODIS® certifi ed offi cials to fulfi ll chain of custody requirements providing a pipeline to connect illegally traffi cked rhinoceros products to individual poached rhinoceros victims. This study applies a panel of 23 STR (microsatellite) loci to genotype 3,968 individual rhinoceros DNA specimens from distinct white and black rhinoceros populations [5]. We assessed the population genetic structure of these (Supplemental information) and applied them to forensic match analyses of specifi c DNA profi les in more than 120 criminal cases to date.Four methods were applied to support forensic matching of confi scated tissue evidence to crime scenes: fi rst, furthercharacterization and optimization of STR panels informative for rhinoceros species; second, development and application of the RhODIS® database containing genotypes and demographic information of more than 20,000 rhinoceros acquisitions; third, analysis of the populationgenetic structure of white and black rhinoceros species, subspecies and structured populations; and fourth, computation of match probability Correspondencestatistics for specifi c profi les derived from white and black rhinoceroses. We established a reference database consisting of 3,085 genotypes of white rhinoceros (C. simum ) and 883 black rhinoceros (D. bicornis ) sampled since 2010 which provide the basis for robust match probability statistics.The effects of historic range contractions or expansions, migration, translocation andpopulation fragmentation caused by poaching and habitat reduction on rhinoceros population genetic structure have been reported but are limited [6–8]. Southern whiterhinoceros are traditionally considered panmictic and comprising a single subspecies, C. s. simum , as a result of the severe founder effect in the late 19th century [2]. Black rhinoceros are generally subdivided into three modern subspecies, D.b. bicornis , D.b. michaeli and D.b. minor [8]. Population structure of white and black rhinoceros based upon three different analyses (Supplemental information) affi rmed the partition of white versus black rhinoceros species plus the separation of the three black rhinoceros subspecies. The STRUCTURE algorithm revealed a fi ne grain distinctiveness between black rhinoceros D.b. minor populations from Zimbabwe and KwaZulu-Natal (KZN), South Africa and also indicates that black rhinoceros in the Kruger National Park (KNP) are comprised of a mix of KZN and Zimbabwe rhinoceros as expected, since KNP black rhinoceros founders originated from these two locales [9].For forensic match applications, we calculated allele frequencies for all polymorphic unlinked loci for white (3,085 genotypes) and black rhinoceros (883 genotypes). These estimates and other STR locus statistics were calculated for each rhinoceros species. Population differentiation (F ST ) between white and black rhinoceros subspecies supports the recognition of theSouthern white rhinoceros subspecies (C. s. simum ), and three blackrhinoceros subspecies, D.b. bicornis , D.b. michaeli and D.b. minor , with signifi cant partitioning of the Zimbabwe versus KZN D.b. minor populations in the present Africanrhinoceros populations.Current BiologyMagazineR14 Current Biology 28, R1–R16, January 8, 2018Over 5,800 rhinoceros crime cases have been submitted to RhODIS ® since 2010 and in excess of 120 case reports relating carcass material to evidence items (horn, tissue, blood stains and other confi scated materials) have been provided to investigators. Table 1 summarizes nine of these rhinoceros crime cases which have been concluded in court. These are illustrative of where DNA matches were made and the use of thisevidence for prosecution, conviction and sentencing of perpetrators of rhinoceros crimes. Table 1 includes case sample details, species identifi ed and match probability calculated using the RhODIS ® reference database. Thesuccessful prosecution, conviction and sentencing of suspects in South Africa and other countries affi rm the utilityof the RhODIS ®approach in criminal prosecutions of the perpetrators of illegal rhinoceros trade and provide an international legal precedent for prosecution of rhinoceros crimes using a robust forensic matching of confi scated evidence items to specifi c wildlife crime scenes.SUPPLEMENTAL INFORMATIONSupplemental Information includingexperimental procedures, one figure and one table can be found with this article online at https:///10.1016/j.cub.2017.11.005.Table 1. Summary of nine prosecuted cases of rhinoceros crime. Samples were successfully matched using composite STR genotyping with cumulative match probability calculated using a conservative Theta ( ) of 0.1. Details of case with matching evi-dence items, location of poaching incident, species and subspecies identifi ed, cumulative match probability, status of the case (conviction date: sentence) and the nationalities of the accused are provided for six South African cases and single cases from Kenya, Namibia and Singapore. (KNP – Kruger National Park, SA – South Africa, ORTIA – OR Tambo International Airport, HiP – Hluhluwe-iMfolozi Park, OPC – Ol Pejeta Conservancy, ENP – Etosha National Park). a and b refer to match probability calculations for specifi c white and black rhinoceros summarised in Supple-mental information.REFERENCES1. Emslie, R.H., Milliken, T., Talukdar, B., Ellis,S., Adcock, K., and Knight, M.H. (2016). African and Asian Rhinoceroses - Status,Conservation and Trade. In A Report from the IUCN Species Survival Commission (IUCN SSC) African and Asian Rhino SpecialistGroups and TRAFFIC to the CITES Secretariat pursuant to Resolution Conf. 9.14 (Rev. CoP15).2. Player, I. (2013). The White Rhino Saga,(Johannesburg: Jonathan Ball Publishers). 3. Walker, C., and Walker, A. (2012). The RhinoKeepers, (Johannesburg: Jacana Media). 4. Milliken, T., and Shaw, J. (2012). The SouthAfrica – Viet Nam Rhino Horn Trade Nexus: A deadly combination of institutional lapses, corrupt wildlife industry professionals and Asian crime syndicates. TRAFFIC, Johannesburg, South Africa.5. Harper, C.K., Vermeulen, G.J., Clarke, A.B.,De Wet, J.I., and Guthrie, A.J. (2013).Extraction of nuclear DNA from rhinoceros horn and characterization of DNA profi ling systems for white (Ceratotherium simum ) and black (Diceros bicornis ) rhinoceros. Forensic Sci. Int. Genet. 7, 428–433.6. Anderson-Lederer, R.M., Linklater, W.L., andRitchie, P .A. (2012). Limited mitochondrial DNA variation within South Africa’s black rhino (Diceros bicornis minor ) population and implications for management. Afr. J. Ecol. 50, 404–413.7. Kotzé, A., Dalton, D.L., Du Toit, R.,Anderson, N., and Moodley, Y. (2014). Genetic structure of the black rhinoceros (Diceros bicornis ) in south-eastern Africa. Conserv. Genet. 15, 1479–1489.8. Moodley, Y., Russo, I.R.M., Dalton, D.L., Kotzé,A., Muya, S., Haubensak, P ., Bálint,B., Munimanda, G.K., Deimel,C., Setzer, A.,et al . (2017). Extinctions, genetic erosion and conservation options for the black rhinoceros (Diceros bicornis ). Sci. Rep. 7, 41417.9. Hall-Martin, A. (1988). Conservation of the blackrhino: the strategy of the National Parks Board of South Africa. Quagga 1, 12–17.1Faculty of Veterinary Science, University of Pretoria, Onderstepoort 0110, South Africa. 2Theodosius Dobzhansky Center for Genome Bioinformatics, St. Petersburg State University, St. Petersburg, Russia 199004. 3IUCN SSC African Rhino Specialist Group, Hilton 3245, South Africa. 4National Prosecuting Authority, Silverton 0184, South Africa. 5Veterinary Wildlife Services, South African National Parks, Skukuza, South Africa. 6Ezemvelo KZN Wildlife, Queen Elizabeth Park, Pietermaritzburg 3201, South Africa. 7South African Police Service, Stock Theft and Endangered Species Unit, Pretoria 0001, South Africa. 8Ministry of Environment and Tourism, Windhoek, Namibia. 9Kenya Wildlife Service, Nairobi 00100, Kenya. 10Lowveld Rhino Trust, Harare, Zimbabwe. 11WWF: African Rhino Programme, Cape Town, South Africa. 12Smithsonian Conservation Biology Institute, 3001 Connecticut Ave NW,Washington, DC 20008, USA. 13Guy Harvey Oceanographic Center, Nova Southeastern University, 8000 North Ocean Drive, Ft Lauderdale, FL 33004, USA.*E-mail: ******************.za。

清华大学电子工程系


2012-4-21 Liu Jia
11
The State of the Art
Robust Speech Recognition
Microphone array Normalization Channel normalization by cepstral mean subtraction RASTA (Relative Spectra) Vocal tract length normalization for speaker variations PMC (Parallel Model Combination) Speech enhancement technology Speech enhancement + PMC At present no good effective methods
S
Discriminative Training
Minimum Classification Error Rate (MCE) Minimum Discrimination Information (MDI) Maximum Mutual Information (MMI)
F ( λ ) = log
State-specific-based confusion classes are collected. The state-specific linear discriminant transformation is used. The discrimination of the acoustic models are improved by using discriminant analysis.
( w ,τ ,t )
2012-4-21 Liu Jia

美国关岛国际机场及其办公室均接获一


Centauri/
Your assignment, translate this to Arcturan:
1a. ok-voon ororok sprok . 1b. at-voon bichat dat . 2a. ok-drubel ok-voon anok plok sprok . 2b. at-drubel at-voon pippat rrat dat . 3a. erok sprok izok hihok ghirok . 3b. totat dat arrat vat hilat . 4a. ok-voon anok drok brok jok . 4b. at-voon krat pippat sat lat . 5a. wiwok farok izok stok . 5b. totat jjat quat cat . 6a. lalok sprok izok jok stok . 6b. wat dat krat quat cat . farok crrrok hihok yorok clok kantok ok-yurp 7a. lalok farok ororok lalok sprok izok enemok . 7b. wat jjat bichat wat dat vat eneat . 8a. lalok brok anok plok nok . 8b. iat lat pippat rrat nnat . 9a. wiwok nok izok kantok ok-yurp . 9b. totat nnat quat oloat at-yurp . 10a. lalok mok nok yorok ghirok clok . 10b. wat nnat gat mat bat hilat . 11a. lalok nok crrrok hihok yorok zanzanok . 11b. wat nnat arrat mat zanzanat . 12a. lalok rarok nok izok hihok mok . 12b. wat nnat forat arrat vat gat .

动态系统中的变化率:第二语言发展中的时段

66现代交际• 2017年13期动态系统中的变化率:第二语言发展中的时段周玥(中国海洋大学外国语学院山东青岛266100)摘要:在动态系统理论视域下,二语习得是一个动态发展的非线性复杂系统。

第二语言发展发生于不同但相互作用的时段中,从几十年的生命跨度到几毫秒的大脑活动,均可作为语言发展的时段之一。

由于时段之间的交互作用,研究者常基于多个重点时段,对语言现象进行研究。

研究发现,在二语发展这一动态系统中,含有多个子系统,各子系统有自己的时段和变化率,而各时段间的交互作用是语言发展的核心之一Q基于时段,对语言学习者的学习动机进行研究,实际意义重大。

关键词:第二语言发展时段交互作用变化率动态系统中图分类号:H319文献标识码:A文章编号:1009-5349 (2017) 13-0066-03一、时段与时间的分形性时段和时间窗口是两个不同概念。

时段指的是发展过程中的间隔尺寸;从宏观角度出发,着眼于一生的发展变化,以众多时刻作为样本。

而时间窗口指的是研究的持续时间。

比如,研究人的一生,时间窗口是贯穿其终生的一整段时期,而时段可用于查验不同时期的发展变化。

譬如在一项研究中,研究者着眼于研究学习者在两年间语音发展的情况(时间窗口),并每周评估其表现情况(时段)。

(一)时段对于人类发展的研究而言,仅有一小部分时段是与之相关的。

针对时段,Lem ke曾总结了一份极具实用性的概要(图1),结果表明与人类生存相关的时段在10-5秒到109秒之间。

Lem ke着眼于研究在学校这一背景下,时段是如何组合及产生交互作用的。

其中,尤其是10-1-107这一段时段是与之密切相关的。

(二)时间的分形性其他相关文章中曾提及,人类一生中,语言系统的子系统在所有的时段中,均是不断发展的。

时间具有分形性,从某种意义上说,时间是无标度的。

这意味着尽管研究者可着眼于以年或毫秒,或以在此之间的其他单位作为时段,但没有任何尺度可用来界定语言的发展。

  1. 1、下载文档前请自行甄别文档内容的完整性,平台不提供额外的编辑、内容补充、找答案等附加服务。
  2. 2、"仅部分预览"的文档,不可在线预览部分如存在完整性等问题,可反馈申请退款(可完整预览的文档不适用该条件!)。
  3. 3、如文档侵犯您的权益,请联系客服反馈,我们会尽快为您处理(人工客服工作时间:9:00-18:30)。

Recent Developments in the Emu Speech Database SystemLasse Bombien1, Steve Cassidy2, Jonathan Harrington1, Tina John1, Sallyanne Palethorpe31. Institut für Phonetik und Sprachliche Kommunikation, University of Munich, Germany.2. Department of Computing, Macquarie University, Sydney.3. Macquarie Centre for Cognitive Science, Macquarie University, Sydney.AbstractThe Emu Speech Database system is a set of software tools developed to supportresearch in acoustic phonetics and other corpus based speech research. This paperdescribes recent updates that have been made to the latest release of the software.1.IntroductionThe EMU Speech Database System, which has been developed over a number of years (e.g., McVeigh & Harrington 1992; Harrington, Cassidy, Fletcher and McVeigh, 1993; Harrington & So, 1994; Cassidy, 1999; Cassidy & Harrington, 1996, 2001; Harrington, Cassidy, John and Scheffers., 2003) is an integrated set of tools for creating, querying and analysing annotated speech corpora. In common with many speech software systems, it includes facilities for annotating speech data from synchronized waveform and spectrographic displays, as well as a library of routines for digital speech processing. It differs from these, however, in allowing hierarchical and autosegmental annotations and in providing a query language for extracting annotations from these structures and their associated signal files. Since 1988, when it was first developed as ‘Acoustic Phonetics in S’ at CSTR, Edinburgh University (Taylor, Caley, Black and King., 1999), EMU has provided a transparent interface to a powerful graphical and statistical programming language (first ‘S’, then ‘Splus’ and since the mid 1990s, the R programming environment - R Development Core Team, 2006). Another distinguishing feature of EMU is its organization of a speech database in a so-called template file, which specifies the physical location of the signal files and the types of hierarchical and autosegmental relationships between sets of labels. The template file was included to facilitate the sharing of speech corpora across different users, working at different sites, and possibly on different platforms. Another feature, which is unique to EMU, is the possibility of building various kinds of labeling structures automatically using the Tcl scripting language (Welch, Jones and Hobbs, 2003). More recently, an interface has been built to Praat, a computational system for doing phonetics (Boersma & Weenink, 2001), allowing Praat labeling structures to be converted into EMU and vice-versa (Harrington et al., 2003).2.Current EMU structureIn the last three years, EMU 2.0 has been provided with a new internal structure (Fig. 1) that allows individual parts of the system to be modified and tested independently. Both the front-end and the back-end have been designed in a modular way and can be extendedwithout much effort.Figure 1: the new EMU 2.0 structure.In earlier versions, the EMU back-end consisted of three parts providing:(1) the database functionality,(2) signal processing (trackdata)(3) signal display (padgraph).These parts were integrated into the Tcl interpreter differently depending on the platform, e.g., by loading shared libraries on Windows while *nix (all unix derivates) used custom interpreters with built-in functionality. Subsequently, Snack (Sjölander, 2000) was used for displaying waveform and spectrographic data. One of the advantages of Snack is that it is easily extendable to support, for example, the SSFF file format frequently used in EMU; another is that it provides the facility for zoomable spectrograms. Accordingly, the newEMU back-end consists of the platform independent Tcl extensions, emucore (database functionality) and snack. Two more extensions are planned as part of the back-end:(1) In order to facilitate the use of Praat from within EMU, an extension tclpraat has been developed. This extension is used to send commands to Praat via sendpraat (Boersma and Weenink, 2006)(2) So far, the emu application tkassp for signal processing has made use of the assp tools (advanced speech signal processing) by invoking subprocesses. As the assp tools are currently being rewritten as a library supporting 64 bit systems, a new extension tclassp will be used in future rather than calling subprocesses. The assp library will also be used to add support for the SSFF file format in snack.The EMU front-end has undergone significant changes, too. While in earlier releases, all applications had to be invoked separately, EMU 2.0 includes a front-end dbemu (see Fig. 2) which interfaces with all current EMU applications and provides an overview of all accessible databases and their utterances. dbemu constitutes the new core of all applications while the former core, the EMU labeller, is now one application among others in the EMU system. All EMU applications are separate Tcl packages which can share their functionality. While all of the applications can be launched from within dbemu, most of them can still be started on their own.Figure 2: the new EMU 2.0 frontend – thedbemu interface.A major problem for collaborative work between groups of researchers on the same database was in how to store the template file and the speech data and annotations. For EMU 2.0, template files can be stored anywhere that the computer has access to. Thus, neither the speech data and annotations nor the template file needs to be stored on the machine that runs the EMU system. The various paths to the template files are accessible and can be defined using a new gui emu-conf, while the location of the different databases is viewable directly from the dbemu database list. The new internal structure of the EMU system removes the limitations of the earlier system on the size of signals that can be annotated with EMU.From dbemu there is an interface to a gui GTemplateEditor for creating template files in which the user fills in information about the location of the signal and annotation files of the database, provides information about the type of annotation structure, and specifies both the types of display that should appear when the database is accessed as well as any Tcl scripts for building annotation structures. After a database template has been written, it shows up as the corresponding database name in dbemu.3.Signal ProcessingWhile in other speech processing systems, parametric data such as formants and pitch are often calculated on the fly, EMU makes use of the assp tools or their graphical interface tkassp for calculating derived signal files. The assp tools include routines for filtering, and for pitch, formant, and spectral analysis. A library with these functions is currently in development to replace the present command line programs. tkassp can be launched either in standalone mode, or else dbemu can be used to pass a list of utterances from the database to tkassp for signal processing. tkassp offers multiple types of input selection: single files, directories, file and directory lists, EMU utterance lists, and EMU segment lists. The output files can be stored by either specifying a single output directory but also by maintaining the directory structure of the input. The latter is specifically useful for complex database structures. The output files of tkassp can easily be added to the database as additional tracks using the database template. There is also a tcl-script for conversion of raw EPG3 (electropalatographic) files to the SSFF format that can be interpreted by EMU. Finally, EMU allows both formant and fundamental frequency tracks to be manually corrected with a mouse and updated.4.AnnotationLoading the database causes all that database’s utterances to appear. These utterances can be opened by the EMU labeller tool, in which annotation is supported by the Tcl/Tk programming language (Welch et al., 2003), or else they can be opened by the other integrated tools wavesurfer and praat.The EMU labeller supports an annotation model that allows hierarchical relations between labels. EMU distinguishes timeless annotation levels from time-bounded Event and Segment annotation levels. All levels can have hierarchical relations to each other; this enables rich annotation structures (Cassidy and Harrington, 2001; Harrington et al., 2003).Further development will introduce new annotation views that are more suitable to larger scale annotations that are needed, for example, in dialogue analysis, while retaining the same ability to integrate detailed phonetic or prosodic annotations into a hierarchical structure.All annotation levels, their hierarchical relationships, view options and relationship to time are defined in the database template using a graphical template editor (GTemplateEditor). The template editor also allows legal labels to be specified as well as labels to be decomposed into features.In previous versions, it was possible to edit EMU annotations only using the EMU labeller, but in this most recent version, all time-dependent levels can be edited in Praat(Boersma and Weenink, 2001) and in WaveSurfer(Sjölander & Beskow, 2005) without losing the hierarchical structure of the EMU annotation.WhileWaveSurfer innately supports the label format produced by ESPS, Waves+ and EMU, for Praat the label files are converted to textgrids (and back) on the fly. Label levels that are not directly anchored in time, i.e. that inherit their times from other labels, are necessarily disregarded when editing in Praat/WaveSurfer(since they can only deal with time-bound labels). However, on quitting Praat/WaveSurfer, the EMU hierarchy will be updated and where necessary rearranged.The EMU system has provision for including custom Tcl scripts for automatic annotation (AutoBuild-Scripts) or more complex modules by appropriate definitions in the database template. Some predefined scripts are integrated in the EMU system as well as two complete modules: epgdisplay for palatogram visualization and hier2sigview for displaying timeless annotation levels in the signal view of the EMU labeler.AutoBuild-Scripts are used to automatically build hierarchical relations between annotation levels using temporal alignment, dictionaries etc. A new graphical tool, the AutoBuild wizard, facilitates an easy deployment of AutoBuild-Scripts over an entire database or selected parts of it, eliminating the need to load each utterance for this task. Any AutoBuild-Script can be used from within the wizard, that is, not necessarily the one defined in the database template.5.Querying the databaseThe EMU query language (Cassidy & Bird, 2000) enables researchers to isolate speech segments based on both sequential and hierarchical context. There can be simple queries such as ‘return all segments from the Phonetic level’ but also complex queries like ‘return all segments from the Phonetic level that precede a vowel nasal sequence in intermediate phrase-initial words following the definite article’.The EMU 2.0 Query Tool provides a graphical query tool queryGUI that assists the user by creating a query string, thus removing the need for the user to learn the syntax and semantics of the query language itself. With a few simple mouse clicks, a query string can be created and sent to the EMU Query Tool that runs the queries over the chosen utterances of the database.Each query results in an utterance list where all query matching segments are listed with their onset and offset times and the utterance they belong to. For these segments, signals can be extracted from within the EMU Query Tool. All the results can be saved to files for further analysis. The EMU Query Tool and thus the queryGUI are accessible from the dbemu interface.6.Analysis- R statisticalEnvironmentThe EMU system provides a package within the R programming language for statistical computing. R is similar to the S language and environment which was developed at Bell Laboratories (formerly AT&T, now Lucent Technologies) by John Chambers and colleagues. The EMU-R package enhances R by including the facility for querying annotations and extracting signal data from EMU databases (see Harrington, forthcoming, for numerous examples of formant, spectral, and EPG analyses in R). Querying can be done either directly from within R or by loading stored output files of the graphical query tool. Query results and signal data can be further manipulated, displayed and statistically evaluated in R. Additional functions for plotting and signal processing are also included. In future, it will be possible to make use of R's Tcl/Tk interface, in order to launch (graphical) EMU tools from within R.7.ConclusionsEMU 2.0 provides a user-friendly access to all EMU functions using graphical user interfaces, a new internal structure in the programming code, and the integration of other third part tools Praat and Wavesurfer. EMU runs on Windows and UNIX platforms and can be downloaded from the EMU sourceforge website () together with some databases (see also Harrington, forthcoming). Current development includes completing the port to the Macintosh OS X platform.8.ReferencesBoersma, P. and Weenink D. (2001), Praat, a system for doing phonetics by computer, Tech. Report 132, Inst.Phonetic Sciences, Univ. Amsterdam. Retrieved on 2006-11-21 from /Boersma, P. and Weenink D. (2006), Sendpraat: sending messages to a Praat shell program(Code Version February17, 2006) [Computer Program],Retrieved on 2006-11-21 fromhttp://www.fon.hum.uva.nl/praat/sendpraat.html. Cassidy, S. (1999), Compiling multi-tiered speech databases into the relational model: experiments with the EMU system. In Proc.Eurospeech, 2238-2242.Budapest, Hungary.Cassidy, S. and Bird S. (2000), Querying databases of annotated speech. In Proc. Eleventh Australasian Database Conference, 22, 12-20. Canberra, Australia. Cassidy, S. and Harrington, J. (1996). EMU: an enhanced hierarchical speech database management system. In Proc. Sixth Australian International Conference on Speech Science and Technology, 361-366. Adelaide, Australia.Cassidy, S. and J. Harrington (2001), Multi-level annotation in the Emu speech database management system, Speech Communication, 33, 61-77. Harrington, J. (forthcoming). The Phonetic Analysis of Speech Corpora.Blackwell. Retrieved on 2006-11-21 from http://www.phonetik.uni-muenchen.de/~jmh/research/emupapers/pasc.htm Harrington J., Cassidy S., Fletcher J. and McVeigh A.(1993), The mu+ system for corpusbased speech research, Computer Speech and Language, 7, 305-331. Harrington, J., Cassidy, S., John, T. and Scheffers, M.(2003). Building an interface between EMU and Praat:a modular approach to speech database analysis.International Congress of Phonetic Sciences.Barcelona, Spain.Harrington, J. & So, L. (1994). Some design criteria in segmenting and labelling a database of spoken Cantonese. In Proc. Fifth Australian InternationalConference on Speech Science and Technology. 215-220. Perth, Australia.McVeigh, A. and Harrington, J. (1992). The mu+ system for speech database analysis. In Proc. Fourth Australian International Conference on Speech Science and Technology. 548-553. Brisbane, Australia R Development Core Team (2006).R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing.Sjölander, K. (2000), The Snack sound extension for Tcl/Tk, Retrieved on 2006-11-21 fromhttp://www.speech.kth.se/snack/Sjölander, K. and Beskow, J. (2005). WaveSurfer - an open source speech tool,Retrieved on 2006-11-21 fromhttp://www.speech.kth.se/wavesurfer/Taylor, P., Caley R., Black A. W., and King S. (1999).Edinburgh Speech Tools Library, System Documentation Edition1.2. CSTR, University of Edinburgh.Welch, B. B., Jones K., and Hobbs J. (2003). Practical Programming in Tcl and Tk (4 ed.) Englewood Cliffs, N.J.: Prentice Hall.。

相关文档
最新文档