Interaction with texts Information retrieval as information-seeking behavior

合集下载

综合教程 Unit6 Is an Only Child a Lonely Child?

综合教程 Unit6 Is an Only Child a Lonely Child?

Detailed reading
Is an Only Child a Lonely Child?
1. Many couples, who already have one healthy, happy child, are facing a dilemma, namely, the issue of whether or not to have a second child. They have demanding jobs and limited time and financial resources, but they are also very keen to ensure that their only child does not become a lonely child. So, what are the pros and cons of having a second child? Is an only child a lonely child? That is what so many couples are very much concerned about and eager to understand.
Reading aloud Cultural information Audiovisual supplement
2. Are you from a one-child family? Can you imagine living with so many brothers and sisters? Open.
Text analysis Structural analysis
Structural analysis
1. Divide the text into parts by completing the table.

如何利用机器学习解决文本情感分析问题

如何利用机器学习解决文本情感分析问题

如何利用机器学习解决文本情感分析问题机器学习是一种使用算法和统计模型来训练计算机系统,使其能够从大量数据中自动学习和改进的方法。

在过去的几年里,机器学习被广泛应用于各种领域,包括文本情感分析。

文本情感分析是指对文本进行解析和理解,以确定其中所包含的情感和情绪。

为了解决文本情感分析问题,机器学习可以被用来建立模型以自动识别和分类文本中的情感。

以下是一些利用机器学习解决文本情感分析问题的方法:1. 构建情感词典:情感词典是一个包含了大量情感词汇的词典,每个词汇都与一种情感相关联。

构建情感词典的方法可以利用机器学习算法,通过对大量文本数据进行训练,自动找到与情感相关的词汇。

通过这种方式,可以快速判断一段文本的情感倾向。

2. 使用监督学习进行分类:监督学习是一种常见的机器学习方法,在文本情感分析中也被广泛应用。

该方法通过使用已标记的训练数据,即带有情感标签的文本,训练一个分类模型。

这样的模型可以根据文本的特征(例如词汇、语法结构等)来预测文本的情感。

3. 应用神经网络:神经网络在近年来在机器学习领域取得了显著的进展,其在文本情感分析中也有很好的应用。

通过使用神经网络模型,可以更好地捕捉文本之间的复杂关系,提高情感分析模型的准确性。

例如,可以使用循环神经网络(RNN) 或者卷积神经网络 (CNN) 来对文本进行建模,并输出相应的情感分类结果。

4. 结合词向量技术:词向量是一种将词汇嵌入到一个向量空间中的技术,以便计算机能够更好地理解和处理文本。

利用词向量技术,可以将文本转化为数值表示,然后应用各种机器学习算法进行情感分析。

常用的词向量技术包括 Word2Vec 和GloVe 等。

这些技术能够捕捉到词汇之间的语义和上下文关系,对于情感分析任务的准确性和效果有着积极的影响。

总之,机器学习是解决文本情感分析问题的一种有效方法。

通过构建情感词典、使用监督学习进行分类、应用神经网络以及结合词向量技术,我们可以建立准确并高效的文本情感分析模型。

characterized interactions -回复

characterized interactions -回复

characterized interactions -回复Characterized interactions refer to the various forms of exchanges and relationships between individuals, groups, or even countries that are marked by distinct traits or qualities. These interactions can take place in various spheres of life, such as personal, social, economic, or political. In this article, we will explore the concept of characterized interactions and provide a step-by-step analysis of their different aspects.Step 1: Understanding Characterized Interactions Characterized interactions are essentially interactions that are distinguished by specific traits or features. These traits can include the nature of the relationship, the intensity of the interaction, the purpose or goal of the interaction, and the impact it has on the parties involved.Step 2: Types of Characterized InteractionsThere are numerous types of characterized interactions, depending on the context in which they occur. Some common types include: 1. Personal interactions: These interactions occur between individuals on a personal level, such as family members, friends, or romantic partners. They are characterized by emotional closeness,trust, and shared experiences.2. Social interactions: These interactions occur within a society or community and can be both formal and informal. Examples include interactions at social gatherings, in educational institutions, or at workplaces. They are characterized by social norms, expectations, and hierarchies.3. Economic interactions: These interactions occur in the realm of business and trade. They involve transactions, negotiations, and collaborations between individuals, companies, or even countries. They are characterized by economic motives, competition, and mutual benefits.4. Political interactions: These interactions take place in the political arena and involve interactions between governments, political parties, or international organizations. They are characterized by power dynamics, diplomacy, and negotiation.Step 3: Factors Influencing the Characterization of Interactions Several factors can influence the characterization of interactions. These factors can shape the nature, intensity, and outcome of the interactions. Some key factors include:1. Culture: Cultural norms and values play a significant role in shaping interactions. Different cultures may prioritize differentvalues, such as individualism, collectivism, or hierarchy, which can impact the way interactions are conducted.2. Power dynamics: Power imbalances can significantly affect the characterization of interactions. Interactions between individuals or groups with unequal power may involve domination, oppression, or exploitation.3. Context: The context in which interactions take place, such as the environment, time, or specific circumstances, can have a significant impact on their characterization. For example, interactions during a crisis may be characterized by urgency or cooperation.4. Communication: Effective communication is crucial for successful interactions. The way information is exchanged, interpreted, and understood can shape the dynamics and outcomes of interactions.Step 4: Importance of Characterized Interactions Characterized interactions play a vital role in human society and have several important implications:1. Building relationships: Interpersonal interactions are essential for building and maintaining relationships. These interactions lay the foundation for trust, empathy, and mutual understanding.2. Social cohesion: Interactions within communities foster a sense of belonging and social cohesion. They allow individuals to bond,collaborate, and work towards common goals.3. Economic development: Economic interactions drive trade, innovation, and growth. They facilitate the exchange of goods, services, and knowledge, creating opportunities for economic development.4. Conflict resolution: Political interactions can be instrumental in resolving conflicts and fostering peace. Dialogue, negotiation, and diplomacy are essential tools in addressing differences and promoting cooperation.Step 5: Challenges in Characterized InteractionsWhile characterized interactions have significant benefits, they also face challenges that can hinder their effectiveness:1. Cultural differences: In a globalized world, interactions often involve individuals from diverse cultural backgrounds. These cultural differences can lead to misunderstandings or conflicts if not managed effectively.2. Power inequalities: Unequal power dynamics can create barriers to effective interactions. Marginalized groups may struggle to assert themselves or have their voices heard.3. Miscommunication: Inadequate communication can lead to misunderstandings, mistrust, or misinterpretation of intentions.Clear and effective communication is essential for successful interactions.4. Conflict of interests: Different parties may have conflicting interests or goals, leading to disagreements or competition. Effective negotiation and compromise are necessary to navigate these challenges.In conclusion, characterized interactions are a fundamental part of human life in various spheres, including personal, social, economic, and political. Understanding and effectively navigating these interactions require considering factors such as culture, power dynamics, context, and communication. While characterized interactions have several benefits, they also face challenges that need to be addressed for successful outcomes. By recognizing the importance of characterized interactions and actively working towards positive and inclusive interactions, individuals and societies can foster understanding, collaboration, and growth.。

机器学习与数据挖掘笔试面试题

机器学习与数据挖掘笔试面试题
What is a decision tree? What are some business reasons you might want to use a decision tree model? How do you build a decision tree model? What impurity measures do you know? Describe some of the different splitting rules used by different decision tree algorithms. Is a big brushy tree always good? How will you compare aegression? Which is more suitable under different circumstances? What is pruning and why is it important? Ensemble models: To answer questions on ensemble models here is a :
Why do we combine multiple trees? What is Random Forest? Why would you prefer it to SVM? Logistic regression: Link to Logistic regression Here's a nice tutorial What is logistic regression? How do we train a logistic regression model? How do we interpret its coefficients? Support Vector Machines A tutorial on SVM can be found and What is the maximal margin classifier? How this margin can be achieved and why is it beneficial? How do we train SVM? What about hard SVM and soft SVM? What is a kernel? Explain the Kernel trick Which kernels do you know? How to choose a kernel? Neural Networks Here's a link to on Coursera What is an Artificial Neural Network? How to train an ANN? What is back propagation? How does a neural network with three layers (one input layer, one inner layer and one output layer) compare to a logistic regression? What is deep learning? What is CNN (Convolution Neural Network) or RNN (Recurrent Neural Network)? Other models: What other models do you know? How can we use Naive Bayes classifier for categorical features? What if some features are numerical? Tradeoffs between different types of classification models. How to choose the best one? Compare logistic regression with decision trees and neural networks. and What is Regularization? Which problem does Regularization try to solve? Ans. used to address the overfitting problem, it penalizes your loss function by adding a multiple of an L1 (LASSO) or an L2 (Ridge) norm of your weights vector w (it is the vector of the learned parameters in your linear regression). What does it mean (practically) for a design matrix to be "ill-conditioned"? When might you want to use ridge regression instead of traditional linear regression? What is the difference between the L1 and L2 regularization? Why (geometrically) does LASSO produce solutions with zero-valued coefficients (as opposed to ridge)? and What is the purpose of dimensionality reduction and why do we need it? Are dimensionality reduction techniques supervised or not? Are all of them are (un)supervised? What ways of reducing dimensionality do you know? Is feature selection a dimensionality reduction technique? What is the difference between feature selection and feature extraction? Is it beneficial to perform dimensionality reduction before fitting an SVM? Why or why not? and Why do you need to use cluster analysis? Give examples of some cluster analysis methods? Differentiate between partitioning method and hierarchical methods. Explain K-Means and its objective? How do you select K for K-Means?

英文文章情感分析技术研究

英文文章情感分析技术研究

英文文章情感分析技术研究随着互联网技术的发展,我们面对各种信息获取途径,其中以网络文章最为方便快捷。

而英文文章作为全球通用语言之一,在网络文章中也占有很大的比例。

但是,在每天阅读大量英文文章时,我们需要花费较多的时间去理解文章的情感倾向,这也成为了许多人的困扰。

因此,如何进行英文文章情感分析成为了一个热门话题。

英文文章情感分析技术的研究,不仅可以节省读者大量的时间,还可以在商业领域、投资领域、社交媒体分析、电子商务和在线广告等方面得到广泛的应用。

一、英文文章情感分析的概念和意义英文文章情感分析,是通过自然语言处理技术,对文本内容进行识别、处理和分析,从中获取相关情感信息。

主要包括对文章的语气、情感倾向等进行分析,从而帮助人们更快、更好地理解文章的内涵,确定文章的立场和情感倾向。

英文文章情感分析技术在人工智能领域中的应用十分广泛。

在金融投资领域,可以帮助投资者更好地分析市场情况,从而做出更明智的投资决策;在商业领域,可以对顾客评论进行分析,从而改进产品设计和营销策略;在社交媒体分析方面,可以帮助企业更好地了解用户需求,提高品牌曝光率等。

二、英文文章情感分析技术的基本原理英文文章情感分析技术的基本原理包括自然语言处理、文本分类、情感词典、机器学习等技术。

具体流程如下:(1)语言处理:首先,需要对文章文本进行处理并提取有效特征。

如,删除中文字符、拼写纠错、词干提取、去除停用词等。

(2)文本分类:然后根据分类算法进行有效的文本分类,如支持向量机、决策树、朴素贝叶斯等。

(3)情感词典:情感词典是一套用于情感分析的自然语言词典,包括积极、消极、中性等情绪词汇。

情感词典的建立,主要是通过专门的调研和采集人工标注数据集,从而建立起符合各类情感分类的词库。

(4)机器学习:最后是对类别进行机器学习,通过不断训练提升算法的准确性,从而输出更有意义和正确的分类。

三、英文文章情感分析技术的研究进展英文文章情感分析技术在最近几年内得到了快速的发展,下面对几个具有代表性的例子进行介绍。

active vocabulary英文定义

active vocabulary英文定义

主题:活跃词汇的英文定义及其重要性活跃词汇是指在日常交流和书面表达中频繁使用的词汇,它们可以帮助我们更准确、更流利地表达自己的思想和观点。

活跃词汇在语言能力的培养和提高中起着至关重要的作用。

下面将简要阐述活跃词汇的英文定义及其重要性。

一、活跃词汇的英文定义1. Active vocabularyActive vocabulary refers to the words and phrases that a person can readily access and use in spoken and writtenmunication. These are the words that a person isfortable using and that form the core of their everyday language repertoire. Active vocabulary differs from passive vocabulary, which includes words that a person understands but does not regularly use in their own speech or writing.2. Importance of active vocabularyActive vocabulary is essential for effectivemunication in any language. When a person has a strong active vocabulary, theyare able to express themselves clearly and confidently. This not only enhances their ability tomunicate with others, but also improves their overall language proficiency.二、活跃词汇的重要性1. Facilitates clear expressionHaving a robust active vocabulary allows individuals to articulate their thoughts and ideas with precision. When people are able to call upon a wide range of words and phrases, they can convey their message more effectively, avoiding ambiguity and misunderstanding. This is particularly crucial in professional settings, academic pursuits, and everyday conversations.2. Enhances language fluencyA strong active vocabulary contributes to fluency in speaking and writing. When individuals are familiar with a broad array of words, they are able tomunicate more smoothly and coherently. This fluency lends a sense of authority and sophistication to their language use, enabling them to engage in meaningful conversations and produce well-crafted written work.3. Supportsprehension and retentionExpanding one's active vocabulary stimulates mental agility and enhances cognitive abilities. When individuals are able to actively use and understand a wide range of words, they are better equipped toprehend and ret本人n new information. This not only benefits their language skills, but also extends to other areas of learning and knowledge acquisition.4. Fosters intellectual growthA rich active vocabulary is a sign of intellectual prowess and curiosity. When individuals actively engage with language and strive to expand their vocabulary, they demonstrate amitment to self-improvement and intellectual growth. This pursuit of linguistic mastery often correlates with a broader interest in learning and a more sophisticated approach to critical thinking.三、培养活跃词汇的方法1. Reading extensivelyOne of the most effective ways to enrich one's active vocabulary is through extensive reading. Exposure to diverse texts, ranging from literature and non-fiction to news articles and academic journals, introduces individuals to new words and expressions. Regular reading helps to internalize these words and make them part of one's active vocabulary.2. Engaging in meaningful conversationsActively engaging in conversations with a variety of people provides ample opportunities to practice and expand one's active vocabulary. By participating in discussions on various topics, individuals can learn new words and phrases, as well as observe how they are used in context. This reinforces their understanding and usage of these words in their ownmunication.3. Using vocabulary-building toolsUtilizing vocabulary-building tools, such as flashcards, word games, and language learning apps, can 本人d in the acquisition of new words. These tools allow individuals to actively engage with and reinforce their knowledge ofvocabulary, making it easier to ret本人n and use these words in their everydaymunication.4. Seeking feedback and correctionOpenness to feedback and correction from peers, teachers, or language experts can also help individuals refine and expand their active vocabulary. Receiving guidance on the appropriate usage of words and expressions encourages continuous improvement and reinforces the acquisition of new vocabulary.总结:活跃词汇对于语言表达能力的提高至关重要。

2019年考研英语一真题(20200731150050).pdf

2019年英语(一)考研真题Section Ⅰ Use of EnglishDirections:Read the following text. Choose the best word(s) for each numbered blank and markA, B, C or D on the ANSWER SHEET. (10 points)Today we live in a world where GPS systems, digital maps, and other navigation apps are available on our smart phones. 1 of us just walk straight into the woods without a phone. But phones 2 on batteries, and batteries can die faster than we realize. 3 you get lost without aphone or a compass, and you 4 can’t find north, a few tricks to help you navigate 5 to civilization, one of which is to follow the land...When you find yourself well 6 a trail, but not in a completely 7 area, you have to answertwo questions: Which 8 is downhill, in this particular area And where is the nearest watersource Humans overwhelmingly live in valleys, and on supplies of fresh water. 9 , if you head downhill, and follow any H2O you find, you should 10 see signs of people.you may be 11 If you’ve explored the area before, keep an eye out for familiar sights—how quickly identifying a distinctive rock or tree can restore your bearings.Another 12 : Climb high and look for signs of human habitation. 13 , even in denseforest, you should be able to 14 gaps in the tree line due to roads, train tracks, and otherpaths people carve 15 the woods. Head toward these 16 to find a way out. At night, scan the horizon for 17 light sources, such as fires and streetlights, then walk toward the glow of light pollution.frequent, look for the 19 we leave18 , assuming you’re lost in an area humans tend toon the landscape. Trail blazes, tire tracks, and other features can 20 you to civilization.1. [A]Some [B]Most [C]Few [D]All2. [A]put[B]take[C]run [D]come3. [A]Since [B] If [C] Though [D]Until4. [A]formally [B] relatively [C] gradually [D] literally5. [A] back [B] next [C] around [D] away6. [A]onto [B]off[C]across [D]alone7. [A]unattractive[B] uncrowded [C]unchanged [D]unfamiliar8. [A] site[B]point [C]way [D]place9. [A] So [B] Yet [C]Instead [D]Besides10. [A]immediately [B] intentionally [C]unexpectedly [D] eventually11. [A]surprised [B]annoyed [C]frightened [D]confused12. [A] problem [B]option [C]view [D]result13. [A] Above all [B]In contrast [C] On average [D] For example14. [A]bridge [B]avoid [C]spot [D]separate15. [A] from [B] through [C]beyond [D] under16. [A] posts [B]links [C]shades [D]breaks17. [A] artificial [B] mysterious [C] hidden [D] limited18. [A] Finally [B] Consequently [C] incidentally [D] Generally19. [A] memories [B] marks [C] notes [D] belongings20. [A] restrict [B] adopt [C] lead [D] exposeSection Ⅱ Reading ComprehensionPart ADirections:Read the following four texts. Answer the questions below each text by choosing A, B,C or D. Mark your answers on the ANSWER SHEET. (40 points)Text 1Financial regulations in Britain have imposed a rather unusual rule on the bosses of bigbanks. Starting next year, any guaranteed bonus of top executives could be delayed 10 yearsif their banks are under investigation for wrongdoing. The main purpose o f this “clawback” rule is to hold bankers accountable for harmful risk-taking and to restore public trust infinancial institution. Yet officials also hope for a much larger benefit: more long termdecision-making not only by banks but also bu all corporations, to build a stronger economyfor future generations.“Short-termism” or the desire for quick profits, has worsened in publicly tradedcompanies, says the Bank of England’s top economist. Andrew Haldane. He quotes a giant ofclassical economies, Alfred Marshall, in describing this financial impatience as acting like“Children who pick the plums out of their pudding to eat them at once” rather than pu them aside to be eaten last.The average time for holding a stock in both the United States and Britain, he notes, hasdropped from seven years to seven months in recent decades. Transient investors, whodemand high quarterly profits from companies, can hinder a firm’s efforts to invest in long-term research or to build up customer loyalty. This has b een dubbed “quarterlycapitalism”.In addition, new digital technologies have allowed more rapid trading of equities,quicker use of information, and thus shortens attention spans in financial markers.seems to be a predominance of short-term thinking at the expense of long-t erm investing,” said Commissioner Daniel Gallagher of the US Securities and Exchange Commission in speechthis week.In the US, the Sarbanes-Oxley Act of 2002 has pushed most public companies to defer performance bonuses for senior executives by about a year, slightly helping reduce“short-termism.” In its latest survey of CEO pay, The Wall Street Journal finds thatsubstantial part” of executive pay is now tied to performance.-termism,” such as changes in the tax Much more could be done to encourage “longcode and quicker disclosure of stock acquisitions. In France, shareholders who hold onto acompany investment for at least two years can sometimes earn more voting rights in acompany.Within companies, the right compensation design can provide incentives for executivesto think beyond their own time at the company and on behalf of all stakeholders. Britainnew rule is a reminder to bankers that society has an interest in their performance, not justfor the short term but for the long term.21. According to Paragraph 1, one motive in imposing the new rule is the_________.A. enhance banker’s sense of responsibilityB. help corporations achieve larger profitsC. build a new system of financial regulationD. guarantee the bonuses of top executives22. Alfred Marshall is quoted to indicate_________.A. the conditions for generating quick profitsB. governments ’ impatience in decision -makingC. the solid structure of publicly traded companiesD. “short -termism ” in economics activi ties23. It is argued that the influence of transient investment on public companies canbe__________.A. indirectB. adverseC. minimalD. temporary24. The US and France examples are used to illustrate____________.A. the obstacles to preventing “short -termism ”.B. the significance of long-term thinking.C. the approaches to promoting “long -termism ”.D. the prevalence of short-term thinking.25. Which of the following would be the best title for the textA. Failure of Quarterly CapitalismB. Patience as a Corporate VirtueC. Decisiveness Required of Top ExecutivesD. Frustration of Risk-taking BankersText 2Grade inflation--the gradual increase in average GPAs(grade-point averages) over thepast few decades —is often considered a product of a consumer era in higher education, inwhich students are treated like customers to be pleased. But another, related force —a policyoften buried deep in course catalogs called “grade forgiveness ”—is helping raise GPAs.Grade forgiveness allows students to retake a course in which they received a low grade,and the most recent grade or the highest grade is the only one that counts in calculating astudent’s overall GPA.The use of this little-known practice has accelerated in recent years, as colleges continueto do their utmost to keep students in school (and paying tuition) and improve theirgraduation rates. When this practice first started decades ago, it was usually limited tofreshmen, to give them a second chance to take a class in their first year if they struggled intheir transition to college-level courses. But now most colleges save for many selectivecampuses, allow all undergraduates, and even graduate students, to get their low gradesforgiven.College officials tend to emphasize that the goal of grade forgiveness is less about thegrade itself and more about encouraging students to retake courses critical to their degreeprogram and graduation without incurring a big penalty. “Untimely,” said Jack Miner, Ove more success because they retake aState University’s registrar,“we see students achiecourse and do better in subsequent contents or master the content that allows them tograduate on time.”That said, there is a way in which grade forgiveness satisfies colleges’ own ne For public institutions, state funds are sometimes tied partly to their success on metrics suchas graduation rates and student retention—so better grades can, by boosting figures likethose, mean more money. And anything that raises GPAs will likely make students—who, atthe end of the day, are paying the bill—feel they’ve gotten a better value for their tuitiondollars, which is another big concern for colleges.Indeed, grade forgiveness is just another way that universities are responding tor education. Since students and parents expect a collegeconsumers’ expectations for highedegree to lead a job, it is in the best interest of a school to turn out graduates who are asqualified as possible—or at least appear to be. On this, students’ and colleges’ incenti seem to be aligned.26. What is commonly regarded as the cause of grade inflationA. The change of course catalogs.B. Students’ indifference to GPAS.C. Colleges’ neglect of GPAS.D. The influence of consumer culture.27. What was the original purpose of grade forgivenessA. To help freshmen adapt to college learning.B. To maintain colleges’ graduation rates.C. To prepare graduates for a challenging future.D. To increase universities’ income from tuition.28. According to Paragraph 5,grade forgiveness enable colleges to_________.A. obtain more financial supportB. boost their student enrollmentsC. improve their teaching qualityD. meet local governments’ needs29. What does the phrase “to be aligned”(Line 5,most probably meanA. To counterbalance each other.B. To complement each other.C. To be identical with each other.D. To be contradictory to each other.30. The author examines the practice of grade forgiveness by________.A. assessing its feasibilityB. analyzing the causes behind itC. comparing different views on itD. listing its long-run effectsText 3This year marks exactly two countries since the publication of Frankenstein; or, The Modern Prometheus, by Mary Shelley. Even before the invention of the electric light bulb, the author produced a remarkable work of speculative fiction that would foreshadow many ethical questions to be raised by technologies yet to come.Today the rapid growth of artificial intelligence (AI) raises fundamental questions:is intelligence, identify, orconsciousness What makes humans humans”What is being called artificial general intelligence, machines that would imitate the wayhumans think, continues to evade scientists. Yet humans remain fascinated by the idea ofrobots that would look, move, and respond like humans, similar to those recently depictedon popular sci-fi TV series such as “Westworld” and “Humans”.Just how people think is still far too complex to be understood, let alone reproduced,n a situation wheresays David Eagleman, a Stanford University neuroscientist. “We are just ithere are no good theories explaining what consciousnesss actually is and how you couldever build a machine to get there.”But that doesn’t mean crucial ethical issues involving AI aren’t at hand. The com of autonomous vehicles, for example, poses thorny ethical questions. Human driverssometimes must make split-second decisions. Their reactions may be a complex combinationof instant reflexes, input from past driving experiences, and what their eyes and ears tellthem in that mo ment. AI “vision” today is not nearly as sophisticated as that of humans. Andto anticipate every imaginable driving situation is a difficult programming problem.Whenever decisions are based on masses of data, “you quickly get into a lot of ethical questions,” notes Tan Kiat How, chief executive of a Singapore-based agency that is helpingthe government develop a voluntary code for the ethical use of AI. Along with Singapore,other governments and mega-corporations are beginning to establish their own guidelines.Britain is setting up a data ethics center. India released its AI ethics strategy this spring.On June 7 Google pledged not to “design or deploy AI” that would cause “overa or to develop AI-directed weapons or use AI for surveillance that would violate internationalnorms. It also pledged not to deploy AI whose use would violate international laws or humanrights.While the statement is vague, it represents one starting point. So does the idea thatdecisions made by AI systems should be explainable, transparent, and fair.To put it another way: How can we make sure that the thinking of intelligent machinesreflects humanity’s highest values Only then will they be useful servants and not-of-control monster.Frankenstein’s out31. Mary Shelley’s novel Frankenstein is mentioned because itA. fascinates AI scientists all over the world.B. has remained popular for as long as 200 years.C. involves some concerns raised by AI today.D. has sparked serious ethical controversies.32. In David Eaglema n’s opinion, our current knowledge of consciousnessA. helps explain artificial intelligence.B. can be misleading to robot making.C. inspires popular sci-fi TV series.D. is too limited for us to reproduce it.33. The solution to the ethical issues brought by autonomous vehiclesA. can hardly ever be found.B. is still beyond our capacity.C. causes little public concern.D. has aroused much curiosity.34. The author’s attitude toward Google’s pledge is one ofA. affirmation.B. skepticism.C. contempt.D. respect.35. Which of the following would be the best title for the textA. AI’s Future: In the Hands of Tech GiantsB. Frankenstein, the Novel Predicting the Age of AIC. The Conscience of AI: Complex But InevitableD. AI Shall Be Killers Once Out of ControlText 4States will be able to force more people to pay sales tax when they make online purchases under a Supreme Court decision Thursday that will leave shoppers with lighter wallets but is a big financial win for states.Thursday overruled a pair of decades-old decisions that The Supreme Court’s opinionstates said cost them billions of dollars in lost revenue annually. The decisions made it moredifficult for states to collect sales tax on certain online purchases.The cases the court overturned sai d that if a business was shipping a customer’s purchase to a state where the business didn’t have a physical presence such as a warehouse or office, the business didn’t have to collect sales tax for the state. Customers were generally responsible for paying the sales tax to the state themselves if they weren’t charged it, bu most didn’t realize they owed it and few paid.Justice Anthony Kennedy wrote that the previous decisions were flawed. “Each ye physical presence rule becomes further removed from economic reality and results insignificant revenue losses to the States,” he wrote in an opinion joined by four other justices.-term prosperity and hasKennedy wrote that the rule “limited states’ ability to seek longprevented market participants from competing on an even playing field.”The ruling is a victory for big chains with a presence in many states, since they usuallycollect sales tax on online purchases already. Now, rivals will be charging sales tax where theyen collecting sales tax nationwide because they typicallyhadn’t before. Big chains have behave physical stores in whatever state a purchase is being shipped to. , with its network of warehouses, also collects sales tax in every state that charges it, though third-party sellerswho use the site don’t have to.Until now, many sellers that have a physical presence in only a single state or a fewstates have been able to avoid charging sales taxes when they ship to addresses outsidethose states. Sellers that use eBay and Etsy, which provide platforms for smaller sellers, alsohaven’t been collecting sales tax nationwide. Under the ruling Thursday, states can pass lawsrequiring out-of-state sellers to collect the state’s sales tax from customers and send it to the state.Retail trade groups praised the ruling, saying it levels the playing field for local andonline businesses. The losers, said retail analyst Neil Saunders, are online-only retailers,especially smaller ones. Those retailers may face headaches complying with various statesales tax laws. TheSmall Business & Entrepreneurship Council advocacy group said in a a statement,businesses and internet entrepreneurs are not well served at all by this decision.”36. The Supreme Court decision Thursday willns with statesA. Dette business’ relutioB. put most online business in a dilemmaC. make more online shoppers pay sales taxD. forces some states to cut sales tax37. It can be learned from paragraphs 2 and 3 that the overruled decisionsA. have led to the dominance of e-commerceB. have cost consumers a lot over the yearsC. were widely criticized by online purchasesD. were considered up favorable by states38. According to Justice Anthony Kennedy, the physical presence rule hasA. hindered economic developmentB. brought prosperity to the countryC. harmed fair market competitionD. boosted growth in states revenue39. Who are most likely to welcome the Supreme Court rulingA. Internet entrepreneursB. Big-chair ownersC. Third-party sellersD. Small retailers40. In dealing with the Supreme Court decision Thursday, the authorA. gives a factual account of it and discusses its consequencesB. describes the long and complicated process of its makingC. presents its main points with conflicting views on themD. cities some saces related to it and analyzes their implicationsPart BDirections:The following paragraphs are given in a wrong order. For Questions 41-45, you arerequired to reorganize these paragraphs into a coherent article by choosing from the listA-G and filling them into the numbered boxes. Paragraph C and F have been correctlyplaced. Mark your answers on ANSWER SHEET. (10 points)A. These tools can help you win every argument-not in the unhelpful sense of beatingyour opponents but in the better sense of learning about the issues that divide people.Learning why they disagree with us and learning to talk and work together with them. If wereadjust our view of arguments—from a verbal fight or tennis game to a reasoned exchangethrough which we all gain mutual respect, and understanding—then we change the verynature of what it means to “win” an argument.B. Of course, many discussions are not so successful. Still, we need to be careful not toaccuse opponents of bad arguments too quickly. We need to learn how to evaluate themproperly. A large part of evaluation is calling out bad arguments, but we also need to admitgood arguments by opponents and to apply the same critical standards to ourselves. Humilityrequires you to recognize weakness in your own arguments and sometimes also to acceptreasons on the opposite side.C. None of these will be easy but you can start even if others refuse to. Next time youstate your position, formulate an argument for what you claim and honestly ask yourselfwhether your argument is any good. Next time you talk with someone who takes a stand, askthem to give you a reason for their view. Spell out their argument fully and charitably. Assessits strength impartially. Raise objections and listen carefully to their replies.D. Carnegie would be right if arguments were fights, which is how we often think ofthem. Like physical fights, verbal fights can leave both sides bloodied. Even when you win,you end up no better off. Your prospects would be almost as dismal if arguments were evenjust competitions-like, say, tennis games. Paris of opponents hit the ball back and forth untilone winner emerges from all who entered. Everybody else loses. This kind of thinking is whyso many people try to avoid arguments, especially about politics and religion.E. In his 1936 work How to Win Friends and Influence People , Dale Carnegie wrote:“there is only one way…to get thebest of an argument-and that is to avoid it. “This aversion to arguments is common, but it depends on a mistaken view of arguments that causes profound problems for our personaland social lives- and in many ways misses the point of arguing in the first place.F. These views of arguments also undermine reason. If you see a conversation as a fightor competition, you can win by cheating a s long as you don’t get caught. You will be happy to convince people with bad arguments. You can call their views stupid, or joke about howignorant they are. None of these tricks will help you understand them, their positions or theissues that divide you, but they can help you win-in one way.G. There is a better way to win arguments. Imagine that you favor increasing theminimum wage in our state, and I do not. If you yell, “yes,” and I yell. “No,” n learns anything. We neither understand nor respect each other, and we have no basis forcompromise or cooperation. In contrast, suppose you give a reasonable argument: thatfull-time workers should not have to live in poverty. Then I counter with another reasonableargument: that a higher minimum wage will force businesses to employ fewer people for lesstime. Now we can understand each other’s positions and recognize our shared values, sincewe both care about needy workers.41→42→F→43→44→C→45Part CDirections:Read the following text carefully and then translate the underlined segments intoChinese. Your translation should be written neatly on the ANSWER SHEET. (10 points) It was only after I started to write a weekly column about the medical journals, andbegan to read scientific papers from beginning to end, that I realised just how bad much ofthe medical literature frequently was. I came to recognise various signs of a bad paper: thekind of paper that purports to show that people who eat more than one kilo of broccoli aweek were times more likely than those who eat less to suffer late in life from perniciousanaemia. (46) There is a great deal of this kind of nonsense in the medical journals which,when taken up by broadcasters and the lay press, generates both health scares andshort-lived dietary enthusiasms.Why is so much bad science published A recent paper, titled “The Natural Selection Bad Science”, published on the Royal Society’s open science website, attempts to answer th intriguing and important question. It says that the problem is not merely that people do badscience, but that our current system of career advancement positively encourages it. What isimportant is not truth, but publication, which has become almost an end in itself. There hasbeen a kind of inflationary process at work: (47) nowadays anyone applying for a researchpost has to have published twice the number of papers that would have been required forthe same post only 10 years ago. Never mind the quality, then, count the number.(48) Attempts have been made to curb this tendency, for example, by trying toincorporate some measure of quality as well as quantity into the assessment of an applicantpapers. This is the famed citation index, that is to say the number of times a paper has beenquoted elsewhere in the scientific literature, the assumption being that an important paperwill be cited more often than one of small account. (49) This would be reasonable if it werenot for the fact that scientists can easily arrange to cite themselves in their futurepublications, or get associates to do so for them in return for similar favours.Boiling down an individual’s output to simple metrics, such as number of publications or journal impacts, entails considerable savings in time, energy and ambiguity. Unfortunately,the long-term costs of using simple quantitative metrics to assess researcher merit are likelyto be quite great. (50) If we are serious about ensuring that our science is both meaningfuland reproducible, we must ensure that our institutions encourage that kind of science.Section Ⅲ WritingPart A51. Directions:Suppse you are working for the “Aiding rurd Primary School” project of your universi Write an email to answer the inquiry from an international student volunteer, specifyingdetails of the project.Do not sign your own name at the end of the email. Use “Li Ming” instead.(10 poin Part B52. Directions:Write an essay of 160—200 words based on the following pictures. In your essay, youshould1) describe the pictures briefly,2) interpret the meaning, and3) give your comments.You should write neatly on the ANSWER SHEET. (20 points)。

新技能英语高级教程unit3

Unit 3: Advanced English Skills1. IntroductionEnglish is a widely spoken language with over 1.5 billion speakers worldwide. It is the official language in 67 countries and serves as a lingua franca in many more. As such, mastering advanced English skills is essential for individuals seeking to excel in their careers andmunicate effectively on a global scale. Unit 3 of the advanced English skills course focuses on enhancing students' proficiency in speaking, listening, reading, and writing.2. Speaking SkillsIn this section, students will learn advanced techniques for public speaking and presentations. Emphasis is placed on articulation, tone, and body language. Furthermore, students will practice delivering impromptu and prepared speeches on various topics. By the end of the unit, students should be able tomunicate with eloquence and confidence in diverse settings.3. Listening SkillsImproving listeningprehension is crucial for effectivemunication. This section of the course hones students' ability to understandfast-paced speech, accents, and colloquial expressions. Students will also develop strategies for active listening, note-taking, and summarizingplex information. Through extensive listening exercises, students will be adept at following lectures, discussions, and presentations.4. Reading SkillsUnit 3 delves into advanced readingprehension strategies. Students will engage withplex texts, including literature, academic articles, and news reports. They will learn to analyze and interpret challenging vocabulary, sentence structures, and literary devices. Additionally, students will enhance their critical thinking skills by evaluating diverse perspectives presented in written materials.5. Writing SkillsThe writingponent of the course focuses on honing students' ability to produce well-structured and cohesive essays, reports, and creative pieces. Students will have the opportunity to refine their grammar, vocabulary, and stylistic choices. They will also practice crafting persuasive arguments, synthesizing information, and incorporating evidence to support their ideas. By the end of the unit, students will be capable of producingsophisticated written works.6. Vocabulary ExpansionExpanding one's vocabulary is integral to achieving fluency in any language. In this unit, students will be exposed to advanced English vocabulary related to academic, professional, and everyday contexts. They will learn to use context clues, word roots, and affixes to infer the meanings of unfamiliar words. Additionally, students will explore strategies for memorizing and ret本人ning new vocabulary.7. Grammar and SyntaxMastery of advanced grammar and syntax is pivotal in conveying nuanced meanings and ideas. This section of the course focuses on refining students' understanding ofplex grammatical structures, verb tenses, and sentence patterns. Students will also practice applying advanced syntax rules to their writing and speaking tasks, enabling them to express themselves with precision and clarity.8. Cultural CompetenceEffectivemunication in English goes beyond linguistic proficiency—it also requires an understanding of culturalnuances and customs. In this unit, students will explore cultural differences inmunication styles, social etiquette, and nonverbal cues. They will develop sensitivity to cultural diversity and learn to adapt their language use to various cultural contexts.9. ConclusionUnit 3 of the advanced English skills course equips students with the tools and techniques to excel in their English language proficiency. By honing their speaking, listening, reading, and writing abilities, students will g本人n the confidence to navigate diverse linguistic and cultural landscapes. With dedication and practice, students will emerge from this unit as proficientmunicators capable of engaging with the globalmunity in English.。

外研版高中英语选择性必修第三册课后习题 Unit 4 第四单元测评卷

第四单元测评第一部分听力(共两节,满分30分)第一节(共5小题;每小题1.5分,满分7.5分)听下面5段对话。

每段对话后有一个小题,从题中所给的A、B、C三个选项中选出最佳选项。

听完每段对话后,你都有10秒钟的时间来回答有关小题和阅读下一小题。

每段对话仅读一遍。

1.Where does the conversation probably take place?A.At a teahouse.B.At a supermarket.C.At a restaurant.2.What are the speakers doing at the moment?A.Watching a match.B.Looking at a poster.C.Deciding to join a team.3.How many days does the woman work in a week?A.About four days.B.Nearly five days.C.Almost six days.4.When was the wedding?A.In June.B.In January.C.In July.5.What’s the probable relationship between the two speakers?A.Classmates.B.Neighbours.C.Colleagues.第二节(共15小题;每小题1.5分,满分22.5分)听下面5段对话或独白。

每段对话或独白后有几个小题,从题中所给的A、B、C三个选项中选出最佳选项。

听每段对话或独白前,你将有时间阅读各个小题,每小题5秒钟;听完后,各小题将给出5秒钟的作答时间。

每段对话或独白读两遍。

听第6段材料,回答第6、7题。

6.Where does the conversation probably take place?A.In the street.B.In the schoolyard.C.In the library.7.Where is the woman going?A.To the library.B.To the registration office.C.To the lecture theatre.听第7段材料,回答第8、9题。

参考文献(人工智能)

参考文献(人工智能)曹晖目的:对参考文献整理(包括摘要、读书笔记等),方便以后的使用。

分类:粗分为论文(paper)、教程(tutorial)和文摘(digest)。

0介绍 (1)1系统与综述 (1)2神经网络 (2)3机器学习 (2)3.1联合训练的有效性和可用性分析 (2)3.2文本学习工作的引导 (2)3.3★采用机器学习技术来构造受限领域搜索引擎 (3)3.4联合训练来合并标识数据与未标识数据 (5)3.5在超文本学习中应用统计和关系方法 (5)3.6在关系领域发现测试集合规律性 (6)3.7网页挖掘的一阶学习 (6)3.8从多语种文本数据库中学习单语种语言模型 (6)3.9从因特网中学习以构造知识库 (7)3.10未标识数据在有指导学习中的角色 (8)3.11使用增强学习来有效爬行网页 (8)3.12★文本学习和相关智能A GENTS:综述 (9)3.13★新事件检测和跟踪的学习方法 (15)3.14★信息检索中的机器学习——神经网络,符号学习和遗传算法 (15)3.15用NLP来对用户特征进行机器学习 (15)4模式识别 (16)4.1JA VA中的模式处理 (16)0介绍1系统与综述2神经网络3机器学习3.1 联合训练的有效性和可用性分析标题:Analyzing the Effectiveness and Applicability of Co-training链接:Papers 论文集\AI 人工智能\Machine Learning 机器学习\Analyzing the Effectiveness and Applicability of Co-training.ps作者:Kamal Nigam, Rayid Ghani备注:Kamal Nigam (School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, knigam@)Rayid Ghani (School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213 rayid@)摘要:Recently there has been significant interest in supervised learning algorithms that combine labeled and unlabeled data for text learning tasks. The co-training setting [1] applies todatasets that have a natural separation of their features into two disjoint sets. We demonstrate that when learning from labeled and unlabeled data, algorithms explicitly leveraging a natural independent split of the features outperform algorithms that do not. When a natural split does not exist, co-training algorithms that manufacture a feature split may out-perform algorithms not using a split. These results help explain why co-training algorithms are both discriminativein nature and robust to the assumptions of their embedded classifiers.3.2 文本学习工作的引导标题:Bootstrapping for Text Learning Tasks链接:Papers 论文集\AI 人工智能\Machine Learning 机器学习\Bootstrap for Text Learning Tasks.ps作者:Rosie Jones, Andrew McCallum, Kamal Nigam, Ellen Riloff备注:Rosie Jones (rosie@, 1 School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213)Andrew McCallum (mccallum@, 2 Just Research, 4616 Henry Street, Pittsburgh, PA 15213)Kamal Nigam (knigam@)Ellen Riloff (riloff@, Department of Computer Science, University of Utah, Salt Lake City, UT 84112)摘要:When applying text learning algorithms to complex tasks, it is tedious and expensive to hand-label the large amounts of training data necessary for good performance. This paper presents bootstrapping as an alternative approach to learning from large sets of labeled data. Instead of a large quantity of labeled data, this paper advocates using a small amount of seed information and alarge collection of easily-obtained unlabeled data. Bootstrapping initializes a learner with the seed information; it then iterates, applying the learner to calculate labels for the unlabeled data, and incorporating some of these labels into the training input for the learner. Two case studies of this approach are presented. Bootstrapping for information extraction provides 76% precision for a 250-word dictionary for extracting locations from web pages, when starting with just a few seed locations. Bootstrapping a text classifier from a few keywords per class and a class hierarchy provides accuracy of 66%, a level close to human agreement, when placing computer science research papers into a topic hierarchy. The success of these two examples argues for the strength of the general bootstrapping approach for text learning tasks.3.3 ★采用机器学习技术来构造受限领域搜索引擎标题:Building Domain-specific Search Engines with Machine Learning Techniques链接:Papers 论文集\AI 人工智能\Machine Learning 机器学习\Building Domain-Specific Search Engines with Machine Learning Techniques.ps作者:Andrew McCallum, Kamal Nigam, Jason Rennie, Kristie Seymore备注:Andrew McCallum (mccallum@ , Just Research, 4616 Henry Street Pittsburgh, PA 15213)Kamal Nigam (knigam@ , School of Computer Science, Carnegie Mellon University Pittsburgh, PA 15213)Jason Rennie (jr6b@)Kristie Seymore (kseymore@)摘要:Domain-specific search engines are growing in popularity because they offer increased accuracy and extra functionality not possible with the general, Web-wide search engines. For example, allows complex queries by age-group, size, location and cost over summer camps. Unfortunately these domain-specific search engines are difficult and time-consuming to maintain. This paper proposes the use of machine learning techniques to greatly automate the creation and maintenance of domain-specific search engines. We describe new research in reinforcement learning, information extraction and text classification that enables efficient spidering, identifying informative text segments, and populating topic hierarchies. Using these techniques, we have built a demonstration system: a search engine forcomputer science research papers. It already contains over 50,000 papers and is publicly available at ....采用多项Naive Bayes 文本分类模型。

  1. 1、下载文档前请自行甄别文档内容的完整性,平台不提供额外的编辑、内容补充、找答案等附加服务。
  2. 2、"仅部分预览"的文档,不可在线预览部分如存在完整性等问题,可反馈申请退款(可完整预览的文档不适用该条件!)。
  3. 3、如文档侵犯您的权益,请联系客服反馈,我们会尽快为您处理(人工客服工作时间:9:00-18:30)。

Interaction with Texts: Information Retrieval asInformation-Seeking BehaviorNicholas J. BelkinSchool of Communication, Information & Library StudiesRutgers UniversityNew Brunswick, NJ 08903 USAbelkin@AbstractWe present an analysis of information retrieval as an information-seeking activity, sup-porting people's inteactions with text. This analysis suggests that some assumptions underlying the standard model of information retrieval are inappropriate, and we sug-gest alternative assumptions and discuss their implications for information retrieval system design. It is proposed that information retireval is most properly considered as information-seeking behavior, that the central process of information retrieval is user interaction with text, and that the user is the central component of the information retrieval system. Possible ways to incorporate this view in the design of information retrieval systems are discussed.1. IntroductionFrom our experience of the everyday world, we see that people are in consistent, if not constant interaction with texts1 of a variety of types, for a variety of purposes. We read newspapers, we watch television, we go to picture galleries, we use libraries, we engage with advisory services; in order to know about our world, to be entertained, to understand, to learn, to become informed, to do our work, to help us solve our problems. An essential feature of all of these activities is precisely our interaction with these texts. By interaction, we mean that, in such activities, people are not just passive recipients of messages, but rather active seekers of texts, and active constructors of meaning from these texts. They look for texts of potential interest, they make judgments about the usefulness or interest of texts by engaging with them, they interpret texts in order to understand them. Thus, our engagement with texts and our interpretations of them are central to our being able to use them for our goals, whatever they may be.At times, we engage in behaviors stimulated by our desire to manage a problem (Belkin, Seeger & Wersig, 1983), resolve a problematic situation (Schutz & Luck-mann, 1973; Wersig, 1979), respond to a gap in our life-path (Dervin, 1983), or re-solve an 'anomalous state of knowledge' (Belkin, 1980). Such activities are usually characterized as information-seeking behaviors, in which we actively engage with texts, or collections of texts, or people who give us access to texts, in order to find and1Text in this paper is to be taken in the semiotic sense of an ordered collection of signs, which includes information-bearing objects in any mode or medium.be able to use information which can help us in problem management, etc. There is clearly a wide variety of such information seeking behaviors, such as asking a col-league for advice, browsing through journals to keep up-to-date, searching in a library for some specific information, and so on. It is important to note that, despite this range of forms that information-seeking behaviors take, they are all interactions with texts, and all share the general characteristics of such interactions, described in the previous paragraph.2In the situations which lead to information-seeking behaviors, our interactions with texts often take place within the context of systems, social and technical, which are constructed to support us in accomplishing the goals of the behaviors. Such systems include, for instance, libraries, information centers, advisory services, and information retrieval (IR) systems. In this paper, we are concerned explicitly with the issue of how one might best characterize the IR problem, and IR systems, in order to support people in their information-seeking behaviors most effectively.2. 'Standard' Information RetrievalInformation retrieval (IR) has most usually been construed as the problem of selecting texts from a database in response to some more-or-less well-specified query. From this point of view, the major issues of concern to IR have been the representation of texts, and of queries, and techniques for the comparison of text and query representation. This 'standard' view of IR is perhaps most succinctly represented in the diagram of Figure 1.From Figure 1, we see that the standard view of IR has comparison of text surrogate and query as its central process. This comparison in turn depends upon the two repre-sentation processes. In practice, most emphasis has been placed upon the process of text representation, with relatively little work having been done on the development of representation schemes specific to information needs or problems. Concern with repre-sentation of the information need has typically arisen after the process of judgment, which is typically to be performed by the user, as an estimate of the potential relevance of the text to the information need. The results of the judgement process are then used by the system to modify the query, or, occasionally to modify the text representations. This process of query modification, or 'relevance feedback', is perceived, in the stan-dard view of IR, as an attempt to gain the 'best' possible representation of the user's query, or 'information need', in order to select the texts most likely to be relevant to that query ; that is, to improve the representation so that the comparison process will work most effectively. It is important to recognize that, in this model of IR, the person involved in IR is seen as a user of the system, standing outside of it. Involvement of the user with the IR system is minimal, and interaction (in the form of the judgment process) is seen as ancillary to, and only in support of, the representation and comparison processes.2In those information-seeking behaviors where there is direct interaction with another person, as in advisory situations, we can consider the dialogue itself, including the advice, as a textFigure 1. A model of the standard view of information retrieval Underlying this view of IR are two strong assumptions. The first is that there exists some static information need associated with the user, which is, at least in principle, specifiable. The second is that the most appropriate way to address that need is to search for, and select, the text or texts which can best resolve that need. The force of these assumptions is, as we see from the discussion of the standard view, to devalue or even ignore the significance of interaction of the 'user' with the texts; and, to provide support for only one form of information-seeking behavior, that associated with searching for some well-specified information item. Additionally, through the privi-leged position of comparison and representation, and the assignment of responsibility for these activities to the system, the standard view of IR leads to strong control by the system of the entire IR process, and the consequent lack of power or control by the user.There is a long history of research and practice in IR within this 'paradigm'. Indeed, one is tempted to say that almost all IR research and practice has been based on these assumptions, or this general viewpoint. There have been, however, some notable ex-ceptions taken to this approach to IR, which, in addition to their intrinsic worth, are of interest in terms of what they have recognized as problematic in the standard model,and how they have addressed these problems.For instance, Oddy's (1977) system THOMAS was a response to the assumptions of a static, specifiable information need. THOMAS's goal was to allow users of the system to engage in information seeking without query specification. It is of some interest in this context to note that this goal was accomplished by making the user's interaction (especially judgment) with the texts the central process of the system. The user's con-ception of her/his information need was assumed to be dynamic, and the means of re-solving it was through a browsing-like information-seeking behavior. Croft & Thomp-son (1987) incorporated support for two different information-seeking behaviors within their system, I3R, namely, browsing and search by specification. Additionally, I3R en-couraged the user to participate actively in the system, by eliciting and incorporating the user's knowledge in the representation processes. Bates (1989) has suggested an IR system design based on a concept of information needs which change as a person inter-acts with texts in an information-seeking episode. She has also proposed a model, in Bates (1990), in which control of the IR process can be varied between the system and the user. In both of these papers, and in others, Bates has argued forcefully, eloquently and persuasively against the assumptions underlying the standard view of IR. Ingw-ersen (1992) has suggested a design model for IR systems in which the user's interac-tion with the components of the system, and interaction among those components, is central. In this model, the IR system is construed as an intermediary, whose function-ing is dependent upon interaction with the user, and in particular on knowledge about the user's interactions with the texts. This is, in essence, a dialogue-based model of IR. These examples of attempts to respond to the constraints of the assumptions of the standard model of IR demonstrate that the constraints are indeed real, with real, and unfortunate consequences. They, and others like them, also demonstrate that it is pos-sible to address at least some of the problems raised by these assumptions, and suggest both general strategies and specific techniques for the development of a more general model of IR. From this work, we see that, while not wholly incorrect, the assumptions underlying the standard view of IR are untenable as general principles, and constrain IR unrealistically to support of only one kind of information-seeking behavior. Building upon the work summarized above, this paper attempts to establish an alterna-tive view of IR. The basis of the alternative approach which is presented here will be to embed IR within the general context of human interaction with texts, specifically re-ferring to information-seeking behaviors. We will attempt, in particular, to develop a view of IR in which the user is the central component in the system, and interaction is the central process. Such a model of IR would not reject the work which has been done in the standard paradigm, but rather incorporate it as support for particular classes of in-teraction, and information-seeking behaviors.3. Information Retrieval as Information-Seeking BehaviorWe begin by considering the assumptions about the IR situation of the standard model, in what sense they have failed, and why. The assumption of a static, specifiable infor-mation need is problematic from the point of view of interaction with texts, becausesuch interaction necessarily implies interpretation, and interpretation necessarily im-plies change in the interpreter. If the information-seeker's knowledge changes by virtue of engagement with text, then it is at least possible, if not necessary, that that person's view of the situation or condition that led to engagement in information-seeking behav-ior also changes. Considering this from the point of view of Schutz and Luckmann (1973), any modification of one's knowledge, in the condition of a problematic situa-tion, is itself a modification of the problematic situation. Or, a change in one's state of knowledge, by virtue of having engaged with a text, will be reflected in some change in the anomalous state of knowledge (Belkin, 1980) that led one to engage in information-seeking behavior. All this implies that the assumption of a single information need, constant (although progressively better represented) through the course of an informa-tion-seeking episode, is inadequate to the information-seeking situation in general. There is by now a substantial literature, from both theoretical and empirical perspec-tives, on the non-specifiability of information 'needs' (e.g. Bates, 1989; Belkin, Oddy & Brooks, 1982; Belkin, Seeger & Wersig, 1983; Oddy, 1977; Taylor, 1968). Most such arguments depend upon concepts concerning the cognitive state of the person engaging in information-seeking behavior, but aspects of this issue can also be addressed from the point of view of interaction with texts (e.g. Bates, 1989). The argument follows from the dynamic nature of the problematic situation, and, in particular, from the un-predictability of the result of the interpretation of a text, from the point of view of the IR system; and, the unpredictability of what texts will be offered, and what they will of-fer, from the point of view of the information seeker. Thus, interaction with texts im-plies at least the possibility of an unpredictable, and therefore unspecifiable, change in the condition which led to the interaction in the first place (e.g. the information 'need'). The second assumption of standard IR, that of a single form of information-seeking be-havior, is clearly inadequate. There is substantial documentation, empirical and theo-retical, of a variety of information-seeking behaviors in which people engage. Belkin, et al. (1990), Ellis (1989), Hancock-Beaulieu (1990)and Kuhlthau (1991) all have demonstrated that, in the course of information-seeking episodes, people change from one kind of interaction to another, and in the course of problem resolution, people en-gage in different types of interactions with texts, according to different goals, knowl-edge, intentions, and so on. These different goals, such as, for instance, learning about the structure of some group of texts, lead to behaviors which cannot be supported effectively through the specification of a query and the retrieval of a relevant text. Thus, an IR system or model which depends solely upon representation and comparison of well-specified queries with texts cannot support behaviors suited to such goals, such as, for instance, browsing. Both I3R (Croft & Thompson, 1987) and THOMAS (Oddy, 1977), as well as hypertext systems such as that described by Frisse (1988), demonstrate that the support of browsing-like information-seeking requires active intervention and interaction by the user with the texts, and the representations of texts and their relationships, which the standard model of IR cannot support. The usual response has been to build parallel systems to support different information-seeking behaviors; another, which we support here, is to begin from a model of IR in which the range of information-seeking behaviors can be integrated.Our alternative assumptions to those of standard IR, derived from the concept of infor-mation-seeking behavior as human interaction with texts, are thus the following:1.That information-seeking is inherently an interactive process, and that that processis characterized by the general features of people's interactions with texts;2.That the goal of IR systems is to support the range of information-seeking behav-iors.From these assumptions, it follows that the central process for IR is interaction of user with text, in support of a wide variety of information-seeking behaviors and associated goals, relevant to a different problematic situations. It also follows that the information seeking is the central component of IR, since it is that person's interaction with texts which drives the entire IR system. This, in turn, means that an adequate model of IR, under these assumptions, requires that the user be an active participant in the IR sys-tem, rather than a passive recipient of and reactor to output from the system. In short, these conditions suggest that it is appropriate to model that which the user does in the course of interaction in an IR system, as information-seeking behavior. In this way, the processes in which the person engages, the entities with which the person interacts, and the salient characteristics of the person engaged in information-seeking can be identi-fied and treated in an integrated manner, responsive to the variety of goals and condi-tions of interaction.4. Implementing Information Retrieval as Information-Seeking BehaviorIt is now reasonable to ask what a model of IR based on our alternative assumptions might look like, or even, how one could begin to implement such ideas in IR systems. Unfortunately, we cannot offer here definitive answers to these questions, as, for in-stance, in terms of a formal alternative model of IR. These are still goals subject to fur-ther research. Instead, we discuss some suggestions for potential ways to take account of this viewpoint, and how these could be (or, in some cases, have been) realized in IR system design. In particular, we consider:•how to incorporate the user as the central component of the IR system;•how to identify and support different information-seeking behaviors and different information-seeking goals; and,•how to make user interaction with texts the central process of IR.4.1 Incorporating the User in the IR SystemIncorporating the user in the IR system has been a major component of several different models of IR. Belkin, Oddy & Brooks (1982), for instance, suggested that the user should be a part of the IR system, in order for the rest of the system to be able to understand, and take account of, the situation that led the user to engage in the system, and in order to take advantage of the user's response to texts offered for interaction. Their means for accomplishing this was two-fold. First, they made representation of the person's anomalous state of knowledge the primary process in the system. Thisincluded incorporating in the representation some aspects of the person's goals and situation in general. Second, their system design required the user to interact with offered texts, and to inform the rest of the system of the results of the interpretation, with specific reference to whether, and in what ways, the text was useful.Belkin, Seeger & Wersig (1983) took a somewhat different approach to incorporation of the user in the IR system, pointing out that for an IR system to be effective in sup-porting a person in problem management, it is necessary for the built part of the system (the 'information provision mechanism - IPM) to play the role of an intelligent 'inter-mediary' between the user and the texts with which the user would interact. In this model, the primary effort of the IPM is to understand the characteristics of the user, and the user's problematic situation, which would be relevant for identifying texts which might be appropriate for the user to interact with. This understanding was itself to be gained through dialogue-like interaction with the user, based on the functions taking place in equivalent human-human information interactions. Ingwersen (1992) takes this approach several steps further, stressing more explicitly the intermediary role, and identifying a number of new functionalities which support the user in interaction with texts, and the rest of the system in identifying appropriate texts.Bates (1990) has shown that control is a crucial issue in IR interaction, pointing out that under the standard model the IR system assumes almost total control of the interaction, by doing the representation, comparison and modification automatically, and without reference to the user. She suggests that, although there may be situations in which the user would like to surrender control in this way, there are other situations, defined, for instance, by the nature of the information-seeker's goals, or preferences for interaction style, in which the user would prefer to more directly control the course of the interaction, through, for instance, direct interaction with texts, direct choice of which texts to interact with, and so on. This leads to a concept of degree of control associated with particular kinds of interaction, and suggestions as to how to implement a system with mixed control. Who is in control is of course dependent on the roles and responsibilities of the participants to the interaction.Overall, then, these various approaches to incorporating the user in the IR system sug-gest that treating IR as an interaction between user and intermediary (or advisor), in which both parties have responsiblities, and over which both parties can exercise con-trol, is a reasonable strategy for making the user the central component of IR.4.2 Identifying and Supporting Information-Seeking BehaviorsThe problems of identifying and supporting different information-seeking behaviors have been addressed by a number of researchers, including Ellis (1989) and Hancock-Beaulieu (1990). Here, we discuss our own approach to this issue, as used in the design of two different IR systems. Belkin, Marchetti & Cool (1993) have suggested that information-seeking behaviors (or, in their terms, information-seeking strategies -ISSs) can be characterized according to some small set of dimensions or facets which define a space of ISSs. Figure 2 is a summary of the four facets which they tentativelysuggest are at least necessary to account for the range of ISSs which have been observed. These facets, goal of the interaction, method of interaction, mode of retrieval and type of resource interacted with, have been identified through observation and classification of information-seeking behaviors in a variety of settings. Their suggestion is that any ISS can be characterized according to its values on these facets, or dimensions. Thus, the explicit combination of poles of each facet, in Figure 2, leads to a set of 16 prototypical ISSs.ISS METHOD GOAL MODE RESOURCESc S L S R S I M1x x x x2x x x x3x x x x4x x x x5x x x x6x x x x7x x x x8x x x x9x x x x10x x x x11x x x x12x x x x13x x x x14x x x x15x x x x16x x x xMethod: Sc = Scan; S = SearchGoal: L = Learn; S = SelectMode: R = Recognize; S = SpecifyResource: I = Information; M = Meta-informationFigure 2. Information Seeking Strategies (after Belkin, Marchetti & Cool, 1993) Belkin, Marchetti & Cool (1993) point out that, in the course of a single information-seeking episode, a person may engage in several such ISSs, moving from one to an-other as knowledge, goals, intentions, and so on, change, through the course of the in-teraction. They suggest that, by characterizing ISSs according to their underlying di-mensions, both support for the single ISS currently being engaged in, and support for movement from one ISS to another can be specified within a single IR system design. They describe an interface, BRAQUE, which does provide such support, allowing the user to move one ISS to another according to the user's (self-defined) current goals and knowledge, especially in response to the results of interaction within the system. Belkin & Cool (1993), and Belkin, Cool, Stein & Thiel (1993), starting from the same concept of dimensions of ISSs, suggest further that each region of the ISS space could have associated with it, not only a general information-seeking behavior, but a proto-typical interaction or dialogue structure, which takes place between the user and the rest of the IR system. They propose, on this basis, an IR system design which provides dialogue patterns for the different ISSs, and sequences of ISSs, based on a library of specific cases of real interactions, organized according to the protoypical ISS struc-tures. This allows for explicit interactive, and mixed-initiative support for specific in-formation-seeking behaviors, seen as complex combinations of ISSs, which change as the user changes in her/his interaction with the texts.A key problem arising in these approaches to identification and support of information-seeking behavior, is that they are explicitly concerned just with behavior, and have little to say about why a person might engage in one form of information-seeking in any particular situation. In order to be able to provide support other than just offering the user a choice of support mechanisms (for instance, predicting appropriate ISSs in specific circumstances), it is necessary to establish relations between the behaviors and some other characteristics of the user, such as the person's goals or problematic situation. This, in turn, suggests that we need some classification of such factors, and a means to establish how they are affected by other aspects of the situation in which the user is embedded. Dervin (1983), in her classification of 'information gaps' provides one possible approach to this issue. Belkin & Marchetti (1990) have suggested cognitive task analysis as another means for identifying relations between user characteristics and specific information-seeking behaviors. Brajnik , Guida & Tasso (1988), and Daniels (1986) have suggested that the construction of specific kinds of models of the user, by other parts of the system, could be used to establish such relations. And the types of studies discussed in section 4.1 also address this issue, in terms of identifying what it is important to know about the user, in order to support the user in interaction in the IR system. But it must be said that, at the moment, no-one has succeeded in firmly establishing such relations, and that this issue is definitely still a matter for further research.4.3 Interaction with Text as the Central Process of IRThe discussion in the previous two sections rather clearly suggests how one might also make the user's interaction with texts the central process in IR. Thinking about partici-pation roles and responsibilites in the interaction, or at least making mixed control pos-sible, as suggested by Bates (1990), is a clear starting point. Promoting and supporting direct access to texts, and direct manipulation of texts, and direct response to texts, whether guided, as in Oddy's (1977) THOMAS, or user-directed, as in Belkin, Marchetti & Cool's (1993) BRAQUE, is another. Promoting movement among infor-mation-seeking behaviors, as for instance, by the dialogue structures suggested by Belkin, Cool, Stein & Thiel (1993), or supporting such movement by providing tools for it, as in BRAQUE, also serves to make the user's interaction central. The facilities in I3R (Croft & Thompson, 1987) for supporting browsing, and especially for eliciting and using the user's knowledge in support of the overall interaction, also are means to address this goal.These are all examples of how to move the concept of the person's interaction with texts, including all of the interpretation issues, to the fore. They have in common thatrepresentation and comparison processes, although clearly necessary, are used to sup-port the user's interaction. This is in stark contrast to the standard model of IR, in which interaction, in the form of the judgement process, is construed as support for rep-resentation and comparison. Thus, making user interaction with text the central process of IR seems to require allowing user control of the overall interaction, basing representation upon the results of user interpretation of the interaction, and using the comparison process to support interaction, by suggestion, rather than as a means to a definitive answer.5. ConclusionFrom our analysis of the conditions which lead to the IR situation, we conclude that IR is most properly considered as a form of information-seeking behavior, in which the user's interaction with text is the central phenomenon, to which the IR system must re-spond, and which the IR system must support. The explict consequences of this view are that: the goal of the IR system is to support the user in her/his entire range of infor-mation-seeking behaviors; the user must be considered the central component of the IR system; and, interaction (both user's interaction with texts, and other interactions which support it) is the central process of IR. From this follows, that the role of the representation and comparison processes in IR are in support of interaction, and, that control of the IR interaction must be mixed between the participants.Although we do not present a model of IR based on these conclusions, we have shown, by way of example, how various aspects of their consequences can be applied to IR models, and especially, to the design of IR systems. Taken together, the various ap-proaches to the different problems raised by this view of IR do suggest a general strat-egy for IR system design which might eventually evolve into a more formal model. This strategy begins by considering the IR system as an explicitly interactive system, among the user, the textual resources, and a mediating mechanism. The structure of this interaction is dialogue based, with mixed initiative. Each participant in the system has explict roles and responsibilities. The mediating mechanism and textual resource provide suppport for the user to engage in several information-seeking behaviors, with movement among the behaviors available both on user instigation and system guidance. The results of the user's interaction with the texts can be used directly by the person for modifying the interaction, and by the mediating mechanism, to provide suggestions for guiding the interaction. The texts of the system are available for direct engagement by the user, and direct manipulation by the user, as well as for comparison processes engaged in by the mediating mechanism.In general, this strategy aims to empower the user in the information-seeking interac-tion, by establishing the user's goals and interactions with text as primary, and the rest of the IR system components as supportive of them. We hope that IR systems designed on these principles would be systems which would be not only effective, but also plea-surable, in helping people to interact with text.。

相关文档
最新文档