Ranking Scientific Publications Using a Simple Model of Network Traffic
西班牙国家科学与技术!

Index
1. Introduction 2. Spanish R&D&Innovation system 3. Spanish strategy and National Plan for Scientific, Technical and Innovative Research. 4. Spanish Excellence Initiatives in Research. 5. Public Research Organizations. 6. Funding Agency for Spanish Business Innovation: CDTI & CHINEKA programme.
Structure of Spanish Strategy
Specific objectives • Institutional reinforcement • Sustanaibility and usage of Research Infrastructures • Promotion of cutting-edge knowledge. • Promotion of Emerging Technologies Specific objectives • Mobillity & development of Research Career • Training and Capacitation on R&D • Allocation of Human Resources
D. G. FOR INNOVATION AND COMPETITIVENESS CSIC
CENTER FOR INDUSTRIAL TECHNOLOGICAL DEVELOPMENT (CDTI) PUBLIC RESEARCH ORGANISATIONS
Searching the Scientific Literature

Chapter10Searching the Scientific LiteratureThe history of science knows scores of instanceswhere an investigator was in the possessionof all the important facts for a new theory,but simply failed to ask the right questions.Ernst Mayr Before you carry out even yourfirst experiment in the lab,you would be well advised to spend some time in the library doing a thorough lit-erature search of your research topic.Perhaps you worked on a project in this samefield as an undergraduate,or think you are familiar with thefield because it is related to other work you have done.Even if you have some knowledge of the literature on your project,you shouldn’t skip this step.The investment of time in the library will pay off many times over in the lab.Y ou don’t want to risk doing work that someone else has already done,or going down the same worn path that others have traveled before you.Science is not carried out in a vacuum.It is about steady forward progress over long periods of time and wise graduate students will take the time to read and benefit from the researchfindings of their predecessors.As you embark on your literature search,you may feel quickly over-whelmed by the pile of papers you accumulate,so keep in mind that is impossible to read all the research ever published in your area.Being selective about what you read is key to getting a thorough overview of a particularfield,without drowning in too much information.But whatever you do and however you decide to go about it,do not skip this step.Y ou will live to regret it.Getting started in the libraryGet comfortable with the layout of the library and with the research tools available at your institute.Introduce yourself to the reference librarian(s)and explain that you want to carry out a literature search76Mastering your PhDon your thesis topic.These individuals are great sources of information and are there to help you in your search.Y ou’ll be spending a lot of time in the library so take the time to become familiar with all its services.What kinds of scientific literature exist and which ones will be most important to you?Broadly speaking,scientific literature can be divided into two types of articles:peer-reviewed and popular.Most of the articles published in scientific journals,both primary (original research)and secondary(review articles)have gone through a stringent process called peer-review.Before an editor will accept a paper for publication,he/she will send it out for review to at least two experts in thefield.The identity of the reviewers is always kept secret from the author so that any comments made on the manuscript will be impartial.When the paper comes back with comments from the referees,it is the author’s responsibility to correct any errors or discrepancies in interpretation before the paper can be accepted for publication.This process,while not infallible,insures that most articles are scientifically sound and as free from error as possible.Articles published in popular scientific magazines are not subject to peer review,and are therefore not always reliable sources of information.Secondary literature is published in the form of review articles. As the name suggests,these articles are often very comprehensive in nature and provide an overview of the scientificfindings in a particular field over a particular period of time.Reviews do not present new and original data,they are compilations of other people’s work,but very often written by a big name scientist in thefield.Review articles can be a goldmine of information and will significantly help you with your literature search by cutting down on the amount of time you spend on searching out individual(primary)research articles.Using the InternetBy now you have already identified the scope of your research project (see Chapter1:Getting Started).Take advantage of the breadth and scope of the Internet and do an electronic search on your research topic(use key words wisely,or this type of search can get quickly out10Searching the Scientific Literature77 of hand).Start by searching popular and comprehensive databases such as MedLine,PubMed,GeoRef,and ScienceDirect.Y our univer-sity library should have a subscription to some of these and other databases that are password protected.Download and print out the articles that are the most pertinent to your research.As you become more involved in your search,you will start to a get a feel for the im-portant researchers in thefield.Mark down their names and research institutions.Another excellent way to get started with your literature search is to read recent review articles published on your topic.Think of this as a bit of a short cut.Someone,somewhere has done much of the work before you and compiled at least a part of your literature search in a comprehensive review article that may contain up to200references.Another tip is to photocopy the earliest papers in thefield.These papers represent the seminal work in your area of inquiry.It is critical to know how thefield started,which experiments were done,and who the principal players were.As you search databases,identify review articles and key publications,you will begin to create a chronological picture of your research topic.It’s very important to have an under-standing of the early stages of the inquiry into your topic.As you read the papers chronologically up to the present,you will develop an understanding of how the current knowledge of your particular researchfield developed.Surely infits and starts as science tends to go,but as more research is carried,more pieces of the puzzle will have beenfilled in.Perhaps now you are beginning to understand why this process is essential.Y ou need to be familiar with all the work that has been done on your topic,not just as a tool for learning,but to avoid repeating work that others have done before.Imagine the graduate student who spends six months doing a series of experiments only to discover(be-latedly)that someone else has done them ten years ago.This happens more often than you think,so don’t let it happen to you.Making good use of the Science Citation IndexThe Science Citation Index(SCI®)provides access to current and past author abstracts and cited references from some3700science and78Mastering your PhDtechnical journals.This important tool can help you weed through literally hundreds of research articles tofind the most cited,and hopefully,the most important articles in yourfield.Once you’ve gath-ered a solid collection of articles,you will need to scan through them and summarize and record the pertinent information.How best to or-ganize all this information?We suggest that you keep a written record for the purpose of building your bibliography.For each article you’ve selected,write down the author,title,name of journal and year of publication.Then jot down a few words about each of the following:1.Statement of the problem2.Hypothesis3.Theories and assumptions4.Research methods5.Data collection tools/procedures6.Research design7.Methods8.Interpretation of data(did data support or reject the hypothesis?)9.Conclusions/suggestions for future researchIf this sounds daunting and like a massive amount of work,just re-member that investing time now in a proper literature search will save you vast amounts of time later on when you start writing up your own research articles(see Chapter12),not to mention your thesis. Y ou won’t want to spend days and weeks in the library hunting down papers orfinding out(oops!)that you’ve duplicated experiments that someone else has already done,just when you’re getting ready to write everything up.How do you know when your literature searchis successful and complete?Y ou’ll know you’ve made a comprehensive literature search when you have performed the following tasks:10Searching the Scientific Literature79 1.Identified the most recent articles(last10years,plus seminalarticles)on your research topic.2.Skimmed each article and prepared a brief summary of each one.3.Assessed each article for the strengths and weaknesses of the ex-perimental setup,methods and procedures used,data collection and analysis.It is up to you to develop an organized method for storing and retriev-ing this information.Many people copy each paper and then attach a cover sheet with the summary and assessment points to it.It may also be wise to record this information on your computer and type the name of the author,journal,etc.of each article in standard format. This will save you oceans of time when you go about writing up your own articles and have to refer to these references.Libraries are great places to spend time.They can offer a much needed refuge from the lab and help you place your own work into context by reading about the work of others.Don’t feel you’re wasting your time if youfind yourself in the library when others are in the lab. Remind yourself that you might just know something they don’t.。
长大后做个科学家英语作文

长大后做个科学家英语作文Growing Up to Be a Scientist.From a tender age, the allure of the unknown and the boundless possibilities of science captivated me. I marveled at the intricate machinations of the natural world, eager to unravel its secrets and push the boundaries of human knowledge. As I embarked on my educational journey,my passion for science only intensified.In the realm of physics, I reveled in the elegance of Newton's laws and the awe-inspiring tapestry of quantum mechanics. I delved into the cosmos, exploring themysteries of black holes and the vast expanse of our universe. The allure of biology equally enthralled me, as I delved into the intricate workings of living organisms and the wonders of genetic engineering.Beyond the classroom, I immersed myself inextracurricular activities that fueled my scientificcuriosity. I participated in science fairs, where I showcased my research projects and shared my discoveries with peers and mentors. I joined science clubs and robotics teams, where I collaborated with like-minded individuals and developed invaluable practical skills.Inspired by the groundbreaking advancements made by scientists throughout history, I recognized the profound impact science has on society. I witnessed firsthand how scientific innovations have transformed medicine, transportation, and communication. I aspired to contribute to this legacy, to make my own mark on the world through scientific discovery.As I transitioned to higher education, I pursued a degree in science, where I delved deeper into the complexities of my chosen field. I immersed myself in cutting-edge research, working alongside renownedscientists and contributing to the body of scientific knowledge. Through sleepless nights spent poring over data and countless hours of experimentation, I honed my analytical skills, problem-solving abilities, and criticalthinking capacity.Upon completing my formal studies, I embarked on a career as a scientist. I joined a research laboratory whereI could pursue my passion for scientific discovery and makea meaningful contribution to the advancement of knowledge. My research focuses on unraveling the molecular basis of complex diseases, with the ultimate goal of developing novel therapies to alleviate human suffering.The path of a scientist is not without its challenges. There are setbacks, disappointments, and moments of self-doubt. However, the allure of the unknown and the potential for scientific breakthroughs drive me forward. I am fueled by an unyielding curiosity and an unwavering belief in the power of science to transform the world for the better.As I continue my journey as a scientist, I am guided by a profound sense of responsibility. I recognize the immense impact that scientific advancements can have on society, and I am committed to using my knowledge and skills for the benefit of humanity. I strive to disseminate my findingsthrough scientific publications and public outreach initiatives, ensuring that the fruits of my research extend beyond the confines of the laboratory.Moreover, I am passionate about mentoring and inspiring the next generation of scientists. I volunteer my time teaching science workshops to students, sharing my love of science and encouraging them to pursue their own scientific aspirations. I believe that by fostering a culture of scientific literacy and inquiry, we can cultivate a future generation of innovators and problem-solvers who will continue to push the boundaries of human knowledge and make the world a better place.。
biomaterials research jci分区 -回复

biomaterials research jci分区-回复Biomaterials Research in JCI 分区: A Comprehensive AnalysisIntroductionBiomaterials research has gained significant attention in recent years due to its potential applications in various fields such as medicine, tissue engineering, and drug delivery systems. As a result, researchers widely publish their findings in established scientific journals to disseminate knowledge and advance the field. Among the many journals dedicated to biomaterials research, the JCI 分区(Journal Citation Reports) ranking system provides a valuable metric to assess the impact and quality of these publications. In this article, we will explore the JCI 分区rankings for biomaterials research, discussing the factors considered in the evaluation process and the significance of publishing in high-ranking journals.Understanding JCI 分区RankingsThe JCI 分区ranking system categorizes journals into quartiles (Q1-Q4), with Q1 being the highest impact factor journals, and Q4 representing the lowest impact factor journals. Impact factor is ametric that quantifies the average number of citations received by articles published in a specific journal within a given period. The JCI 分区ranking system classifies journals based on subject categories, and each category can have different quartile ranges. For example, biomaterials research falls under the "Materials Science, Biomaterials" category. Understanding the rankings in this category will help researchers assess the quality and impact of biomaterials research journals.Factors Considered in the Evaluation ProcessThe JCI 分区ranking system considers several factors when evaluating journals. The primary factor is the number of citations received by articles published in the journal within a defined period. The citation count reflects the influence and impact of the research published in the journal. Additionally, the system takes into account the number of articles published, the number ofself-citations, and the citation network of the journal. These factors ensure a comprehensive assessment of a journal's quality and impact within a specific field, in this case, biomaterials research.Significance of Publishing in High-Ranking JournalsPublishing in high-ranking journals in biomaterials research has several significant benefits. Firstly, it provides researchers with a platform to showcase their work to a wider audience, including experts and peers in the field. High-ranking journals often have a rigorous peer-review process, ensuring that only high-quality research is accepted for publication. This, in turn, enhances the credibility and visibility of the research. Additionally, high-ranking journals tend to have a larger readership, which increases the likelihood of the research being cited by other researchers. A higher citation count indicates that the research is influential and contributes to the advancement of the field. This, in turn, opens doors for collaboration opportunities and grants. Moreover, publishing in high-ranking journals allows researchers to gain recognition and establish themselves as experts in their respective biomaterials research subfields.ConclusionBiomaterials research plays a crucial role in advancing various scientific fields, and publishing research findings in reputablejournals is essential to disseminate knowledge and make a significant impact. The JCI 分区ranking system provides a valuable metric to assess the impact and quality of biomaterials research publications. Researchers in the field can use this ranking system to choose suitable journals for publication, enhancing the credibility and visibility of their work. By understanding the factors considered in the evaluation process and the significance of publishing in high-ranking journals, researchers can actively contribute to the growth and development of biomaterials research.。
gopubmed

Gopubmed1. IntroductionGopubmed is a web-based application that provides a convenient interface to access PubMed, a comprehensive database of scientific publications in the field of biomedicine. Developed using the Go programming language, Gopubmed aims to simplify the search and retrieval of scientific articles by leveraging the extensive capabilities of the PubMed database.2. FeaturesGopubmed offers several key features that enhance the user experience and make it easier to access relevant scientific publications:2.1 Search FunctionalityGopubmed allows users to perform advanced searches across the PubMed database based on various criteria such as author, title, keyword, and publication date. The search results are displayed in a user-friendly format, providing essential details about each publication, including the title, authors, abstract, and publication date.2.2 Article DetailsUpon selecting a specific publication from the search results, Gopubmed retrieves and displays detailed information about the article. This includes the abstract, keywords, MeSH terms (Medical Subject Headings), publication type, and other relevant metadata. This feature helps users gather comprehensive information about an article before deciding to further explore its content.2.3 Saved SearchesGopubmed allows users to save their frequently used searches for future reference. This feature enables users to quickly repeat a search without having tore-enter the search criteria. Saved searches can be easily managed and edited, providing flexibility and convenience to the users.2.4 Export and SharingGopubmed provides the ability to export search results and individual articles in various formats including plain text, CSV, and BibTeX. This feature enables users to save, share, and cite articles according to their requirements. The export function further enhances the utility of Gopubmed as a research tool.2.5 User ManagementGopubmed offers user management capabilities, allowing users to create accounts, log in, and personalize their experience. The user accounts enable features such as saving searches, managing preferences, and organizing collections of articles. This functionality adds a level of customization and personalization to the Gopubmed experience.3. BenefitsGopubmed offers several benefits to users in the scientific community:3.1 Easy Access to PubMedPubMed is a vast and well-respected repository of scientific articles, but navigating and searching through the database can be challenging. Gopubmed simplifies the process by providing a user-friendly interface and advanced search options, making it easier for researchers to access the information they need.3.2 Efficient Retrieval of Relevant ArticlesBy leveraging the advanced search capabilities of the PubMed database, Gopubmed enables users to retrieve the most relevant scientific articles based on their specific criteria. This saves time and effort by filtering out irrelevant publications and delivering focused results.3.3 Enhanced Organization and CollaborationWith features such as saved searches, export options, and user accounts, Gopubmed allows users to efficiently organize their research, collaborate with colleagues, and share findings. This promotes knowledge sharing and facilitates collaboration within the scientific community.3.4 Customization and PersonalizationThrough user accounts, Gopubmed enables users to personalize their search experience, save preferences, and create collections of articles. This customization feature enhances usability and tailors the application to the specific needs and interests of individual users.4. ConclusionGopubmed is a powerful web application designed to simplify access to the PubMed database, offering advanced search options, article details, and export capabilities. With its user-friendly interface and personalized features, Gopubmed aims to enhance the research workflow, enabling users to efficiently find and retrieve relevant scientific publications. Whether you are an academic researcher,medical professional, or science enthusiast, Gopubmed can be a valuable tool in your journey for knowledge discovery.Note: This document is a fictional representation and does not describe an actual product or service.。
中考英语阅读:Top universities in China

中考英语阅读:Top universities inChina中考英语阅读:Top universities in China(Words: about 240; Time: 3 minutes )Peking University, Tsinghua University and Zhejiang University have retained their places as the top three Chinese universities, according to a ranking guide released on May 14.The university table, which was compiled by the Wuhan-based Research Centre for China Science Evaluation in Central Chinas Hubei Province, evaluates teaching quality, staff-student rate, scientific research and school reputation.Tsinghua University, Peking University and Shanghai Jiaotong University are the top three science schools, while Peking University, Renmin University of China and Beijing Normal University are listed the best three in humanities (人文学科) and social science research.Unlike the previous two annual reports, thisyears guide also gave rankings for 192 specific subjects.Peking University ranked top in philosophy, Chinese literature, history, medical science and natural science. Tsinghua was the best place for engineering, Renmin University of China top choice for economics and law, Beijing Normal University best in education and psychology, while Zhejiang University was the top destination for computer studies.Qiu Junping, the centers director, said this years judgment began in November of 2005 and included 887 colleges on the Chinas mainland. Statistics used are from four major sources: government figures, international and domestic databases, government and university websites and authoritative publications.“The guide offers students a full range of information on the best schools in China,” he said. “It is an objective and detailed guide for students choosing full-time degree courses, and it also serves as a reference to government pol icy makers.”Help:retain v. to keep something; to continue to havesomething 保持;继续拥有compile v. to produce a book, list, report, etc. by bringing together different items, articles, etc. 编写Try this:1. In the comprehensive survey, _______________ is listed at the top of the rankings.2. According to the passage, if you want to study computer science, _________________ is the best choice.3. According to the passage, if you want to study both philosophy and literature, the top choice is__________________.(Key: 1. Peking University 2. Zhejiang University 3. Peking University)。
esi学科的排名方法

esi学科的排名方法ESI (Essential Science Indicators) is a database that provides access to a rich array of citation and publication data to measure the scientific performance and impact of institutions, countries, and journals. ESI evaluates over 22,000 journals from around the world, and its rankings are widely used as a benchmark for assessing the research quality and impact of different scientific disciplines.ESI学科排名是根据学科领域的学术研究产出和影响力进行评估的一个重要指标。
它可以帮助研究人员和机构全面了解各种学科领域的发展状况,有助于评估和比较不同学科领域的研究质量和影响力。
The ranking method used by ESI takes into account various factors such as the number of publications, citations, and the impact of the publications in a particular field. These factors are used to calculate the citation impact and other metrics to determine the ranking of different disciplines. ESI's ranking method is based on rigorous and transparent methodologies that are constantly reviewed and updated to ensure accuracy and reliability.ESI的排名方法是基于学科领域的学术文献数量、引用次数以及影响力等因素进行综合考量的。
图书馆的英语单词

图书馆的英语单词图书馆有保存人类文化遗产、开发信息资源、参与社会教育等职能。
那样你知道图书馆的英语单词是什么吗?目前跟我们一起学习关于图书馆的英语常识吧。
图书馆的英语单词librarybibliotheca图书馆的英语例句暴乱者放火烧了当地的图书馆。
The rioters torched the local library.图书馆搬迁的艰巨任务The mammoth task of relocating the library这个图书馆藏书10万卷。
This library has 100000 volumes.那城市有自身的公用图书馆和公园。
The town has its own public library and public gardens.那个图书馆从上午9点到下午6点开放。
The library is open from 9 to 6.我喜爱在图书馆学习。
I like to study in the library.我要去图书馆还这些书。
Im just going to check in these books at the library.还有一些则需要需要亲自光顾图书馆分支机构。
Some others require visiting the library branch in person.那个图书馆吸引了成千上万的学者和研?a href='http:///yangsheng/kesou/' target='_blank'>咳嗽薄?/p> The library attracts thousands of scholars and researchers.我在去图书馆的路上经过了那家商店。
I passed the store on my way to the library.图书馆的英语句子带翻译我经常到图书馆,包括学校图书馆或公共图书馆借阅书籍。
- 1、下载文档前请自行甄别文档内容的完整性,平台不提供额外的编辑、内容补充、找答案等附加服务。
- 2、"仅部分预览"的文档,不可在线预览部分如存在完整性等问题,可反馈申请退款(可完整预览的文档不适用该条件!)。
- 3、如文档侵犯您的权益,请联系客服反馈,我们会尽快为您处理(人工客服工作时间:9:00-18:30)。
a r X i v :p h y s i c s /0612122v 1 [p h y s i c s .s o c -p h ] 13 D e c 2006Ranking Scientific Publications Using a Simple Model of Network TrafficDylan Walker 1,2,Huafeng Xie 2,3,Koon-Kiu Yan 1,2,Sergei Maslov 21Department of Physics and Astronomy,Stony Brook University,Stony Brook,New York,11794,USA2Department of Condensed Matter Physics and Materials Science,Brookhaven National Laboratory,Upton,New York 11973,USA3New Media Lab,The Graduate Center,CUNY,New York,NY 10016,USA (Dated:February 2,2008)To account for strong aging characteristics of citation networks,we modify Google’s PageRank algorithm by initially distributing random surfers exponentially with age,in favor of more recent publications.The output of this algorithm,which we call CiteRank,is interpreted as approximate traffic to individual publications in a simple model of how researchers find new information.We develop an analytical understanding of traffic flow in terms of an RPA-like model and optimize parameters of our algorithm to achieve the best performance.The results are compared for two rather different citation networks:all American Physical Society publications and the set of high-energy physics theory (hep-th)preprints.Despite major differences between these two networks,we find that their optimal parameters for the CiteRank algorithm are remarkably similar.Due to their rapid growth and large size,many in-formation networks have become untenable to navigate without some sort of ranking scheme.This is particu-larly evident in the example of the World Wide Web,a network of pages connected by hyperlinks.A successful solution to the problem of ranking the Web is Google’s PageRank algorithm [1].Another class of information networks that could benefit from such a ranking method are citation networks.These networks are comprised of scientific publications connected by citation links.Current methods of ranking publications based on the total number of citations received are rather crude.They are too “democratic”in treating all citations as equal and ignoring differences in importance of citing papers.One of the advantages of Google’s PageRank algorithm is that it implicitly accounts for the importance of the citing article in a self-consistent fashion.Authors of [2]proposed using the PageRank algorithm to improve the formula used to calculate the impact factor of scientific journals.In [3]some of us directly applied this algorithm to individual papers published in all American Physical Society journals.This allowed us to discover a set of highly influential papers (“scientific gems”)that would be undervalued based on just their number of citations.However,there exist significant differences between the World Wide Web and citation networks that suggest a modification of the original PageRank algorithm.The most important difference is that,unlike hyperlinks,ci-tations cannot be updated after publication.This makes aging effects [4,5]in citation networks much more pro-nounced than in the WWW.The other consequence is the inherent time-arrow present in the topology of cita-tion networks,due to the constraint that a paper may only cite earlier works.This significantly alters the spec-tral properties of the adjacency matrix which lie at the heart of the PageRank algorithm.In particular,the ab-sence of directed loops means that the adjacency matrixcan have only zero eigenvalues.The success of the PageRank algorithm can be at-tributed,in part,to its ability to capture the behavior of people randomly browsing the network of web pages.Indeed,the PageRank of a given web page can be inter-preted as the predicted traffic (quantified e.g.,by the rate of downloads)for that page if every WWW user follows a random path of (on average)1/αhyperlinks starting from a randomly selected webpage.The assumption that a typical web-surfer starts at a randomly selected webpage might be not completely unreasonable for the WWW,but it needs to be modified for citation networks.As all of us know,researchers typically start “surfing”scientific publications from a rather recent publication that caught their attention on a daily update of a preprint archive or a recent volume of a journal.Thus a more realistic model for the traffic along the citation network should take into account that researchers preferentially start their quests from recent papers and progressively get to older and older papers with every step.In this work we introduce the CiteRank algorithm,an adaptation of the PageRank algorithm to citation net-works.Our algorithm simulates the dynamics of a large number of researchers looking for new information.Ev-ery researcher,independent of one another,is assumed to start his/her search from a recent paper or review and to subsequently follow a chain of citations until satisfied.Explicitly,we define the following two-parameter CiteR-ank model of such a process,allowing one to estimate the traffic T i (τdir ,α)to a given paper i .A recent paper is selected randomly from the whole population with a probability that is exponentially discounted according to the age of the paper,with a characteristic decay time of τdir .At every step of the path,with probability αthe researcher is satisfied and halts his/her line of inquiry.With probability (1−α)a random citation to an adja-cent paper is followed.The predicted traffic,T i (τdir ,α),2to a paper is proportional to the rate at which it is vis-ited if a large number of researchers independently follow such a simple-minded process.While we interpret the output of the CiteRank algo-rithm as the traffic,its utility ultimately lies in the ability to successfully rank publications.High CiteRank traffic to a publication denotes its high relevance in the context of currently popular research directions,while the PageR-ank number is more of a “lifetime achievement award”[3].It is fruitful to compare the CiteRank traffic to a paper,T i ,with the more traditional method of ranking publica-tions,the number of citations received.Indeed,the two are highly correlated;a result easily understood on the basis that the larger the number of citations a paper has,the more likely it will be visited by a researcher via one of the incoming links.However,the more refined CiteRank algorithm sur-passes the conventional ranking,by number of citations,in its characterization of relevancy on two accounts.Like the original PageRank algorithm [1][2],in CiteRank,the popularity of papers is calculated in a self-consistent fash-ion:The effect of a citation from a more popular paper is greater that that of a less popular one.A citation from a paper that is “highly visible”will contribute more to the visibility of the cited paper.Furthermore,the age of a citing paper is intrinsically accounted for.The effect of a recent citation to a paper is greater than that of an older citation to the same paper.New citations indicate the relevancy of a paper in the context of current lines of research.An algorithmic description of the aforementioned model can be understood as follows.The transfer ma-trix associated with the citation network is W ij =1/k outjif j cites i and 0otherwise,where k outj is the out-degree of the jth paper.Let ρi ,the probability of initially se-lecting the i th paper in a citation network,be given by ρi =e −age i /τdir .The probability that the researcher will encounter a paper by initial selection alone is given by ρ.Similarly,the probability of encountering the paper after following one link is (1−α)W · ρ.The CiteRank traffic of the paper is then defined as the probability of encountering it via paths of any length:T =I · ρ+(1−α)W · ρ+(1−α)2W 2· ρ+···(1)Practically,we calculate the CiteRank traffic on all pa-pers in our dataset by taking successive terms in the above expansion to sufficient convergence (<10−10of the average value).In order to assess the viability of this ranking scheme and to select optimal parameters (τdir ,α),we need a quantitative measure of its performance on real cita-tion networks.Two real citation networks are evaluated.Hep-th :An archive snapshot of the “high energy physics theory”archive from April 2003(preprints ranging from 1992to 2003).This dataset,containing around 28,000papers and 350,000citation links,was downloaded from [6].Physrev :Citation data between journals publishedby the American Physical Society [7].This dataset con-tains around 380,000papers and 3,100,000citation linksranging from 1893to 2003.Of course,evaluating the performance of any ranking scheme is a delicate,but often necessary,matter.One way to select the best performing αand τdir is to optimize the correlation between the predicted traffic,T i (τdir ,α)and the actual traffic (e.g.,downloads).Unfortunately,the actual traffic data for scientific publications are not readily available for these networks.However,it is rea-sonable to assume that traffic to a paper is positively correlated with the number of new citations it accrues over a recent time interval,∆k in .For lack of better in-tuition we first assume a linear relationship between ac-tual traffic and number of recent citations accrued.This corresponds to a simple-minded scenario in which every researcher downloading a paper will,with a small proba-bility,add it to the citation list of the manuscript he/she is writing [8].In order to compare CiteRank with actual citation accrual,we constructed an historical snapshot of the networks.In both cases,the most recent 10percent of papers are pruned from the network.The CiteRank traffic,T i ,of the remaining 90percent of the papers is then evaluated and correlated with their actual accrual of new citations,∆k in ,originating at the most recent 10percent of papers.It is important to note the qualitative0.20.40.60.20.30.40.50.6FIG.1:The Pearson (linear)correlation coefficient between the number of recent citations accrued (∆k in )and CiteRank traffic (T i )is calculated over the parameter space of the Cit-eRank model for the hep-th (A)and physrev (B)network.Both networks exhibit peaks in correlation coefficient in the α-τdir plane.The highest correlation is achieved for α=0.48,τdir =1year in the hep-th network and α=0.50,τdir =2.6years,in the physrev network.and quantitative differences between the two citation net-works considered.The Physical Review citation network (physrev)is comprised of a large number (∼400,000)of peer-reviewed publications acquired over a period close to a hundred years.The high-energy physics archive citation network (hep-th)is comprised completely of a3 much smaller number(∼28000)of electronically sub-mitted publication preprints,with no associated form ofpeer review.Despite these significant differences in thenature of the networks considered,the general features oftheir correlation contours are outstandingly similar.Inboth cases,a single sharp peak in correlation is evidentfor particular values of the parameters.The value of theoptimal parameters for both networks are:hep-th:α=0.48,τdir=1yearphysrev:α=0.50,τdir=2.6yearsRemarkably,the value ofαis nearly the same for tworather different networks considered here and is in agree-ment with that proposed in[3]on purely empiricalgrounds.The difference in optimal parameterτdir for these networks is in agreement with the common-sense expectation of faster response time(and hence faster ag-ing of citations)in preprint archives compared to peer-reviewed publications.Another feature of Fig.1is that,in both networks,large values of the correlation coefficient are concentrated along a diagonally-positioned ridge.In other words,the best choice ofαfor a givenτdir seems to rise linearly withτdir,a behavior that will be revisited later in this text.The resultant CiteRank traffic and corresponding ranking for the two citation networks can be accessed here[9].While the correlation contour plots shown in Fig.1 are a promising indication that the CiteRank model of traffic provides a good zero-order approximation to the actual traffic along a citation network,they are to some extent predicated on the assumption of a linear relation-ship between actual traffic and∆k in.One might readily ask how this model fares in the absence of such an as-sumption.While the assumption of a linear relationship may be unreasonable,a positive,monotonic relationship between these quantities is certainly expected.There is a statistical correlation method precisely adapted for such a situation,namely,the Spearman rank correlation.Un-der this relaxed correlation measure,only the rank of T i are correlated with the rank of∆k in.Numerical changes in T i that do not lead to reordering have no effect on the value of the rank correlation coefficient.Another ratio-nale for using rank correlations is that our ultimate goal is ranking publications,not modeling the traffic.Thus, we are currently not interested in individual T i’s,but only in their relative values.Spearman correlation con-tour plots are constructed for both networks and shown in Fig.2.The optimal values for both networks are: hep-th:α=0.31,τdir=1.6yearphysrev:α=0.55,τdir=8yearsThese results roughly confirm the prediction ofα∼0.5 from Fig.1,however there is a more appreciable discrep-ancy inτdir between linear and rank correlation for both networks.In both panels of Fig.1,over a broad range of parame-ters,the optimal value ofα(τdir)for a given value ofτdir0.450.50.550.60.350.40.450.50.55FIG.2:The Spearman rank correlation coefficient between recent citations accrued(∆k in)and CiteRank traffic(T i)for the hep-th(A)and physrev(B)network.Both networks ex-hibit similar behavior.There are more extended regions of good correlation relative to the linear correlation contours of fig. 1.This broadening is expected as a consequence of the more relaxed correlation measure.The highest rank corre-lation occurs forα=0.31,τdir=1.6years,in the hep-th network andα=0.55,τdir=8years,in the physrev net-work.is positively correlated withτdir.This is an indication that these two parameters are entangled.In fact,this is to be expected as it is some admixture of the two param-eters which leads to the exposure of a given paper to the researcher.An intuitive picture of this entanglement can be understood in terms of the penetration depth,which is a measure of how far back in time a random surfer follow-ing rules of the CiteRank algorithm is likely to get.The penetration depth is affected by bothτdir-the average age of the initial paper at which he/she started follow-ing the chain of citations,and1/α-the mean number of steps on this chain of citations.For smallτdir and large α,the penetration depth is small,implying that only very recent papers receive traffic.On the other hand,for large τdir and smallα,the penetration depth is very large,in-dicating that most of the traffic is directed towards older papers.To better understand howαandτdir influence the age distribution of CiteRank traffic,we performed the fol-lowing quantitative analysis.Let T tot(t)denote the to-tal CiteRank model traffic to papers written exactly t years ago.As described by Eq.1,two distinct pro-cesses contribute to T tot(t).Thefirst is the“direct”traf-fic T dir(t)due to the initial selection of papers in this age group,which is proportional to exp(−t/τdir)[11]. The second is the“indirect”traffic T ind(t)arriving via one of the incoming citation links.The latter is given byT ind(t)=(1−α) ∞t T tot(t′)P c(t′,t)dt′,where P c(t′,t)is the fraction of citations originating from papers of age t′that cite papers of age t.It should be noted that P c(t′,t)is an empirical distribution and,as such,is a measured property of the citation network under consid-eration.According to[5]and our ownfindings,P c(t′,t) is reasonably well approximated by the exponential form 11912),Series II(1913-1969),and Series III(1970-present).This latter series includes thefive topical sec-tions:Phys.Rev.A,B,C,D,and E.Also included are Phys.Rev.Lett.,Rev.Mod.Phys.,and Phys.Rev.Spe-cial Topics,Accelerators and Beams.[8]It should be noted that we make no attempt to modelnetwork growth in this paper.[9]/~maslov/citerank/[10]J.Jensen,A.Mackintosh,Rare Earth Magnetism:Struc-tures and Excitations,155,Clarendon Press,Oxford, 1991.[11]Precisely speaking T dir(t)in the CiteRank model is givenby N p(t)–the number of papers of age t–multipliedby the exponential probability of selection exp(−t/τdir).Since N p(t)itself often has approximately exponentialform with time constantτp,τdir used in the follow-ing equations should be“renormalized”to˜τdir=τdir·τp/(τp+τdir).However,τp is usually rather large(∼28years in the PhysRev network).Thus except for verylargeτdir’s this renormalization can be safely ignored.[12]M.V.Simkin,V.P.Roychowdhury,Complex Syst.14,269(2003).[13]The apparent disagreement in the tail involves profounddips due to the World War II and I[5],which of coursecannot be explained by any theoretical model.。