数据仓库整体词汇表
(完整版)数据库重要术语(中英文)

(完整版)数据库重要术语(中英文)单词汇总(数据库专业一点的词汇其实主要就是每章后面review items的内容,在这里简单列一下,如果你实在没时间看书,至少这些单词要认识。
):1.数据库系统:database system(DS),database management system(DBMS)2.数据库系统(DS),数据库管理系统(DBMS)3.关系和关系数据库table= relation,column = attribute属性,domain, atomic domain, row= tuple,relational database, relation schema, relation instance, database schema, database instance;4.表=关系,列=属性属性,域,原子域,排=元组,关系型数据库,关系模式,关系实例,数据库模式,数据库实例;1.key们: super key, candidate key, primary key, foreign key, referencing relation, referencedrelation;2.超码,候选码,主码,外码,参照关系,被参照关系5.关系代数(relational algebra):selection, project, natural join, Cartesian product, set operations,union, intersect, set difference ( except\minus), Rename, assignment, outer join, grouping, tuple relation calculus6.(关系代数):选择,项目,自然连接,笛卡尔积,集合运算,集,交集,集合差(除\负),重命名,分配,外连接,分组,元组关系演算7. sql组成:DDL:数据库模式定义语言,关键字:createDML:数据操纵语言,关键字:Insert、delete、updateDCL:数据库控制语言,关键字:grant、removeDQL:数据库查询语言,关键字:select8.3.SQL语言:DDL,DML,DCL,QL,sql query structure, aggregate functions, nested subqueries,exists(as an operator), unique(as an operator), scalar subquery, assertion, index(indices), catalogs, authorization, all privileges, granting, revoking, grant option, trigger, stored procedure, stored function4.SQL语言:DDL,DML,DCL,QL,SQL查询结构,聚合函数,嵌套子查询,存在(如运营商),独特的(如运营商),标量子查询,断言指数(指数),目录,授权,所有权限,授予,撤销,GRANT OPTION,触发器,存储过程,存储函数9.表结构相关:Integrity constraints, domain constraints, referential integrity constraints10.完整性约束,域名约束,参照完整性约束5.数据库设计(ER 模型):Entity-Relationship data model, ER diagram, composite attribute,single-valued and multivalued attribute, derived attribute,binary relationship set, degree of relationship set, mapping cardinality,1-1, 1-m, m-n relationship set (one to one, one to many, many to many), participation, partial or total participation, weak entity sets, discriminator attributes, specialization and generalization6.实体关系数据模型,ER图,复合属性,单值和多值属性,派生属性,二元关系集,关系集,映射基数的程度,1-1,1-米,MN关系集合(一对一,一对多,多对多),参与部分或全部参与,弱实体集,分辨符属性,特化和概化11.函数依赖理论:functional dependence, normalization,lossless join (or lossless) decomposition,First Normal Form (1NF), the third normal form (3NF), Boyce-codd normal form (BCNF), R satisfies F, F holds on R, Dependency preservation保持依赖, Trivial, closure of a set of functional dependencies函数依赖集的闭包, closure of a set of attributes属性集闭包,Armstrong’s axioms Armstrong公理, reflexivity rule自反律, augmentation rule,增广率, transitivity传递律, restriction of F to R i ,F在Ri上的限定,canonical cover正则覆盖,extraneous attributes无关属性, decomposition algorithm分解算法.7.函数依赖,规范化,无损连接(或无损)分解,第一范式(1NF),第三范式(3NF)BC范式(BCNF),R满足F,F持有R,依赖保存,平凡,一组函数依赖封闭,一组属性,8.事务:transition, ACID properties ACID特性,并发控制系统concurrency control system,故障恢复系统recovery system,事务状态transition state, 活动的active, 部分提交的partially committed, 失败的failed, 中止的aborted, 提交的committed,已结束的terminated, 调度schedule,操作冲突conflict of operations, 冲突等价conflict equivalence,冲突可串行化conflict serializablity,可串行化顺序serializablity order,联级回滚cascading rollback,封锁协议locking protocol,共享(S)锁shared-mode lock (S-lock),排他(X)锁exclusive-mode lock (X-lock), 相容性compatibility, 两阶段封锁协议2-phase locking protocol, 意向锁intention lock, 时间戳timestamp, 恢复机制recovery scheme,日志log, 基于日志的恢复log-based recovery, 延迟的修改deferred modification, 立即的修改immediate modification, 检查点checkpoint.数据库系统DBS Database System数据库系统应用Database –system applications文件处理系统file-processing system数据不一致性data inconsistency一致性约束consistency constraint数据抽象Data Abstraction实例instance模式schema物理模式physical schema逻辑模式logical schema物理数据独立性physical data independence数据模型data model实体-联系模型entity-relationship model(E-R)关系数据模型relational data model基于对象的数据模型object-based data model半结构化数据模型semistructured data model数据库语言database language数据定义语言data-definition language数据操纵语言data-manipulation language查询语言query language元数据metadata应用程序application program规范化normalization数据字典data dictionary存储管理器storage manager查询管理器query processor事务transaction原子性atomicity故障恢复failure recovery并发控制concurrency-control两层和三层数据库体系结构two-tier/three-tier 数据挖掘datamining数据库管理员DBA database administrator表table关系relation元组tuple空值null value数据库模式database schema数据库实例database instance关系模式relation schema关系实例relation instance码keys超码super key候选码candidate key主码primary key外码foreign key参照关系referencing relation被参照关系referenced relation属性attribute域domain原子域atomic domain参照完整性约束referential integrity constraint 模式图schema diagram查询语言query language过程化语言procedural language非过程化语言nonprocedural language关系运算operations on relations选择元组selection of tuples选择属性selection of attributes自然连接natural join笛卡尔积Cartesian product集合运算set operations关系代数relational algebraSQL查询语言SQL query structureSelect 字句select clauseFrom 字句from clauseWhere 字句where clause自然连接运算natural join operationAs字句as clauseOrder by 字句order by clause相关名称(相关变量,元组变量) correlation name (correlation variable,tuple variable)集合运算set operationsUnionInterestExcept空值null values真值“unknown”truth “unknown”聚集函数aggregate functionsavg,min,max,sum,countgroup byhaving嵌套子查询nested subqueries集合比较set comparisons{《,《=,》,》=}{some,all}existsuniquelateral字句lateral clausewith字句with clause标量子查询scalar subquery数据库修改database modification删除deletion插入insertion更新updating参照完整性referential integrity参照完整性约束referential –integrity constraint 或子集依赖subset dependency可延迟的deferrable断言assertion连接类型join types内连接和外连接inner and outer join左外连接、右外连接和全外连接left 、right and full outer join Natural 连接条件、using连接条件和on连接条件natural using and so on视图定义view definition物化视图materialized views视图更新view update事务transactions提交commit work回滚roll back work原子事务atomic transaction完整性约束integrity constraints域约束domain constraints唯一性约束unique constraintCheck 字句check clause参照完整性referential integrity级联删除cascading delete级联更新cascading updates断言assertions日期和时间类型date and time types默认值default values索引index大对象large object用户定义类型user-defined types域domains目录catalogs模式schemas授权authorization权限privileges选择select插入insert更新update所有权限all privileges授予权限granting of privileges收回权限revoking of privileges授予权限的权限privileges to privileges Grant option 角色roles视图授权authorization on views执行授权execute authorization调用者权限invoker privileges行级授权row-level authorizationJDBCODBC预备语句prepared statements访问元数据accessing metadataSQL注入SQL injection嵌入式SQL embedded SQL游标cursors可更新的游标updatable cursors动态SQL dynamic SQLSQL函数SQL functions存储过程stored procedures过程化结构procedural constructs外部语言例程external language routines触发器triggerBefore 和after 触发器before and after triggers过渡变量和过渡表transition variables and tables递归查询recursive queries单调查询monotonic queries排名函数ranking functionsRankDense rankPartition by分窗windowing联机分析处理(OLAP)online analytical processing多维数据multidimensional data度量属性measure attributes维属性dimension attributes转轴pivoting数据立方体data cube切片和切块slicing and dicing上卷和下钻rollup and drill down交叉表cross-tabulation第七章实体-联系数据模型Entity-relationship data model实体和实体集entity and entity set属性attribute域domain简单和复合属性simple and composite attributes单值和多值属性single-valued and multivalued attributes 空值null value派生属性derived attribute超码、候选码以及主码super key ,candidate key, and primary key 联系和联系集relationship and relationship set二元联系集binary relationship set联系集的度degree of relationship set描述性属性descriptive attributes超码、候选码以及主码super key ,candidate key, and primary key 角色role自环联系集recursive relationship setE-R图E-R diagram映射基数mapping cardinality一对一联系one-to-one relationship一对多联系one-to-many relationship多对一联系many-to-one relationship多对多联系many-to-many relationship参与participation全部参与total participation部分参与partial participation弱实体集和强实体集weak entity sets and strong entity sets分辨符属性discriminator attributes标识联系identifying relationship特化和概化specialization and generalization超类和子类superclass and subclass属性继承a ttribute inheritance单和多继承single and multiple inheritance条件定义的和用户定义的成员资格condition-defined and userdefined membership 不相交概化和重叠概化disjoint and overlapping generalization全部概化和部分概化total and partial generalization聚集aggregationUMLUML类图UML class diagram第八章E-R模型和规范化E-R model and normalization分解decomposition函数依赖functional dependencies无损分解lossless decomposition原子域atomic domains第一范式(1NF)first normal form(1NF)合法关系legal relations超码super keyR满足F R satisfies FF在R上成立F holds on RBoyce-Codd范式BCNF Boyce-Codd normal form(BCNF)保持依赖dependency preservation第三范式(3NF)third normal form(3NF)平凡的函数依赖thivial functional dependencies函数依赖集的闭包closure of a set of functional dependencies Armstrong公理Armstrong ‘s axioms属性集闭包closure of attribute setsF在Ri上的限定restriction of F to Ri正则覆盖canonical cover无关属性extraneous attributesBCNF分解算法BCNF decomposition algorithm3NF分解算法3NF decomposition algorithm多值依赖multivalued dependencies第四范式(4NF)fourth normal form(4NF)多值依赖的限定restriction of a multivalued independency投影-连接范式(PJNF)project-join normal form(PJNF)域-码范式(DKNF)domain-key normal form(DKNF)泛关系universal relation唯一角色假设unique-role assumption 去规范化denormalization。
数字化领域常见的英语词汇

数字化领域常见的英语词汇以下是一些数字化领域常见的英语词汇:①Algorithm(算法):一套解决问题的明确步骤或准则。
②Hyperlink(超链接):允许用户导航到其他网页或资源的链接。
③Internet(互联网):由各种计算机网络相互连接而成的网络。
④E-commerce(电子商务):使用电子手段进行商业活动,包括在线购物、支付等。
⑤E-mail(电子邮件):通过互联网发送的电子信息,通常以文本、图像或视频等形式存在。
⑥Web browser(网页浏览器):用于浏览互联网上的网页的工具。
⑦World Wide Web(万维网):由各种网页相互链接而成的网络,通常通过浏览器进行访问。
⑧URL(统一资源定位器):标识互联网上资源的地址。
⑨FTP(文件传输协议):用于在互联网上传输文件的协议。
⑩Network(网络):由各种计算机、设备及数据传输线互连而成的系统,用于共享资源、通讯和信息交换。
⑪Server(服务器):提供服务的计算机,通常用于存储数据、提供网页等。
⑫Client(客户端):使用服务器提供的服务的计算机,通常是指用户的计算机。
⑬Database(数据库):用于存储、管理和检索数据的系统。
⑭Software Development Kit(软件开发工具包):包含用于开发软件应用程序的工具和库的集合。
⑮Operating System(操作系统):管理计算机硬件和软件资源的系统。
⑯Application(应用程序):用于完成特定任务的软件程序。
⑰Cloud computing(云计算):使用网络从中央远程服务器提供和管理数据和应用程序的模型。
⑱Virtual reality(虚拟现实):通过计算机技术创建的模拟现实环境,用户可以在其中进行互动。
⑲Artificial intelligence(人工智能):模拟人类智能的技术,包括机器学习、自然语言处理等。
⑳Data science(数据科学):结合统计学、计算机科学和领域专业知识的跨学科领域,以定量方式分析数据并提取洞见。
关于大数据你应该知道的50个专业术语

关于大数据你应该知道的50个专业术语1. 大数据(Big Data)- 指的是规模庞大、复杂多变的数据集合。
它在各个领域中不断积累和产生,涵盖了结构化、半结构化和非结构化的数据。
2. 数据挖掘(Data Mining)- 是从大数据中自动发现和提取有用信息的过程。
它使用统计学、模式识别和机器学习等技术,帮助解读数据并发现隐藏的模式和规律。
3. 云计算(Cloud Computing)- 是通过互联网提供各种计算资源和服务的模式。
大数据通常需要庞大的计算和存储能力,云计算提供了弹性和可靠的资源解决方案。
4. 数据仓库(Data Warehouse)- 是用于存储和管理结构化数据的集中式系统。
它经过数据清洗和整合,方便用户进行复杂的分析和查询。
5. 数据湖(Data Lake)- 是指将各种类型和格式的数据存储在一个集中式的存储系统中。
与数据仓库不同,数据湖不需要事先定义数据模式和结构,可以更灵活地处理复杂的数据分析需求。
6. Hadoop- 是一个开源的分布式计算框架,用于处理大规模数据集。
它基于MapReduce算法,能够有效地分布和处理数据。
7. MapReduce- 是一种并行计算编程模型,用于处理大规模数据集。
它将数据分成多个小块,分发给多个计算节点进行并行计算,最终将结果合并返回。
8. Spark- 是一个快速、通用、高级的大数据处理引擎。
它支持内存计算,能够在大数据集上进行复杂的数据处理和分析。
9. 数据可视化(Data Visualization)- 是将数据以图表、图形和其他可视化形式展示的过程。
它能够帮助用户更好地理解和分析数据,发现潜在的信息和见解。
10. 数据清洗(Data Cleaning)- 是处理和修正数据中的错误、缺失和不一致之前的过程。
清洗后的数据更加准确可靠,有助于后续的分析和应用。
11. 数据集成(Data Integration)- 是将来自不同数据源的数据合并成一个统一的数据集的过程。
数据仓库专业术语表

数据仓库专业术语表AAccess PathThe path chosen by a database management system to retrieve the requested data.Access ProviderA company which provides its customers a service whereby they can access theInternet.The user normally connects to the access provid er’s computer viaa modem using a dial up connection.Active AttackA persistent security assault by someone trying to gain restricted accessby altering data.There are multiple techniques,decryption for example,which can be used to lead the attack.Active Server Pages (ASP)Active server pages are a set of software components that run on a Web server and allow Web developers to build dynamic Web pages.Activity-Based Costing (ABC)Activity-based costing (ABC) is an information system that maintains and processes data on a firm's activities and products.It identifies the activities performed,traces cost to these activities,and then uses various cost drivers to trace the cost of activities to products.Activity-Based Management (ABM)Activity-based management(ABM) is the use of the activity-based costing tool by process owners to control and improve their operations.Because process analysis is conducted in the building of an activity-based cost model,management knows its business much better and can consequently evaluate value-added and non-value-added activities.Because a certain volume of work produces a certain outcome,"What if"analysis can be conducted to determine what resources are required of operations are scaled back or expanded.Ad ClicksAlso called clickthroughs.The number of times a user "clicks" on an online ad,often measured as a function of time("ad clicks per day").Ad Hoc QueryAny query that cannot be determined prior to the moment the query is issued.A query that consists of dynamically constructed SQL,which is usuallyconstructed by desktop-resident query tools.Ad Hoc Query ToolAn end-user tool that accepts an English-like or point-and-click request for data and constructs an ad-hoc query to retrieve the desired result.Administrative DataIn a data warehouse,the data that helps a warehouse administrator manage the warehouse.Examples of administrative data are user profiles and order history data.Aggregate DataData that is the result of applying a process to combine data elements.Data that is taken collectively or in summary form.AggregatorThis is an e-commerce business model in which the Web site sells products or services which it does not produce or warehouse.An aggregator creates an environment where multiple providers (sellers) must compete on terms determined by the use.AlertsA notification from an event that has exceeded a pre-defined threshold.AnalystSomeone who creates views for analytic interpretation of data,performs calculations and distributes the resulting information in the form of reports.Analytic ApplicationsPackaged software that meets three distinct conditions:process support,separation of function and time-oriented,integrated data.Analytic applications expand the reach of business intelligence to an extended user base,packaging these technologies in a business context.AnalyticsThe process and techniques for the exploration and analysis of business data to discover and identify new and meaningful information and trends that allow for analysis to take place.AppletA small Java program that can be embedded in an HTML page.They cannot accesscertain resources on local computers such as files and serial devices and are prohibited from communication with most other computers across a network.Application Service Provider (ASP)ASPs provide the infrastructure needed to deliver reliable application access,including enterprise applications,hardware platforms,operating systems,database systems,network hardware as well as the technical expertise to make it all work for a monthly service charge.ASCIIAmerican Standard Code for Information Interchange.An eight-bit code for character representation,includes seven bits plus parity.ASPApplication Service Provider.A company that offers access over the Internet to application programs and related services that would otherwise have to be located in other own personal or enterprise computers.Atomic DataData elements that represent the lowest level of detail.For example,in a daily sales report,the individual items sold would be atomic data,while rollups such as invoice and summary totals from invoices are aggregate data.AttributeA field represented by a column within an object (entity).An object may bea table,view or report.An attribute is also associated with an SGML(HTML)tag used to further define the usage.Authorization RequestA request initiated by a consumer to access data for which the consumer doesnot presently have access privileges.Authorization RulesCriteria used to determine whether or not an individual,group,or application may access reference data or a process.AvailabilityUser access to applications and/or data stores that reside and execute on computing systems accessing information that resides in files and databases suppo rted by an organization’s various operating environments.BB2BBusiness-to-business commerce conducted over the Web.B2CBusiness-to-consumer commerce conducted over the Internet.It links consumers to commercial entities in one-way networks.Balanced ScorecardA comprehensive,top-down view of organizational performance with a strongfocus on vision and strategy.In 1992 the founding fathers of the Balanced Scorecard,Drs.Robert Kaplan and David Norton,debuted their methodology in the Harvard Business Review.Then,in 1996,they released The BalancedScorecard Translating Strategy into Action,the so-called bible of the Balanced Scorecard.Balanced Scorecard CollaborativeA professional services firm dedicated to the worldwideawareness,use,enhancement and integrity of the balanced scorecard as a value-added management process.Balanced Scorecard Collaborative CertificationAn industry-standard certification offered to software providers whose balanced scorecard applications meet the functional standards of Kaplan and Norton.These are applications that will enable end users to achieve the benefits of the balanced scorecard management process.Baldrige Criteria for Performance ExcellenceCriteria providing a systems perspective for understanding performance management.They reflect validated, leading management practices against which an organization can measure itself.With their acceptance nationally and internationally as the model for performance excellence,the criteria represent a common language for communication among organizations for sharing best practices.BannerA picture or graphic that stretches horizontally across a Web page.These canbe used to title the Web page,start or separate different sections,create links to other Web pages,or provide a place for advertisements.Banner AdvertisingA marketing mechanism that contains strips of advertisements that aresporadically positioned on a web page and are extremely popular on the World Wide Web.These types of ads generally take up a considerable amount of bandwidth and are sometimes disturbing to the Web user.Base TablesThe normalized data structures maintained in the target warehousing database.Also known as the detail data.Basel II New Accord (Basel 2,New Accord)This is a set of banking standards,which will regulate finance and banking for countries in the European Union.The Basel Committee on Banking Supervision is tasked with the goal to complete the New Accord by mid-year 2004,with implementation to take effect in member countries by year-end 2006.To that end,work already has begun in a number of countries on draft rules that would integrate Basel capital standards with national capital regimes.Basel II is focused specifically on global banks and financial institutions and ensures liquidity of those institutions for the protection of public trust.BenchmarkingA point of reference for measurement.Benefit SegmentationThe process of grouping customers into market segments according to the benefits they seek from the product.Refers to their needs and wants only.Best PracticesA case study considered to be a good example of a business discipline.Bidirectional ExtractsThe ability to extract,cleanse and transfer data in two directions among different types of databases,including hierarchical,networked and relational databases.Braking MechanismA software mechanism that prevents users from querying the operationaldatabase once transaction loads reach a certain level.Bricks and MortarRefers to businesses that exist in the real world as opposed to just the cyber world such as bricks-and-mortar retail outlets,bricks-and-mortar warehouses,etc.BrowserThe generic term for software programs that retrieve,display and print information World Wide Web.The most popular browsers are Microsoft Internet Explorer,Netscape Navigator and Mosaic.Mosaic was the first browser to introduce graphics.Previously,users were only allowed to view the text of Web pages.Currently,Microsoft Outlook is the most popular browser in the world.Bulk Data TransferA software-based mechanism designed to move large data files.It supportscompression,blocking and buffering to optimize transfer times.Business Activity Monitoring (BAM)BAM is a business solution that is supported by an advanced technical infrastructure that enables rapid insight into new business strategies,the reduction of operating cost by real-time identification of issues and improved process performance.Business ArchitectureOne of the four layers of an information systems architecture.A business architecture describes the functions a business performs and the information it uses.Business ContinuityThe degree to which an organization may achieve uninterrupted stability of systems and operational procedures.Business DataInformation about people,places,things,business rules,and events,which is used to operate the business.It is not metadata.(Metadata defines and describes business data.)Business DriversThe people,information,and tasks that support the fulfillment of a business objective.Business Intelligence (BI)Business intelligence is actually an environment in which business users receive data that is reliable,consistent,understandable,easily manipulated and timely.With this data,business users are able to conduct analyses thatyield overall understanding of where the business has been,where it is now and where it will be in the near future.Business intelligence serves two main purposes.It monitors the financial and operational health of the organization(reports,alerts,alarms,analysis tools,key performance indicators and dashboards).It also regulates the operation of the organization providing two-way integration with operational systems and information feedback analysis.Business Intelligence PlatformA foundation of enabling tools and technologies necessary for the developmentand deployment of business intelligence and business performance management applications.Business Intelligence Service Provider (BISP)A natural extension of the ASP,application of data warehousing and businessintelligence (BI) methodologies and technologies to the ASP model.BISPs tie into information systems behind a corporation's firewall,providing traditional data warehouse and analytic application capabilities for Internet-based e-businesses,especially e-commerce Web sites and are hosted off site.Business Intelligence SoftwareA category of software that enables companies to access,analyze and shareinformation to understand how the business is performing and to improve decision making.Business Intelligence ToolsThe tools and technologies used to access and analyze business information.They include online analytical processing(OLAP) technologies,data miningand advanced analytics;end-user tools for ad hoc query and analysis,enterprise class query,analysis and reporting including dashboards for performance monitoring;and production reporting against all enterprise data sources.Business ModelA view of the business at any given point in time.The view can be from aprocess,data,event or resource perspective,and can be the past,present or future state of the business.Business Performance Calibration (BPC)The continuous,near real-time forecasting and analysis of related performance metrics to achieve balanced performance,i.e.,efficient growth and the optimal management of resources.Business Performance Intelligence (BPI)A subset of the BI market and involves planning and budgeting,BalancedScorecard performance management and activity-based costing.Business Performance Management (BPM)Applications that help direct modeling or scenario exploration activities.Rather than simply exploring what happened and why,the application can help the user consider the implications of alternative courses of action before they become operational.Performance management suggests an explicit relationship to action,and modeling is the key link to do this.Business Performance MeasurementApplications that provide support for specific KPIs(key performance indicators) enable a business to measure their performance.This is often coupled with comparative information from industry sources,so a company can compare their performance against that of others in their industry.Business performance measurement applications support the analysis phase of the business improvement cycle.Business TransactionA unit of work acted upon by a data capture system to create,modify,or deletebusiness data.Each transaction represents a single valued fact describinga single business event.CC-Commerce (Collaborative-Commerce)A business strategy that motivates value-chain partners with a commonbusiness interest to generate value through sharing information at all phases of the business cycle (from product development to distribution).C2BThe financial interaction,initiated by a consumer,between a consumer and business.CachePronounced "cash". The storage of recently visited sites and data which can be accessed from computer memory instead of linking the server each time you return to the site.This speeds the access time, but does not reflect any changes to the site while in the cache.On rapidly changing sites you may needto click the reload button in order to read the most recent changes.Call CenterThe part of an organization that handles inbound/outbound communications with customers.Campaign ManagementDetailed tracking, reporting and analysis that provides precise measurements regarding current marketing campaigns,how they are performing and the types of leads they attract.Cartesian productA Cartesian join will get you a Cartesian product.A Cartesian join is whenyou join every row of one table to every row of another table.You can also get one by joining every row of a table to every row of itself.Cascading Style Sheet (CSS)Cascading style sheets is a style sheet language that enables authors and users to attach style(fonts,spacing and aural cues) to structure that include HTML and XML applications.CASEComputer Aided Software Engineering.CASE ManagementThe management of information between multiple CASE encyclopedias," whether the same or different CASE tools.CatalogA component of a data dictionary that contains a directory of its DBMS objectsas well as attributes of each object.CellData point defined by one member of each dimension of a multidimensional structure.Often,potential cells in multidimensional structures are empty,leading to "sparse" storage.Central WarehouseA database created from operational extracts that adheres to asingle,consistent,enterprise data model to ensure consistency of decision-support data across the corporation.A style of computing where all the information systems are located and managed from a single physical location.Change Data CaptureThe process of capturing changes made to a production data source.Change data capture is typically performed by reading the source DBMS log.It consolidates units of work,ensures data is synchronized with the original source,and reduces data volume in a data warehousing environment.ChurnDescribes customer attrition.A high churn rate implies high customer disloyalty.Classic Data Warehouse DevelopmentThe process of building an enterprise business model,creating a system datamodel,defining and designing a data warehouse architecture,constructing the physical database,and lastly populating the warehouses database.Clicks and MortarA business that has successfully integrated its online existence with itsoffline,real-world existence.For example,a retail store that allows customers to order products online or purchase products at its store location.ClickthroughThe percentage of advertisements or other content a user clicks on or chooses to view.ClientA software program used to contact and obtain data from a server softwareprogram on another computer.Each client program is designed to work with one or more specific kinds of server programs, and each server requires a specific kid of client.Client/ServerA distributed technology approach where the processing is divided by function.The server performs shared functions——managing communications,providing database services,etc.The client performs individual user functions——providing customized interfaces,performing screen to screen navigation,offering help functions,etc.Client/Server ArchitectureA networked environment where a smaller system such as a PC interacts witha larger,faster system.This allows the processing to be performed on thelarger system which frees the user's PC.The larger system is able to connect and disconnect from the clients in order to more efficiently process the data.Client/Server ProcessingA form of cooperative processing in which the end-user interaction is througha programmable workstation (desktop) that must execute some part of theapplication logic over and above display formatting and terminal emulation.CollectionA set of data that resulted from a DBMS query.COM+Provides an enterprise development environment,based on the Microsoft component object model (COM),for creating component-based,distributed applications.Common Object Model (COM)Common object model is an object-based programming specification,designed to provide object interoperability through sets of predefined routines called interfaces.Common Object Request Broker Architecture (CORBA)Common object request broker architecture is the Object Management Group(OMG) vendor-independent architecture and infrastructure,which computer applications use to work together over networks.Communications IntegrityAn operational quality that ensures transmitted data has been accurately received at its destination.ConsolidationThe process that takes data from different systems and entities,and possibly desperate formats,and combines and aggregates that information to create a unified view.ConsumerAn individual,group or application that accesses data/information in a data warehouse.Consumer ProfileIdentification of an individual,group or application and a profile of the data they request and use:the kinds of warehouse data, physical relational tables needed,and the required location and frequency of the data (when,where,and in what form it is to be delivered).Content ManagementThe processes and workflows involved in organizing,categorizing,and structuring information resources so that they can be stored,published,and reused in multiple ways.A content management system(CMS) is used to collect,manage and publish content,storing the content either as components or whole documents,while maintaining the links between components.It may also provides for content revision control.Continuous AvailabilityA protocol,associated execution and ready state of functionality thatvirtually guarantees computing-system operational continuity in any downtime event.Continuous availability concerns itself with 1) the recovery of applications,data and data transactions committed up to the moment of system loss;and 2) seamless,24x7 system availability that offsets any planned or unplanned downtime event.Control DataData that guides a process. For example,indicators,flags,counters and parameters.CookiesCookies are text files that are stored at the client's hard drive.When a browser requests a document,the web server creates a fragment of data,which is sent to the browser and stored at the client's computer.Afterward,when the browser solicits another document,the cookie is sent with the request.Cookies are very similar to the caller id boxes that have become so popular in that they provide telemarketers with such relevant information as:the consumers name,address,and previous purchase payment record.Cooperative ProcessingA style of computer application processing in which thepresentation,business logic,and data management are split among two or more software services that operate on one or more computers.In cooperative processing,individual software programs (services) perform specific functions that are invoked by means of parameterized messages exchanged between them.Copy ManagementThe analysis of the business benefit realized by the cost of expenditure on some resource,tool,or application development.Corporate Performance ManagementAn umbrella term used to describe the methodologies, metrics,processes and systems used to monitor and manage the business performance of an enterprise.Cost Benefit AnalysisThe analysis of the business benefit realized by the cost of expenditure on some resource,tool,or application development.Critical Success FactorsKey areas of activity in which favorable results are necessary for a company to reach its goal.CRMCustomer Relationship Management.CrosstabA process or function that combines and/or summarizes data from one or moresources into a concise format for analysis or reporting.CubeA data cube is a multidimensional structure that contains an aggregate valueat each point,i.e.,the result of applying an aggregate function to an underlying relation.Data cubes are used to implement online analytical processing (OLAP).Currency DateThe date the data is considered effective.It is also known as the "as of"date or temporal currency.Customer Relationship ManagementThe idea of establishing relationships with customers on an individual basis,then using that information to treat different customers differently.Customer buying profiles and churn analysis are examples of decision support activities that can affect the success of customer relationships.Cyber marketingThis term refers to any type of Internet-based promotion.This includes Web sites, targeted e-mail,Internet bulletin boards, sites where customers can dial-in and download files,and sites that engage in internet commerce by offering products for sell over the Internet.The term doesn't have a strict meaning,though,and many marketing managers use it to cover any computer-based marketing tools.DDashboardAn application or custom user interface that organizes and presents information in a way that is easy to read.The information may be integrated from multiple components into a unified display.A dashboard helps monitor individual,business unit and organizational performance and processes fora greater understanding of the business.DataItems representing facts,text,graphics,bit-mapped images,sound,analog or digital live-video segments.Data is the raw material of a system supplied by data producers and is used by information consumers to create information.Data Access ToolsAn end-user oriented tool that allows users to build SQL queries by pointing and clicking on a list of tables and fields in the data warehouse.Data AcquisitionIdentification,selection and mapping of source data to target data.Detection of source data changes,data extraction techniques,timing of data extracts,data transformation techniques,frequency of database loads and levels of data summary are among the difficult data acquisition challenges.Data Analysis and Presentation ToolsSoftware that provides a logical view of data in a warehouse.Some create simple aliases for table and column names;others create data that identify the contents and location of data in the warehouse.Data ApplianceA combination of hardware,software,DBMSs and storage,all under oneumbrella.A black box that yields high performance in both speed and storage, making the BI environment simpler and more useful to the users.Data ConsumerAn individual,group,or application that receives data in the form of a collection.The data is used for query,analysis,and reporting.Data CustodianThe individual assigned the responsibility of operating systems, data centers, data warehouses, operational databases, and business operations in conformance with the policies and practices prescribed by the data owner.Data DictionaryA database about data and database structures. A catalog of all data elements,containing their names, structures, and information about their usage. A central location for metadata. Normally, data dictionaries are designed to store a limited set of available metadata, concentrating on the information relating to the data elements, databases, files and programs of implemented systems.Data DirectoryA collection of definitions, rules and advisories of data, designed to beused as a guide or reference with the data warehouse. The directory includes definitions, examples, relations, functions and equivalents in other environments.Data ElementThe most elementary unit of data that can be identified and described in a dictionary or repository which cannot be subdivided.Data Extraction SoftwareSoftware that reads one or more sources of data and creates a new image of the data.Data Flow DiagramA diagram that shows the normal flow of data between services as well as theflow of data between data stores and services.Data IntegrationPulling together and reconciling dispersed data for analytic purposes that organizations have maintained in multiple, heterogeneous systems. Data needs to be accessed and extracted, moved and loaded, validated and cleaned, and standardized and transformed.Data LoadingThe process of populating the data warehouse. Data loading is provided by DBMS-specific load processes, DBMS insert processes, and independent fastload processes.Data ManagementControlling, protecting, and facilitating access to data in order to provide information consumers with timely access to the data they need. The functions provided by a database management system.Data Management SoftwareSoftware that converts data into a unified format by taking derived data to create new fields, merging files, summarizing and filtering data; the process of reading data from operational systems. Data Management Software is also known as data extraction software.Data MappingThe process of assigning a source data element to a target data element.。
数据仓库的技术词汇

■数据仓库的技术词汇access (访问或存取)—在存储单元上查找、读或写数据的操作。
access method (访问方法或存取方法)—用于将物理记录从大容量存储设备传入或传出的技术。
access pattern (访问模式或存取模式)—访问数据结构的一般序列(例如,从元组到元组,从记录到记录,从段到段等等)。
accuracy (精确度)—一种对避免误差的定性估计,或对误差大小的定量度量,表示为一个相对误差的函数。
ad hoc processing (特别处理)—仅执行一次,偶尔访问,并且用从未用过的参数操纵数据,通常以启发式的迭代的方式进行。
after image (后映像)—当完成一个事务后,放入日志的数据快照。
agent of change (变化动因)—大得不能抗拒的驱动力,通常是系统的老化、技术的变化、需求的根本改变等等。
algorithm (算法)—组织好用以在有限步骤内解决问题的一系列语句。
analytical processing (分析型处理)—使用计算机为管理决策提供分析,通常包括趋势分析、向下探查分析、统计分析及概要分析等等。
application (应用)—支持一个组织或企业需求的一组相互联系的算法和数据。
application database (应用数据库)—组织好用以支持一种特定应用的数据集合。
archival database (存档数据库)—包含具有历史特性的数据的数据集合。
一般来说,存档数据是不被更新的。
每个存档数据单元都和一个过去的时间点有关。
artifact (人工关系)—在D S S环境中用于表示参照完整性的一种设计技术。
atomic (原子)—(1)存储在数据仓库中的数据;( 2)处理分析的最低层次。
atomic database (原子数据库)—由原始的原子数据组成的数据库;一个数据仓库;一个D S S基础数据库。
atomic-level data (原子层数据)—具有最低粒度级的数据。
数据仓库-术语

1.ESB全称为Enterprise Service Bus,即企业服务总线。
它是传统中间件技术与XML、Web服务等技术结合的产物。
ESB提供了网络中最基本的连接中枢,是构筑企业神经系统的必要元素。
ESB的出现改变了传统的软件架构,可以提供比传统中间件产品更为廉价的解决方案,同时它还可以消除不同应用之间的技术差异,让不同的应用服务器协调运作,实现了不同服务之间的通信与整合。
从功能上看,ESB提供了事件驱动和文档导向的处理模式,以及分布式的运行管理机制,它支持基于内容的路由和过滤,具备了复杂数据的传输能力,并可以提供一系列的标准接口。
2.ETL:Extract-Transform-Load的缩写,数据抽取(Extract)、转换(Transform)、装载(Load)的过程。
3.DW:Data Warehousing,根据Bill.Inmon的定义,“数据仓库是面向主题的、集成的、稳定的、随时间变化的,主要用于决策支持的数据库系统”。
4.Metadata:元数据。
描述数据的数据,指在数据仓库建设过程中所产生的有关数据源定义,目标定义,转换规则等相关的关键数据。
5.ETL是数据仓库建立的核心过程数据仓库系统先天不足,是在业务系统的基础上发展而来的,其内部存储的数据来自于事务处理的业务系统和外部数据源。
而企业内各源数据缺少统一的标准,因企业的业务系统是在不同时期、不同背景、面对不同应用、不同开发商等各种客观前提下建立的,其数据结构、存储平台、系统平台均存在很大的异构性。
因而其数据难以转化为有用的信息,原始数据的不一致性导致决策时其可信度的降低。
6.ETL是BI/DW的核心和灵魂,按照统一的规则集成并提高数据的价值,是负责完成数据从数据源向目标数据仓库转化的过程,是实施数据仓库的重要步骤。
如果说数据仓库的模型设计是一座大厦的设计蓝图,数据是砖瓦的话,那么ETL就是建设大厦的过程。
在整个项目中最难部分是用户需求分析和模型设计,而ETL规则设计和实施则是工作量最大的,其工作量要占整个项目的60%-80%,这是国内外从众多实践中得到的普遍共识。
数据仓库名词解释

数据仓库名词解释数据仓库是一个面向主题的、集成的、稳定的、直接面向最终用户的数据集合,用于支持企业决策制定、分析和决策支持系统。
数据仓库是一个独立的数据存储和管理系统,其目标是针对企业中各个部门的数据进行整合、清洗、加工和建模,从而提供一套一致、可信、易于访问和理解的数据,帮助用户进行数据分析和企业决策。
以下是一些与数据仓库相关的重要概念和名词的解释:1. 数据集成:将来自不同数据源的数据整合到数据仓库中,包括内部和外部数据源。
2. 数据清洗:数据清洗是指通过一系列的操作,消除数据中的错误、重复、缺失和不一致的部分,提高数据的质量。
3. 数据加工:对数据进行转换、聚合、计算和抽取,以满足用户的特定需求和分析目的。
4. 主题:数据仓库的主题是指根据企业的业务需求而组织起来的数据类别或领域,例如销售、人力资源、供应链等。
5. 元数据:元数据是描述数据的数据,包括数据的源头、结构、定义、关系等。
元数据对于数据仓库的管理和使用非常重要。
6. 维度:维度是数据仓库中描述主题的属性,如时间、地理位置、产品、客户等,用于分析和查询。
7. 度量:度量是数据仓库中可以计量和比较的数据,如销售额、利润、客户数量等。
8. 星型模式:星型模式是一种常见的数据仓库建模技术,其中一个中心表(事实表)围绕着多个维度表进行关联。
9. 粒度:粒度是指数据仓库中所记录的事实的详细程度,如日销售额、月销售额、年销售额等。
10. OLAP(联机分析处理):OLAP是一种针对多维数据进行快速查询和分析的技术,通过透视表、图表和报表等方式展现数据。
11. ETL(抽取、转换和加载):ETL是数据仓库中的核心过程,用于从源系统中抽取数据,通过转换和加工后加载到数据仓库中。
12. 决策支持系统:决策支持系统是通过利用数据仓库中的数据和分析工具,辅助管理层做出决策的信息系统。
数据仓库在企业中扮演着重要的角色,它能够提供一致、准确的数据,帮助企业决策者进行数据分析和制定决策。
数仓词根汇总

数仓词根汇总以下是一些常见的数仓相关词根和术语:1. 数据(Data):指代事实、观察结果或资讯,是数仓的基本组成单位。
2. 仓库(Warehouse):表示数据存储的地方,数仓就是指数据仓库。
3. 模型(Model):用来表示数据之间的关系和规律,可以是逻辑模型、物理模型等。
4. 维度(Dimension):描述数据的特征属性,例如时间、地点、产品等。
5. 度量(Measure):衡量业务指标的数值,例如销售额、利润等。
6. 聚合(Aggregation):将细粒度的数据按照某个维度进行汇总,以便于分析和报表展示。
7. 抽取(Extraction):从源系统中提取数据到数仓的过程。
8. 转换(Transformation):对抽取的数据进行清洗、整理和加工,使其符合数仓的要求。
9. 装载(Loading):将经过转换的数据加载到数仓中。
10. 星型模型(Star Schema):一种常见的数据模型,通过一个中心事实表和多个维度表来描述数据的关系。
11. 雪花模型(Snowflake Schema):在星型模型的基础上,进一步规范化维度表,以减少冗余和数据存储空间。
12. ETL(Extract, Transform, Load):表示从源系统抽取数据、进行转换和加载到数仓的全过程。
13. OLAP(Online Analytical Processing):在线分析处理,用于对数仓中的数据进行多维度分析和查询。
14. 数据挖掘(Data Mining):通过算法和技术,从大量数据中发现隐藏的模式和规律。
15. 数据质量(Data Quality):指数据的准确性、完整性、一致性等方面的特征。
这些词根和术语在数仓领域非常常见,了解它们可以帮助更好地理解和应用数仓相关的知识和技术。
- 1、下载文档前请自行甄别文档内容的完整性,平台不提供额外的编辑、内容补充、找答案等附加服务。
- 2、"仅部分预览"的文档,不可在线预览部分如存在完整性等问题,可反馈申请退款(可完整预览的文档不适用该条件!)。
- 3、如文档侵犯您的权益,请联系客服反馈,我们会尽快为您处理(人工客服工作时间:9:00-18:30)。
Abbreviation Abbr简称Abnormal Abnm异常Account Acct账户Accountant Accnt会计Accrued Acr应计利息Accumulate Accu累计Accumulation Accum公积金X Action Actn行动Activity Actvt行动Actural Actl实际的Address Addr地址Adjusted Adj调整ADRatio ADRatio涨跌幅Aggregate Aggr汇总Allocation Alloc分配Allotment Allot中签Amount Amt金额AmountPerTransaction APT单笔金额Analysis Analys分析And n和Announce Anno宣布Application App申请Apply Aply申购Approval Aprv批准Arbitrage Arb套利Area Area地区AShare A A股X AssetManagement AssetMan资产管理Attribute Attr属性Auction Auc拍卖Authorized Auth授权的Average Avg平均X Balance Bal余额Bandwidth BW带宽X Bid Bid申报BigDeal BgDl大宗交易Birthday Bir生日Bond Bond债券BondRepurchase BondR债券回购Bonds Bond债券Branch Branch营业部BShare B B股X Business Biz业务Buy Buy买Buyout BO买断X Calculation Calc计算Calendar Cldr日历Capital Cap资本X X X CapitalValue CapVal总市值XCash Cash现金Catalog Ctlg分类XCategory Ctgr种类XCertified Cert认证Change Chg更改XCheck Chk检查XChecking Chk检查Child Cld孩子XClearing Clg清算Close Cls收ClosePrice ClosePrice收盘Code Code代码Company Com公司Constituent Cnst成份XContactor Contactor联系人XContinue C连续Continuous Cont连续的XConversion Convs转换Convert Cvt转换ConvertableBond CvtBd转债Convertible Cvt可转换Cooresponding Corspd相关的CorporateBond CBond企业债Count Count计数Counterparty Ctpy对手方Country Cntry国家XCoupon Cpn票面Create Crt创建XCSRC CSRC证监会Cumulative Cmltv累计XCurrency Curr货币Current Currt当前Cycle Cyc周期XDaily Dly每日Date Date日期Dealer Dler一级交易商XDegree Degr学历XDelete Del删除Deleted Del删除Deletion Del删除Delist Delist摘牌Denomination Dnom名义Department Dept部门Derivative Derv衍生品XDescription Desc描述Designated Dsgnt指定Destination Dest目标/对象X Detail Dtl详细Developer Devp发展商Difference Diff差异Dimension DMN维度Direction Dir方向Discount Dscnt折价Distribution Distrb分布District District地市Dividend Div分红Document Doc文档Domain Domn辖区X Duration Dur久期Dynamic Dynm动态Effective Eff有效Efficiency Effcy效率Employee Emp雇员English Eng英语Entry Ent进入Equities Equ股权ETF ETF ETFExchange Exch交换Executed Exe执行Expense Exps费用Experience Expr经验Expiration Expr到期Expire Expire到期Expiry Expr到期External Ext外部FIFO FIFO先进先出Final Final期末Finance Fin金融Financial Fin金融First Fst第一FirstLevelMarket FLM一级市场Flag Flg标志Foreign Fori外国Forward Fwd向前复权X Frequency Freq频率Frozen Frzn冻结X Fund Fund基金FundCompany FundCompany基金公司Funds Fund基金Gender Sex性别Group Group组Guarantor Gutr担保人HighPrice High最高价Hold Hld持有Holding Hld持有Identifier ID唯一标识Income Incm收入Increase Incr增长Index Indx指数Indicator Ind标志Industry Indstr行业Information Info信息Initial Init初始的Institute Inst机构Instrument Instr工具Interest Intrst利息Internal Inter内部Interval Intv区间Investor Invst投资者Issue Iss发行Last Lst最后LastClose LstCls昨收盘Legal Leg法律的Legal Person LP法人X LegalPersonRep LPR法人代表X Letter Lett信函License Lic许可证LIFO LIFO后进先出LimitDown LD涨停限制LimitUp LU跌停限制Listing Lst上市Location Loc地点LowPrice Low最低价Maintenance Mtn维护Major Major主要,大Management Mgmt管理X Manager Mngr经理Margin Magn保证金Market Mkt证券市场X MarketValue MktVal市值Master Master主要Match Mtch配对Mature Mature到期Maturity Matur到期Maximum Max最大Member Mem会员Membership Mem会员资格Method Meth方法Middle Mid中Minimum Min最少Minor Minor小Minute Minu分钟Mnemonic Nmnc缩写Mobile Mob移动电话Multiplier Multp乘数Name Name名称Negotiable Nego流通NetAssetValue NAV净值Network Net通行网络X Nontrade Nontrd非交易Number Num个数Online Olin在线Open Open开户OpenPrice Open开盘价Operation Oper操作,主营业务Order Ord申报Orgnization Org组织Paid-in Paid实收PartialFilled PrtFil部分成交Participant Prtp参与人Password Pswd密码Payment Pay支付Percentage Percn占比Period Peri时期Person Prsn人Physical Assets PA实物资产X Placement Plcmt定向配售X PN PN市净率X Portfolio Portfo投资组合Position Pos职位Prefix Pfx前缀Premium Prm溢价Pressure Psure压力Previous Prev上一Price Prc价格Principal Prcpl主Process Proc处理Product Prodt产品Profit Profit利润ProfitAndLoss PnL损益Proprietary Prop自营Province Prov省/直辖市Public Pub公布Publish Pub刊登Qualification Quaf合格Quantity Qty数量Quarter Quarter季度Quotation Quot行情Raised Rsd筹措Ratio Ratio比例Recommendatory Recom推荐人Recover Rcov复权X Redemption Redp赎回Region Rgn地区Register Reg登记Registration Regst登记Re-Issuance ReIss扩募Remaining Rem剩余Replace Rplc替代Report Rept报表Representative Rep代表Repurchase Repur回购Restriction Restr限制Result Result结果ReturnOnAssets ROA资产报酬率ReturnOnEquity ROE净资产收益率Right Rgt权利Scoring Score评分Seat Seat席位Second Sec秒SecondLevelMarket SLM二级市场Sector Sector板块X Securities Sec金融工具Security Sec证券Segment Sgmt段Sell Sell卖Senior Seni高级Sequence Seq序号X Service Svs服务Settlement Stl清算Share Share股份ShHld Conce持股集中度X ShareHold ConcentrationShareHolder ShareHlder股东Short Sht简略Special Specl特殊的Specification Spec说明Sponsor Spon发起人Start Strt开始State Owned SO国有X Statistic Stat统计StatisticSeat StSeat统计席位Status Stat状态Stock Stock股票Structure Strut结构Submitter Sbmt提交Subscribe Subscr签约Subscription Subscr申购Success Suc成功Suffix Sfx前缀Summary Sum汇总X SurrogateKey SurKey代理键Surveillance Srvl监管Suspend Suspd暂停Suspension Suspen停牌Swing Swing振幅X Tbond Tbond国债TechnicalPost TechPost职称Telephone Tel电话TimeConsume TC耗时TopN TopN前N排名X Total Tot总和Tradable Trd可交易的Trade Trd交易Trading Trd交易Traffic Trfc网络交通X Transaction Tran交易Transactions Trans笔数Transfer Trfr转让X Trustee Trst托管X Turnover Turnover流通量/换手Type Type类型Underlying Udly基础Underwriter Underwriter承销商X Uneffective Uneff无效Unit Unit单位Unmature UMature未到期Update Upd更新Volume Vol数量VolumePerTransaction VPT单笔数量Warrant WT权证X Warrants WT权证Week Week周Weight Wgt权重Weighted Wgt加权的Withdraw Withdraw撤销Year Year年Yearly Yly每年Zipcode Zip邮政编码X。