A Quantitative Method for Quality Evaluation of Web Sites and Applications

合集下载

做方法确认不可不知的31个概念（中英文对照）

做方法确认不可不知的31个概念（中英文对照）1.方法确认Method Validation实验室通过试验，提供客观有效证据证明特定检测方法满足预期的用途。

注：方法确认应当建立方法的性能特性和使用的限制条件，并识别影响方法性能的因素及影响程度，确定方法所适用的基体，以及方法的正确度和精密度。

(ISO/IEC指南99：2007，2.45)2.方法验证Method Verification实验室通过核查，提供客观有效证据证明满足方法规定的要求。

(ISO/IEC指南99：2007，2.44)3.实验室内方法确认In-house Method Validation在一个实验室内，在合理的时间间隔内，用一种方法在预定条件下对相同或不同测试样品进行的分析实验，以证明特定检测方法满足预期的用途。

4.实验室间方法确认Interlaboratory MethodValidation在两个或多个实验室之间实施的方法确认。

实验室依照预定条件用相同方法对相同样品的测定，以证明特定检测方法满足预期的用途。

5.定性方法Qualitative Method根据物质的化学、生物或物理性质对其进行鉴定的分析方法。

6.定量方法Quantitative Method测定被分析物的质量或质量分数的分析方法，可用适当单位的数值表示。

7.确证方法Confirmatory Method能提供全部或部分信息，并明确定性，在必要时可在关注的浓度水平上进行定量的方法。

8.筛选方法Screening Method具有处理大量样品的能力，用于检测一种物质或一组物质在所关注的浓度水平上是否存在的方法。

这些方法用于筛选大量样品可能的阳性结果，并用来避免假阴性结果。

此类方法所获得的检测结果通常为定性结果或半定量结果。

9.容许限Permitted Limit，PL对某一定量特性规定和要求的物质限值，如：最大残留限、最高允许浓度或其它最大容许量等。

10.关注浓度水平Level of Interest对判断样品中物质或分析物是否符合法规规定和要求的有决定性意义的浓度（如：容许限浓度）11.选择性 Selectivity测量系统按规定的测量程序使用并提供一个或多个被测量的测得的量值时，每个被测量的值与其他被测量或所研究的现象、物体或物质中的其他量无关的特性。

qualitative analysis method

Step 2: Data Collection
Once the research design is established, the researcher proceeds to collect data. Qualitative data collection methods often involve face-to-face interviews, focus groups, participant observation, or document analysis. These methods enable researchers to gather rich,detailed, and context-specific information. It is essential to establish rapport with participants, maintain ethical standards, and incorporate reflexivity during data collection to ensure the validity and reliability of the data.
qualitative analysis method
"Qualitativploration"
Introduction:
Qualitative analysis is a research method that aims to understand human behavior, experiences, and perspectives through non-numerical data. It involvesexamining and interpreting qualitative data, such as interviews, observations, or textual materials, to gain insights and generate theories. In this article, we will explore the steps involved in qualitative analysis and how it contributes to social sciences research.

定量的英文单词

定量的英文单词单词：quantitative1.1词性：形容词1.2中文释义：数量的；定量的；与数量有关的。

1.3英文释义：Relating to, measuring, or measured by the quantity of something rather than its quality.1.4相关词汇：quantity（名词，数量）、quantify（动词，量化）、quantitatively（副词，数量上；定量地）。

2. 起源与背景2.1词源：源于拉丁语“quantitas”，表示“数量”。

2.2趣闻：在科学研究领域，定量分析是非常重要的手段。

例如在化学实验中，定量研究物质的组成和反应比例，这有助于准确地理解化学过程并开发新的化学产品。

3. 常用搭配与短语3.1短语：- quantitative analysis：定量分析例句：Quantitative analysis is crucial in economic research.翻译：定量分析在经济研究中至关重要。

- quantitative data：定量数据例句：The researchers collected a large amount of quantitative data for their study.翻译：研究人员为他们的研究收集了大量的定量数据。

- quantitative method：定量方法例句：We need to use a quantitative method to solve this problem.翻译：我们需要使用一种定量方法来解决这个问题。

4. 实用片段(1). "I'm doing a project on the economic development of this city. I need to focus on the quantitative factors such as GDP growth rate and employment numbers."翻译：“我正在做一个关于这个城市经济发展的项目。

定量研究方法(Quantitative Research Method)

什么是定量研究？定量研究一般是为了对特定研究对象的总体得出统计结果而进行的。

定性研究具有探索性、诊断性和预测性等特点，它并不追求精确的结论，而只是了解问题之所在，摸清情况，得出感性认识。

定性研究的主要方法包括：与几个人面谈的小组访问，要求详细回答的深度访问，以及各种投影技术等。

在定量研究中，信息都是用某种数字来表示的。

在对这些数字进行处理、分析时，首先要明确这些信息资料是依据何种尺度进行测定、加工的，史蒂文斯（S. S. Stevens）将尺度分为四种类型，即名义尺度、顺序尺度、间距尺度和比例尺度。

[编辑]定量研究的四种测定尺度及特征名义尺度所使用的数值，用于表现它是否属于同一个人或物。

顺序尺度所使用的数值的大小，是与研究对象的特定顺序相对应的。

例如，给社会阶层中的上上层、中上层、中层、中下层、下下层等分别标为“5、4、3、2、1”或者“3、2.5、2、1.5、1”就属于这一类。

只是其中表示上上层的5与表示中上层的4的差距，和表示中上层的4与表示中层的3的差距，并不一定是相等的。

5、4、3 等是任意加上去的符号，如果记为 100、50、10 也无妨。

间距尺度所使用的数值，不仅表示测定对象所具有的量的多少，还表示它们大小的程度即间隔的大小。

不过，这种尺度中的原点可以是任意设定的，但并不意味着该事物的量为“无”。

例如，O°C 为绝对温度273°K，华氏32°F。

名义尺度和顺序尺度的数值不能进行加减乘除，但间距尺度的数值是可以进行加减运算的。

然而，由于原点是任意设定的，所以不能进行乘除运算。

例如，5℃和 10℃之间的差，可以说与15℃和20℃之间的差是相同的，都是5°C。

但不能说 20℃就是比5℃高4倍的温度。

比例尺度的意义是绝对的，即它有着含义为“无”量的原点0。

长度、重量、时间等都是比例尺度测定的范围。

比例尺度测定值的差和比都是可以比较的。

例如：5分钟与10 分钟之间的差和10分钟与15分钟之间的差都是5 分钟，10 分钟是2分钟的5倍。

Quantitative Methods for Interlaboratory Testing –

•Bartlett’s method •Levene’s method •Graphical methods for demonstrating and comparing variances
7/30-8/10/2001
Uncertainty Workshop by Carl Lee
8
Module Ten: Planning Experiments – the general consideration and Comparative Study for more than
Quantitative Methods for Measuring and Assessing Uncertainties in Testing Process
and Outcomes
(Workshop from July 30 to August 10, 2001 at National Bureau of Standards & Metrology and
• The concept and Procedure for performing the one sample t-test
•Comparative Study for Inter-laboratory Testing : two-group cases
•Designing experiments for two-sample comparative study
•Data transformation techniques
•Post-Hoc Analysis and Sum of square decomposition
•Contrasts, simultaneous comparison, pair-wise comparison, comparison with control

07Quantitative Methods For Decision Makers

An interval estimate provides more information about a population characteristic than does a point estimate
Such interval estimates are called confidence intervals
a confidence interval provides
additional information about
variability
Lower Confidence Limit
Point Estimate
Upper Confidence Limit
Width of confidence interval
population standard deviation is 0.35 ohms.
Solution:
XZ σ n
2.20 1.96 (0.35/ 11)
2.20 0.2068
1.9932 2.4068
Interpretation
We are 95% confident that the true mean resistance is between 1.9932 and 2.4068 ohms
80% 90% 95% 98% 99% 99.8% 99.9%
Confidence Coefficient,
1
0.80 0.90 0.95 0.98 0.99 0.998 0.999
Z value
1.28 1.645 1.96 2.33 2.58 3.08 3.27
Confidence intervals
Point Estimate ± (Critical Value)(Standard Error)

资料二级易考点数量定量分析 quantitative methods

定量分析：Quantitative Methods （1Case ）题目内容形式：题目信息：Case 一般会先以一段文字介绍什么为dependent factor ，什么为independent factor ；然后会给一些表格，可能包括回归表，ANOVA table （可能不是表格，直接给数值），Durbin-Waston /Dicky-Fuller testing table ；紧随表格后可能会有一些关于表格数据得出的结论。

考察形式：题目主要是根据表格信息得出一些结论或者判断case 中的结论是否正确。

通常考察一个Case ，考题最多集中在Correlation Analysis 和Regression Analysis ；time-series 可能会涉及到1-2题。

易考点一：Correlation Analysis1.Sample Correlation Coefficient 的公式：考法：知道r,Sx 和Sy,求Cov(X ，Y)或者知道Sx,Sy 和Cov(X ，Y),求r （☆☆）r XY =0，no linear correlationr XY =1，完全正相关r XY =-1，完全负相关-1<r XY <0;0<r XY <1,Hypothesis testing of correlation2.Hypothesis testing of correlation:考法：知道n 和r ，求相关系数ρ的t 统计量，已经问是否线性相关（☆☆☆）H 0：ρ=0（两个变量相关系数为0，不相关）H 1：ρ≠0t-test ：212r n r t --=df =n -2（two-taild ）Decision rule ：If t >|t critical |，reject H 0YX XY S S Y X Cov r ),(=易考点二：Linear regression model（以multiple 为例，simple 的区别就是少了X 2这一项，☆☆☆）CoefficientStandard Errort-statisticP-value Intercept b 00b S t 0p 0X 1b 11b S t 1p 1X 2b 22b S t 2p 21.Hypothesis testing of regression （当检测出某项系数为0，就说明对应的自变量对于表示Y is not statistically significant ，以b 1举例）：H 0：Coefficient =0（b 1=0）H 1：Coefficient≠0Decision rule ：If p <significance level （α），reject H 0（如果5%的significance ，p 1小于0.05，则reject H 0；如果不给significance level ，p-value 很小也reject ）If t statistic >|t critical |，reject H 02.Predict Y ：22110X b X b b Y++=3.Confidence interval for a coefficient （for b 1）)(11b critical s t b⨯±易考点三：ANOVA analysis（simple regression 的情况就是k=1的情况，k 为自变量的个数，☆☆☆）ANOVA Degree of FreedomSum of SquareMean Square Regression k RSS MSR=RSS/k Residual n-k-1SSE MSE=SSE/(n-k-1)Totaln-1SST1.Coefficient of determination (R 2):R 2=RSS/SSTR 2判断的是有多少variation 被自变量所解释，if R 2=0.7735，意思为通过回归方程得出的dependent variable 有77.35%能被independent variable 所解释Adjusted R 2=)]1([(-1211R k n n -⨯---,Adjusted R 2≤R 2加入新的变量，R 2会增加；但如果增加的变量没有统计学意义的时候，Adjusted R 2不会增加，反而会减小。

04_QuantitativeMethod质量和过程

数据驱动的决策制定定分析定量分析方法的应用Dr. Kaibo WangDepartment o Industrial Engineering epa t e t of dust a g ee g Tsinghua University, Beijing 王凯波清华大学工业工程系kbwang@ /kbwang/很多人自认为是使用“数据”说话但实际上，很多决策靠“直觉”制定但实上很多决策靠直觉制定 Data Driven F D t Di Force科学的统计推理基本的统计分析基本的图表工具定量的脑力激荡工具定性的脑力激荡工具直觉 (5% companies) (15% companies) (30% companies) (30% companies) (15% companies) (5% companies)Highest level gLowest Level2面包师和科学家的故事科学家所看到的1st time2nd timeDistribution of Pie Diameters12Distribution of Pie Diameters14 1217.4, 18.5, 18.9, 19.6, 20.8, 19.0, 17.8, 18.4, 19.0, 20.5, 18.4, 18.5, 19.4, 19.1, 19 4 19 1 …20.5, 20.5, 20.7, 20.7, 20.7, 20.6, 21.0, 20.9, 21.7, 21.4, 21.4, 20.6, 20.9, 21.0… 20 9 21 010Frequency8 6 4 2 0Frequency16 17 18 19 20 2110 8 6 4 2 0 20.2 20.4 20.6 20.8 21.0 21.2 21.4 21.6BeforeAfter你有什么发现？3 4 4容差设计的基本模型过程学习 y=f(x) X 前序多阶段质量输出本阶段的设备参数过程的一般模型如何制定公差从对质量特性的要求出发，“反向”逆推对过程参数的要求第一步：确定质量特性的要求范围第步确定质量特性的要求范围第二步：确定“过程参数”和“质量特性”之间的函数关系（通过实验设计或者数据分析获得）第三步：确定过程参数的公差范围如果有多个过程参数，则函数关系更复杂56示例：确定染料浓度的公差QFD与标准制定QFD将制造过程标准控制与客户和市场建立连接，为过程控制提供了终极指导成品颜色深浅染料二染料染料一企业标准制定直接影响最常产品质量与客户满意度标准制定的核心是定量化分析过程输入与输出变量的函数关系客户要求范围-> 客户要求范围化学染料浓度78多阶段制造过程的质量特性传播多阶段制造过程• 设置参数 1工序2• 参数1 • 参数2质量参数数值• 参数1 • 参数2 • 参数3工序4• ….工序1工序3测量点多阶段的思想：一。

英语作文量化评价方式

英语作文量化评价方式Quantitative Evaluation Method for English CompositionI. IntroductionThe purpose of this document is to introduce a quantitative evaluation method for English compositions. This method aims to provide a standardized and objective assessment of students' writing skills based on predefined criteria. By utilizing this method, educators can effectively analyze and grade English compositions in a consistent and fair manner.II. Evaluation CriteriaThe following criteria will be considered when evaluating English compositions:1. Content: Assess the relevance, development, and coherency of the ideas presented in the composition.- Relevance: The content should directly address the given topic or prompt.- Development: The composition should provide sufficient supporting details and examples.- Coherency: The ideas should be logically organized and connected.2. Organization: Evaluate the structure and paragraphing of the composition.- Structure: The composition should have an introduction, body paragraphs, and a conclusion.- Paragraphing: Each paragraph should focus on a single idea and include appropriate topic sentences and supporting details.3. Vocabulary: Assess the variety and accuracy of the words and phrases used in the composition.- Variety: The composition should incorporate a diverse range of vocabulary.- Accuracy: The words and phrases should be used correctly and appropriately.4. Grammar and Syntax: Evaluate the correctness and effectiveness of grammatical structures and sentence formation.- Correctness: The composition should adhere to basic grammatical rules and sentence construction.- Effectiveness: The use of various sentence structures should enhance the overall readability and clarity of the composition.5. Mechanics: Assess the punctuation, capitalization, and spelling in the composition.- Punctuation: The composition should correctly use commas, periods, question marks, and other punctuation marks.- Capitalization: Proper capitalization rules should be followed for names, titles, and the beginning of sentences.- Spelling: Words should be spelled correctly, and American English spelling conventions should be used.III. Scoring RubricTo quantitatively evaluate English compositions, a scoring rubric will be used. The rubric will assign points to each of the evaluation criteria mentioned above. The total possible points will be allocated as follows:1. Content: 30 points2. Organization: 20 points3. Vocabulary: 20 points4. Grammar and Syntax: 20 points5. Mechanics: 10 pointsTotal: 100 pointsIV. Grading ScaleThe grading scale will be as follows:- 90-100 points: Excellent- 80-89 points: Good- 70-79 points: Fair- 60-69 points: Poor- Below 60 points: Needs ImprovementV. ImplementationEducators should follow these steps when implementing the quantitative evaluation method:1. Provide clear guidelines and instructions to students regarding the evaluation criteria and expectations.2. Use the scoring rubric to assess each composition consistently and objectively.3. Offer constructive feedback to students, highlighting their strengths and areas for improvement.4. Encourage students to reflect on their writing and use the feedback to enhance their future compositions.By utilizing this quantitative evaluation method, educators can ensure a fair and consistent assessment of English compositions, ultimately promoting students' writing skills and overall language proficiency.。

基于AHP-FCE_的深度学习能力评价模型的构建研究

Operations Research and Fuzziology 运筹与模糊学, 2023, 13(5), 4414-4427Published Online October 2023 in Hans. https:///journal/orfhttps:///10.12677/orf.2023.135441基于AHP-FCE的深度学习能力评价模型的构建研究周慧敏，陆海华南通大学理学院，江苏南通收稿日期：2023年7月13日；录用日期：2023年9月20日；发布日期：2023年9月27日摘要深度学习能力的评价是典型的多指标复杂型综合性评价，主要表现在四个维度：认知维度、思维维度、技能维度、情感维度。

层次分析法(AHP)可以人为控制某些指标的权重，对定性指标进行量化处理，使主观判断变为客观描述；模糊综合评价法(FCE)可以将定性评价转化为定量评价，对受到多种因素制约的深度学习能力进行总体评价，能够较好地解决模糊、难以量化的问题。

本研究将采用层次分析法确定深度学习能力的评价因素权重，利用模糊综合评价法建立评价模型，量化出深度学习能力评价的等级，最终进行指标体系构建和评价研究。

关键词层次分析法，模糊综合评价法，深度学习能力评价模型Research on the Construction of DeepLearning Ability Evaluation ModelBased on Analytic Hierarchy Processand Fuzzy Comprehensive EvaluationHuimin Zhou, Haihua LuSchool of Science, Nantong University, Nantong JiangsuReceived: Jul. 13th, 2023; accepted: Sep. 20th, 2023; published: Sep. 27th, 2023AbstractThe evaluation of deep learning ability is a typical multi-index and complex comprehensive evalu-周慧敏，陆海华ation. It is mainly manifested in the following four dimensions: cognitive dimension, dimension of thinking, skill dimension, emotional dimension. Analytic hierarchy process can artificially control the weight of some indicators, quantify qualitative indicators, and make subjective judgment into objective description. The fuzzy comprehensive evaluation method can transform the qualitative evaluation into the quantitative evaluation, and make the overall evaluation of the deep learning ability restricted by many factors, which can better solve the problems of fuzzy and difficult to quantify. In this study, Analytic hierarchy process will be used to determine the evaluation factor weight of deep learning ability, and fuzzy comprehensive evaluation method will be used to estab-lish the evaluation model, quantify the level of deep learning ability evaluation, and finally con-duct the index system construction and evaluation research.KeywordsAnalytic Hierarchy Process, Fuzzy Comprehensive Evaluation, Evaluation Model of Deep Learning AbilityThis work is licensed under the Creative Commons Attribution International License (CC BY 4.0)./licenses/by/4.0/1. 引言当今世界教育发展的主要趋势是：教育已经成为贯穿全体公民一生的终身教育，促进人全面发展的素质教育，突出学习者个性的创新教育。

1、下载文档前请自行甄别文档内容的完整性，平台不提供额外的编辑、内容补充、找答案等附加服务。
2、"仅部分预览"的文档,不可在线预览部分如存在完整性等问题,可反馈申请退款(可完整预览的文档不适用该条件!)。
3、如文档侵犯您的权益，请联系客服反馈,我们会尽快为您处理(人工客服工作时间：9:00-18:30)。

A Quantitative Method for Quality Evaluation of Web Sites andApplicationsLuis Olsina1, Gustavo Rossi2 31 GIDIS, Department of Informatics, Faculty of Engineering, at UNLPam,Calle 9 y 110, (6360) General Pico, La Pampa, ArgentinaE-mail olsinal@.ar2 LIFIA at UNLP, Calle 50 y 115, (1900) La Plata, Buenos Aires, Argentina3 CONICET - ArgentinaE-mail grossi@.arAbstract.In this paper, a quantitative evaluation strategy to assess the quality of Web sites and applications (WebApps) is discussed. We give an overview of the WebQEM (Web Quality Evaluation Method) and its supporting tool by presenting an E-commerce case study. The methodology is useful to systematically assess characteristics, subcharacteristics and attributes that influence product quality. We show that the implementation of the evaluation yields global, partial and elementary quality indicators that can help different stakeholders in understanding and improving the assessed product. Concluding remarks and in-progress research are finally presented.Keywords:Web Engineering, Quantitative Evaluation, WebQEM, Quality Characteristics and Attributes. 1. IntroductionThe Web is playing a central role in diverse application domains such as business, education, industry, and entertainment. As a consequence, there are increasing concerns about the ways in which WebApps are developed and the degree of quality delivered. Thus, there are compelling reasons for a systematic and disciplined use of engineering methods and tools for developing and evaluating Web sites and applications [9]. We need sound evaluation methods for obtaining reliable information about the product’s quality. These methods should identify which attributes and characteristics should be used to obtain meaningful indicators for assuring specific evaluation goals given a user viewpoint.It is widely known that the quality of software products can be described in terms of quality characteristics as defined in the ISO/IEC 9126-1 standard [5]. “However, the state of the art in software measurement is such that, in general, the direct measurement of these characteristics is not practical. What is possible is to assess these characteristics based on the measurement of lower abstraction attributes of the product” ([6], p.3). We consider an attribute as a direct or indirect measurable (tangible or abstract) property of an entity (a WebApps in our case). In addition, we can use a quality model (in the form of a quality requirement tree) in order to specify such characteristics, subcharacteristics and attributes. These quality, cost, or productivity requirements are often quoted as non-functional requirements in the literature.In this context, stakeholders should consider which are the characteristics and attributes that influence the product quality and quality in use (though ensuring product quality is not often sufficient to guarantee quality in use –however, this discussion is beyond the scope of this article). Specifically, there are some characteristics that influence product quality as those prescribed in the ISO 9126-1 standard, i.e., usability, functionality, reliability, efficiency, portability, and maintainability. In order to define and specify the quality requirement tree for a given assessment goal and user viewpoint, we should consider diverse attributes –e.g., Broken Links, Orphan Pages, Quick Access Pages, Table of Contents, Site Map, Links Colour Style Uniformity, Permanence of Main Controls, just to quote a few of them. It might also be admitted, however, that designing a rigorous non-functional requirement model that gives us a strong correlation between attributes and characteristics is a hard endeavour.In this paper, we present the Web Quality Evaluation Method (QEM) [14, 15] and some aspects of its supporting tool, WebQEM_Tool [16]. We show that, by using the methodology for assessment purposes, we can give recommendations both by controlling quality requirements in new Web development projects and by evaluating requirements in operational phases. We show that we can discover either absent features, or requirements poorly implemented, i.e., design and implementation drawbacks related to the interface, navigation, accessibility, search mechanisms, content, reliability and performance, among others.Though our method can be applied for assessing all aspects of Web sites and application, we focus on those that are perceived by the user (navigation, interface, reliability, etc.) instead of other product attributes (such as quality of code, design, etc). In this sense we emphasize the Web site characteristics and attributes from a general visitor viewpoint.The rest of this paper proceeds as follows. In Section 2, we describe the evaluation process to which WebQEM adheres. In Section 3, we discuss a comprehensible example in the field of E-commerce in order to illustrate both the methodology and the first version of the supporting tool. Finally, some concluding remarks and future work are drawn.2. Overview of the Evaluation Process in the WebQEM MethodologyThe WebQEM process steps are grouped in the following four major technical phases:1.Quality Requirements Definition and Specification;2.Elementary Evaluation (both Design and Implementation stages);3.Global Evaluation (both Design and Implementation stages);4.Conclusion of the Evaluation(regarding Recommendations).Figure 1, shows the evaluation process underlying the methodology including the phases, stages, main steps, inputs and outputs. This model is inspired in the ISO’s process model for evaluators [6]. We next give an overview of the major technical phases (the evaluation process has also planning and scheduling steps).2.1 Quality Requirements Definition and Specification.In this phase, evaluators must clarify the evaluation goals and the intended user viewpoint. They should select a quality model, for instance, the ISO-prescribed characteristics in addition to attributes customized to the Web domain. The relative importance of these components should be identified considering the WebApps audience and the extent of the coverage required.Regarding the user profile, at least three abstract evaluation views of quality may be defined, i.e., visitors, developers and managers views. For example, the visitor category can be decomposed in general and expert visitor subcategories.Thus, taking into accounts the domain and product descriptions, the agreed goals, and the selected user view (i.e., the explicit and implicit user needs), characteristics, subcharacteristics and attributes should be specified in a quality requirement tree. In the end of this phase, a quality requirement specification document is produced.2.2 The Elementary Evaluation.In this phase, two major stages are defined as depicted in Fig. 1: The design and the implementation of the elementary evaluation.For each measurable attribute Ai from the requirement tree, we can associate a variable Xi, which will take a numerical value from a direct or indirect metric. However, the value of this metric will not represent the level of satisfaction of this elementary requirement at all. For that reason, it is necessary to define an elementary criterion function that will result afterwards in an elementary indicator or preference value.For instance, let us consider the Broken Links attribute, which measure (count) links that lead to missing destination pages. A possible indirect metric is: X = #Broken_Links / #Total_Links_of_Site. Now, how do we interpret the measured value?; what are the best, worst and intermediate preferred values? The next formula represents a possible criterion function to determine the elementary quality preference EP:EP= 1 (or 100%) if X = 0; EP = 0 (or 0%) if X >= X max ;otherwise EP= (X max – X) / X max if 0 < X < X maxwhere X max is some agreed upper threshold such as 0.06So the elementary quality preference EP is frequently interpreted as the percentage of satisfied requirement for a given attribute, and it is defined in the range between 0, and 100% (so the scale type and the unit of metrics become normalized [20]). Furthermore, to ease the interpretation of preferences, we primarily group them in three acceptability levels, namely: unsatisfactory (from 0 to 40%), marginal (from 40 to 60%), and satisfactory (from 60 to 100%) –this is exemplified in Section 3.4.In the implementation stage, the selected metrics are applied to the Web application as shown in Fig. 1. Some values can be measured observationally, while others can be obtained automatically by using computerized tools.Figure 1. The evaluation processes underlying in the WebQEM methodology. The phases, stages, main processes, inputs and outputs are shown.2.3 The Global Evaluation Phase.Again, two major stages are defined: The design and the implementation of the partial/global quality evaluation. In the design stage, aggregation criteria and a scoring model should be selected. The goal of quantitative aggregation and scoring models is to make the evaluation process well structured, accurate, and comprehensible by evaluators. There are at least two type of models: for example those based on linear additive scoring models [2], and those based on nonlinear multi-criteria scoring models [1] where different attributes and characteristics relationships can be designed. In both cases, the relative importance of indicators is considered by means of weights. For example, if our procedure is based on a linear additive scoring model the aggregation and computing of partial/global indicators or preferences (P/GP), considering relatives weights (W) is based on the following formula:P/GP= (W1 EP1 + W2 EP2+ ... + W m EP m); (1)such that if the elementary preference (EP) is in the unitary interval range the following is held:0 <= EP i <= 1 ; or given a percentage scale, 0 <= EP i <= 100 ;and the sum of weights must fulfill that(W1 + W2 + ... + W m ) = 1; if W i > 0 ; to i = 1 ... m;The basic arithmetic aggregation operator for inputs is the plus (+, or A) connector. The above (1) expression cannot be used to model simultaneity or replaceability of inputs, among other limitations, as we discuss later.Therefore, once a scoring model has been selected, the aggregation process follows the hierarchical structure as defined in the non-functional requirement tree (as the one shown in table 2), from bottom to top. Applying a stepwise aggregation mechanism, a global schema can be obtained in the end. This model allows us to compute partial and global indicators in the implementation stage. The global quality preference represents ultimately the global degree of satisfaction in meeting the stated requirements.2.4 The Conclusion of the Evaluation.In this phase, the documentation of Web product components, the specification of quality requirements, metrics, criteria, elementary and final results are recorded. In the end, the strengths and weaknesses of the assessed product with regard to established goals and user viewpoint can be analyzed and understood by requesters and evaluators. Recommendations can be suggested and justified.2.5 Automating the Process using WebQEM_Tool.The evaluation and comparison processes require both methodological and technological support. We have developed a Web-based tool [16] that supports the administration of evaluation projects. It allows editing and relating non-functional requirements. For instance, in our E-commerce case study, more than ninety attributes have intervened [15]. Then, by means of the automatic or manual edition of elementary indicators, WebQEM_Tool allows to aggregate the elements in order to yield a schema and calculate a global quality indicator for each site. This enables evaluators to assess and compare the quality of Web products. The WebQEM_Tool relies on a Web-based hyperdocument model that supports traceability of evaluation aspects as we will see in Section 3.2. The results of an evaluation are shown through linked pages with textual, tabular and graphical information, which are dynamically generated from tables stored in the data layer.3. Putting WebQEM to WorkIn this section we discuss some issues related with an e-bookstore case study [15]. We have also made evaluations in the museum and academic domains; they can be seen in [12, 13].3.1 About WebApps Quality Requirements.There are many potential attributes, both general and domain specific that contribute to the quality of WebApps. Figure 2, shows a screenshot of an e-store home page (.ar) pointing just some few attributes generally available in this kind of sites. Table 2 documents a wider list of tailorable quality requirements considering a general visitor profile.The requirement tree shown in Table 2 is intended to be reusable among domains. For instance, the Usability characteristic is split up in subcharacteristics such as Global Site Understandability, Feedback and Help Features, and Interface and Aesthetic Features. The Functionality characteristic is decomposed in Searching and Retrieving Issues, Navigation and Browsing Issues, and Domain Specific Functionality and Content. This last component of the tree (where Functionality is the supercharacteristic) should be customized among domains, therefore is no totally intended for reuse. Table 1 outlines the schema we used in the e-bookstore study. We have identified five main components for e-stores (see also the categorization in [8]), namely: Product Information (2.3.1 coded), Purchase Features (2.3.2), Customer Features (2.3.3), Store Features (2.3.4), and Promotion Policies (2.3.5).Figure 2. A screenshot of Cúspide’s home page where some attributes are highlighted.Table 1. The Domain Specific Functionality and Content subcharacteristic for E-bookstore sites (the italic style represents direct or indirect measurable attributes).2.3 Domain Specific Functionality and Content (for E-bookstores)2.3.1 Product Information 2.3.1.1 Product Description 2.3.1.1.1 Basic Book Description 2.3.1.1.2 Book Content & Structure 2.3.1.1.2.1 Book’s Table of Contents 2.3.1.1.2.2 Content Description 2.3.1.1.3 Product Image 2.3.1.1.3.1 Image Availability 2.3.1.1.3.2 Zooming 2.3.1.2 Price Evaluation2.3.1.2.1 Price Comparison Availability 2.3.1.3 Product Rating Availability 2.3.1.4 Related Titles /Authors Recommendation2.3.1.5 Catalog Download Facility 2.3.2 Purchase Features 2.3.2.1 Purchase Mode 2.3.2.1.1 On -line2.3.2.1.1.1 Shopping Basket2.3.2.1.1.1.1 Shopping Basket Availability 2.3.2.1.1.1.2 Continue Buying Feedback 2.3.2.1.1.1.3 Edit/Recalculate Feature 2.3.2.1.1.2 Quick Purchase (1-click or similar)2.3.2.1.1.3 Checkout Features 2.3.2.1.1.3.1 Checkout Security 2.3.2.1.1.3.2 Canceling Feedback 2.3.2.1.2 Off-line2.3.2.1.2.1 Printable Checkout Form 2.3.2.1.2.2 Fax/TE/E-mail Purchase 2.3.2.2 Purchase Policies2.3.2.2.1 Purchase Cancellation Policy 2.3.2.2.2 Return Policy Information2.3.2.2.3 Shipping & Handling Information 2.3.2.2.4 Payment Policy Information 2.3.2.2.5 Resent Purchase (Gift service)2.3.3 Customer Features 2.3.3.1 E-subscriptions2.3.3.2 Customized Recommendations 2.3.3.3 Account Facility 2.3.3.3.1 Account Availability 2.3.3.3.2 Account Security 2.3.3.3.3 Account Configuration 2.3.3.3.3.1 Order History/Status 2.3.3.3.3.2 Account Settings 2.3.3.3.3.3 Address Book2.3.3.4 Customer Revision of a Book 2.3.4 Store Features2.3.4.1 Title Availability Rate 2.3.4.2 Store Ranking 2.3.4.2.1 The Top Books2.3.4.2.2 The Best Seller Books 2.3.5 Promotion Policies2.3.5.1 With-sale Promotion Availability 2.3.5.2 Appetizer Promotion Availability (Contests, Miles, etc.)Though the subtree of table 1 has been specified for the e-bookstore field, it can be easily seen that many of its parts are reusable for a more general e-commerce domain. For instance, in the case of the Purchase Features (2.3.2), we can see two main subfactors: Purchase Mode (2.3.2.1), and Purchase Policies (2.3.2.2). Regarding the Purchase Mode subcharacteristic, online and offline modes are feasible, though the former is becoming more popular as long as confidence in security was increasing [3]. For online purchase, the Shopping Basket , Quick Purchase , and Checkout features are modeled.As remarked elsewhere [18], the shopping basket mechanism is generally used to decouple the selection process from the checkout process of products or services. It is interesting to compare many of these criteria with existing navigation and interface patterns. We can easily argue that when we record and reuse design experiencewe can obtain valuable information for specifying quality attributes or subcharacteristics.Table 2.Tailorable quality requirement tree for a general visitor standpoint (the italic style represents direct or indirect measurable attributes)1. Usability .1.1 Global Site Understandability1.1.1 Global Organization Scheme1.1.1.1Table of Contents1.1.1.2 Site Map1.1.1.3 Global Indexes1.1.1.3.1Subject Index1.1.1.3.2 Alphabetical Index1.1.1.3.3 Chronological Index1.1.1.3.4 Geographical Index1.1.1.3.5 Other Indexes (by audience, by format, hybrid -like alphabetical and subject oriented)1.1.2 Quality of Labeling System1.1.3 Audience-oriented Guided Tour 1.1.3.1Conventional Tour1.1.3.2VR Tour1.1.4 Image Map (Metaphorical, Building, Campus, Floor and Room Imagemaps) 1.2 Feedback and Help Features1.2.1 Quality of Help Features1.2.1.1 Global Help (for first-time visitors) 1.2.1.2 Specific Help (for searching, checking out, etc.)1.2.2 Addresses Directory1.2.2.1 E-mail Directory1.2.2.2 Phone-Fax Directory1.2.2.3 Post mail Directory1.2.3 Link-based Feedback1.2.3.1 FAQ Feature1.2.3.2 What’s New Feature1.2.4 Form-based Feedback1.2.4.1 Questionnaire Feature1.2.4.2 Comments/Suggestions1.2.4.3 Subject-Oriented Feedback1.2.4.4 Guest Book1.3 Interface and Aesthetic Features 1.3.1 Cohesiveness by Grouping Main Control Objects1.3.2 Presentation Permanence and Stability of Main Controls 1.3.2.1Direct Controls Permanence (Main,Search, Browse Controls)1.3.2.2Indirect Controls Permanence1.3.2.3Stability1.3.3Style Issues1.3.3.1Links Color Style Uniformity1.3.3.2Global Style Uniformity1.3.4 Aesthetic Preference1.4 Miscellaneous Features1.4.1Foreign Language Support1.4.2 Website Last Update Indicator1.4.2.1 Global1.4.2.2 Scoped (per sub-site or page)1.4.3Screen Resolution Indicator2. Functionality .2.1 Searching and Retrieving Issues2.1.1 Website Search Mechanisms2.1.1.1Global Search2.1.1.2Scoped Search (e.g., MuseumCollections, Books, Academic Personnel)2.1.2 Retrieve Mechanisms2.1.2.1Level of Retrieving Customization2.1.2.2 Level of Retrieving Feedback2.2 Navigation and Browsing Issues2.2.1 Navigability2.2.1.1 Orientation2.2.1.1.1Indicator of Path2.2.1.1.2 Label of Current Position2.2.1.2 Average of Links per Page2.2.2 Navigational Control Objects2.2.2.1 Presentation Permanence andStability of Contextual (sub-site) Controls2.2.2.1.1 Contextual Controls Permanence2.2.2.1.2 Contextual Controls Stability2.2.2.2 Level of Scrolling2.2.2.2.1 Vertical Scrolling2.2.2.2.2 Horizontal Scrolling2.2.3 Navigational Prediction2.2.3.1Link Title (link with explanatory help)2.2.3.2Quality of Link Phrase2.2.4 Browse Mechanisms2.2.4.1Quick Browse Controls2.3 Domain Specific Functionality andContentNote: see, for example, the specification toe-bookstores in Table 1.3. Reliability .3.1 Non-deficiency3.1.1 Link Errors3.1.1.1 Broken Links3.1.1.2 Invalid Links3.1.1.3 Unimplemented Links3.1.2 Spelling Errors3.1.3 Miscellaneous Errors or Drawbacks3.1.3.1 Deficiencies or absent features dueto different browsers3.1.3.2 Deficiencies or unexpected results(e.g., non-trapped search errors, frameproblems, etc.) independent of browsers3.1.3.3Orphan Pages3.1.3.4Destination Nodes (unexpectedly)under Construction4. Efficiency .4.1 Performance behavior4.1.1 Quick Static Pages4.2 Accessibility4.2.1 Information Accessibility4.2.1.1 Support for text-only version4.2.1.2 Readability by deactivating theBrowser Image Feature4.2.1.2.1 Image Title4.2.1.2.2 Global Readability4.2.2 Window Accessibility4.2.2.1 Number of panes regarding frames4.2.2.2 Non-frame Version3.2 Designing and Implementing the Elementary Evaluation.As mentioned in Section 2.2, the evaluators should define, for each quantifiable attribute, the basis for the elementary evaluation criterion, and perform the measurement and preference mapping process.In order to record the information that is needed during the evaluation processes, we have defined a descriptive specification framework as exemplified in tables 3 and 4. Specific information about definition of attributes, subcharacteristics and characteristics as well as metrics, elementary preference criteria, scoring model components and calculations are recorded. (Notice that codes in the templates in tables 3 and 4 correspond to those shown in the requirement tree).Once evaluators have designed and implemented the elementary evaluation process they should be able to model attributes, subcharacteristics, and characteristics relationships. They should consider not only the relative importance of each attribute in the group but also if the attribute (or subcharacteristic) is a mandatory, an alternative or a neutral one. For this purpose, a robust aggregation and scoring model is needed as we discuss in the next subsection.Table 3.Template and example with the characteristic items. WebQEM_Tool uses this information.Title (code):Reliability (3)Type:CharacteristicFactor:QualitySubcharacteristic/s (code/s):Non-deficiency (3.1)Definition / Comments:“The capability of the software product to maintain a specified level ofperformance when used under specified conditions”[5]Model to determine the Global/Partial Computation:Nonlinear multi-criteria scoring model; specifically, the Logic Scoring of Preference model [1]Employed Tool/s:WebQEM_ToolArithmetic / Logic Operator: C - - (Note: The arithmetic or logic operator item for thesubcharacteristic and characteristic aggregation will be explained in thesubsection 3.3)Weight:0.2Calculated Preference Value/s: A set of values for Reliability, as shown in table 5.Table 4.Template and example with the attribute items.Code / Title: 3.1.1.1. Broken LinksType:AttributeHighest level characteristic:Reliability (3)Supercharacteristic/s (code):Link Errors (3.1.1)Definition / Comments It represents found links that lead to missing destination pages bothinternal and external static pages of a site (known also as dangling links).“Users get irritated when they attempt to go somewhere, only to get theirreward snatched away at the last moment by a 404 or otherincomprehensible error message”. /alertbox/980614.htmlTemplate of Metric and Parameters:Note: The metric and parameters item links another template with information of the selected metric criterion, the expected and planned values, measurement dates among other fields. For instance, the metric criterion is:X = #Broken_Links / #Total_Links_of_Site.For each e-store in the field study, we got the respective X valueData Collection Type:Note: The data collection type item records whether the data are gatheredmanually or automatically and the kind of the employed tool (if that isdone automatically, as for the Broken Links attribute).Employed Tool/s:our Website MA tool, among othersElementary Preference Function:EP= 1 (or 100%) if X = 0;EP = 0 (or 0%) if X >= X max ;otherwise EP= (X max – X) / X max if 0 < X < X maxX max = 0.06Weight:0.5Elementary Preference Value/s:Cúspide’s site yielded an elementary preference of 99.83 %, Amazon,98.40 %, Barnes and Noble 97.45 %, Borders, 76.34, and Díaz de Santos60.07 %3.3 Designing and Implementing the Partial/Global Evaluation.In these stages (see Fig. 1), an aggregation and scoring model should be selected and applied. The hierarchically grouped attributes, subcharacteristics, and characteristics will then be related by arithmetic or logic operators accordingly.As commented in section 2.3, we can choose between (1) a linear additive scoring model; and (2) a nonlinearmulti-criteria scoring model.The additive scoring model cannot be used to model simultaneity or replaceability of inputs; they are not useful to express for example simultaneous satisfaction of several requirements as inputs. Additivity assumes that insufficient presence of a specific attribute (input) can always be compensated by sufficient presence of any other attribute. Furthermore, additive models are unable to model mandatory requirements; i.e., the total absence of a necessary attribute or subcharacteristic cannot be well compensated by means of the high presence of others. Instead, if we use a nonlinear multi-criteria scoring model we can deal with simultaneity, neutrality, replaceability, and other input relationships using aggregation operators based on the weighted power means mathematical model [1]. This model, so-called Logic Scoring of Preferences (LSP), is a generalization of the additive-scoring model, and can be expressed as follow:P/GP(r)= (W1 EP r1 + W2 EP r 2+ ... + W m EP r m)1/ r ; (2)where -∞ <= r <= +∞ ; P/GP(- ∞)= min (EP1 , EP2 , ... , EP m) and;P/GP(+∞) = max (EP1 , EP2, ... , EP m);The power r is a parameter (a real number) selected in order to achieve the desired logical relationship and intensity of polarization of the aggregation function. If P/GP(r) is closer to the minimum then such a criterion specifies the requirement for the simultaneity of inputs. Conversely, if it is closer to the maximum then it specifies the requirement for the replaceability of inputs.As the reader may see, the formula (2) is additive when r = 1, which models the neutrality relationship; i.e., the formula remains the same as in the first additive model. In addition, (2) is supra-additive for r > 1 which models the disjunction or replaceability of inputs. And it is sub-additive for r < 1, (with r != 0) which models the conjunction or simultaneity of inputs.In the case study, the use of this last model was selected. A seventeen-level approach of conjunction-disjunction operators was used, as defined by Dujmovic [1]. Each operator in the model corresponds to a particular value of the r parameter. When r = 1 the operator is tagged with A (or the + sign). The C or conjunctive operators range from weak (C-) to strong (C+) quasi-conjunction functions, i.e., from decreasing values of r, starting from r < 1.In general, the conjunctive operators imply that, a low quality of an input preference can never be well compensated by a high quality of some other input to output a high quality preference (in other words, a chain is as strong as its weakest link). Conversely, disjunctive operators (D operators) imply that a low quality of an input preference can always be compensated by a high quality of some other input.Figure 3. Once the weights and operators were agreed, and the schema checked, the WebQEM_Tool could yield the partial and global preferences as shown in the right-side pane.In order to design the LSP aggregation schema, the following key basic questions (which are part of the Global Preference Criteria Definition task, in Fig. 1), should be answered : (1) what is the kind of relationship among this group of related attributes/subcharacteristic, etc.?, is it either a conjunctive, or disjunctive or neutral relationship?; (2) what is the level of intensity of the logic operator from a weak to strong conjunctive/disjunctive polarization?; (3) what is the relative importance or weight of each element into the group?The WebQEM_Tool allows evaluators to select the aggregation and scoring model. When using the additive scoring model, the aggregation operator is A for all the tree composites (subcharacteristics and characteristics). Instead, if evaluators select the LSP model, the operator must be indicated for each subcharacteristic and characteristic. Fig. 3, shows a partial view of the enacted schema for as generated by our tool.3.4 Analyzing and Recommending.Once the final execution of the evaluation was performed and agreed, decision-makers are ready to analyze the results and draw the conclusions.Table 5, shows the final values for Usability, Functionality, Reliability and Efficiency characteristics and the global Quality indicator in our case study in the e-commerce domain. It also shows the Domain Specific Functionality and Content subcharacteristic (2.3 in Table 1) for the Functionality characteristic.The colored quality bars at the right side in Fig. 4, indicate the acceptability levels; evaluators can easily see the quality level each e-bookstore has reached. For instance, a scoring within a gray bar shows the need for improvement actions; meanwhile, an unsatisfactory rating level may indicate that urgent change actions must be performed. A scoring within a green bar can be interpreted as a satisfactory quality for the analyzed feature. This e-bookstore study was deeply discussed in [15].Table 5.Summary of partial and global preferences for each e-bookstore assessed in late 1999. (For legibility reasons, this table is not shown in the tool screenshot as in Fig. 4)Characteristic and Sub-characteristics Amazon B&N Cúspide Díaz Stos Borders1. Usability76.1682.6275.9356.0972.672. Functionality83.1580.1261.6928.6461.452.1 Searching and Retrieving Issues1001009142.6772.062.2 Navigation and Browsing Issues70.7169.8573.2564.1251.952.3 Domain Specific Functionality and Content81.9976.5345.8114.4261.552.3.1 Product Information63.7242.2040.6410.2015.982.3.2 Purchase Features91.7684.8467.7217.1181.922.3.3 Customer Features100852028.08652.3.4 Store Features10096.8071.2033.6093.572.3.5 Promotion Policies601004001003. Reliability99.4499.1190.9778.5191.664. Efficiency96.8874.5490.1786.0190.90 Global Quality Preference86.8182.9575.5250.3774.86Figure 4.The screenshots show the diverse type of information (as textual, tabular and graphical) yielded by the WebQEM_Tool. The right side shows a graph with the final rankings to each evaluated e-bookstore.。