Generalized Additive Models

合集下载

GAMens 1.2.1 文档说明书

Package‘GAMens’October12,2022Title Applies GAMbag,GAMrsm and GAMens Ensemble Classiﬁers forBinary ClassiﬁcationVersion1.2.1Author Koen W.De Bock,Kristof Coussement and Dirk Van den PoelMaintainer Koen W.De Bock<********************>Depends R(>=2.4.0),splines,gam,mlbench,caToolsDescription Implements the GAMbag,GAMrsm and GAMens ensembleclassiﬁers for binary classiﬁca-tion(De Bock et al.,2010)<doi:10.1016/j.csda.2009.12.013>.The ensemblesimplement Bagging(Breiman,1996)<doi:10.1023/A:1010933404324>,the Random Sub-space Method(Ho,1998)<doi:10.1109/34.709601>,or both,and use Hastie and Tibshirani's(1990,ISBN:978-0412343902)generalized additive models(GAMs)as base classiﬁers.Once an ensemble classiﬁer has been trained,it canbe used for predictions on new data.A function for cross validation is alsoincluded.License GPL(>=2)RoxygenNote6.0.1NeedsCompilation noRepository CRANDate/Publication2018-04-0517:12:34UTCR topics documented:GAMens (2)GAMens.cv (5)predict.GAMens (7)Index101GAMens Applies the GAMbag,GAMrsm or GAMens ensemble classiﬁer to adata setDescriptionFits the GAMbag,GAMrsm or GAMens ensemble algorithms for binary classiﬁcation using gen-eralized additive models as base classiﬁers.UsageGAMens(formula,data,rsm_size=2,autoform=FALSE,iter=10,df=4,bagging=TRUE,rsm=TRUE,fusion="avgagg")Argumentsformula a formula,as in the gam function.Smoothing splines are supported as nonpara-metric smoothing terms,and should be indicated by s.See the documentationof s in the gam package for its arguments.The GAMens function also providesthe possibility for automatic formula speciﬁcation.See’details’for more infor-mation.data a data frame in which to interpret the variables named in formula.rsm_size an integer,the number of variables to use for random feature subsets used in the Random Subspace Method.Default is2.If rsm=FALSE,the value of rsm_sizeis ignored.autoform if FALSE(default),the model speciﬁcation in formula is used.If TRUE,the function triggers automatic formula speciﬁcation.See’details’for more infor-mation.iter an integer,the number of base classiﬁers(GAMs)in the ensemble.Defaults to iter=10base classiﬁers.df an integer,the number of degrees of freedom(df)used for smoothing spline estimation.Its value is only used when autoform=TRUE.Defaults to df=4.Itsvalue is ignored if a formula is speciﬁed and autoform is FALSE.bagging enables Bagging if value is TRUE(default).If FALSE,Bagging is disabled.Either bagging,rsm or both should be TRUErsm enables Random Subspace Method(RSM)if value is TRUE(default).If FALSE, RSM is disabled.Either bagging,rsm or both should be TRUE fusion speciﬁes the fusion rule for the aggregation of member classiﬁer outputs in the ensemble.Possible values are avgagg (default), majvote , w.avgagg orw.majvote .DetailsThe GAMens function applies the GAMbag,GAMrsm or GAMens ensemble classiﬁers(De Bock et al.,2010)to a data set.GAMens is the default with(bagging=TRUE and rsm=TRUE.For GAMbag, rsm should be speciﬁed as FALSE.For GAMrsm,bagging should be FALSE.The GAMens function provides the possibility for automatic formula speciﬁcation.In this case, dichotomous variables in data are included as linear terms,and other variables are assumed con-tinuous,included as nonparametric terms,and estimated by means of smoothing splines.To enable automatic formula speciﬁcation,use the generic formula[response variable name]~.in combi-nation with autoform=TRUE.Note that in this case,all variables available in data are used in the model.If a formula other than[response variable name]~.is speciﬁed then the autoform op-tion is automatically overridden.If autoform=FALSE and the generic formula[response variable name]~.is speciﬁed then the GAMs in the ensemble will not contain nonparametric terms(i.e.,will only consist of linear terms).Four alternative fusion rules for member classiﬁer outputs can be speciﬁed.Possible values are avgagg for average aggregation(default), majvote for majority voting, w.avgagg for weighted average aggregation,or w.majvote for weighted majority voting.Weighted approaches are based on member classiﬁer error rates.ValueAn object of class GAMens,which is a list with the following components:GAMs the member GAMs in the ensemble.formula the formula used tot create the GAMens object.iter the ensemble size.df number of degrees of freedom(df)used for smoothing spline estimation.rsm indicates whether the Random Subspace Method was used to create the GAMens object.bagging indicates whether bagging was used to create the GAMens object.rsm_size the number of variables used for random feature subsets.fusion_method the fusion rule that was used to combine member classiﬁer outputs in the en-semble.probs the class membership probabilities,predicted by the ensemble classiﬁer.class the class predicted by the ensemble classiﬁer.samples an array indicating,for every base classiﬁer in the ensemble,which observations were used for training.weights a vector with weights deﬁned as(1-error rate).Usage depends upon speciﬁca-tion of fusion_method.Author(s)Koen W.De Bock<********************>,Kristof Coussement<*********************> and Dirk Van den Poel<************************>ReferencesDe Bock,K.W.and Van den Poel,D.(2012):"Reconciling Performance and Interpretability in Customer Churn Prediction Modeling Using Ensemble Learning Based on Generalized Additive Models".Expert Systems With Applications,V ol39,8,pp.6816–6826.De Bock,K.W.,Coussement,K.and Van den Poel,D.(2010):"Ensemble Classiﬁcation based on generalized additive models".Computational Statistics&Data Analysis,V ol54,6,pp.1535–1546.Breiman,L.(1996):"Bagging predictors".Machine Learning,V ol24,2,pp.123–140.Hastie,T.and Tibshirani,R.(1990):"Generalized Additive Models",Chapman and Hall,London.Ho,T.K.(1998):"The random subspace method for constructing decision forests".IEEE Transac-tions on Pattern Analysis and Machine Intelligence,V ol20,8,pp.832–844.See Alsopredict.GAMens,GAMens.cvExamples##Load data(mlbench library should be loaded)library(mlbench)data(Ionosphere)IonosphereSub<-Ionosphere[,c("V1","V2","V3","V4","V5","Class")]##Train GAMens using all variables in Ionosphere datasetIonosphere.GAMens<-GAMens(Class~.,IonosphereSub,4,autoform=TRUE,iter=10)##Compare classification performance of GAMens,GAMrsm and GAMbag ensembles,##using4nonparametric terms and2linear termsIonosphere.GAMens<-GAMens(Class~s(V3,4)+s(V4,4)+s(V5,3)+s(V6,5)+V7+V8,Ionosphere,3,autoform=FALSE,iter=10)Ionosphere.GAMrsm<-GAMens(Class~s(V3,4)+s(V4,4)+s(V5,3)+s(V6,5)+V7+V8,Ionosphere,3,autoform=FALSE,iter=10,bagging=FALSE,rsm=TRUE)Ionosphere.GAMbag<-GAMens(Class~s(V3,4)+s(V4,4)+s(V5,3)+s(V6,5)+V7+V8,Ionosphere,3,autoform=FALSE,iter=10,bagging=TRUE,rsm=FALSE)##Calculate AUCs(for function colAUC,load caTools library)library(caTools)GAMens.auc<-colAUC(Ionosphere.GAMens[[9]],Ionosphere["Class"]=="good",plotROC=FALSE)GAMrsm.auc<-colAUC(Ionosphere.GAMrsm[[9]],Ionosphere["Class"]=="good",plotROC=FALSE)GAMbag.auc<-colAUC(Ionosphere.GAMbag[[9]],Ionosphere["Class"]=="good",plotROC=FALSE)GAMens.cv Runs v-fold cross validation with GAMbag,GAMrsm or GAMens en-semble classiﬁerDescriptionIn v-fold cross validation,the data are divided into v subsets of approximately equal size.Subse-quently,one of the v data parts is excluded while the remainder of the data is used to create a GAMens object.Predictions are generated for the excluded data part.The process is repeated v times.UsageGAMens.cv(formula,data,cv,rsm_size=2,autoform=FALSE,iter=10,df=4,bagging=TRUE,rsm=TRUE,fusion="avgagg")Argumentsformula a formula,as in the gam function.Smoothing splines are supported as nonpara-metric smoothing terms,and should be indicated by s.See the documentationof s in the gam package for its arguments.The GAMens function also providesthe possibility for automatic formula speciﬁcation.See’details’for more infor-mation.data a data frame in which to interpret the variables named in formula.cv An integer specifying the number of folds in the cross-validation.rsm_size an integer,the number of variables to use for random feature subsets used in the Random Subspace Method.Default is2.If rsm=FALSE,the value of rsm_sizeis ignored.autoform if FALSE(by default),the model speciﬁcation in formula is used.If TRUE,the function triggers automatic formula speciﬁcation.See’details’for more infor-mation.iter an integer,the number of base(member)classiﬁers(GAMs)in the ensemble.Defaults to iter=10base classiﬁers.df an integer,the number of degrees of freedom(df)used for smoothing spline estimation.Its value is only used when autoform=TRUE.Defaults to df=4.Itsvalue is ignored if a formula is speciﬁed and autoform is FALSE.bagging enables Bagging if value is TRUE(default).If FALSE,Bagging is disabled.Either bagging,rsm or both should be TRUErsm enables Random Subspace Method(RSM)if value is TRUE(default).If FALSE, rsm is disabled.Either bagging,rsm or both should be TRUE fusion speciﬁes the fusion rule for the aggregation of member classiﬁer outputs in the ensemble.Possible values are avgagg for average aggregation(default),majvote for majority voting, w.avgagg for weighted average aggregationbased on base classiﬁer error rates,or w.majvote for weighted majority vot-ing.ValueAn object of class GAMens.cv,which is a list with the following components:foldpred a data frame with,per fold,predicted class membership probabilities for the left-out observations.pred a data frame with predicted class membership probabilities.foldclass a data frame with,per fold,predicted classes for the left-out observations.class a data frame with predicted classes.conf the confusion matrix which compares the real versus predicted class member-ships,based on the class object.Author(s)Koen W.De Bock<********************>,Kristof Coussement<*********************> and Dirk Van den Poel<************************>ReferencesDe Bock,K.W.and Van den Poel,D.(2012):"Reconciling Performance and Interpretability in Customer Churn Prediction Modeling Using Ensemble Learning Based on Generalized Additive Models".Expert Systems With Applications,V ol39,8,pp.6816–6826.De Bock,K.W.,Coussement,K.and Van den Poel,D.(2010):"Ensemble Classiﬁcation based on generalized additive models".Computational Statistics&Data Analysis,V ol54,6,pp.1535–1546.Breiman,L.(1996):"Bagging predictors".Machine Learning,V ol24,2,pp.123–140.Hastie,T.and Tibshirani,R.(1990):"Generalized Additive Models",Chapman and Hall,London.Ho,T.K.(1998):"The random subspace method for constructing decision forests".IEEE Transac-tions on Pattern Analysis and Machine Intelligence,V ol20,8,pp.832–844.See Alsopredict.GAMens,GAMensExamples##Load data:mlbench library should be loaded!)library(mlbench)data(Sonar)SonarSub<-Sonar[,c("V1","V2","V3","V4","V5","V6","Class")]##Obtain cross-validated classification performance of GAMrsm##ensembles,using all variables in the Sonar dataset,based on5-fold##cross validation runsSonar.cv.GAMrsm<-GAMens.cv(Class~s(V1,4)+s(V2,3)+s(V3,4)+V4+V5+V6,SonarSub,5,4,autoform=FALSE,iter=10,bagging=FALSE,rsm=TRUE)##Calculate AUCs(for function colAUC,load caTools library)library(caTools)GAMrsm.cv.auc<-colAUC(Sonar.cv.GAMrsm[[2]],SonarSub["Class"]=="R",plotROC=FALSE)predict.GAMens Predicts from aﬁtted GAMens object(i.e.,GAMbag,GAMrsm orGAMens classiﬁer).DescriptionGenerates predictions(classes and class membership probabilities)for observations in a dataframe using a GAMens object(i.e.,GAMens,GAMrsm or GAMbag classiﬁer).Usage##S3method for class GAMenspredict(object,data,...)Argumentsobjectﬁtted model object of GAMens class.data data frame with observations to genenerate predictions for....further arguments passed to or from other methods.ValueAn object of class predict.GAMens,which is a list with the following components:pred the class membership probabilities generated by the ensemble classiﬁer.class the classes predicted by the ensemble classiﬁer.conf the confusion matrix which compares the real versus predicted class member-ships,based on the class object.Obtains value NULL if the testdata is unlabeled.Author(s)Koen W.De Bock<********************>,Kristof Coussement<*********************> and Dirk Van den Poel<************************>ReferencesDe Bock,K.W.and Van den Poel,D.(2012):"Reconciling Performance and Interpretability in Customer Churn Prediction Modeling Using Ensemble Learning Based on Generalized Additive Models".Expert Systems With Applications,V ol39,8,pp.6816–6826.De Bock,K.W.,Coussement,K.and Van den Poel,D.(2010):"Ensemble Classiﬁcation based on generalized additive models".Computational Statistics&Data Analysis,V ol54,6,pp.1535–1546.Breiman,L.(1996):"Bagging predictors".Machine Learning,V ol24,2,pp.123–140.Hastie,T.and Tibshirani,R.(1990):"Generalized Additive Models",Chapman and Hall,London.Ho,T.K.(1998):"The random subspace method for constructing decision forests".IEEE Transac-tions on Pattern Analysis and Machine Intelligence,V ol20,8,pp.832–844.See AlsoGAMens,GAMens.cvExamples##Load data,mlbench library should be loaded!)library(mlbench)data(Sonar)SonarSub<-Sonar[,c("V1","V2","V3","V4","V5","V6","Class")]##Select indexes for training set observationsidx<-c(sample(1:97,60),sample(98:208,70))##Train GAMrsm using all variables in Sonar dataset.Generate predictions##for test set observations.Sonar.GAMrsm<-GAMens(Class~.,SonarSub[idx,],autoform=TRUE,iter=10,bagging=FALSE,rsm=TRUE)Sonar.GAMrsm.predict<-predict(Sonar.GAMrsm,SonarSub[-idx,])##Load data mlbench library should be loaded!)library(mlbench)data(Ionosphere)IonosphereSub<-Ionosphere[,c("V1","V2","V3","V4","V5","V6","V7","V8","Class")]Ionosphere_s<-IonosphereSub[order(IonosphereSub$Class),]##Select indexes for training set observationsidx<-c(sample(1:97,60),sample(98:208,70))##Compare test set classification performance of GAMens,GAMrsm and##GAMbag ensembles,using using4nonparametric terms and2linear terms in the##Ionosphere datasetIonosphere.GAMens<-GAMens(Class~s(V3,4)+s(V4,4)+s(V5,3)+s(V6,5)+V7+V8,IonosphereSub[idx,],autoform=FALSE,iter=10,bagging=TRUE,rsm=TRUE)Ionosphere.GAMens.predict<-predict(Ionosphere.GAMens,IonosphereSub[-idx,])Ionosphere.GAMrsm<-GAMens(Class~s(V3,4)+s(V4,4)+s(V5,3)+s(V6,5)+V7+V8, IonosphereSub[idx,],autoform=FALSE,iter=10,bagging=FALSE,rsm=TRUE) Ionosphere.GAMrsm.predict<-predict(Ionosphere.GAMrsm, IonosphereSub[-idx,])Ionosphere.GAMbag<-GAMens(Class~s(V3,4)+s(V4,4)+s(V5,3)+s(V6,5)+V7+V8, IonosphereSub[idx,],autoform=FALSE,iter=10,bagging=TRUE,rsm=FALSE) Ionosphere.GAMbag.predict<-predict(Ionosphere.GAMbag, IonosphereSub[-idx,])##Calculate AUCs(for function colAUC,load caTools library)library(caTools)GAMens.auc<-colAUC(Ionosphere.GAMens.predict[[1]],IonosphereSub[-idx,"Class"]=="good",plotROC=FALSE)GAMrsm.auc<-colAUC(Ionosphere.GAMrsm.predict[[1]],Ionosphere[-idx,"Class"]=="good",plotROC=FALSE)GAMbag.auc<-colAUC(Ionosphere.GAMbag.predict[[1]],IonosphereSub[-idx,"Class"]=="good",plotROC=FALSE)Index∗classifGAMens,2GAMens.cv,5predict.GAMens,7∗modelsGAMens,2GAMens.cv,5predict.GAMens,7GAMens,2,6,8GAMens.cv,4,5,8predict.GAMens,4,6,710。

gam模型每个因子的回归系数

gam模型每个因子的回归系数-概述说明以及解释1.引言1.1 概述Generalized Additive Models (GAM) 是一种统计模型，它结合了广义线性模型（Generalized Linear Models, GLM）和非参数平滑技术，用于建模非线性关系。

相比传统的线性回归模型，GAM能更好地拟合非线性关系，并允许我们研究每个自变量对因变量的影响，同时控制其他自变量的效果。

GAM模型的核心思想是将因变量拟合为多个非线性函数的组合，每个自变量可以通过自适应平滑函数建模。

本文旨在介绍GAM模型中每个因子的回归系数，以及这些系数的含义和解释。

通过对每个因子的回归系数进行分析，我们可以深入理解GAM 模型在实际问题中的应用，以及每个因子对因变量的影响程度。

文章结构部分内容可以包括以下信息：1.2 文章结构本文主要分为引言、正文和结论三个部分。

在引言部分，我们将首先对GAM模型进行概述，简要介绍文章的结构和目的。

在正文部分，我们将详细介绍GAM模型的概念和每个因子的意义，重点讨论每个因子的回归系数及其意义。

最后，在结论部分，我们将对全文进行总结，展望未来研究方向，并得出结论。

通过这样的结构，我们将全面深入地探讨GAM 模型每个因子的回归系数，为读者提供全面的信息和深刻的认识。

1.3 目的本文旨在探讨GAM模型中每个因子的回归系数的意义和影响，通过深入分析每个因子在模型中的作用，帮助读者更好地理解GAM模型的应用和解释。

同时，也旨在为研究者和实践者提供一些有益的参考，以便他们在实际应用中更好地理解和解释GAM模型的结果，从而提高模型的准确性和可信度。

通过本文的研究，希望能为GAM模型的理论研究和实践应用提供一定的借鉴和参考。

2.正文2.1 GAM模型介绍部分：广义可加模型（Generalized Additive Model，GAM）是一种灵活的非参数统计模型，它可以用于建模因变量和自变量之间的非线性关系。

R语言实现广义加性模型GeneralizedAdditiveModels（GAM）入门

R语⾔实现⼴义加性模型GeneralizedAdditiveModels（GAM）⼊门转载请说明。

下⾯进⾏⼀个简单的⼊门程序学习。

先新建⼀个txt，叫做 Rice_insect.txt ，内容为：（⽤制表符Tab）Year Adult Day Precipitation1973 27285 15 387.31974 239 14 126.31975 6164 11 165.91976 2535 24 184.91977 4875 30 166.91978 9564 24 146.01979 263 3 24.01980 3600 21 23.01981 21225 13 167.01982 915 12 67.01983 225 17 307.01984 240 40 295.01985 5055 25 266.01986 4095 15 115.01987 1875 21 140.01988 12810 32 369.01989 5850 21 167.01990 4260 39 270.8 Adult为累计蛾量，Day为降⾬持续天数，Precipitation为降⾬量。

输⼊代码：library(mgcv) #加载mgcv软件包，因为gam函数在这个包⾥Data <- read.delim("Rice_insect.txt") #读取txt数据，存到Data变量中Data <- as.matrix(Data) #转为矩阵形式#查看Data数据：Data，查看第2列：Data[,2]，第2⾏：Data[2,]Adult<-Data[,2]Day<-Data[,3]Precipitation<-Data[,4]result1 <- gam(log(Adult) ~ s(Day)) #此时，Adult为相应变量，Day为解释变量summary(result1) #输出计算结果此时可以看到：Family: gaussianLink function: identityFormula:log(Adult) ~ s(Day)Parametric coefficients:Estimate Std. Error t value Pr(>|t|)(Intercept) 7.9013 0.3562 22.18 4.83e-13 ***---Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1Approximate significance of smooth terms:edf Ref.df F p-values(Day) 1.713 2.139 0.797 0.473R-sq.(adj) = 0.0471 Deviance explained = 14.3%GCV score = 2.6898 Scale est. = 2.2844 n = 18Day的影响⽔平p-value=0.473，解释能⼒为14.3%，说明影响不明显。

GAM并行化

Generalized additive models求解并行思路我们的目标是针对广义可加模型的经典求解方法，提出其并行化改造方案。

本文中针对改进的算法是Local Scoring Algorithm 。

Local Scoring Algorithm 主要步骤中，计算量集中在其调用Backfitting Algorithm 拟合修正后的可加模型这一步骤。

因此我们将细致分析Backfitting Algorithm 的执行步骤并提出并行化方案。

一、GAM 模型及串行化算法简介GAM 形式如下：1[|]()d j j j E Y =G f x α=⎛⎫=+ ⎪⎝⎭∑X x （1）其中Y 为响应变量（因变量），X 为自变量，α为线性误差，而()i f ⋅是称为平滑函数的一种非参数函数，G 为连接函数。

对于GAM 模型，我们主要是为了探寻响应变量Y 与自变量X 间的某种关系，这种关系可能是线性的，也可能是非线性的，也可能是既具有线性成分，又具有非线性成分的关系。

在GAM 中，每一个自变量i X 都是通过一个称为光滑函数的非参数函数()i f ⋅来建立其与响应变量的关系。

而连接函数G 的主要作用主要是将模型的适用范围进行推广，用以将可加模型的适用范围由高斯分布推广到指数分布族。

对于GAM 模型，我们这里使用经典算法Local Scoring Algorithm 与Backfiting Algorithm 的框架对模型进行求解、拟合。

其中Backfitting Algorithm 主要用于估计可加模型，其主要思想如下：根据GAM 模型的形式，我们可以观察到1[()|]()dj k k j k j j kE Y f X X f X α=≠--=∑ （2）其中，对于k ，1<=k<=d 。

主要通过迭代算法计算()j f ⋅，给定一组当前的ˆˆ{,}j f α，从给定的观测值{,}i y i x ，通过迭代计算局部残差1ˆˆ(),1,...,,dl i i l i l l kry f x i n α=≠=--=∑ 通过应用非参数平滑，来更新估计值ˆj f 。

generalized-additive-models算法

generalized additive models算法Generalized Additive Models (GAM), 或者广义可加模型，是统计学中一种常用的非参数回归方法。

它结合了广义线性模型（GLM）和非线性平滑方法，能够适应非线性、非正态分布和非常数方差的数据。

本文将详细介绍GAM算法，并一步一步回答与其相关的问题。

第一部分：GAM算法的介绍1.1 什么是广义可加模型？广义可加模型是一种广义线性模型的扩展形式，它可以处理非线性关系，且不需要假设预测变量之间的交互作用具有线性形式。

广义可加模型通过将预测变量的非线性部分表示为平滑函数的线性组合，从而实现对非线性关系的建模。

1.2 广义可加模型的优点有哪些？广义可加模型具有以下优点：- 不需要假设任何先验形式的数据分布- 可以处理非参数回归问题- 可以通过平滑函数拟合数据的非线性关系- 可以同时考虑多个预测变量的影响第二部分：GAM模型的建立步骤2.1 数据准备首先需要准备用于建模的数据集。

数据集应包含一个响应变量和一个或多个预测变量。

2.2 平滑函数的选择根据数据的特点选择适当的平滑函数，常见的平滑函数包括样条函数（splines）、局部回归（loess）和样条光滑（smoothing splines）等。

平滑函数的选择要考虑数据的特点以及模型的拟合程度。

2.3 模型的拟合与评估通过最小化损失函数来拟合模型，常用的损失函数包括最小二乘法（OLS）和广义最小二乘法（GLS）。

拟合完模型后，需要对模型进行评估，比较观察值和预测值之间的差异。

2.4 平滑度调整根据模型的拟合结果，根据需要调整平滑的程度，以达到最佳的拟合效果。

平滑度的调整可以通过调整平滑参数或者选择不同的平滑函数来实现。

第三部分：GAM模型的应用3.1 连续型响应变量的预测GAM模型在连续型响应变量的预测方面表现出色。

例如，可以使用GAM 模型预测一个人的年龄对其收入的影响，还可以预测某种化学物质的浓度与环境因素之间的关系。

generalized additive mixed modeling

generalized additive mixed modeling1. 引言1.1 概述在统计建模中，回归模型是一种常见的分析工具，用于研究变量之间的关系。

然而，传统的回归模型通常对数据的线性关系做出了限制，无法很好地拟合复杂的非线性关系。

为了解决这个问题，广义可加混合模型（Generalized Additive Mixed Modeling, GAMM）应运而生。

GAMM是一种灵活而强大的统计建模方法，它结合了广义可加模型（Generalized Additive Model, GAM）和混合效应模型（Mixed Effects Model）。

通过引入非线性平滑函数和随机效应，GAMM能够更准确地描述变量之间的复杂关系，并考虑到数据中可能存在的随机变异。

本文将详细介绍GAMM的理论基础、模型框架和参数估计方法。

同时，我们还将探讨GAMM在各个领域中的应用，并与传统回归模型以及混合效应模型进行比较和评估。

最后，我们将总结目前对于GAMM方法的认识，并提出未来研究方向。

1.2 文章结构本文共分为五个部分。

首先，在引言部分概述了GAMM的背景和研究意义。

接下来，第二部分将介绍GAMM的理论基础、模型框架和参数估计方法。

第三部分将详细探讨GAMM在生态学、社会科学和医学研究中的应用案例。

第四部分将与其他回归模型和传统混合模型进行比较，并对GAMM方法的优缺点及局限性进行讨论。

最后，在第五部分中，我们将总结全文的主要内容，并提出对未来研究方向的建议。

1.3 目的本文旨在全面介绍广义可加混合模型（GAMM）这一统计建模方法，以及其在不同领域中的应用。

通过对GAMM的理论基础、模型框架和参数估计方法进行详细描述，读者可以了解到该方法如何解决传统回归模型无法处理非线性关系问题的局限性。

同时，通过实际案例研究，读者可以进一步了解GAMM在生态学、社会科学和医学研究等领域中的应用效果。

此外，通过与其他回归模型和传统混合模型进行比较，本文还旨在评估GAMM方法的优势和局限性。

生物药剂学与药物动力学专业词汇

生物药剂学与药物动力学专业词汇※<A>Absolute bioavailability, F 绝对生物利用度Absorption 吸收Absorption pharmacokinetics 吸收动力学Absorption routes 吸收途径Absorption rate 吸收速率Absorption rate constant 吸收速率常数Absorptive epithelium 吸收上皮Accumulation 累积Accumulation factor 累积因子Accuracy 准确度Acetylation 乙酰化Acid glycoprotein 酸性糖蛋白Active transport 主动转运Atomic absorption spectrometry 原子吸收光谱法Additive 加和型Additive errors 加和型误差Adipose 脂肪Administration protocol 给药方案Administration route 给药途径Adverse reaction 不良反应Age differences 年龄差异Akaike’s information criterion, AIC AIC判据Albumin 白蛋白All-or-none response 全或无效应Amino acid conjugation 氨基酸结合Analog 类似物Analysis of variance, ANOVA ANOVA方差分析Anatomic Volume 解剖学体积Antagonism 拮抗作用Antiproliferation assays 抑制增殖法Apical membrane 顶端表面Apoprotein 载脂蛋白脱辅基蛋白Apparatus 仪器Apparent volume of distribution 表观分布容积Area under the curve, AUC 曲线下面积Aromatisation 芳构化Artery 动脉室Artifical biological membrane 人工生物膜Aryl 芳基Ascorbic acid 抗坏血酸维生素C Assistant in study design 辅助实验设计Average steady-state plasma drug concentration 平均稳态血浆药物浓度Azo reductase 含氮还原酶※<B>Backward elimination 逆向剔除Bacteria flora 菌丛Basal membrane 基底膜Base structural model 基础结构模型Basolateral membrane 侧底膜Bayesian estimation 贝易斯氏评估法Bayesian optimization 贝易斯优化法Bile 胆汁Billiary clearance 胆汁清除率Biliary excretion 胆汁排泄Binding 结合Binding site 结合部位Bioactivation 生物活化Bioavailability, BA 生物利用度Bioequivalence, BE 生物等效性Biological factors 生理因素Biological half life 生物半衰期Biological specimen 生物样品Biomembrane limit 膜限速型Biopharmaceutics 生物药剂学Bioequivalency criteria 生物等效性判断标准Biotransformation 生物转化Biowaiver 生物豁免Blood brain barrier, BBB BBB血脑屏障Blood clearance 血液清除率Blood flow rate-limited models 血流速度限速模型Blood flux in tissue 组织血流量Body fluid 体液Buccal absorption of drug 口腔用药的吸收Buccal mucosa 口腔粘膜颊粘膜Buccal spray formulation 口腔喷雾制剂※<C>Capacity limited 容量限制Carrier mediated transport 载体转运Catenary model 链状模型Caucasion 白种人Central compartment 中央室Characteristic 特点Chelate 螯合物Chinese Traditional medicine products 中药制剂Cholesterol esterase 胆固醇酯酶Chromatogram 色谱图Circulation 循环Classification 分类Clearance 清除率Clinical testing in first phase I期临床试验Clinical testing in second phase Ⅱ期临床试验Clinical testing in third phase Ⅲ期临床试验Clinical trial 临床试验Clinical trial simulation 临床实验计划仿真Clockwise hysteresis loop 顺时针滞后回线Collection 采集Combined administration 合并用药Combined errors 结合型误差Common liposomes, CL 普通脂质体Compartment models 隔室模型Compartments 隔室Competitive interaction 竞争性相互作用Complements 补体Complex 络合物Confidential interval 置信区间Conjugation with glucuronic acid 葡萄糖醛酸结合Controlled-release preparations 控释制剂Control stream 控制文件Conventional tablet 普通片Convergence 收敛Convolution 卷积Corresponding relationship 对应关系Corticosteroids 皮质甾体类Counter-clockwise hysteresis loop 逆时针滞后回线Countermeasure 对策Course in infusion period 滴注期间Covariance 协方差Covariates 相关因素Creatinine 肌酐Creatinine clearance 肌酐清除率Cytochrome P450, CYP450 细胞色素P450 Cytoplasm 细胞质Cytosis 胞饮作用Cytosol 胞浆胞液质※<D>Data File 数据文件Data Inspection 检视数据Deamination 脱氨基Deconvolution 反卷积Degree of fluctuation, DF DF波动度Delayed release preparations 迟释制剂Desaturation 降低饱和度Desmosome 桥粒Desulfuration 脱硫Detoxication 解毒Diagnosis 诊断Diffusion 扩散作用Dietary factors 食物因素Displacement 置换作用Disposition 处置Dissolution 溶解作用Distribution 分布Dosage adjustment 剂量调整Dosage form 剂型Dosage form design 剂型设计Dosage regimen 给药方案Dose 剂量dose-proportionality study 剂量均衡研究Dropping pills 滴丸Drug absorption via eyes 眼部用药物的吸收Drug binding 药物结合Drug concentration in plasma 血浆中药物浓度Drug Delivery System, DDS 药物给药系统Drug interaction 药物相互作用Drug-plasma protein binding ratio 药物—血浆蛋白结合率Drug-Protein Binding 药物蛋白结合Drug transport to foetus 胎内转运※<E>Efficient concentration range 有效浓度范围Efflux 外排Electrolyte 电解质Electro-spray ionization, ESI 电喷雾离子化Elimination 消除Elimination rate constant 消除速度常数Elongation 延长Emulsion 乳剂Endocytosis 入胞作用Endoplasmic reticulum 内质网Enterohepatic cycle 肠肝循环Enzyme 酶Enzyme induction 酶诱导Enzyme inhibition 酶抑制Enzyme-linked immunosorbent assays ELISA 酶联免疫法Enzymes or carrier-mediated system 酶或载体—传递系统Epithelium cell 上皮细胞Epoxide hydrolase 环化物水解酶Erosion 溶蚀Excretion 排泄Exocytosis 出胞作用Exons 外显子Experimental design 实验设计Experimental procedures 实验过程Exponential errors 指数型误差Exposure-response studies 疗效研究Extended least squares, ELS 扩展最小二乘法Extended-release preparations 缓控释制剂Extent of absorption 吸收程度External predictability 外延预见性Extraction ratio 抽取比Extract recovery rate 提取回收率Extrapolation 外推法Extravascular administration 血管外给药※<F>F test F检验Facilitated diffusion 促进扩散Factors of dosage forms 剂型因素Fasting 禁食Fibronectin 纤粘连蛋白First order rate 一级速度First Moment 一阶矩First order absorption 一级吸收First-order conditional estimation, FOCE 一级条件评估法First-order estimation, FO 一级评估法Fiest-order kinetics 一级动力学First pass effect 首过作用首过效应Fixed-effect parameters 固定效应参数Flavoprotein reductaseNADPH－细胞色素还原酶附属黄素蛋白还原酶Flow-through cell dissolution method 流室法Fluorescent detection method 荧光检测法Fraction of steady-state plasma drug concentration 达稳分数Free drug 游离药物Free drug concentration 游离药物浓度※<G>Gap junction 有隙结合Gas chromatography, GC 气相色谱法Gasrtointestinal tract, GI tract 胃肠道Gender differences 性别差异Generalized additive modeling, GAM 通用迭加模型化法Glimepiride 谷胱甘肽Global minimum 整体最小值Glomerular filtration 肾小球过滤Glomerular filtration rate, GFR 肾小球过滤率Glucuonide conjugation 葡萄糖醛酸结合Glutathione conjugation 谷胱甘肽结合Glycine conjugation 甘氨酸结合Glycocalyx 多糖—蛋白质复合体Goodness of Fit 拟合优度Graded response 梯度效应Graphic method 图解法Gut wall clearance肠壁清除率※<H>Half life 半衰期Health volunteers 健康志愿者Hemodialysis 血液透析Hepatic artery perfusion administration 肝动脉灌注给药Hepatic clearance, Clh 肝清除率Hierarchical Models 相同系列药物动力学模型High performance liquid chromatography, HPLC 高效液相色谱Higuchi equation Higuchi 方程Homologous 类似Human liver cytochrome P450 人类肝细胞色素P450 Hydrolysis 水解Hydroxylation 羟基化Hysteresis 滞后Hysteresis of plasma drug concentration 血药浓度滞后于药理效应Hysteresis of response 药理效应滞后于血药浓度※<I>Immunoradio metrec assays, IRMA 免疫放射定量法Incompatibility 配伍禁忌Independent 无关，独立Individual parameters 个体参数Individual variability 个体差异Individualization of drug dosage regimen 给药方案的个体化Inducer 诱导剂Induction 诱导Infusion 输注Inhibition 抑制Inhibitor 抑制剂Initial dose 速释部分Initial values 初始值Injection sites 注射部位Insulin 胰岛素Inter-compartmental clearance 隔室间清除率Inter-individual model 个体间模型Inter-individual random effects 个体间随机效应Inter-individual variability 个体间变异性Intermittence intravenous infusion 间歇静脉输液Internal predictability 内延预见性Inter-occasion random effects 实验间随机效应Intestinal bacterium flora 肠道菌丛Intestinal metabolism 肠道代谢Intra-individual model 个体内模型Intra-individual variability 个体内变异性Intramuscular administration 肌内给药Intramuscular injection 肌内注射Intra-peritoneal administration 腹腔给药Intravenous administration 静脉给药Intravenous infusion 静脉输液Intravenous injection 静脉注射Intrinsic clearance固有清除率内在清除率Inulin 菊粉In vitro experiments 体外试验In vitro–In vivo correlation, IVIVC 体外体内相关关系In vitro mean dissolution time, MDT vitro 体外平均溶出时间In vivo Mean dissolution time, MDT vivo 体内平均溶出时间Ion exchange 离子交换Isoform 异构体Isozyme 同工酶※<K>Kerckring 环状皱褶Kidney 肾※<L>Lag time 滞后时间Laplace transform 拉普拉斯变换Lateral intercellular fluid 侧细胞间隙液Lateral membrane 侧细胞膜Least detection amount 最小检测量Linearity 线性Linear models 线性模型Linear regression method 线性回归法Linear relationship 线性关系Lipoprotein 脂蛋白Liposomes 脂质体Liver flow 肝血流Local minimum 局部最小值Loading dose 负荷剂量Logarithmic models 对数模型Long circulation time liposomes 长循环脂质体Loo-Riegelman method Loo-Riegelman法Lowest detection concentration 最低检测浓度Lowest limit of quantitation 定量下限Lowest steady-state plasma drug concentration 最低稳态血药浓度Lung clearance 肺清除率Lymphatic circulation 淋巴循环Lymphatic system 淋巴系统※<M>Maintenance dose 维持剂量Mass balance study 质量平衡研究Masticatory mucosa 咀嚼粘膜Maximum likelihood 最大似然性Mean absolute prediction error, MAPE 平均绝对预测误差Mean absorption time, MAT 平均吸收时间Mean disintegration time, MDIT 平均崩解时间Mean dissolution time, MDT 平均溶出时间Mean residence time, MRT 平均驻留时间Mean sojourn time 平均逗留时间Mean squares 均方Mean transit time 平均转运时间Membrane-limited models 膜限速模型Membrane-mobile transport 膜动转运Membrane transport 膜转运Metabolism 代谢Metabolism enzymes 代谢酶Metabolism locations 代谢部位Metabolites 代谢物Metabolites clearance, Clm 代谢物清除率Method of residuals 残数法剩余法Methylation 甲基化Michaelis-Menten equation 米氏方程Michaelis-Menten constant 米氏常数Microbial assays 微生物检定法Microsomal P－450 mixed-function oxygenases 肝微粒体P-450混合功能氧化酶Microspheres 微球Microvilli 微绒毛Minimum drug concentration in plasma 血浆中最小药物浓度Mixed effects modeling 混合效应模型化Mixed-function oxidase, MFO 混合功能氧化酶Models 模型Modeling efficiency 模型效能Model validation 模型验证Modified release preparations 调释制剂Molecular mechanisms 分子机制Mono-exponential equation 单指数项公式Mono-oxygenase 单氧加合酶Mucous membrane injury 粘膜损伤Multi-compartment models 多室模型延迟分布模型Multi-exponential equation 多指数项公式Multifactor analysis of variance, multifactor ANOVA 多因素方差分析Multiple dosage 多剂量给药Multiple-dosage function 多剂量函数Multiple-dosage regimen 多剂量给药方案Multiple intravenous injection 多次静脉注射Myoglobin 肌血球素※<N>Naive average data, NAD 简单平均数据法Naive pool data, NPD 简单合并数据法Nanoparticles 纳米粒Nasal cavity 鼻腔Nasal mucosa 鼻粘膜National Institute of Health 美国国立卫生研究所Nephron 肾原Nephrotoxicity 肾毒性No hysteresis 无滞后Non-compartmental analysis, NCA 非隔室模型法Non-compartmental assistant Technology 非隔室辅助技术Nonionized form 非离子型Nonlinear mixed effects models, NONMEM 非线性混合效应模型Nonlinear pharmacokinetics 非线性药物动力学Non-linear relationship 非线性关系Nonparametric test 非参数检验※<O>Objective function, OF 目标函数Observed values 观测值One-compartment model 一室模型（单室模型）Onset 发生Open randomized two-way crossover design 开放随机两路交叉实验设计Open crossover randomized design 开放交叉随机设计Oral administration 口服给药Ordinary least squares, OLS 常规最小二乘法Organ 器官Organ clearance 器官清除率Original data 原始数据Osmosis 渗透压作用Outlier 偏离数据Outlier consideration 异常值的考虑Over-parameterized 过度参数化Oxidation 氧化Oxidation reactions 氧化反应※<P>Paracellular pathway 细胞旁路通道Parameters 参数Passive diffusion 被动扩散Pathways 途径Patient 病人Peak concentration 峰浓度Peak concentration of drug in plasma 血浆中药物峰浓度Poly-peptide 多肽Percent of absorption 吸收百分数Percent of fluctuation, PF 波动百分数Perfused liver 灌注肝脏Period 周期Peripheral compartments 外周室Peristalsis 蠕动Permeability of cell membrane 细胞膜的通透性P-glycoprotein, p-gp P-糖蛋白Phagocytosis 吞噬Pharmaceutical dosage form 药物剂型pharmaceutical equivalents 药剂等效性Pharmacokinetic models 药物动力学模型Pharmacokinetic physiological models 药物动力学的生理模型Pharmacological effects 药理效应Pharmacologic efficacy 药理效应Pharmacokinetics, PK 药物动力学Pharmacokinetic/pharmacodynamic link model 药物动力学-药效动力学统一模型Pharmacodynamics, PD 药效动力学Pharmacodynamic model 药效动力学模型Phase II metabolism 第II相代谢Phase I metabolism 第I相代谢pH-partition hypothesis pH分配假说Physiological function 生理功能Physiological compartment models 生理房室模型Physiological pharmacokinetic models 生理药物动力学模型Physiological pharmacokinetics 生理药物动力学模型Pigment 色素Physicochemical factors 理化因素Physicochemical property of drug 药物理化性质Physiological factors 生理因素Physiology 生理Physiological pharmacokinetic models 生理药物动力学模型Pinocytosis 吞噬Plasma drug concentration 血浆药物浓度Plasma drug concentration－time curve 血浆药物浓度-时间曲线Plasma drug-protein binding 血浆药物蛋白结合Plasma metabolite concentration 血浆代谢物浓度Plasma protein binding 血浆蛋白结合Plateau level 坪浓度Polymorphism 多态性Population average pharmacokinetic parameters 群体平均动力学参数Population model 群体模型Population parameters 群体参数Population pharmacokinetics 群体药物动力学Post-absorptive phase 吸收后相Post-distributive phase 分布后相Posterior probability 后发概率practical pharmacokinetic program 实用药代动力学计算程序Precision 精密度Preclinical 临床前的Prediction errors 预测偏差Prediction precision 预测精度Predicted values 拟合值Preliminary structural model 初始结构模型Primary active transport 原发性主动转运Principle of superposition 叠加原理Prior distribution 前置分布Prodrug 前体药物Proliferation assays 细胞增殖法Proportional 比例型Proportional errors 比例型误差Prosthehetic group 辅基Protein 蛋白质Pseudo-distribution equilibrium 伪分布平衡Pseudo steady state 伪稳态Pulmonary location 肺部Pulsatile drug delivery system 脉冲式释药系统※<Q、R>QQuality controlled samples 质控样品Quality control 质量控制Quick tissue 快分布组织RRadioimmuno assays, RIA 放射免疫法Random error model 随机误差模型Rapid intravenous injection 快速静脉注射Rate constants 速度常数Rate method 速度法Re-absorption 重吸收Receptor location 受体部位Recovery 回收率Rectal absorption 直肠吸收Rectal blood circulation 直肠部位的血液循环Rectal mucosa 直肠黏膜Reductase 还原酶Reduction 还原Reductive metabolism 还原代谢Reference individual 参比个体Reference product 参比制剂Relative bioavailability, Fr 相对生物利用度Release 释放Release medium 释放介质Release standard 释放度标准Renal 肾的Renal clearance, Clr 肾清除率Renal excretion 肾排泄Renal failure 肾衰Renal impairment 肾功能衰竭Renal tubular 肾小管Renal tubular re-absorption 肾小管重吸收Renal tubular secretion 肾小管分泌Repeatability 重现性Repeated one-point method 重复一点法Requirements 要求Research field 研究内容Reside 驻留Respiration 呼吸Respiration organ 呼吸器官Response 效应Residuals 残留误差Residual random effects 残留随机效应Reversal 恢复Rich Data 富集数据Ritschel one-point method Ritschel 一点法Rotating bottle method 转瓶法Rough surfaced endoplasmic reticulum 粗面内质网Routes of administration 给药途径※<S、T>SSafety and efficacy therapy 安全有效用药Saliva 唾液Scale up 外推Scale-Up/Post-Approval Changes, SUPAC 放大/审批后变化Second moment 二阶矩Secondary active transport 继发性主动转运Secretion 分泌Sensitivity 灵敏度Serum creatinine 血清肌酐Sigma curve 西格玛曲线Sigma-minus method 亏量法（总和减量法）Sigmoid curve S型曲线Sigmoid model Hill’s方程Simulated design 模拟设计Single-dose administration 单剂量（单次）给药Single dose response 单剂量效应Sink condition 漏槽条件Skin 皮肤Slow Tissue 慢分布组织Smooth surfaced endoplasmic reticulum 滑面内质网Soluble cell sap fraction 可溶性细胞液部分Solvent drag effect 溶媒牵引效应Stability 稳定性Steady-state volume of distribution 稳态分布容积Sparse data 稀疏数据Special dosage forms 特殊剂型Special populations 特殊人群Specialized mucosa 特性粘膜Species 种属Species differences 种属差异Specificity 特异性专属性Square sum of residual error 残差平方和Stagnant layer 不流动水层Standard curve 标准曲线Standard two stage, STS 标准两步法Statistical analysis 统计分析Statistical moments 统计矩Statistical moment theory 统计矩原理Steady state 稳态Steady state plasma drug concentration 稳态血药浓度Stealth liposomes, SL 隐形脂质体Steroid 类固醇Steroid-sulfatases 类固醇－硫酸酯酶Structure 结构Structure and function of GI epithelial cells 胃肠道上皮细胞的构造与功能Subcutaneous injections 皮下注射Subgroup 亚群体Subjects 受试者Sublingual administration 舌下给药Sublingual mucosa 舌下粘膜Subpopulation 亚群Substrate 底物Sulfate conjugation 硫酸盐结合Sulfation 硫酸结合Sum of squares 平方和Summation 相加Superposition method 叠加法Susceptible subject 易受影响的患者Sustained-release preparations 缓释制剂Sweating 出汗Synergism 协同作用Systemic clearance 全身清除率TTargeting 靶向化Taylor expansion 泰勒展开Tenous capsule 眼球囊Test product 试验制剂Therapy drug monitoring, TDM 治疗药物监测Therapeutic index 治疗指数Thermospray 热喷雾Three-compartment models 三室模型Though concentration 谷浓度Though concentration during steady state 稳态谷浓度Thromboxane 血栓素Tight junction 紧密结合Tissue 组织Tissue components 组织成分Tissue interstitial fluid 组织间隙Tolerance 耐受性Topping effect 尖峰效应Total clearance 总清除率Toxication and emergency treatment 中毒急救Transcellular pathway 经细胞转运通道Transdermal absorption 经皮肤吸收Transdermal drug delivery 经皮给药Transdermal penetration 经皮渗透Transport 转运Transport mechanism of drug 药物的转运机理Trapezoidal rule 梯形法Treatment 处理Trial Simulator 实验计划仿真器Trophoblastic epithelium 营养上皮层Two-compartment models 二室模型Two one sided tests 双单侧t检验Two period 双周期Two preparations 双制剂Two-way crossover bioequivalence studies 双周期交叉生物等效性研究Typical value 典型值※<U~Z>UUnwanted 非预期的Uniformity 均一性Unit impulse response 单位刺激反应Unit line 单位线Urinary drug concentration 尿药浓度Urinary excretion 尿排泄Urinary excretion rate 尿排泄速率VVagina 阴道Vaginal Mucosa 阴道黏膜Validation 校验Variance of mean residence time, VRT 平均驻留时间的方差Vein 静脉室Villi 绒毛Viscre 内脏Volumes of distribution 分布容积volunteers or patients studies 人体试验WWagner method Wagner法Wagner-Nelson method Wagner-Nelson法Waiver requirements 放弃（生物等效性研究）要求Washout period 洗净期Weibull distribution function Weibull分布函数Weighted Least Squares WLS加权最小二乘法Weighted residuals 加权残留误差XXenobiotic 外源物, 异生素ZZero Moment 零阶矩Zero-order absorption 零级吸收Zero-order kinetics 零级动力学Zero order rate 零级速度Zero-order release 零级释放。

6 孟生旺：广义线性模型—发展与应用

参数估计：
b(m + 1) = b(m ) + (X ⅱ + 2l *A SA)- 1[X ⅱ (y - m + l *A S 2I ] WX WG )
18
GLM的推广与应用的推广
• 分布假设的推广
– 过离散：
• 混合泊松分布：泊松-逆高斯，泊松-对数正态
– 零膨胀：
• 零膨胀模型
– 长尾：
• 对数正态，帕累托
11
• 模型比较模型比较：信息准则
A IC = − 2 l + 2 p B IC = − 2 l + p ln( n )
– AIC或BIC的值越小越好。 – 误差平方和的比较？
12
GLM的优缺点的优缺点 • 优点：
– 统计检验 – 处理相关性和交互作用（见下页） – 现成软件
• 缺点：
– 无法处理加法和乘法的混合模型 – 参数模型，函数形式有限 – 寻找交互项：耗时
yij p q ∑ nij α i f ( ) αi ˆ β j = f −1 i ∑ nij pα i q i
26
应用案例
• 来源： Ismail et al.(2007) 和Cheong et al.(2008) • 马来西亚车险汇总数据
分类变量保障类型水平综合险非综合险国内国外男性个人女性个人商务 0至1年 2至3年 4至5年 6年以上中部北部东部南部东马
28
广义线性模型的拟合结果比较
29
回归树的结果
30
模型的误差平方和比较
模型线性回归回归树泊松-逆高斯回归负二项回归泊松回归神经网络（1个神经元）神经网络（2个神经元）神经网络（3个神经元）误差平方和参数个数（SSE） 11 11 12 12 11 13 25 37 19.08 16.76 15.08 14.73 13.04 12.30 5.85 5.11 类R2 0.7274 0.7606 0.7846 0.7896 0.8138 0.8242 0.9165 0.9270

1、下载文档前请自行甄别文档内容的完整性，平台不提供额外的编辑、内容补充、找答案等附加服务。
2、"仅部分预览"的文档,不可在线预览部分如存在完整性等问题,可反馈申请退款(可完整预览的文档不适用该条件!)。
3、如文档侵犯您的权益，请联系客服反馈,我们会尽快为您处理(人工客服工作时间：9:00-18:30)。

of the tted curve. The right panel of gure 1 shows a cubic spline t to the data.
What value of did we use in gure 1? In fact it is not convenient to express the desired smoothness of f in terms of , as the meaning of
* *
*
*
*
*
*** *
* *
*
0.5
1.5
2.5
x
y
-1
0
1
*
* **
*
** * ** * **
** * **
*
*****
* * **
** **
*
** * ** **********
*
* ****
*
******
* *
*
*
*
*
*** *
* *
*
0.5
1.5
2.5
x
-1
Figure 1: Left panel shows a ctitious scatterplot of an outcome measure y plotted against a prognostic factor x. In the right panel, a scatterplot smooth has been added to describe the trend of y on x.
fቤተ መጻሕፍቲ ባይዱctor. of y on
We wish x. If we
to were
ta to
smooth nd the
curve curve
f (x) that
that summarizes simply minimizes
tPhe(ydie?pefn(dxein))c2e,
the result would be an interpolating curve that would not be smooth at all.
Department of Statistics and Division of Biostatistics, Stanford University, Stanford California 94305; trevor@
yDepartment of Preventive Medicine and Biostatistics, and Department of Statistics, University of Toronto; tibs@; tibs@
depends on the units of the prognostic factor x. Instead, it is possible to de ne an \e ective number of parameters" or \degrees of freedom" of a cubic spline smoother, and then use a numerical search to determine the value of to yield this number. In gure 1 we chose the e ective number of parameters to be 5. Roughly speaking, this means that the complexity of the curve is about the same as a polynomial regression of degrees 4. However, the cubic spline smoother \spreads out" its parameters in a more even manner, and hence is much more exible than a polynomial regression. Note that the degrees of freedom of a smoother need not be an integer.
Generalized additive models
Trevor Hastie
and
Robert Tibshirani y
1 Introduction
In the statistical analysis of clinical trials and observational studies, the identi cation and adjustment for prognostic factors is an important component. Valid comparisons of di erent treatments requires the appropriate adjustment for relevant prognostic factors. The failure to consider important prognostic variables, particularly in observational studies, can lead to errors in estimating treatment di erences. In addition, incorrect modeling of prognostic factors can result in the failure to identify nonlinear trends or threshold e ects on survival.
dataset. Fast and stable numerical procedures are available for computation
2
1
0
y
*
* **
*
*** ** * **
** * **
*
*****
* * **
** **
*
** * ** **********
*
* ****
*
******
IbtygPov(eyrin?s
the tradeo f(xi))2) and
between the goodness of t wiggliness of the function.
to the Larger
data (as values of
measured force f
to be smoother.
The above discussion tells how to t a curve to a single prognostic factor. With multiple prognostic factors, if xij denotes the value of the jth prognostic
This article describes exible statistical methods that may be used to identify and characterize the e ect of potential prognostic factors on an outcome variable. These methods are called \generalized additive models", and extend the traditional linear statistical model. They can be applied in any setting where a linear or generalized linear model is typically used. These settings include standard continuous response regression, categorical or ordered categorical response data, count data, survival data and time series.
We rst give some background on the methodology, and then discuss the
details of the logistic regression model and its generalization. Some related
developments are discussed in the last section.
1
opatdrheddeirticitvteoecrhmonofidqtuehleerse)fopmrlamocdePeslPtxhjxejje,
ects of prognostic factors xj in terms of a linear
where j with
Pthfej
j
(xj
are parameters. The generalized ) where fj is a unspeci ed (\non-
setting, and then indicate how it is used in generalized additive modeling.
Suppose that we have a scatterplot of points (xi; yi) like that shown in gure 1. Here y is a response or outcome variable, and x is a prognostic
For any value of , the solution to (1) is a cubic spline, i.e., a piecewise