计量经济学第四章作业参考答案

合集下载

《计量经济学》习题（第四章）

《计量经济学》习题（第四章）第四章习题⼀、单选题1、如果回归模型违背了同⽅差假定，最⼩⼆乘估计量＿＿＿＿A ．⽆偏的，⾮有效的 B.有偏的，⾮有效的C ．⽆偏的，有效的 D.有偏的，有效的2、Goldfeld-Quandt ⽅法⽤于检验＿＿＿＿A ．异⽅差性 B.⾃相关性C ．随机解释变量 D.多重共线性3、DW 检验⽅法⽤于检验＿＿＿＿A ．异⽅差性 B.⾃相关性C ．随机解释变量 D.多重共线性4、在异⽅差性情况下，常⽤的估计⽅法是＿＿＿＿A ．⼀阶差分法 B.⼴义差分法C ．⼯具变量法 D.加权最⼩⼆乘法5、在以下选项中，正确表达了序列⾃相关的是＿＿＿＿j i u x Cov D j i x x Cov C ji u u Cov B ji u u Cov A j i j i j i j i ≠≠≠≠≠=≠≠,0),(.,0),(.,0),(.,0),(.6、如果回归模型违背了⽆⾃相关假定，最⼩⼆乘估计量＿＿＿＿A ．⽆偏的，⾮有效的 B.有偏的，⾮有效的C ．⽆偏的，有效的 D.有偏的，有效的7、在⾃相关情况下，常⽤的估计⽅法＿＿＿＿A ．普通最⼩⼆乘法 B.⼴义差分法C ．⼯具变量法 D.加权最⼩⼆乘法8、White 检验⽅法主要⽤于检验＿＿＿＿A ．异⽅差性 B.⾃相关性C ．随机解释变量 D.多重共线性9、Glejser 检验⽅法主要⽤于检验＿＿＿＿A ．异⽅差性 B.⾃相关性C ．随机解释变量 D.多重共线性10、简单相关系数矩阵⽅法主要⽤于检验＿＿＿＿A ．异⽅差性 B.⾃相关性C ．随机解释变量 D.多重共线性2222)(.)(.)(.)(.σσσσ==≠≠i i i i x Var D u Var C x Var B u Var A12、所谓不完全多重共线性是指存在不全为零的数k λλλ,,,21 ，有＿＿＿＿1112211221221122.0.0..k k k k k x x x k k k k A x x x v B x x x C x x x v e D x x x v e v λλλλλλλλλλλλ++++=+++=∑?++++=++++=式中是随机误差项13、设21,x x 为解释变量，则完全多重共线性是＿＿＿＿0.(021.0.021.22121121=+=++==+x x e x D v v x x C e x B x x A 为随机误差项）14、⼴义差分法是对＿＿＿＿⽤最⼩⼆乘法估计其参数 11211211121121)()1(....-------+-+-=-++=++=++=t t t t t t t t t t t t t t t u u x x y y D u x y C u x y B u x y A ρρβρβρρρβρβρββββ15、在DW 检验中要求有假定条件，在下列条件中不正确的是＿＿＿＿A ．解释变量为⾮随机的 B.随机误差项为⼀阶⾃回归形式C ．线性回归模型中不应含有滞后内⽣变量为解释变量D.线性回归模型为⼀元回归形式16、在下例引起序列⾃相关的原因中，不正确的是＿＿＿＿A.经济变量具有惯性作⽤B.经济⾏为的滞后性C.设定偏误D.解释变量之间的共线性17、在DW 检验中，当d 统计量为2时，表明＿＿＿＿A.存在完全的正⾃相关B.存在完全的负⾃相关C.不存在⾃相关D.不能判定18、在DW 检验中，当d 统计量为4时，表明＿＿＿＿A.存在完全的正⾃相关B.存在完全的负⾃相关C.不存在⾃相关D.不能判定19、在DW 检验中，当d 统计量为0时，表明＿＿＿＿A.存在完全的正⾃相关C.不存在⾃相关D.不能判定20、在DW 检验中，存在不能判定的区域是＿＿＿＿A. 0﹤d ﹤l d ,4-l d ﹤d ﹤4B. u d ﹤d ﹤4-u dC. l d ﹤d ﹤u d ,4-u d ﹤d ﹤4-l dD. 上述都不对21、在修正序列⾃相关的⽅法中，能修正⾼阶⾃相关的⽅法是＿＿＿＿A. 利⽤DW 统计量值求出ρB. Cochrane-Orcutt 法C. Durbin 两步法D. 移动平均法22、在下列多重共线性产⽣的原因中，不正确的是＿＿＿＿A.经济本变量⼤多存在共同变化趋势B.模型中⼤量采⽤滞后变量C.由于认识上的局限使得选择变量不当D.解释变量与随机误差项相关23、在DW 检验中，存在正⾃相关的区域是＿＿＿＿A. 4-l d ﹤d ﹤4B. 0﹤d ﹤l dC. u d ﹤d ﹤4-u dD. l d ﹤d ﹤u d ,4-u d ﹤d ﹤4-l d24、逐步回归法既检验⼜修正了＿＿＿＿A ．异⽅差性 B.⾃相关性 C ．随机解释变量 D.多重共线性25、设)()(,2221i i i i i ix f u Var u x y σσββ==++=，则对原模型变换的正确形式为＿＿＿＿ )()()()(.)()()()(.)()()()(..212222122121i i i i i i i i i i i i i i i i i i i i i i i i x f u x f x x f x f y D x f u x f x x f x f y C x f u x f x x f x f y B u x y A ++=++=++=++=ββββββββ 26、在修正序列⾃相关的⽅法中，不正确的是＿＿＿＿A.⼴义差分法B.普通最⼩⼆乘法C.⼀阶差分法D. Durbin 两步法27、在检验异⽅差的⽅法中，不正确的是＿＿＿＿A. Goldfeld-Quandt ⽅法B. spearman 检验法C. White 检验法28、在DW 检验中，存在零⾃相关的区域是＿＿＿＿A. 4-l d ﹤d ﹤4B. 0﹤d ﹤l dC. u d ﹤d ﹤4-u dD. l d ﹤d ﹤u d ,4-u d ﹤d ﹤4-l d29．如果模型中的解释变量存在完全的多重共线性，参数的最⼩⼆乘估计量是（）A ．⽆偏的 B. 有偏的 C. 不确定 D. 确定的30. 已知模型的形式为u x y 21+β+β=,在⽤实际数据对模型的参数进⾏估计的时候,测得DW 统计量为0.6453,则⼴义差分变量是( )A. 1t t ,1t t x 6453.0x y 6453.0y ----B. 1t t 1t t x 6774.0x ,y 6774.0y ----C. 1t t 1t t x x ,y y ----D. 1t t 1t t x 05.0x ,y 05.0y ----31. 在具体运⽤加权最⼩⼆乘法时,如果变换的结果是x u x x x 1xy 21+β+β=,则Var(u)是下列形式中的哪⼀种?( )A. 2σxB. 2σ2x B. 2σx D. 2σLog(x)32. 在线性回归模型中,若解释变量1x 和2x 的观测值成⽐例,即有i 2i 1kx x =,其中k 为⾮零常数,则表明模型中存在( )A. 异⽅差B. 多重共线性C. 序列⾃相关D. 设定误差33. 已知DW 统计量的值接近于2，则样本回归模型残差的⼀阶⾃相关系数ρ近似等于( ) A. 0 B. –1 C. 1 D. 4⼆、多项选择1、能够检验多重共线性的⽅法有＿＿＿＿A.简单相关系数法B. DW检验法C. 判定系数检验法D. ⽅差膨胀因⼦检验E.逐步回归法2、能够修正多重共线性的⽅法有＿＿＿＿A.增加样本容量B.岭回归法C.剔除多余变量E.差分模型3、如果模型中存在异⽅差现象，则会引起如下后果＿＿＿＿A. 参数估计值有偏B. 参数估计值的⽅差不能正确确定C. 变量的显著性检验失效D. 预测精度降低E. 参数估计值仍是⽆偏的4、能够检验异⽅差的⽅法是＿＿＿＿A. gleiser检验法B. White检验法C. 图形法D. spearman检验法E. DW检验法F. Goldfeld-Quandt检验法5、如果模型中存在序列⾃相关现象，则会引起如下后果＿＿＿＿A. 参数估计值有偏B. 参数估计值的⽅差不能正确确定C. 变量的显著性检验失效D. 预测精度降低E. 参数估计值仍是⽆偏的6、检验序列⾃相关的⽅法是＿＿＿＿A. gleiser检验法B. White检验法C. 图形法D. DW检验法E. Goldfeld-Quandt检验法7、能够修正序列⾃相关的⽅法有＿＿＿＿A. 加权最⼩⼆乘法B. Durbin两步法C. ⼴义最⼩⼆乘法D. ⼀阶差分法E. ⼴义差分法8、Goldfeld-Quandt检验法的应⽤条件是＿＿＿＿A. 将观测值按解释变量的⼤⼩顺序排列B. 样本容量尽可能⼤C. 随机误差项服从正态分布D. 将排列在中间的约1/4的观测值删除掉9、在DW检验中，存在不能判定的区域是＿＿＿＿A. 0﹤d﹤l dB. u d﹤d﹤4-u dC. l d﹤d﹤u dD. 4-u d﹤d﹤4-l dE. 4-l d﹤d﹤4。

第四章计量经济学答案范文

第四章一元线性回归第一部分学习目的和要求本章主要介绍一元线性回归模型、回归系数的确定和回归方程的有效性检验方法。

回归方程的有效性检验方法包括方差分析法、t检验方法和相关性系数检验方法。

本章还介绍了如何应用线性模型来建立预测和控制。

需要掌握和理解以下问题：1 一元线性回归模型2 最小二乘方法3 一元线性回归的假设条件4 方差分析方法5 t检验方法6 相关系数检验方法7 参数的区间估计8 应用线性回归方程控制与预测9 线性回归方程的经济解释第二部分练习题一、术语解释1 解释变量2 被解释变量3 线性回归模型4 最小二乘法5 方差分析6 参数估计7 控制8 预测二、填空ξ，目的在于使模型更1 在经济计量模型中引入反映（）因素影响的随机扰动项t符合（）活动。

2 在经济计量模型中引入随机扰动项的理由可以归纳为如下几条：（1）因为人的行为的（）、社会环境与自然环境的（）决定了经济变量本身的（）；（２）建立模型时其他被省略的经济因素的影响都归入了（）中；（３）在模型估计时，（）与归并误差也归入随机扰动项中；（4）由于我们认识的不足，错误的设定了（）与（）之间的数学形式，例如将非线性的函数形式设定为线性的函数形式，由此产生的误差也包含在随机扰动项中了。

3 （）是因变量离差平方和，它度量因变量的总变动。

就因变量总变动的变异来源看，它由两部分因素所组成。

一个是自变量，另一个是除自变量以外的其他因素。

（）是拟合值的离散程度的度量。

它是由自变量的变化引起的因变量的变化，或称自变量对因变量变化的贡献。

（）是度量实际值与拟合值之间的差异，它是由自变量以外的其他因素所致，它又叫残差或剩余。

4 回归方程中的回归系数是自变量对因变量的（）。

某自变量回归系数β的意义，指的是该自变量变化一个单位引起因变量平均变化（）个单位。

5 模型线性的含义，就变量而言，指的是回归模型中变量的（）；就参数而言，指的是回归模型中的参数的（）；通常线性回归模型的线性含义是就（）而言的。

(完整word版)计量经济学第四章习题详解

第四章习题4.1 没有进行t检验,并且调整的可决系数也没有写出来，也就是没有考虑自由度的影响，会使结果存在误差.4.3200224430.3120332。

7 330.6200334195。

6135822.8 334。

6200446435.8159878.3 l347.7200554273.7183084.8 353.9200663376.9211923。

5 359。

2200773284。

6249529。

9 376.5200879526.5314045.4 398.7200968618。

4340902。

8 395。

9201094699.3401512.8 408。

92011113161.4472881.6 431.0一研究的目的和要求我们知道，商品进口额与很多因素有关，了解其变化对进出口产品有很大帮助。

为了探究和预测商品进口额的变化，需要定量地分析影响商品进口额变化的主要因素。

二、模型的设定及其估计经分析，商品进口额可能与国内生产总值、居民消费价格指数有关。

为此,考虑国内生产总值GDP、居民消费价格指数CPI为主要因素。

各影响变量与商品进口额呈正相关。

为此，设定如下形式的计量经济模型：=+ln+lnCP式中,亿元）；lnGDP为国内生产总值(亿元）；lnCPI为居民消费价格指数（以1985年为100）。

各解释变量前的回归系数预期都大于零。

为估计模型，根据上表的数据，利用EViews软件，生成Y、lnGDP、lnCPI等数据，采用OLS方法估计模型参数,得到的回归结果如下图所示:模型方程为：lnY=-3。

111486+1。

338533lnGDP-0.421791lnCPI（0。

463010）(0。

088610）(0。

233295）t= (—6。

720126) (15。

10582）（—1。

807975)=0.988051 =0.987055 F=992。

2582该模型=0.988051,=0。

987055,可决系数很高，F检验值为992.2582，明显显著。

计量经济学第二版第四章课后习题

第四章课后习题 4.1 解1）存在22βˆαˆ=且33βˆγˆ=。

因为2X 和3X 之间的相关系数为零，即2X 和3X 相互之间不存在线性关系，两者是相互独立的，所以分别一元回归和二元回归两者的系数都不会发生变化。

利用公式证明如下：2）会。

3）如第一问解释，22βˆαˆ=，33βˆγˆ=是成立的，所以存在)αˆ()βˆ(22Var Var =，)αˆ()βˆ(33Var Var =。

4.2 解：根据我对多重共线性的认识，我认为任何一种逐步回归都存在弊端。

根据课本上对多重共线性的定义，不仅包括解释变量之间精确的线性关系，还包括解释变量之间近似的线性关系。

而逐步回归法是通过逐步筛选并剔除引起多重共线性的变量。

所以在采用逐步回归法时，难免会出现一些不符合要求的变量被剔除的情况，此变量岁引起多重共线性，但其对被解释变量也有一定的影响，直接剔除就是忽略其的影响，使得回归结果不够精确。

误差增大。

4.3解：将数据输入到Eviews中，可得如下图所示：图1注释：X2表示国内生产总值GDP，X3表示居民消费价格指数CPI。

利用软件，采用最小二乘法进行回归，结果如下图所示：图2 建立回归模型如下：i t X X Y μβββ＋＋＋33221ln ln ln =1）从回归结果中，可知此模型的参数1β=﹣3.06015，2β=1.656675，3β=﹣1.0570542）利用软件求出lnx2和lnx3的相关系数，可得由上图可知lnx2，lnx3之间存在很强的线性相关性。

证实存在多重共线性。

根据题目要求分别进行三次回归：图3 图4 图5根据元以上回归结果可知，lnx2和lnx3在一元回归中分别对lny 影响显著，都通过了P 值法的检验；而在第三个回归结果中，可知lnx2和lnx3存在显著的线性关系，因为1C 没有通过P 值法检验，故修正后的模型为32ln 245971.2ln x x =。

故此题中存在严重的多重共线性。

计量经济学第四章练习题及参考解答

第四章练习题及参考解答4.1 假设在模型i i i iu X X Y +++=33221βββ中，32X X 与之间的相关系数为零，于是有人建议你进行如下回归：ii i i i i u X Y u X Y 23311221++=++=γγαα(1)是否存在3322ˆˆˆˆβγβα==且？为什么？ (2)111ˆˆˆβαγ会等于或或两者的某个线性组合吗？ (3)是否有()()()()3322ˆvar ˆvar ˆvar ˆvarγβαβ==且？练习题4.1参考解答：(1) 存在3322ˆˆˆˆβγβα==且。

因为()()()()()()()23223223232322ˆ∑∑∑∑∑∑∑--=iiiii iii iii x x x x x xx y x x y β当32X X 与之间的相关系数为零时，离差形式的032=∑i i x x有()()()()222223222322ˆˆαβ===∑∑∑∑∑∑iiiiiiii xx y x x x x y 同理有：33ˆˆβγ= (2) 111ˆˆˆβαγ会等于或的某个线性组合因为12233ˆˆˆY X X βββ=--，且122ˆˆY X αα=-，133ˆˆY X γγ=- 由于3322ˆˆˆˆβγβα==且，则 11222222ˆˆˆˆˆY Y X Y X X αααββ-=-=-=11333333ˆˆˆˆˆY Y X Y X X γγγββ-=-=-=则 1112233231123ˆˆˆˆˆˆˆY Y Y X X Y X X Y X X αγβββαγ--=--=--=+- (3) 存在()()()()3322ˆvar ˆvar ˆvar ˆvarγβαβ==且。

因为()()∑-=22322221ˆvarr x iσβ当023=r 时，()()()22222232222ˆvar 1ˆvar ασσβ==-=∑∑iixr x 同理，有()()33ˆvar ˆvar γβ=4.2在决定一个回归模型的“最优”解释变量集时人们常用逐步回归的方法。

计量经济学课后答案第四、五章（内容参考）

计量经济学课后答案第四、五章（内容参考）第四章随机解释变量问题1. 随机解释变量的来源有哪些?答：随机解释变量的来源有：经济变量的不可控，使得解释变量观测值具有随机性；由于随机干扰项中包括了模型略去的解释变量，而略去的解释变量与模型中的解释变量往往是相关的；模型中含有被解释变量的滞后项，而被解释变量本身就是随机的。

2．随机解释变量有几种情形? 分情形说明随机解释变量对最小二乘估计的影响与后果?答：随机解释变量有三种情形，不同情形下最小二乘估计的影响和后果也不同。

(1)解释变量是随机的，但与随机干扰项不相关；这时采用OLS估计得到的参数估计量仍为无偏估计量；(2)解释变量与随机干扰项同期无关、不同期相关；这时OLS估计得到的参数估计量是有偏但一致的估计量；(3)解释变量与随机干扰项同期相关；这时OLS估计得到的参数估计量是有偏且非一致的估计量。

3. 选择作为工具变量的变量必须满足那些条件?答：选择作为工具变量的变量需满足以下三个条件：(1)与所替代的随机解释变量高度相关；(2)与随机干扰项不相关；(3)与模型中其他解释变量不相关，以避免出现多重共线性。

4．对模型Y t =β+β1X1t+β2X2t+β3Yt-1+μt假设Yt-1与μt相关。

为了消除该相关性，采用工具变量法：先求Y t关于X1t与 X2t回归，得到Yt，再做如下回归：Y t =β+β1X1t+β2X2t+β3Y t?1-+μt试问：这一方法能否消除原模型中Yt的相关性? 为什么?解答：能消除。

在基本假设下，X1t，X2t与μt应是不相关的，由此知，由X1t 与X2t估计出的Yt应与μt不相关。

5．对于一元回归模型Y t =β+β1Xt*+μt假设解释变量Xt *的实测值Xt与之有偏误：Xt= Xt*+et,其中et是具有零均值、无序列相关，且与Xt不相关的随机变量。

试问：(1) 能否将X t= X t*+e t代入原模型，使之变换成Y t=β0+β1X t+νt后进行估计? 其中，νt为变换后模型的随机干扰项。

计量经济学第4章课后答案

17CHAPTER 4SOLUTIONS TO PROBLEMS4.2 (i) and (iii) generally cause the t statistics not to have a t distribution under H 0.Homoskedasticity is one of the CLM assumptions. An important omitted variable violates Assumption MLR.3. The CLM assumptions contain no mention of the sample correlations among independent variables, except to rule out the case where the correlation is one.4.3 (i) While the standard error on hrsemp has not changed, the magnitude of the coefficient has increased by half. The t statistic on hrsemp has gone from about –1.47 to –2.21, so now the coefficient is statistically less than zero at the 5% level. (From Table G.2 the 5% critical value with 40 df is –1.684. The 1% critical value is –2.423, so the p -value is between .01 and .05.)(ii) If we add and subtract 2βlog(employ ) from the right-hand-side and collect terms, we havelog(scrap ) = 0β + 1βhrsemp + [2βlog(sales) – 2βlog(employ )] + [2βlog(employ ) + 3βlog(employ )] + u = 0β + 1βhrsemp + 2βlog(sales /employ ) + (2β + 3β)log(employ ) + u ,where the second equality follows from the fact that log(sales /employ ) = log(sales ) – log(employ ). Defining 3θ ≡ 2β + 3β gives the result.(iii) No. We are interested in the coefficient on log(employ ), which has a t statistic of .2, which is very small. Therefore, we conclude that the size of the firm, as measured by employees, does not matter, once we control for training and sales per employee (in a logarithmic functional form).(iv) The null hypothesis in the model from part (ii) is H 0:2β = –1. The t statistic is [–.951 – (–1)]/.37 = (1 – .951)/.37 ≈ .132; this is very small, and we fail to reject whether we specify a one- or two-sided alternative.4.4 (i) In columns (2) and (3), the coefficient on profmarg is actually negative, although its t statistic is only about –1. It appears that, once firm sales and market value have been controlled for, profit margin has no effect on CEO salary.(ii) We use column (3), which controls for the most factors affecting salary. The t statistic on log(mktval ) is about 2.05, which is just significant at the 5% level against a two-sided alternative.18(We can use the standard normal critical value, 1.96.) So log(mktval ) is statistically significant. Because the coefficient is an elasticity, a ceteris paribus 10% increase in market value is predicted to increase salary by 1%. This is not a huge effect, but it is not negligible, either.(iii) These variables are individually significant at low significance levels, with t ceoten ≈ 3.11 and t comten ≈ –2.79. Other factors fixed, another year as CEO with the company increases salary by about 1.71%. On the other hand, another year with the company, but not as CEO, lowers salary by about .92%. This second finding at first seems surprising, but could be related to the “superstar” effect: firms that hire CEOs from outside the company often go after a small pool of highly regarded candidates, and salaries of these people are bid up. More non-CEO years with a company makes it less likely the person was hired as an outside superstar.4.7 (i) .412 ± 1.96(.094), or about .228 to .596.(ii) No, because the value .4 is well inside the 95% CI.(iii) Yes, because 1 is well outside the 95% CI.4.8 (i) With df = 706 – 4 = 702, we use the standard normal critical value (df = ∞ in Table G.2), which is 1.96 for a two-tailed test at the 5% level. Now t educ = −11.13/5.88 ≈ −1.89, so |t educ | = 1.89 < 1.96, and we fail to reject H 0: educ β = 0 at the 5% level. Also, t age ≈ 1.52, so age is also statistically insignificant at the 5% level.(ii) We need to compute the R -squared form of the F statistic for joint significance. But F = [(.113 − .103)/(1 − .113)](702/2) ≈ 3.96. The 5% critical value in the F 2,702 distribution can be obtained from Table G.3b with denominator df = ∞: cv = 3.00. Therefore, educ and age are jointly significant at the 5% level (3.96 > 3.00). In fact, the p -value is about .019, and so educ and age are jointly significant at the 2% level.(iii) Not really. These variables are jointly significant, but including them only changes the coefficient on totwrk from –.151 to –.148.(iv) The standard t and F statistics that we used assume homoskedasticity, in addition to the other CLM assumptions. If there is heteroskedasticity in the equation, the tests are no longer valid.4.11 (i) Holding profmarg fixed, n rdintensΔ = .321 Δlog(sales ) = (.321/100)[100log()sales ⋅Δ] ≈ .00321(%Δsales ). Therefore, if %Δsales = 10, n rdintens Δ ≈ .032, or only about 3/100 of a percentage point. For such a large percentage increase in sales,this seems like a practically small effect.(ii) H 0:1β = 0 versus H 1:1β > 0, where 1β is the population slope on log(sales ). The t statistic is .321/.216 ≈ 1.486. The 5% critical value for a one-tailed test, with df = 32 – 3 = 29, is obtained from Table G.2 as 1.699; so we cannot reject H 0 at the 5% level. But the 10% criticalvalue is 1.311; since the t statistic is above this value, we reject H0 in favor of H1 at the 10% level.(iii) Not really. Its t statistic is only 1.087, which is well below even the 10% critical value for a one-tailed test.1920SOLUTIONS TO COMPUTER EXERCISESC4.1 (i) Holding other factors fixed,111log()(/100)[100log()](/100)(%),voteA expendA expendA expendA βββΔ=Δ=⋅Δ≈Δwhere we use the fact that 100log()expendA ⋅Δ ≈ %expendA Δ. So 1β/100 is the (ceteris paribus) percentage point change in voteA when expendA increases by one percent.(ii) The null hypothesis is H 0: 2β = –1β, which means a z% increase in expenditure by A and a z% increase in expenditure by B leaves voteA unchanged. We can equivalently write H 0: 1β + 2β = 0.(iii) The estimated equation (with standard errors in parentheses below estimates) isn voteA = 45.08 + 6.083 log(expendA ) – 6.615 log(expendB ) + .152 prtystrA(3.93) (0.382) (0.379) (.062) n = 173, R 2 = .793.The coefficient on log(expendA ) is very significant (t statistic ≈ 15.92), as is the coefficient on log(expendB ) (t statistic ≈ –17.45). The estimates imply that a 10% ceteris paribus increase in spending by candidate A increases the predicted share of the vote going to A by about .61percentage points. [Recall that, holding other factors fixed, n voteAΔ≈(6.083/100)%ΔexpendA ).] Similarly, a 10% ceteris paribus increase in spending by B reduces n voteAby about .66 percentage points. These effects certainly cannot be ignored.While the coefficients on log(expendA ) and log(expendB ) are of similar magnitudes (andopposite in sign, as we expect), we do not have the standard error of 1ˆβ + 2ˆβ, which is what we would need to test the hypothesis from part (ii).(iv) Write 1θ = 1β +2β, or 1β = 1θ– 2β. Plugging this into the original equation, and rearranging, givesn voteA = 0β + 1θlog(expendA ) + 2β[log(expendB ) – log(expendA )] +3βprtystrA + u ,When we estimate this equation we obtain 1θ≈ –.532 and se( 1θ)≈ .533. The t statistic for the hypothesis in part (ii) is –.532/.533 ≈ –1. Therefore, we fail to reject H 0: 2β = –1β.21C4.3 (i) The estimated model isn log()price = 11.67 + .000379 sqrft + .0289 bdrms (0.10) (.000043) (.0296)n = 88, R 2 = .588.Therefore, 1ˆθ= 150(.000379) + .0289 = .0858, which means that an additional 150 square foot bedroom increases the predicted price by about 8.6%.(ii) 2β= 1θ – 1501β, and solog(price ) = 0β+ 1βsqrft + (1θ – 1501β)bdrms + u= 0β+ 1β(sqrft – 150 bdrms ) + 1θbdrms + u .(iii) From part (ii), we run the regressionlog(price ) on (sqrft – 150 bdrms ), bdrms ,and obtain the standard error on bdrms . We already know that 1ˆθ= .0858; now we also getse(1ˆθ) = .0268. The 95% confidence interval reported by my software package is .0326 to .1390(or about 3.3% to 13.9%).C4.5 (i) If we drop rbisyr the estimated equation becomesn log()salary = 11.02 + .0677 years + .0158 gamesyr (0.27) (.0121) (.0016)+ .0014 bavg + .0359 hrunsyr (.0011) (.0072)n = 353, R 2= .625.Now hrunsyr is very statistically significant (t statistic ≈ 4.99), and its coefficient has increased by about two and one-half times.(ii) The equation with runsyr , fldperc , and sbasesyr added is22n log()salary = 10.41 + .0700 years + .0079 gamesyr(2.00) (.0120) (.0027)+ .00053 bavg + .0232 hrunsyr (.00110) (.0086)+ .0174 runsyr + .0010 fldperc – .0064 sbasesyr (.0051) (.0020) (.0052) n = 353, R 2 = .639.Of the three additional independent variables, only runsyr is statistically significant (t statistic = .0174/.0051 ≈ 3.41). The estimate implies that one more run per year, other factors fixed,increases predicted salary by about 1.74%, a substantial increase. The stolen bases variable even has the “wrong” sign with a t statistic of about –1.23, while fldperc has a t statistic of only .5. Most major league baseball players are pretty good fielders; in fact, the smallest fldperc is 800 (which means .800). With relatively little variation in fldperc , it is perhaps not surprising that its effect is hard to estimate.(iii) From their t statistics, bavg , fldperc , and sbasesyr are individually insignificant. The F statistic for their joint significance (with 3 and 345 df ) is about .69 with p -value ≈ .56. Therefore, these variables are jointly very insignificant.C4.7 (i) The minimum value is 0, the maximum is 99, and the average is about 56.16. (ii) When phsrank is added to (4.26), we get the following:n log() wage = 1.459 − .0093 jc + .0755 totcoll + .0049 exper + .00030 phsrank (0.024) (.0070) (.0026) (.0002) (.00024)n = 6,763, R 2 = .223So phsrank has a t statistic equal to only 1.25; it is not statistically significant. If we increase phsrank by 10, log(wage ) is predicted to increase by (.0003)10 = .003. This implies a .3% increase in wage , which seems a modest increase given a 10 percentage point increase in phsrank . (However, the sample standard deviation of phsrank is about 24.)(iii) Adding phsrank makes the t statistic on jc even smaller in absolute value, about 1.33, but the coefficient magnitude is similar to (4.26). Therefore, the base point remains unchanged: the return to a junior college is estimated to be somewhat smaller, but the difference is not significant and standard significant levels.(iv) The variable id is just a worker identification number, which should be randomly assigned (at least roughly). Therefore, id should not be correlated with any variable in the regression equation. It should be insignificant when added to (4.17) or (4.26). In fact, its t statistic is about .54.23C4.9 (i) The results from the OLS regression, with standard errors in parentheses, aren log() psoda =−1.46 + .073 prpblck + .137 log(income ) + .380 prppov (0.29) (.031) (.027) (.133)n = 401, R 2 = .087The p -value for testing H 0: 10β= against the two-sided alternative is about .018, so that we reject H 0 at the 5% level but not at the 1% level.(ii) The correlation is about −.84, indicating a strong degree of multicollinearity. Yet eachcoefficient is very statistically significant: the t statistic for log()ˆincome β is about 5.1 and that forˆprppovβ is about 2.86 (two-sided p -value = .004).(iii) The OLS regression results when log(hseval ) is added aren log() psoda =−.84 + .098 prpblck − .053 log(income ) (.29) (.029) (.038) + .052 prppov + .121 log(hseval ) (.134) (.018)n = 401, R 2 = .184The coefficient on log(hseval ) is an elasticity: a one percent increase in housing value, holding the other variables fixed, increases the predicted price by about .12 percent. The two-sided p -value is zero to three decimal places.(iv) Adding log(hseval ) makes log(income ) and prppov individually insignificant (at even the 15% significance level against a two-sided alternative for log(income ), and prppov is does not have a t statistic even close to one in absolute value). Nevertheless, they are jointly significant at the 5% level because the outcome of the F 2,396 statistic is about 3.52 with p -value = .030. All of the control variables – log(income ), prppov , and log(hseval ) – are highly correlated, so it is not surprising that some are individually insignificant.(v) Because the regression in (iii) contains the most controls, log(hseval ) is individually significant, and log(income ) and prppov are jointly significant, (iii) seems the most reliable. It holds fixed three measure of income and affluence. Therefore, a reasonable estimate is that if the proportion of blacks increases by .10, psoda is estimated to increase by 1%, other factors held fixed.。

计量经济学第四章练习题及参考解答

第四章练习题及参考解答4.1 假设在模型i i i i u X X Y +++=33221βββ中，32X X 与之间的相关系数为零，于是有人建议你进行如下回归：ii i i i i u X Y u X Y 23311221++=++=γγαα(1)是否存在3322ˆˆˆˆβγβα==且？为什么？ (2)111ˆˆˆβαγ会等于或或两者的某个线性组合吗？ (3)是否有()()()()3322ˆvar ˆvar ˆvar ˆvar γβαβ==且？练习题4.1参考解答：(1) 存在3322ˆˆˆˆβγβα==且。

因为()()()()()()()23223223232322ˆ∑∑∑∑∑∑∑--=iiiii iii iii x x x x x x x y x x y β当32X X 与之间的相关系数为零时，离差形式的032=∑i ix x有()()()()222223222322ˆˆαβ===∑∑∑∑∑∑iiiiiiii xx y x x x x y 同理有：33ˆˆβγ= (2) 111ˆˆˆβαγ会等于或的某个线性组合因为 12233ˆˆˆY X X βββ=--，且122ˆˆY X αα=-，133ˆˆY X γγ=- 由于3322ˆˆˆˆβγβα==且，则 11222222ˆˆˆˆˆY Y X Y X X αααββ-=-=-= 11333333ˆˆˆˆˆY Y X Y X X γγγββ-=-=-= 则 1112233231123ˆˆˆˆˆˆˆY Y Y X X Y X X Y X X αγβββαγ--=--=--=+- (3) 存在()()()()3322ˆvar ˆvar ˆvar ˆvar γβαβ==且。

因为()()∑-=22322221ˆvar r x iσβ当023=r 时，()()()22222232222ˆvar 1ˆvar ασσβ==-=∑∑iixr x 同理，有()()33ˆvar ˆvar γβ=4.2在决定一个回归模型的“最优”解释变量集时人们常用逐步回归的方法。

1、下载文档前请自行甄别文档内容的完整性，平台不提供额外的编辑、内容补充、找答案等附加服务。
2、"仅部分预览"的文档,不可在线预览部分如存在完整性等问题,可反馈申请退款(可完整预览的文档不适用该条件!)。
3、如文档侵犯您的权益，请联系客服反馈,我们会尽快为您处理(人工客服工作时间：9:00-18:30)。

4.3（1）由题知，对数回归模型为：123ln ln ln t t t i Y G D P C PI u βββ=+++ 用最小二乘法对参数进行估计得：ˆl n 3.6491.796l n 1.208l nt tt Y G D P C P I =-+- （0.322）（0.181）（0.354）t=-11.32129 9.931363 -3.41496120.990R = 20.988R = S.E.=0.112388 F=770.602（2）存在多重共线性。

居民消费价格指数的回归系数的符号不能进行合理的经济意义解释，且其简单相关系数为0.985811，说明lnGDP 和lnCPI 存在正相关的关系。

（3）根据题目要求进行如下回归： ○1模型为：121ln ln t t i Y A A G D P v =++ 用最小二乘法对参数进行估计得： l n 3.7451.187l nt t Y G D P =-+ （0.410）（0.039） t= -9.143326 30.65940 20.982R = 20.981R = S.E.=0.143363 F=939.999 ○2模型为：122ln ln t t i Y B B C PI v =++用最小二乘法对参数进行估计得： l n 3.392.254l n t t Y CPI =-+（0.834）（0.154） t= -4.064199 14.62649 20.926R = 20.922R = S.E.=0.291842 F=213.934○3模型为：122ln ln tt i Y B B C PI v =++用最小二乘法对参数进行估计得：l n 0.1441.927l n t t GDP CPI =+ （0.431）（0.080）t= 0.334092 24.2143920.972R = 20.970R = S.E.=0.150715 F=586.337单方程拟合效果都很好，回归系数显著，判定系数较高，GDP 和CPI 对进口的显著的单一影响，在这两个变量同时引入模型引起了多重共线性。

（4）如果仅仅是作预测，可以不在意这种多重共线性，但如果是进行结构分析，还是应该引起注意的。

4.5从模型拟合结果可知，样本观测个数为27，消费模型的判定系数95.02=R ，F 统计量为107.37，在0.05置信水平下查分子自由度为3，分母自由度为23的F 临界值为3.028，计算的F 值远大于临界值，表明回归方程是显著的。

模型整体拟合程度较高。

依据参数估计量及其标准误，可计算出各回归系数估计量的t 统计量值：11.009.1121.0,69.066.0452.0,10.617.0059.1,91.092.8133.83210========t t t t除1t 外，其余的j t值都很小。

工资收入X1的系数的t 检验值虽然显著，但该系数的估计值过大，该值为工资收入对消费边际效应，因为它为1.059，意味着工资收入每增加一美元，消费支出的增长平均将超过一美元，这与经济理论和常识不符。

另外，理论上非工资—非农业收入与农业收入也是消费行为的重要解释变量，但两者的t 检验都没有通过。

这些迹象表明，模型中存在严重的多重共线性，不同收入部分之间的相互关系，掩盖了各个部分对解释消费行为的单独影响。

4.6（1）建立对数回归模型为： 55712132434668ln ln ln ln ln ln ln t t t t t t t LnY X X X X X X Xββββββββ=+++++++。

用最小二乘法进行估计得：1234572.6876387.02511ln 4.87302ln 1.3893ln 0.0477ln 0.0388ln 0.5187l 0.4430ln t t t t t t tLn Y X X X X X X ∧=+--+--+ （4.688794）（5.406211）（5.01120）（0.744155）（0.188431）（0.185312）（0.268625）T= 0.573205 1.298601 -0.972426 -1.866945 0.253146 -0.2.9639 -1.930983 （0.775921） 0.570910（2）从经济意义上来看，各个解释变量之间存在着较明显的共同变化的趋势，2X （GDP ）为3X 、4X 、5X 等加总之和，则从这可以看出解释变量之间可能存在多重共线性。

从模型估计结果可以看出，F 统计量显著而各系数的t 值均不显著，且2356ln ,ln ,ln ,ln t t t t X X X X 的参数与预期相反，表明各解释变量之间存在多重共线性。

同时从计算可以得知，部分解释变量之间相关系数较高。

LNX1 LNX2 LNX3 LNX4 LNX5 LNX6 LNX7 LNX1 1.000000 0.999988 0.999557 0.996886 0.557402 0.995035 -0.173109LNX2 0.999988 1.000000 0.999643 0.997018 0.556012 0.994729 -0.174187LNX3 0.999557 0.999643 1.000000 0.998062 0.543125 0.992628 -0.188286LNX4 0.996886 0.997018 0.998062 1.000000 0.522877 0.986734 -0.206870LNX5 0.557402 0.556012 0.543125 0.522877 1.000000 0.588443 -0.137985LNX6 0.995035 0.994729 0.992628 0.986734 0.588443 1.000000 -0.134513LNX7 -0.173109-0.174187-0.188286-0.206870-0.137985-0.1345131.00000（3）采用逐步回归法：1234567,ln ,ln ,ln ,ln ,ln ,ln t t t t t t t tLnY X X X X X X X 用分别对ln 作一元回归，1ln t X 2ln tX3ln tX4ln tX5ln t X 6ln tX7ln tXi β 0.235851 0.233695 0.218825 0.2017900.214956 0.302941 3.878956t19.76948 19.63261 18.47680 17.98623 18.40287 16.74191 2.6835712R0.960748 0.960143 0.955231 0.9528720.954887 0.945999 0.3103912R0.958295 0.957652 0.952433 0.949927 0.952068 0.942624 0.267290 由此可知，t LnY 对1lnt X 回归对应的2R 最大，以1lnt X 为基础，顺次加入其他变量逐步回归，结果如下：1ln t X2ln tX 3ln t X4ln tX5ln t X 6ln t X7ln t X 2R 1ln tX ，2ln tX3.258502-2.9959960.959742（1.352900）（-1.254992）1ln tX ，3ln tX 0.883148 -0.6025670.962186（2.334781）（-1.712019）1ln t X ，4ln tX 0.273414 -0.0323710.955687（1.754955）（-0.241854）1ln tX ，5ln tX 0.1613190.062185 0.956811 （ 1.660515）（0.671050）1ln t X ，6ln tX0.296971-0.079518 0.956262 （ 2.447204）（-0.506227） 1ln tX ，7ln tX0.251591-0.720790 0.962812 （17.32755）（-1.715632）由上面可知，加入各变量后的t 统计量均小于0.025(183) 2.131t -=，参数不显著，且2lnt X 、3ln t X 、4ln tX 、5lnt X 、 6ln t X 、7lntX 的参数符号与预期相反，这说明模型加入新的解释变量都会引起多重共线性。

剔除后的回归模型为1ˆ9.1630510.235851ln t tLnY X =+（0.125193）（0.011918） T= 73.19140 19.789482R =0.9582952R=0.960748 F =391.6234 DW=32.78552这说明在其他因素不变的情况下，当国民收入每上升1%时，能源消费就平均增加0.23585%。

4.7 设定理论模型为13456iiiiiiiCSNZGZTPOP CUMSZMββββββμ=++++++假定模型满足古典假定，用最小二乘法进行估计，用样本估计参数：11793.34468 1.5350896120.8987877567 1.5270891250.151********.03683557C S N Z G Z JZZ TP SZM ∧=--+-+-（3191.096）（0.129778）（0.245466）（1.206242）（0.033759）（0.105329）T= -3.695704 -11.82861 3.661558 -1.265989 4.477646 0.963783 （0.018460） -1.995382从主要指标分析可见，可决系数为0.995，修正的可决系数为0.993，模型拟和很好，F 统计量为632.10，模型拟和很好，回归方程整体上显著。

T 检验的结果表明，除了农业增加值、工业增加值和总人口外，其他因素对财政收入的影响都不显著，且农业增加值和建筑业增加值的回归系数还是负数，这说明很可能存在严重的多重共线性。

根据样本数据得到各解释变量的样本相关系数矩阵如下：加值、最终消费之间，相关系数都在0.9以上。

这表明模型存在着多重共线性。

采用逐步回归法解决多重共线性问题，分别做CS 对NZ 、GZ 、JZZ 、TPOP 、的系数为负，与预期估计违背。

因此这些变量都会引起严重的多重共线性。

修正的回归结果为：93.081094370.3486152401CS GZ ∧=+（412.5593）（0.017744）T= 0.225619 19.646792R=0.941463 2R =0.939024 F=385.9965这说明在其他因素不变的情况下，工业增加值每增加1亿元，财政收入平均增加0.348615亿元。