我是R新手, 第一次使用这个语言. 使用STEP function 寻找最小的BIC, 在建立模型时
我新增了一些 interaction term 到模型中,
context1$abilsq <- (context1$abil)^2
context1$educsq <- (context1$educ)^2
context1$expersq <- (context1$exper)^2
context1$abileduc <- context1$abil*context1$educ
context1$abilexper<- context1$abil*context1$exper
context1$educexper<- context1$educ*context1$exper
model2A <- lm(log(wage)~abil+educ+exper+abilsq+educsq
+expersq+abileduc+abilexper+educexper
, data = context1)
model2B <- lm(log(wage)~abil+educ+exper+abilsq+educsq
+expersq+abil*educ+abil*exper+educ*exper
, data = context1)
两个模型主要的差别在于 interaction term 有无使用*
model2minBIC <- step(model2A, direction = "backward", k = log(nrow(context1)))
model2minBIC <- step(model2B, direction = "backward", k = log(nrow(context1)))
执行结果似乎 model2B 可以跑出比较小的值, 且两者跑出来的结果也不一样.
下面是model2B的结果.
Step: AIC=-1545.67
log(wage) ~ exper + abileduc + educexper
Df Sum of Sq RSS AIC
<none> 342.06 -1545.7
- abileduc 1 13.185 355.25 -1506.3
- exper 1 17.853 359.91 -1490.2
- educexper 1 27.377 369.44 -1458.1
下面是model2A的结果
Step: AIC=-1539.35
log(wage) ~ abil + educ + exper + educ:exper
Df Sum of Sq RSS AIC
<none> 341.84 -1539.3
- educ:exper 1 2.8843 344.72 -1536.1
- abil 1 10.3991 352.24 -1509.6
想问是什么原因造成两个跑出来的结果会不一样... 谢谢!
[关键字]:
BIC, step