公卫百科  > 所属分类  >  统计   
[0] 评论[0] 编辑

Dummy variable

In regression analysis, a dummy variable (also known as indicator variable or just dummy) is one that takes the values 0 or 1 to indicate the absence or presence of some categorical effect that may be expected to shift the outcome. For example, in econometric time series analysis, dummy variables may be used to indicate the occurrence of wars, or major strikes. It could thus be thought of as a truth value represented as a numerical value 0 or 1 (as is sometimes done in computer programming).

公卫家园



Use of dummy variables usually increases model fit (coefficient of determination), but at a cost of fewer degrees of freedom and loss of generality of the model. Too many dummy variables result in a model that does not provide any general conclusions. 公卫考场

Dummy variables may be extended to more complex cases. For example, seasonal effects may be captured by creating dummy variables for each of the seasons. In panel data fixed effects estimator dummies are created for each of the units in cross-sectional data (e.g. firms or countries) or periods in a pooled time-series. However in such regressions either the constant term has to be removed, or one of the dummies.
公卫人


When there are dummies in all observations, the constant term has to be excluded. If a constant term is included in the regression, it is important to exclude one of the dummy variables from the regression, making this the base category against which the others are assessed. If all the dummy variables are included, their sum is equal to 1 (which stands for the variable X0 to the constant term B0), resulting in perfect multicollinearity. This is referred to as the dummy variable trap. 公卫考场

附件列表


0

词条内容仅供参考,如果您需要解决具体问题
(尤其在法律、医学等领域),建议您咨询相关领域专业人士。

如果您认为本词条还有待完善,请 编辑

上一篇 哑变量    下一篇 彭良斌

标签

暂无标签

同义词

暂无同义词