Comparación entre árboles de regresión CART y regresión lineal
Comparison between CART regression trees and linear regression
Additional Files
Abstract (en)
Linear regression is the most widely used method in statistics to predict values of continuous variables due to its easy interpretation, but in many situations the suppositions to apply the model are not met and some users tend to force them leading them to erroneous conclusions. CART regression trees is a regression alternative that does not require suppositions on the data to be analyzed and is a method of easy interpretation of results. This work compares predictive levels of linear regression with CART through simulation. In general, it was found that when the correct linear regression model is adjusted to the data, the prediction error of linear regression is always lower than that of CART. It was also found that when linear regression model is erroneously adjusted to the data, the prediction error of CART is lower than that of linear regression only when it has a sufficiently large amount of data.Abstract (es)
References
Ankarali, H., Canan, A., Akkus, Z., Bugdayci, R. & Ali Sungur, M. (2007), ‘Comparison of logistic regression model and classification tree: An application to postpartum depression data’, Expert Systems with Applications 32, 987–994.
Breiman, L., Friedman, J., Olshen, R. & Stone, C. (1984), Classification And Regression Trees, CHAPMAN & HALL/CRC, Boca Raton.
Izenman, A. (2008), Modern Multivariate Statistical Techniques, Springer, New York.
Tamminen, S., Laurinen, P. & Roning, J. (1999), ‘Comparing regression trees with neural networks in aerobic fitness approximation’.
Zhang, H. & Singer, B. (2010), Recursive Partitioning and Applications, Springer, New York.
How to Cite
License
The authors maintain the rights to the articles and therefore they are free to share, copy, distribute, execute and publicly communicate the work under the following conditions:
Recognize the credits of the work in the manner specified by the author or licensor (but not in a way that suggests that, you have their support or that they support your use of their work).
Comunicaciones en Estadística is licensed under Creative Commons Atribución-NoComercial-CompartirIgual 4.0 Internacional (CC BY-NC-SA 4.0)
Universidad Santo Tomás preserves the patrimonial rights (copyright) of the published works, and favors and allows the reuse of them under the aforementioned license.