J. Abellán
Classification with decision trees from a nonparametric predictive inference perspective
Abellán, J.; Baker, R.M.; Coolen, F.P.A.; Crossman, R.J.; Masegosa, A.R.
Authors
Abstract
An application of nonparametric predictive inference for multinomial data (NPI) to classification tasks is presented. This model is applied to an established procedure for building classification trees using imprecise probabilities and uncertainty measures, thus far used only with the imprecise Dirichlet model (IDM), that is defined through the use of a parameter expressing previous knowledge. The accuracy of that procedure of classification has a significant dependence on the value of the parameter used when the IDM is applied. A detailed study involving 40 data sets shows that the procedure using the NPI model (which has no parameter dependence) obtains a better trade-off between accuracy and size of tree than does the procedure when the IDM is used, whatever the choice of parameter. In a bias-variance study of the errors, it is proved that the procedure with the NPI model has a lower variance than the one with the IDM, implying a lower level of over-fitting.
Citation
Abellán, J., Baker, R., Coolen, F., Crossman, R., & Masegosa, A. (2014). Classification with decision trees from a nonparametric predictive inference perspective. Computational Statistics & Data Analysis, 71, 789-802. https://doi.org/10.1016/j.csda.2013.02.009
Journal Article Type | Article |
---|---|
Publication Date | Mar 1, 2014 |
Deposit Date | Sep 13, 2013 |
Publicly Available Date | Nov 28, 2014 |
Journal | Computational Statistics & Data Analysis |
Print ISSN | 0167-9473 |
Electronic ISSN | 1872-7352 |
Publisher | Elsevier |
Peer Reviewed | Peer Reviewed |
Volume | 71 |
Pages | 789-802 |
DOI | https://doi.org/10.1016/j.csda.2013.02.009 |
Keywords | Imprecise probabilities, Imprecise Dirichlet model, Nonparametric predictive inference model, Uncertainty measures, Supervised classification, Decision trees. |
Public URL | https://durham-repository.worktribe.com/output/1477979 |
Files
Accepted Journal Article
(389 Kb)
PDF
Copyright Statement
NOTICE: this is the author’s version of a work that was accepted for publication in Computational Statistics & Data Analysis. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. A definitive version was subsequently published in Computational Statistics & Data Analysis, 71, March 2014, 10.1016/j.csda.2013.02.009.
You might also like
Smoothed bootstrap methods for bivariate data
(2023)
Journal Article
Discussion of signature‐based models of preventive maintenance
(2022)
Journal Article
A Cost-Sensitive Imprecise Credal Decision Tree based on Nonparametric Predictive Inference
(2022)
Journal Article
Downloadable Citations
About Durham Research Online (DRO)
Administrator e-mail: dro.admin@durham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search