eSPA plus : Scalable entropy-optimal machine learning classification for small data problems

Vecchi, Edoardo; Pospíšil, Lukáš; Albrecht, Steffen; O'Kane, Terence J.; Horenko, Illia

dc.contributor.author	Vecchi, Edoardo
dc.contributor.author	Pospíšil, Lukáš
dc.contributor.author	Albrecht, Steffen
dc.contributor.author	O'Kane, Terence J.
dc.contributor.author	Horenko, Illia
dc.date.accessioned	2022-06-29T08:25:59Z
dc.date.available	2022-06-29T08:25:59Z
dc.date.issued	2022
dc.identifier.citation	Neural Computation. 2022, vol. 34, issue 5, p. 1220-1255.	cs
dc.identifier.issn	0899-7667
dc.identifier.issn	1530-888X
dc.identifier.uri	http://hdl.handle.net/10084/146329
dc.description.abstract	Classification problems in the small data regime (with small data statistic T and relatively large feature space dimension D) impose challenges for the common machine learning (ML) and deep learning (DL) tools. The standard learning methods from these areas tend to show a lack of robustness when applied to data sets with significantly fewer data points than dimensions and quickly reach the overfitting bound, thus leading to poor performance beyond the training set. To tackle this issue, we propose eSPA+, a significant extension of the recently formulated entropy-optimal scalable probabilistic approximation algorithm (eSPA). Specifically, we propose to change the order of the optimization steps and replace the most computationally expensive subproblem of eSPA with its closed-form solution. We prove that with these two enhancements, eSPA+ moves from the polynomial to the linear class of complexity scaling algorithms. On several small data learning benchmarks, we show that the eSPA+ algorithm achieves a many-fold speed-up with respect to eSPA and even better performance results when compared to a wide array of ML and DL tools. In particular, we benchmark eSPA+ against the standard eSPA and the main classes of common learning algorithms in the small data regime: various forms of support vector machines, random forests, and long short-term memory algorithms. In all the considered applications, the common learning methods and eSPA are markedly outperformed by eSPA+, which achieves significantly higher prediction accuracy with an orders-of-magnitude lower computational cost.	cs
dc.language.iso	en	cs
dc.publisher	MIT Press	cs
dc.relation.ispartofseries	Neural Computation	cs
dc.relation.uri	https://doi.org/10.1162/neco_a_01490	cs
dc.rights	© 2022 Massachusetts Institute of Technology	cs
dc.title	eSPA plus : Scalable entropy-optimal machine learning classification for small data problems	cs
dc.type	article	cs
dc.identifier.doi	10.1162/neco_a_01490
dc.type.status	Peer-reviewed	cs
dc.description.source	Web of Science	cs
dc.description.volume	34	cs
dc.description.issue	5	cs
dc.description.lastpage	1255	cs
dc.description.firstpage	1220	cs
dc.identifier.wos	000785003800007

Files in this item

Files	Size	Format	View
There are no files associated with this item.

This item appears in the following Collection(s)

Články z časopisů s impakt faktorem / Articles from Impact Factor Journals [6377]
Články z časopisů (od roku 2008), které v době vydání článku měly impakt faktor.
Publikační činnost Katedry matematiky / Publications of Department of Mathematics (230) [36]
Kolekce obsahuje bibliografické záznamy publikační činnosti (článků) akademických pracovníků Katedry matematiky (230) v časopisech registrovaných ve Web of Science od roku 2003 po současnost.
Publikační činnost VŠB-TUO ve Web of Science / Publications of VŠB-TUO in Web of Science [7798]
Kolekce obsahuje bibliografické záznamy článků akademických pracovníků VŠB-TUO publikovaných v časopisech indexovaných ve Web of Science od roku 1990 po současnost.

Show simple item record