Show simple item record

dc.contributor.authorFrolov, Alexander A.
dc.contributor.authorHúsek, Dušan
dc.contributor.authorPolyakov, Pavel Y.
dc.contributor.authorŘezanková, Hana
dc.date.accessioned2013-03-11T15:38:42Z
dc.date.available2013-03-11T15:38:42Z
dc.date.issued2012
dc.identifier.citationNeural Network World. 2012, vol. 22, issue 6, p. 565-582.cs
dc.identifier.issn1210-0552
dc.identifier.urihttp://hdl.handle.net/10084/96206
dc.description.abstractStudied are differences of two approaches targeted to reveal latent variables in binary data. These approaches assume that the observed high dimensional data are driven by a small number of hidden binary sources combined due to Boolean superposition. The first approach is the Boolean matrix factorization (BMF) and the second one is the Boolean factor analysis (BFA). The two BMF methods are used for comparison. First is the M8 method from the BMDP statistical software package and the second one is the method suggested by Belohlavek & Vychodil. These two are compared to BFA, especially with the Expectation-maximization Boolean Factor Analysis we had developed earlier has, however, been extended with a binarization step developed here. The well-known bars problem and the mushroom dataset are used for revealing the methods' peculiarities. In particular, the reconstruction ability of the computed factors and the information gain as the measure of dimension reduction was under scrutiny. It was shown that BFA slightly loses to BMF in performance when noise-free signals are analyzed. Conversely, BMF loses considerably to BFA when input signals are noisy.cs
dc.language.isoencs
dc.publisherAkademie věd České republiky, Ústav informatiky a České vysoké učení technické v Praze, Fakulta dopravnícs
dc.relation.ispartofseriesNeural Network Worldcs
dc.subjectdimension reductioncs
dc.subjectstatisticscs
dc.subjectdata miningcs
dc.subjectBoolean factor analysiscs
dc.subjectBoolean matrix factorizationcs
dc.subjectinformation gaincs
dc.subjectlikelihood-maximizationcs
dc.subjectbars problemcs
dc.titleA comparative study of two methodologies for binary datasets analysiscs
dc.typearticlecs
dc.identifier.locationNení ve fondu ÚKcs
dc.type.statusPeer-reviewedcs
dc.description.sourceWeb of Sciencecs
dc.description.volume22cs
dc.description.issue6cs
dc.description.lastpage582cs
dc.description.firstpage565cs
dc.identifier.wos000314321300006


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record