An efficient and scalable approach for mining subgraphs in a single large graph

dc.contributor.authorNguyen, Lam B. Q.
dc.contributor.authorNguyen, Loan T. T.
dc.contributor.authorVo, Bay
dc.contributor.authorZelinka, Ivan
dc.contributor.authorLin, Jerry Chun-Wei
dc.contributor.authorYun, Unil
dc.contributor.authorNguyen, Hung Son
dc.date.accessioned2022-06-27T09:19:13Z
dc.date.available2022-06-27T09:19:13Z
dc.date.issued2022
dc.description.abstractIn many recent applications, a graph is used to simulate many complex systems, such as social networks, traffic models or bioinformatics, and the underlying graphs for these systems are very large. Algorithms for mining all frequent subgraphs from a single large graph have attracted much attention and been studied in more detail lately. Mining frequent subgraphs is important, and defined as finding all subgraphs whose occurrences in a dataset are greater than or equal to a given frequency threshold. Among frequent subgraph algorithms, GraMi is considered as the state-of-the-art approach. However, GraMi has a huge search space, and therefore still needs a lot of time and memory to process a large graph. In this paper, we propose two effective strategies to balance and reduce the search space of GraMi, which can decrease the number of candidate subgraphs generated, with early pruning of a large portion of the domain for each candidate. Our experiments were performed on four real datasets and the results show that the performance of our balancing GraMi is better than those of the original algorithm GraMi and the optimized version SoGraMi.cs
dc.description.sourceWeb of Sciencecs
dc.identifier.citationApplied Intelligence. 2022.cs
dc.identifier.doi10.1007/s10489-022-03164-5
dc.identifier.issn0924-669X
dc.identifier.issn1573-7497
dc.identifier.urihttp://hdl.handle.net/10084/146318
dc.identifier.wos000778916300004
dc.language.isoencs
dc.publisherSpringer Naturecs
dc.relation.ispartofseriesApplied Intelligencecs
dc.relation.urihttps://doi.org/10.1007/s10489-022-03164-5cs
dc.rightsCopyright © 2022, The Author(s), under exclusive licence to Springer Science Business Media, LLC, part of Springer Naturecs
dc.subjectreducing search spacecs
dc.subjectsubgraph miningcs
dc.subjectdata miningcs
dc.subjectearly pruningcs
dc.titleAn efficient and scalable approach for mining subgraphs in a single large graphcs
dc.typearticlecs
dc.type.statusPeer-reviewedcs

Files

License bundle

Now showing 1 - 1 out of 1 results
Loading...
Thumbnail Image
Name:
license.txt
Size:
718 B
Format:
Item-specific license agreed upon to submission
Description: