Low-rank and global-representation-key-based attention for graph transformer

Kong, Lingping; Ojha, Varun; Gao, Ruobin; Suganthan, Ponnuthurai Nagaratnam; Snášel, Václav

dc.contributor.author	Kong, Lingping
dc.contributor.author	Ojha, Varun
dc.contributor.author	Gao, Ruobin
dc.contributor.author	Suganthan, Ponnuthurai Nagaratnam
dc.contributor.author	Snášel, Václav
dc.date.accessioned	2024-01-29T11:15:04Z
dc.date.available	2024-01-29T11:15:04Z
dc.date.issued	2023
dc.identifier.citation	Information Sciences. 2023, vol. 642, art. no. 119108.	cs
dc.identifier.issn	0020-0255
dc.identifier.issn	1872-6291
dc.identifier.uri	http://hdl.handle.net/10084/151976
dc.description.abstract	Transformer architectures have been applied to graph-specific data such as protein structure and shopper lists, and they perform accurately on graph/node classification and prediction tasks. Researchers have proved that the attention matrix in Transformers has low-rank properties, and the self-attention plays a scoring role in the aggregation function of the Transformers. However, it can not solve the issues such as heterophily and over-smoothing. The low-rank properties and the limitations of Transformers inspire this work to propose a Global Representation (GR) based attention mechanism to alleviate the two heterophily and over-smoothing issues. First, this GR based model integrates geometric information of the nodes of interest that conveys the structural properties of the graph. Unlike a typical Transformer where a node feature forms a Key, we propose to use GR to construct the Key, which discovers the relation between the nodes and the structural representation of the graph. Next, we present various compositions of GR emanating from nodes of interest and 𝛼-hop neighbors. Then, we explore this attention property with an extensive experimental test to assess the performance and the possible direction of improvements for future works. Additionally, we provide mathematical proof showing the efficient feature update in our proposed method. Finally, we verify and validate the performance of the model on eight benchmark datasets that show the effectiveness of the proposed method.	cs
dc.language.iso	en	cs
dc.publisher	Elsevier	cs
dc.relation.ispartofseries	Information Sciences	cs
dc.relation.uri	https://doi.org/10.1016/j.ins.2023.119108	cs
dc.rights	© 2023 The Authors. Published by Elsevier Inc.	cs
dc.rights.uri	http://creativecommons.org/licenses/by/4.0/	cs
dc.subject	graph transformer	cs
dc.subject	graph representation	cs
dc.subject	low-rank attention	cs
dc.subject	global representation vector	cs
dc.title	Low-rank and global-representation-key-based attention for graph transformer	cs
dc.type	article	cs
dc.identifier.doi	10.1016/j.ins.2023.119108
dc.rights.access	openAccess	cs
dc.type.version	publishedVersion	cs
dc.type.status	Peer-reviewed	cs
dc.description.source	Web of Science	cs
dc.description.volume	642	cs
dc.description.firstpage	art. no. 119108	cs
dc.identifier.wos	000998393300001

Files in this item

Name:: 0020-0255-2023v642an119108.pdf
Size:: 1.441Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Články z časopisů s impakt faktorem / Articles from Impact Factor Journals [6377]
Články z časopisů (od roku 2008), které v době vydání článku měly impakt faktor.
OpenAIRE [5085]
Kolekce určená pro sklízení infrastrukturou OpenAIRE; obsahuje otevřeně přístupné publikace, případně další publikace, které jsou výsledkem projektů rámcových programů Evropské komise (7. RP, H2020, Horizon Europe).
Publikační činnost Katedry informatiky / Publications of Department of Computer Science (460) [562]
Kolekce obsahuje bibliografické záznamy publikační činnosti (článků) akademických pracovníků Katedry informatiky (460) v časopisech a v Lecture Notes in Computer Science registrovaných ve Web of Science od roku 2003 po současnost.
Publikační činnost VŠB-TUO ve Web of Science / Publications of VŠB-TUO in Web of Science [7798]
Kolekce obsahuje bibliografické záznamy článků akademických pracovníků VŠB-TUO publikovaných v časopisech indexovaných ve Web of Science od roku 1990 po současnost.

Show simple item record