Evaluation of performance enhancement in Ethereum fraud detection using oversampling techniques
| dc.contributor.author | Ravindranath, Vaishali | |
| dc.contributor.author | Nallakaruppan, M. K. | |
| dc.contributor.author | Shri, M. Lawanya | |
| dc.contributor.author | Balusamy, Balamurugan | |
| dc.contributor.author | Bhattacharyya, Siddhartha | |
| dc.date.accessioned | 2025-02-19T16:28:59Z | |
| dc.date.available | 2025-02-19T16:28:59Z | |
| dc.date.issued | 2024 | |
| dc.description.abstract | With the growing popularity of cryptocurrencies and their decentralized nature, the risk of fraudulent activities within these ecosystems has become a pressing concern. This research paper focuses on Ethereum fraud detection using a dataset specifically curated for this purpose. The methodology encompasses essential steps, including data cleaning, correlation analysis, data splitting, and exploratory data analysis to understand the data characteristics. Subsequently, self -optimized machine learning models are trained with the Pycaret library while addressing the class imbalance using SMOTENC (Synthetic Minority oversampling Technique for Nominal and Continuous Data), ADA-SYN (Adaptive Synthetic Algorithm), and K -Means -SMOTE techniques. The performance of the various models is evaluated on test and validation datasets using metrics such as accuracy, precision, recall, and AUC (Area Under Curve). The study reveals that the ensemble models, particularly CATBoost (Categorical Boost) and LGBM (Light Gradient Boost Method), show exceptional efficiency, with accuracy ranging from 97% to 98.42% after oversampling. Moreover, these models exhibit higher F1 scores and AUC values, indicating their potential to detect fraud effectively. The validation metrics also lie in the same range, demonstrating that the models do not suffer from over -fitting. The experiment demonstrates the promise of ensemble models in Ethereum fraud detection, paving the way for deploying robust fraud detection systems in crypto-currency ecosystems. The results show that the K -Means SMOTE oversampling technique has the highest classification accuracy levels of 98.42% with an AUC of 99.82%. | cs |
| dc.description.firstpage | art. no. 111698 | cs |
| dc.description.source | Web of Science | cs |
| dc.description.volume | 161 | cs |
| dc.identifier.citation | Applied Soft Computing. 2024, vol. 161, art. no. 111698. | cs |
| dc.identifier.doi | 10.1016/j.asoc.2024.111698 | |
| dc.identifier.issn | 1568-4946 | |
| dc.identifier.issn | 1872-9681 | |
| dc.identifier.uri | http://hdl.handle.net/10084/155760 | |
| dc.identifier.wos | 001242793400001 | |
| dc.language.iso | en | cs |
| dc.publisher | Elsevier | cs |
| dc.relation.ispartofseries | Applied Soft Computing | cs |
| dc.relation.uri | https://doi.org/10.1016/j.asoc.2024.111698 | cs |
| dc.rights | © 2024 Elsevier B.V. All rights reserved. | cs |
| dc.subject | SMOTENC | cs |
| dc.subject | ADASYN | cs |
| dc.subject | K-Means SMOTE | cs |
| dc.subject | LGBM | cs |
| dc.title | Evaluation of performance enhancement in Ethereum fraud detection using oversampling techniques | cs |
| dc.type | article | cs |
| dc.type.status | Peer-reviewed | cs |
Files
License bundle
1 - 1 out of 1 results
Loading...
- Name:
- license.txt
- Size:
- 718 B
- Format:
- Item-specific license agreed upon to submission
- Description: