Evaluation of performance enhancement in Ethereum fraud detection using oversampling techniques

dc.contributor.authorRavindranath, Vaishali
dc.contributor.authorNallakaruppan, M. K.
dc.contributor.authorShri, M. Lawanya
dc.contributor.authorBalusamy, Balamurugan
dc.contributor.authorBhattacharyya, Siddhartha
dc.date.accessioned2025-02-19T16:28:59Z
dc.date.available2025-02-19T16:28:59Z
dc.date.issued2024
dc.description.abstractWith the growing popularity of cryptocurrencies and their decentralized nature, the risk of fraudulent activities within these ecosystems has become a pressing concern. This research paper focuses on Ethereum fraud detection using a dataset specifically curated for this purpose. The methodology encompasses essential steps, including data cleaning, correlation analysis, data splitting, and exploratory data analysis to understand the data characteristics. Subsequently, self -optimized machine learning models are trained with the Pycaret library while addressing the class imbalance using SMOTENC (Synthetic Minority oversampling Technique for Nominal and Continuous Data), ADA-SYN (Adaptive Synthetic Algorithm), and K -Means -SMOTE techniques. The performance of the various models is evaluated on test and validation datasets using metrics such as accuracy, precision, recall, and AUC (Area Under Curve). The study reveals that the ensemble models, particularly CATBoost (Categorical Boost) and LGBM (Light Gradient Boost Method), show exceptional efficiency, with accuracy ranging from 97% to 98.42% after oversampling. Moreover, these models exhibit higher F1 scores and AUC values, indicating their potential to detect fraud effectively. The validation metrics also lie in the same range, demonstrating that the models do not suffer from over -fitting. The experiment demonstrates the promise of ensemble models in Ethereum fraud detection, paving the way for deploying robust fraud detection systems in crypto-currency ecosystems. The results show that the K -Means SMOTE oversampling technique has the highest classification accuracy levels of 98.42% with an AUC of 99.82%.cs
dc.description.firstpageart. no. 111698cs
dc.description.sourceWeb of Sciencecs
dc.description.volume161cs
dc.identifier.citationApplied Soft Computing. 2024, vol. 161, art. no. 111698.cs
dc.identifier.doi10.1016/j.asoc.2024.111698
dc.identifier.issn1568-4946
dc.identifier.issn1872-9681
dc.identifier.urihttp://hdl.handle.net/10084/155760
dc.identifier.wos001242793400001
dc.language.isoencs
dc.publisherElseviercs
dc.relation.ispartofseriesApplied Soft Computingcs
dc.relation.urihttps://doi.org/10.1016/j.asoc.2024.111698cs
dc.rights© 2024 Elsevier B.V. All rights reserved.cs
dc.subjectSMOTENCcs
dc.subjectADASYNcs
dc.subjectK-Means SMOTEcs
dc.subjectLGBMcs
dc.titleEvaluation of performance enhancement in Ethereum fraud detection using oversampling techniquescs
dc.typearticlecs
dc.type.statusPeer-reviewedcs

Files

License bundle

Now showing 1 - 1 out of 1 results
Loading...
Thumbnail Image
Name:
license.txt
Size:
718 B
Format:
Item-specific license agreed upon to submission
Description: