基于可解释不平衡数据分类方法的加密货币交易欺诈检测
CSTR:
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

TP 183

基金项目:

教育部人文社会科学研究项目(23YJCZH281);国家自然科学基金资助项目(72301136);信息网络安全公安部重点实验室开放课题资助项目(C23600)


Cryptocurrency transaction fraud detection based on imbalanced data classification method with interpretable analysis
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    鉴于正常和欺诈交易数据在加密货币交易样本中分布极端不平衡,以及交易数据的高维特征和非线性关系,提出了一种用于加密货币交易欺诈检测的可解释不平衡数据分类方法。首先,使用synthetic minority oversampling technique(SMOTE)过采样和对比学习的数据增强策略来平衡数据;接着,引入基于transformer的深度学习模型,学习样本相关性,并通过基于对比损失的预训练和基于贝叶斯优化的微调策略来优化模型,更好地区分正常和欺诈交易的特征分布,提取与欺诈相关的高阶、高维特征;最后,设计基于shapley additive explanations(SHAP)的解释器,结合注意力分数对模型预测进行解释,揭示不同交易特征在欺诈检测中的作用。对比实验结果表明,该模型在召回率方面表现出色,能全面识别加密货币交易中的欺诈活动。同时,在F1值上达到最佳,很好地平衡了准确率和召回率。消融实验验证了所提出的数据平衡和预训练?微调策略的必要性,说明其能有效处理不平衡数据的分类问题。研究不仅丰富了金融欺诈检测的研究体系,还增强了加密货币的交易安全性,促进市场健康发展,维护经济稳定与社会安全。

    Abstract:

    Given the extreme imbalance in the distribution of normal and fraudulent transactions data in cryptocurrency transaction samples, as well as the complex high-dimensional features and nonlinear relationships in transaction data, an imbalanced data classification method with interpretable analysis for cryptocurrency transaction fraud detection was proposed. Firstly, synthetic minority oversampling technique(SMOTE) and contrastive learning-based data enhancement strategies were used to balance the data. Secondly, a deep learning model based on transformer was introduced to learn sample relevance. Pre-training based on contrastive loss and fine-tuning strategies based on Bayesian were used to optimize the model. It distinguished the distribution of features between normal and fraudulent transactions bertter, extracted higher-order, high-dimensional features related to fraud. Finally, the interpreter was designed based on shapley additive explanations(SHAP), and combined with attention scores for model predictions interpretaion to reveal the role of different transaction features in fraud detection. Comparative results show that the model performs well in recall and can fully identify fraudulent activities in cryptocurrency transactions. Meanwhile, it achieves an excellent F1 value, striking a balance between accuracy and recall. Ablation experiments verify the necessity of the proposed data balancing and pre-training-fine-tuning strategies, show that it can effectively deal with the classification problem of unbalanced data. The research not only enriches financial fraud detection but also enhances cryptocurrency transaction security, promotes market development, and contributes to economic stability and social security.

    参考文献
    相似文献
    引证文献
引用本文

尹裴,蒋文龙,徐岩.基于可解释不平衡数据分类方法的加密货币交易欺诈检测[J].上海理工大学学报,2025,47(4):461-470.

复制
分享
相关视频

文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2024-06-26
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2025-09-29
  • 出版日期:
文章二维码