Accelerating Multilingual Cryptocurrency Forensics: An NLP-Driven Approach for Efficient Mnemonic Identification
The increasing use of cryptocurrencies in criminal activities presents significant challenges to society and the judicial system, particularly in tracking and seizing illicit digital assets. Among all relevant digital evidence, mnemonic phrases, which are critical for accessing cryptocurrency wallet...
Saved in:
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2025-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/10838568/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832592906079174656 |
---|---|
author | Hsin-Hsiung Kao |
author_facet | Hsin-Hsiung Kao |
author_sort | Hsin-Hsiung Kao |
collection | DOAJ |
description | The increasing use of cryptocurrencies in criminal activities presents significant challenges to society and the judicial system, particularly in tracking and seizing illicit digital assets. Among all relevant digital evidence, mnemonic phrases, which are critical for accessing cryptocurrency wallets, are crucial digital evidence for confiscating criminal proceeds and conducting investigations. However, traditional digital forensics tools, such as the Mnemonic Library Matching Method, lack flexibility and efficiency when handling cryptocurrency-related data. This study introduces an innovative Natural Language Processing (NLP) and deep learning approach for rapid mnemonic identification across 11 languages, including English, Spanish, and Japanese. We trained and compared four NLP deep learning models: RNN, LSTM, BiLSTM, and TextCNN, on a large-scale, real-world dataset. Our analysis reveals that the Text Convolutional Neural Network (TextCNN) model exhibits superior performance, achieving a 99.9993% accuracy rate, nearly matching the 100% accuracy of the Mnemonic Library Matching Method. Crucially, our TextCNN-driven approach processes data 40.47 times faster than the traditional method, significantly enhancing efficiency in time-sensitive forensic environments. This NLP-driven method not only maintains high accuracy while dramatically reducing processing time but also offers greater adaptability for diverse forensic needs compared to traditional techniques. By enabling more effective tracking and seizure of criminal assets, this approach aims to address the broader societal and judicial challenges posed by cryptocurrency-related criminal activities. Our research showcases the potential of NLP and deep learning in digital forensics, providing law enforcement with advanced tools for investigating cryptocurrency-related crimes and curbing the misuse of cryptocurrencies in illicit activities. |
format | Article |
id | doaj-art-0d9688ef6e7b4447b9c6f520fb8f4a9f |
institution | Kabale University |
issn | 2169-3536 |
language | English |
publishDate | 2025-01-01 |
publisher | IEEE |
record_format | Article |
series | IEEE Access |
spelling | doaj-art-0d9688ef6e7b4447b9c6f520fb8f4a9f2025-01-21T00:00:53ZengIEEEIEEE Access2169-35362025-01-0113105131052610.1109/ACCESS.2025.352882910838568Accelerating Multilingual Cryptocurrency Forensics: An NLP-Driven Approach for Efficient Mnemonic IdentificationHsin-Hsiung Kao0https://orcid.org/0009-0001-9895-7171Department of Information Management, Central Police University, Taoyuan, TaiwanThe increasing use of cryptocurrencies in criminal activities presents significant challenges to society and the judicial system, particularly in tracking and seizing illicit digital assets. Among all relevant digital evidence, mnemonic phrases, which are critical for accessing cryptocurrency wallets, are crucial digital evidence for confiscating criminal proceeds and conducting investigations. However, traditional digital forensics tools, such as the Mnemonic Library Matching Method, lack flexibility and efficiency when handling cryptocurrency-related data. This study introduces an innovative Natural Language Processing (NLP) and deep learning approach for rapid mnemonic identification across 11 languages, including English, Spanish, and Japanese. We trained and compared four NLP deep learning models: RNN, LSTM, BiLSTM, and TextCNN, on a large-scale, real-world dataset. Our analysis reveals that the Text Convolutional Neural Network (TextCNN) model exhibits superior performance, achieving a 99.9993% accuracy rate, nearly matching the 100% accuracy of the Mnemonic Library Matching Method. Crucially, our TextCNN-driven approach processes data 40.47 times faster than the traditional method, significantly enhancing efficiency in time-sensitive forensic environments. This NLP-driven method not only maintains high accuracy while dramatically reducing processing time but also offers greater adaptability for diverse forensic needs compared to traditional techniques. By enabling more effective tracking and seizure of criminal assets, this approach aims to address the broader societal and judicial challenges posed by cryptocurrency-related criminal activities. Our research showcases the potential of NLP and deep learning in digital forensics, providing law enforcement with advanced tools for investigating cryptocurrency-related crimes and curbing the misuse of cryptocurrencies in illicit activities.https://ieeexplore.ieee.org/document/10838568/Cryptocurrency crimedeep learningdigital forensicsmnemonic identificationnatural language processing |
spellingShingle | Hsin-Hsiung Kao Accelerating Multilingual Cryptocurrency Forensics: An NLP-Driven Approach for Efficient Mnemonic Identification IEEE Access Cryptocurrency crime deep learning digital forensics mnemonic identification natural language processing |
title | Accelerating Multilingual Cryptocurrency Forensics: An NLP-Driven Approach for Efficient Mnemonic Identification |
title_full | Accelerating Multilingual Cryptocurrency Forensics: An NLP-Driven Approach for Efficient Mnemonic Identification |
title_fullStr | Accelerating Multilingual Cryptocurrency Forensics: An NLP-Driven Approach for Efficient Mnemonic Identification |
title_full_unstemmed | Accelerating Multilingual Cryptocurrency Forensics: An NLP-Driven Approach for Efficient Mnemonic Identification |
title_short | Accelerating Multilingual Cryptocurrency Forensics: An NLP-Driven Approach for Efficient Mnemonic Identification |
title_sort | accelerating multilingual cryptocurrency forensics an nlp driven approach for efficient mnemonic identification |
topic | Cryptocurrency crime deep learning digital forensics mnemonic identification natural language processing |
url | https://ieeexplore.ieee.org/document/10838568/ |
work_keys_str_mv | AT hsinhsiungkao acceleratingmultilingualcryptocurrencyforensicsannlpdrivenapproachforefficientmnemonicidentification |