A Bitcoin Address Behavior Dataset for Pattern Analysis

Bitcoin has emerged as a transformative force in the digital economy, offering decentralized financial transactions with unparalleled transparency. However, its pseudonymous nature also opens the door to diverse usage patterns—ranging from legitimate financial services to illicit activities. Understanding these behaviors requires robust data and analytical frameworks. This article explores the research presented in "BABD: A Bitcoin Address Behavior Dataset for Pattern Analysis", which introduces a comprehensive dataset designed to classify Bitcoin address behaviors using advanced graph-based features and machine learning models.

The study addresses a critical gap in blockchain analytics: the lack of a standardized, well-labeled dataset that captures the complexity of real-world Bitcoin transactions. By constructing a heterogeneous transaction graph and extracting both statistical and structural features, the authors provide a powerful tool for identifying behavioral patterns across 13 distinct address types.

The Need for Comprehensive Bitcoin Behavior Analysis

As Bitcoin adoption grows, so does its misuse in cybercrime, money laundering, and fraud. Traditional analysis methods often rely on simplified models that fail to capture the full network dynamics. Most existing datasets are limited in scope—either focusing on a small number of address types or lacking systematic feature engineering.

👉 Discover how advanced blockchain analytics can uncover hidden transaction patterns.

This paper responds to these limitations by introducing BABD (Bitcoin Address Behavior Dataset), a richly annotated dataset built on over 713 million transaction edges spanning nearly two years of blockchain data. The dataset supports multi-class classification of Bitcoin addresses into 13 categories, including ransomware, darknet markets, gambling, exchanges, mining pools, and personal wallets.

Core Components of the BABD Framework

The BABD framework is structured around two primary feature extraction methodologies: Statistical Indicators (SI) and Local Structural Indicators (LSI). These components work together to create a high-dimensional representation of each Bitcoin address's behavior within the transaction network.

Statistical Indicators (SI)

These features capture quantitative aspects of an address’s transaction history:

Pure Amount Indicators (PAI): Reflect token volume dynamics such as total inflow, outflow, balance changes, and transaction frequency.
Purity Degree Indicators (PDI): Measure the homogeneity of transaction relationships—e.g., whether an address interacts mostly with one type of entity or many.
Pure Time Indicators (PTI): Capture temporal patterns like average time between transactions, peak activity hours, and dormancy periods.
Composite Indicators (CI): Combine PAI, PDI, and PTI into higher-level metrics for enhanced predictive power.

Together, these indicators form a detailed behavioral fingerprint based on economic activity, interaction consistency, and timing regularity.

Local Structural Indicators (LSI)

Recognizing that global graph embedding is computationally prohibitive at scale, the authors propose generating k-hop subgraphs around each labeled address. After testing various values, they determine that k = 4 offers optimal balance between structural richness and computational feasibility.

From these subgraphs, the following network topology features are extracted:

Degree Correlation
Betweenness Centrality
Average Path Length
Graph Diameter
Closeness Centrality
PageRank Score
Subgraph Density

These structural metrics reveal how centrally or peripherally an address operates within its local network—a key signal for distinguishing between centralized services (like exchanges) and decentralized actors (like individual users).

Data Collection and Labeling Strategy

The dataset is built from blockchain data collected via public APIs, covering blocks 585,000 to 685,000 (July 2019 – May 2021). This period includes over 516 million nodes (addresses and transactions) and more than 713 million edges (transaction flows).

Address labeling combines automated sources (e.g., WalletExplorer) with manual curation to ensure accuracy. Addresses are categorized into:

Ransomware
Cybersecurity Services
Darknet Markets
Centralized Exchanges
P2P Financial Infrastructure
P2P Financial Services
Gambling
Government Blacklisted Addresses
Money Laundering
Ponzi Schemes
Mining Pools
Cryptocurrency Tumblers
Personal Wallets

To manage label reliability, addresses are classified as either Strong Addresses (SA)—those with high-confidence labels—or Weak Addresses (WA)—used cautiously in training.

Experimental Setup and Results

The researchers use Graph-tool for graph construction and feature extraction, validating algorithms with NetworkX. Machine learning models—including Random Forest, XGBoost, SVM, Logistic Regression, and Gradient Boosting—are implemented via scikit-learn.

After parallelizing computation for efficiency, the combined SI+LSI model achieves impressive performance:

Accuracy: 93.24% – 96.71%
Precision: 92.80% – 96.46%
Recall: 93.24% – 96.71%
F1-Score: 92.97% – 96.57%

These results demonstrate that integrating both transaction-level statistics and local graph topology significantly improves classification accuracy compared to using either alone.

👉 Learn how cutting-edge machine learning enhances cryptocurrency forensics.

Key Contributions and Research Implications

This work makes several impactful contributions to blockchain analytics:

Heterogeneous Graph Construction: Unlike simplified models, the proposed structure preserves both address and transaction nodes, reducing information loss.
Systematic Feature Taxonomy: The clear categorization of indicators improves reproducibility and enables future extensions.
Publicly Available Dataset: By releasing BABD under an open license, the authors enable further research in crypto forensics and anomaly detection.
Scalable Subgraph Approach: The k-hop method provides a practical solution for extracting topological features without requiring full-graph processing.

Frequently Asked Questions

Q: What makes BABD different from other Bitcoin datasets?
A: BABD stands out due to its combination of 13 labeled address types, dual-feature extraction (statistical + structural), and use of k-hop subgraphs for scalable analysis.

Q: How reliable are the address labels in the dataset?
A: Labels are derived from trusted sources like WalletExplorer and supplemented with manual verification. Strong/Weak labeling ensures transparency about confidence levels.

Q: Can this model detect new or unknown malicious behaviors?
A: While trained for multi-class classification, the framework could be extended to anomaly detection by identifying deviations from known behavioral profiles.

Q: Is the dataset publicly available?
A: Yes, BABD is accessible via the original arXiv paper and shared under academic reuse guidelines.

Q: Why is k=4 chosen for subgraph generation?
A: Empirical testing showed k=4 captures sufficient structural context while remaining computationally feasible on standard hardware.

Q: How can this research be applied in real-world scenarios?
A: Financial regulators, exchange compliance teams, and blockchain forensics firms can use similar models to flag suspicious accounts or improve KYC/AML systems.

Final Thoughts

"A Bitcoin Address Behavior Dataset for Pattern Analysis" offers a methodologically sound and practically valuable contribution to cryptocurrency research. It bridges engineering rigor with real-world applicability, demonstrating how machine learning can decode complex transaction behaviors at scale.

For researchers and practitioners alike, BABD sets a new benchmark in labeled blockchain datasets—emphasizing not just volume, but depth of insight through intelligent feature design.

👉 Explore how next-generation analytics are shaping the future of digital finance.