-
Notifications
You must be signed in to change notification settings - Fork 36
Open
Description
Hi,
While performing Exploratory Data Analysis (EDA) on the txs_features dataset (downloaded from this repository / Google Drive), I found that several columns contain missing values.
I would like to confirm:
- Is this expected behaviour in the original dataset?
- Or could this be caused by incorrect preprocessing on my side?
Details:
- Dataset source: GitHub repository / Drive version
- File name: txs_features.csv
Observation:
- Columns with missing values:
- in_txs_degree
- out_txs_degree
- total_BTC
- fees
- size
- num_input_addresses
- num_output_addresses
- in_BTC_min
- in_BTC_max
- in_BTC_mean
- in_BTC_median
- in_BTC_total
- out_BTC_min
- out_BTC_max
- out_BTC_mean
- out_BTC_median
- out_BTC_total
- Number of affected rows: 965
Additionally, I have attached an Excel file listing the transactions with missing values for reference.
rows_with_missing_in_txs_features.csv
Could you kindly confirm whether:
- These missing values are part of the dataset design?
- Or if there is a preprocessing step I may have missed?
Thank you.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels