Skip to content

Missing Values in txs_features Dataset #9

@sharul-ayub

Description

@sharul-ayub

Hi,

While performing Exploratory Data Analysis (EDA) on the txs_features dataset (downloaded from this repository / Google Drive), I found that several columns contain missing values.

I would like to confirm:

  • Is this expected behaviour in the original dataset?
  • Or could this be caused by incorrect preprocessing on my side?

Details:

  • Dataset source: GitHub repository / Drive version
  • File name: txs_features.csv

Observation:

  • Columns with missing values:
  1. in_txs_degree
  2. out_txs_degree
  3. total_BTC
  4. fees
  5. size
  6. num_input_addresses
  7. num_output_addresses
  8. in_BTC_min
  9. in_BTC_max
  10. in_BTC_mean
  11. in_BTC_median
  12. in_BTC_total
  13. out_BTC_min
  14. out_BTC_max
  15. out_BTC_mean
  16. out_BTC_median
  17. out_BTC_total
  • Number of affected rows: 965

Additionally, I have attached an Excel file listing the transactions with missing values for reference.

rows_with_missing_in_txs_features.csv

Could you kindly confirm whether:

  1. These missing values are part of the dataset design?
  2. Or if there is a preprocessing step I may have missed?

Thank you.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions