Skip to content

Reading PyMigTax enriched YAML files #8

@lasendo

Description

@lasendo

I have downloaded the data mentioned in the paper "Characterizing Characterizing Python Library Migrations" where you analize and create a taxonomy based on an expanded migration dataset. https://figshare.com/articles/dataset/Characterizing_Python_Library_Migrations_-_artifacts/24216858/2?file=44291174

There is a folder full of enriched YAML files that include the taxonomy features (cardinality and properties) of the code changes.

I understand these constitute the PyMigBench2.0 data you mention. How are these YAMLs supposed to be explored?

When I use PyMigBench database instance there are some errors due to the "file:" entry being "files:" in the new YAMLs.

I tried to go the easy way and changed all "files" to "file" in the YAMLs, but then there were further errors.

I have not seen any Database class in the above folder so I'm wondering how you worked with those enriched migration files.

Thank you!

--- Edit:

I see the taxonomy features (cardinality & properties) are already included in data/migration YAML files. However, I noticed that at least one YAML file has one more code change in PyMigTax (above shared folder, /migrations) than in PyMigBench (this repo data/migrations).

"aiohttp__httpx__itzkvn@python-http-monitoring__790a483.yaml" vs. "aiohttp__httpx__itzkvn@python-http-monitoring__790a4830.yaml":

Image

What is the complete dataset to consider.

Thank you again!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions