Skip to content
This repository was archived by the owner on Jan 28, 2025. It is now read-only.
This repository was archived by the owner on Jan 28, 2025. It is now read-only.

JSON data of "coding" nested field is exported as a string instead of the proper JSON structure in parquet files #321

@yuexintonychen

Description

@yuexintonychen

Hi,

I observed some inconsistencies between the live data in AHDS FHIR services and the data in parquet files exported by the corresponding FHIR-to-Synapse Sync Agent, specifically on the value of "coding" nested field. The fhir data has the proper JSON structure but in parquet files the same data is written in string formats.

Some examples I found across different FHIR resource types:

  • patient: identifier[].type.coding
    • fhir data: [{"system":"http://terminology.hl7.org/CodeSystem/v2-0203","code":"MR","display":"Medical Record Number"}]
    • parquet data: "[{\"system\":\"http://terminology.hl7.org/CodeSystem/v2-0203\",\"code\":\"MR\",\"display\":\"Medical Record Number\"}]"
  • observation: component[].code.coding
    • fhir data: [{"code":"age","display":"Age"}]
    • parquet data: "[{\"code\":\"age\",\"display\":\"Age\"}]"
  • observation: component[].value.codeableConcept.coding
    • fhir data: [{"code":"date","display":"Exact Date"}]
    • parquet data: "[{\"code\":\"date\",\"display\":\"Exact Date\"}]"

There seems to be a pattern where all values of "coding" field are stored in string format instead of JSON structure.

Could anyone please check if you have seen the same issue?

Best regards,
Tony

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions