Saving dtypes as metadata for roundtrip consistency by yfukai · Pull Request #262 · royerlab/tracksdata

yfukai · 2026-02-17T06:39:50Z

This is a follow-up on the dtype, assuming #260 has been merged.
In this PR, the graphs store the dtypes of the fields as metadata, enabling robust recovery of the original dtypes during round-trip from_other conversions.

yfukai · 2026-02-17T06:45:35Z

Notes:

I should also store the default values. Serializing the polars dataframes is useful to store the values?
The roles of AttrSchema overlap with the current metadata. This could be SQLGraph-specific machinery.

yfukai · 2026-02-18T02:20:13Z

Both tasks are done. I'll review the code for SQLgraph and mark this open for review.

codecov-commenter · 2026-02-18T02:21:16Z

Codecov Report

❌ Patch coverage is 86.20690% with 24 lines in your changes missing coverage. Please review.
✅ Project coverage is 87.76%. Comparing base (e4bb183) to head (f99e20f).

Files with missing lines	Patch %	Lines
src/tracksdata/utils/_dtypes.py	71.83%	11 Missing and 9 partials ⚠️
src/tracksdata/graph/_sql_graph.py	95.50%	3 Missing and 1 partial ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #262      +/-   ##
==========================================
- Coverage   87.92%   87.76%   -0.16%     
==========================================
  Files          56       56              
  Lines        4471     4610     +139     
  Branches      789      810      +21     
==========================================
+ Hits         3931     4046     +115     
- Misses        344      360      +16     
- Partials      196      204       +8

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

JoOkuma · 2026-02-27T00:09:35Z

@yfukai sorry for the delay.
I'll review this once #260 is merged. Everything looks good so far.

JoOkuma

Great PR @yfukai,

I appreciate all the tests you're adding.
I left a few minor comments.

JoOkuma · 2026-02-27T16:02:01Z

src/tracksdata/graph/_sql_graph.py

+        ordered_keys = [key for key in preferred_order if key in schemas]
+        ordered_keys.extend(key for key in table_class.__table__.columns.keys() if key not in ordered_keys)
+        ordered_keys.extend(key for key in schemas if key not in ordered_keys)
+        return {key: schemas[key] for key in ordered_keys}


What do you think of this implementation? This way, we can avoid checking if it's in ordered_keys, which is a list, and it takes a bit longer to check if the column is there.

Suggested change

ordered_keys = [key for key in preferred_order if key in schemas]

ordered_keys.extend(key for key in table_class.__table__.columns.keys() if key not in ordered_keys)

ordered_keys.extend(key for key in schemas if key not in ordered_keys)

return {key: schemas[key] for key in ordered_keys}

result = {}

# return dictionary in preferred order

for source in (

preferred_order,

table_class.__table__.columns.keys(),

schemas,

):

for key in source:

if key in schemas:

result.setdefault(key, schemas[key])

return result

JoOkuma · 2026-02-27T16:05:31Z

src/tracksdata/graph/_sql_graph.py

+        else:
+            nodes_df = nodes_df.select([pl.col(c) for c in self._node_attr_schemas() if c in nodes_df.columns])


@yfukai , it's not clear to me why this is necessary.
Is it because it might have private attributes?
Could you include a comment on this and the equivalent edge_attr_code?

JoOkuma · 2026-02-27T16:11:22Z

src/tracksdata/graph/_sql_graph.py

+        edge_schemas = self.__edge_attr_schemas
        # Process arguments and create validated schema
-        schema = process_attr_key_args(key_or_schema, dtype, default_value, self.__edge_attr_schemas)
-
-        # Store schema
-        self.__edge_attr_schemas[schema.key] = schema
+        schema = process_attr_key_args(key_or_schema, dtype, default_value, edge_schemas)

        # Add column to database
        self._add_new_column(self.Edge, schema)
+        edge_schemas[schema.key] = schema
+        self.__edge_attr_schemas = edge_schemas

    def remove_edge_attr_key(self, key: str) -> None:
        if key not in self.edge_attr_keys():
            raise ValueError(f"Edge attribute key {key} does not exist")

+        edge_schemas = self.__edge_attr_schemas
        self._drop_column(self.Edge, key)
-        self.__edge_attr_schemas.pop(key, None)
+        edge_schemas.pop(key, None)
+        self.__edge_attr_schemas = edge_schemas


I was a bit lost at first here, but this seems required because self.__edge_attr_schemas has a setter and getter.

Do you think it would be worth being more explicit than my original implementation with @property and .setter? We could address this in another PR.

yfukai added 5 commits February 17, 2026 13:26

added private metadata machinery

29177f3

before adding private

d8292f1

added private metadata view

cff5898

renamed func

68b01d4

implementation of saving and loading dtypes as metadata

1ae2426

yfukai marked this pull request as draft February 17, 2026 06:42

yfukai added 4 commits February 17, 2026 15:48

lint

c50a07b

restricted dtype metadata to sqlgraph

e9bf28f

udpated serialization strategies

9aa9c3a

solved failing tests

7e61ac3

yfukai added 3 commits February 18, 2026 11:45

added test for shape-less pl.Array (xfail)

e5968bf

working

b4acde3

simplified code

cc55976

yfukai marked this pull request as ready for review February 19, 2026 01:48

yfukai mentioned this pull request Feb 19, 2026

Default value of 'pl.Array' typed attribute fails in graph.node_attrs() #251

Open

This was referenced Feb 26, 2026

Fixing regions props node attribute init #259

Open

Struct attrributes #268

Draft

yfukai mentioned this pull request Feb 27, 2026

Dict-like metadata interface & private metadata #260

Merged

Merge branch 'main' into from_other_roundtrip

f99e20f

JoOkuma reviewed Feb 27, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Saving dtypes as metadata for roundtrip consistency#262

Saving dtypes as metadata for roundtrip consistency#262
yfukai wants to merge 13 commits intoroyerlab:mainfrom
yfukai:from_other_roundtrip

yfukai commented Feb 17, 2026

Uh oh!

yfukai commented Feb 17, 2026 •

edited

Loading

Uh oh!

yfukai commented Feb 18, 2026 •

edited

Loading

Uh oh!

codecov-commenter commented Feb 18, 2026 •

edited

Loading

Uh oh!

JoOkuma commented Feb 27, 2026

Uh oh!

JoOkuma left a comment

Uh oh!

JoOkuma Feb 27, 2026

Uh oh!

JoOkuma Feb 27, 2026

Uh oh!

JoOkuma Feb 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

-        ordered_keys = [key for key in preferred_order if key in schemas]
-        ordered_keys.extend(key for key in table_class.__table__.columns.keys() if key not in ordered_keys)
-        ordered_keys.extend(key for key in schemas if key not in ordered_keys)
-        return {key: schemas[key] for key in ordered_keys}
+        result = {}
+        # return dictionary in preferred order
+        for source in (
+            preferred_order,
+            table_class.__table__.columns.keys(),
+            schemas,
+        ):
+            for key in source:
+                if key in schemas:
+                    result.setdefault(key, schemas[key])
+        return result

		else:
		nodes_df = nodes_df.select([pl.col(c) for c in self._node_attr_schemas() if c in nodes_df.columns])

Conversation

yfukai commented Feb 17, 2026

Uh oh!

yfukai commented Feb 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yfukai commented Feb 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov-commenter commented Feb 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

JoOkuma commented Feb 27, 2026

Uh oh!

JoOkuma left a comment

Choose a reason for hiding this comment

Uh oh!

JoOkuma Feb 27, 2026

Choose a reason for hiding this comment

Uh oh!

JoOkuma Feb 27, 2026

Choose a reason for hiding this comment

Uh oh!

JoOkuma Feb 27, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

yfukai commented Feb 17, 2026 •

edited

Loading

yfukai commented Feb 18, 2026 •

edited

Loading

codecov-commenter commented Feb 18, 2026 •

edited

Loading