[PECOBLR-666]Added support for Variant datatype in SQLAlchemy by msrathore-db · Pull Request #42 · databricks/databricks-sqlalchemy

msrathore-db · 2025-09-03T09:21:18Z

Description

Added support for Variant type. Users can now insert and read from the Variant type columns.

Pandas being one of the most important use cases for SQLAlchemy, additional testing has been added to test for Pandas specific use cases. User needs to explicitly provide the dtype parameter for pandas.to_sql() that'll define which columns are of variant type.

Testing

Added unit and E2E tests for DatabricksVariant type

Table creation (DDL)
Data insertion and retrieval via ORM and pandas
Data comparison and round-trip correctness

Related tickets and documents

PECOBLR-666

tests/test_local/e2e/test_complex_types.py

jprakash-db

can you add the variant type in the sqlalchemy_example.py file, so that users have an example

jprakash-db · 2025-09-04T09:24:38Z

tests/test_local/e2e/test_complex_types.py

            return tuple(value)
        elif isinstance(value, dict):
-            return tuple(value.items())
+            return tuple(sorted(value.items()))


is the sorting needed? the response from the server is in the same order that we insert right?

This is needed because parse_json makes changes to the order of the insertion. So we need to verify after sorting itself,

jprakash-db · 2025-09-04T09:27:21Z

tests/test_local/e2e/test_complex_types.py

+            for key in ['variant_simple_col', 'variant_nested_col', 'variant_array_col', 'variant_mixed_col']:
+                if compare[key] is not None:
+                    compare[key] = json.loads(compare[key])


Is this part even needed?

Yes because we get a string so it's better to verify after converting it from JSON to check if the output matches.

jprakash-db · 2025-09-04T09:33:13Z

src/databricks/sqlalchemy/_types.py

+    def literal_processor(self, dialect):
+        """Process literal values for SQL generation.      
+        For VARIANT columns, use PARSE_JSON() to properly insert data.
+        """
+        def process(value):
+            if value is None:
+                return "NULL"
+            try:
+                return self.pe.escape_string(json.dumps(value, ensure_ascii=False, separators=(',', ':')))
+            except (TypeError, ValueError) as e:
+                raise ValueError(f"Cannot serialize value {value} to JSON: {e}")
+
+        return f"PARSE_JSON('{process}')"
+


When you have bind processor, why do you need literal processor? both seems to be doing the same thing

Literal processor is called when literal_bind parameter is set to true for a SQL alchemy query. We can remove this section since the default value for the parameter is false

jprakash-db · 2025-09-04T09:35:06Z

tests/test_local/e2e/test_complex_types.py

+            dtype_mapping = {
+                "variant_simple_col": DatabricksVariant,
+                "variant_nested_col": DatabricksVariant,
+                "variant_array_col": DatabricksVariant,
+                "variant_mixed_col": DatabricksVariant


What if there are other types apart from variant, such as int or array,etc. Does this dtype mapping need to provided for only the variant columns or for all

It's better to provide the entire mapping. If we do not provide this mapping then the data is stored as a string for complex type. However for the general types like int, float, etc we do not need to explicitly map

merged changes from main

…essor for variant

jprakash-db

LGTM. Thanks for making the changes

Added support for Variant datatype in SQLAlchemy

d697cfc

msrathore-db requested a review from jprakash-db September 3, 2025 09:21

msrathore-db requested review from deeksha-db, gopalldb, jackyhu-db, jayantsing-db, madhav-db, samikshya-db, shivam2680 and vikrantpuppala as code owners September 3, 2025 09:21

msrathore-db removed request for deeksha-db, gopalldb, jackyhu-db, jayantsing-db, madhav-db, samikshya-db, shivam2680 and vikrantpuppala September 3, 2025 10:43

msrathore-db had a problem deploying to azure-prod September 3, 2025 11:34 — with GitHub Actions Failure

jprakash-db reviewed Sep 4, 2025

View reviewed changes

tests/test_local/e2e/test_complex_types.py Outdated Show resolved Hide resolved

tests/test_local/e2e/test_complex_types.py Show resolved Hide resolved

Allows user to directly pass object for variant type.

5ec04bc

msrathore-db had a problem deploying to azure-prod September 4, 2025 06:25 — with GitHub Actions Failure

jprakash-db reviewed Sep 4, 2025

View reviewed changes

msrathore-db added 2 commits September 4, 2025 23:24

Merge branch 'main' into PECOBLR-666

d0cce3f

merged changes from main

Added variant to sqlalchemy_example and added a test for literal_proc…

ed7cd94

…essor for variant

msrathore-db temporarily deployed to azure-prod September 5, 2025 06:51 — with GitHub Actions Inactive

Lint fix

446c496

msrathore-db temporarily deployed to azure-prod September 5, 2025 06:58 — with GitHub Actions Inactive

jprakash-db approved these changes Sep 5, 2025

View reviewed changes

msrathore-db merged commit a6f4460 into main Sep 5, 2025
10 of 11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[PECOBLR-666]Added support for Variant datatype in SQLAlchemy#42

[PECOBLR-666]Added support for Variant datatype in SQLAlchemy#42
msrathore-db merged 5 commits intomainfrom
PECOBLR-666

msrathore-db commented Sep 3, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

jprakash-db left a comment

Uh oh!

jprakash-db Sep 4, 2025

Uh oh!

msrathore-db Sep 4, 2025

Uh oh!

jprakash-db Sep 4, 2025

Uh oh!

msrathore-db Sep 4, 2025

Uh oh!

jprakash-db Sep 4, 2025

Uh oh!

msrathore-db Sep 4, 2025

Uh oh!

jprakash-db Sep 4, 2025

Uh oh!

msrathore-db Sep 4, 2025

Uh oh!

jprakash-db left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

msrathore-db commented Sep 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Testing

Related tickets and documents

Uh oh!

Uh oh!

Uh oh!

jprakash-db left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jprakash-db left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

msrathore-db commented Sep 3, 2025 •

edited

Loading