Skip to content

Split mapping.csv into pages to bypass 10 MB workspace upload limit#4856

Open
pritishpai wants to merge 3 commits into
mainfrom
fix/mapping-file-size-limit
Open

Split mapping.csv into pages to bypass 10 MB workspace upload limit#4856
pritishpai wants to merge 3 commits into
mainfrom
fix/mapping-file-size-limit

Conversation

@pritishpai

Copy link
Copy Markdown
Contributor

Fixes #4675

Customers with ~100K+ tables in an external HMS hit a BadRequest: File size imported is (16413218 bytes), exceeded max size (10485760 bytes) error when running create-table-mapping.

  • The Databricks Workspace import API has a hard 10 MB file size limit. With ~100K tables, mapping.csv grows to ~16 MB, exceeding this regardless of query_statement_disposition.
  • save() now chunks rules into pages of 50,000 rows each (~8 MB/file), writing mapping.csv for the first page and mapping-1.csv, mapping-2.csv, … for overflow.
  • load() reads mapping.csv then follows mapping-1.csv, mapping-2.csv, … until NotFound, assembling the full rule list transparently to all callers.
  • Backward compatible — existing single-file mapping.csv installations load unchanged.

Test plan

  • Unit test: save() with > 50,000 rules writes both mapping.csv and mapping-1.csv with correct row counts
  • Unit test: load() with multiple pages assembles all rules in order
  • Unit test: load() with a single-page legacy mapping.csv still returns all rules (backward compat)
  • Manual: reproduce with a workspace that has 100K+ HMS tables and confirm no BadRequest on create-table-mapping

@pritishpai pritishpai requested a review from a team as a code owner June 4, 2026 20:04
@pritishpai pritishpai force-pushed the fix/mapping-file-size-limit branch from e5c4317 to edd3071 Compare June 4, 2026 20:06
@pritishpai pritishpai requested a review from FastLee June 4, 2026 20:12
@github-actions

github-actions Bot commented Jun 4, 2026

Copy link
Copy Markdown

❌ 41/44 passed, 2 flaky, 3 failed, 5 skipped, 7h15m55s total

❌ test_hiveserde_table_ctas_migration_job: TimeoutError: Timed out after 0:05:00 (6m39.084s)
TimeoutError: Timed out after 0:05:00
[gw9] linux -- Python 3.10.20 /home/runner/work/ucx/ucx/.venv/bin/python
20:38 DEBUG [databricks.labs.ucx.install] Cannot find previous installation: Path (/Users/0a330eb5-dd51-4d97-b6e4-c474356b1d5d/.JsmO/config.yml) doesn't exist.
20:38 INFO [databricks.labs.ucx.install] Please answer a couple of questions to configure Unity Catalog migration
20:38 INFO [databricks.labs.ucx.installer.hms_lineage] HMS lineage init script already exists and enabled
20:38 INFO [databricks.labs.ucx.install] Fetching installations...
20:38 DEBUG [databricks.labs.ucx.install] Cannot find previous installation: Path (/Users/0a330eb5-dd51-4d97-b6e4-c474356b1d5d/.JsmO/config.yml) doesn't exist.
20:38 INFO [databricks.labs.ucx.install] Please answer a couple of questions to configure Unity Catalog migration
20:38 INFO [databricks.labs.ucx.installer.hms_lineage] HMS lineage init script already exists and enabled
20:38 INFO [databricks.labs.ucx.install] Fetching installations...
20:43 DEBUG [databricks.labs.ucx.install] Cannot find previous installation: Path (/Users/0a330eb5-dd51-4d97-b6e4-c474356b1d5d/.JsmO/config.yml) doesn't exist.
20:43 INFO [databricks.labs.ucx.install] Please answer a couple of questions to configure Unity Catalog migration
20:43 INFO [databricks.labs.ucx.installer.hms_lineage] HMS lineage init script already exists and enabled
20:43 INFO [databricks.labs.ucx.install] Fetching installations...
20:44 WARNING [databricks.labs.ucx.install] Existing installation at /Users/0a330eb5-dd51-4d97-b6e4-c474356b1d5d/.JsmO is corrupted. Skipping...
20:44 INFO [databricks.labs.ucx.installer.policy] Creating UCX cluster policy.
20:44 DEBUG [tests.integration.conftest] Waiting for clusters to start...
20:44 DEBUG [tests.integration.conftest] Waiting for clusters to start...
20:44 INFO [databricks.labs.ucx.install] Deleting UCX v0.60.2+1320260604204449 from https://DATABRICKS_HOST
20:44 INFO [databricks.labs.ucx.install] Deleting inventory database dummy_ssvjxxuaa
20:44 INFO [databricks.labs.ucx.install] Deleting cluster policy
20:44 INFO [databricks.labs.ucx.install] Deleting secret scope
20:44 INFO [databricks.labs.ucx.install] UnInstalling UCX complete
[gw9] linux -- Python 3.10.20 /home/runner/work/ucx/ucx/.venv/bin/python
❌ test_table_migration_job_refreshes_migration_status[regular-migrate-tables]: AssertionError: Workflow failed: migrate-tables (31m0.509s)
AssertionError: Workflow failed: migrate-tables
assert False
[gw3] linux -- Python 3.10.20 /home/runner/work/ucx/ucx/.venv/bin/python
20:28 DEBUG [databricks.labs.ucx.install] Cannot find previous installation: Path (/Users/0a330eb5-dd51-4d97-b6e4-c474356b1d5d/.O4YO/config.yml) doesn't exist.
20:28 INFO [databricks.labs.ucx.install] Please answer a couple of questions to configure Unity Catalog migration
20:28 INFO [databricks.labs.ucx.installer.hms_lineage] HMS lineage init script already exists and enabled
20:28 INFO [databricks.labs.ucx.install] Fetching installations...
20:28 WARNING [databricks.labs.ucx.install] Existing installation at /Users/0a330eb5-dd51-4d97-b6e4-c474356b1d5d/.O4YO is corrupted. Skipping...
20:28 INFO [databricks.labs.ucx.installer.policy] Creating UCX cluster policy.
20:28 DEBUG [tests.integration.conftest] Waiting for clusters to start...
20:28 DEBUG [tests.integration.conftest] Waiting for clusters to start...
20:28 INFO [databricks.labs.ucx.install] Installing UCX v0.60.2+1320260604202857
20:28 INFO [databricks.labs.ucx.install] Creating ucx schemas...
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=migrate-groups-legacy
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=migrate-tables-in-mounts-experimental
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=migration-progress-experimental
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=migrate-external-tables-ctas
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=scan-tables-in-mounts-experimental
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=convert-wasbs-to-adls-gen2-experimental
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=remove-workspace-local-backup-groups
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=migrate-groups
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=failing
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=migrate-tables
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=assessment
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=assess-workflows
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=migrate-external-hiveserde-tables-in-place-experimental
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=migrate-data-reconciliation
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=validate-groups-permissions
20:29 INFO [databricks.labs.ucx.install] Creating dashboards...
20:29 DEBUG [databricks.labs.ucx.install] Reading step folder /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/migration...
20:29 DEBUG [databricks.labs.ucx.install] Reading step folder /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/progress...
20:29 DEBUG [databricks.labs.ucx.install] Reading step folder /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/views...
20:29 DEBUG [databricks.labs.ucx.install] Reading step folder /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/assessment...
20:29 INFO [databricks.labs.ucx.install] Creating dashboard in /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/migration/groups...
20:29 INFO [databricks.labs.ucx.install] Creating dashboard in /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/migration/main...
20:29 INFO [databricks.labs.ucx.install] Creating dashboard in /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/progress/main...
20:29 INFO [databricks.labs.ucx.install] Creating dashboard in /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/assessment/CLOUD_ENV...
20:29 INFO [databricks.labs.ucx.install] Creating dashboard in /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/assessment/interactive...
20:29 INFO [databricks.labs.ucx.install] Creating dashboard in /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/assessment/estimates...
20:29 INFO [databricks.labs.ucx.install] Creating dashboard in /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/assessment/main...
20:29 INFO [databricks.labs.ucx.installer.mixins] Fetching warehouse_id from a config
20:29 INFO [databricks.labs.ucx.installer.mixins] Fetching warehouse_id from a config
20:29 INFO [databricks.labs.ucx.installer.mixins] Fetching warehouse_id from a config
20:29 INFO [databricks.labs.ucx.installer.mixins] Fetching warehouse_id from a config
20:29 INFO [databricks.labs.ucx.installer.mixins] Fetching warehouse_id from a config
20:29 INFO [databricks.labs.ucx.installer.mixins] Fetching warehouse_id from a config
20:29 INFO [databricks.labs.ucx.installer.mixins] Fetching warehouse_id from a config
20:29 INFO [databricks.labs.ucx.install] Installation completed successfully! Please refer to the https://DATABRICKS_HOST/#workspace/Users/0a330eb5-dd51-4d97-b6e4-c474356b1d5d/.O4YO/README for the next steps.
20:29 INFO [databricks.labs.ucx.progress.install] Installation completed successfully!
20:29 DEBUG [databricks.labs.ucx.installer.workflows] starting assessment job: https://DATABRICKS_HOST#job/153464144646318
20:29 INFO [databricks.labs.ucx.installer.workflows] Named parameters for assessment job: None
20:29 INFO [databricks.labs.ucx.installer.workflows] Started assessment job: https://DATABRICKS_HOST#job/153464144646318/runs/359899143931039
20:29 DEBUG [databricks.labs.ucx.installer.workflows] Validating assessment workflow: https://DATABRICKS_HOST#job/153464144646318
20:29 INFO [databricks.labs.ucx.installer.workflows] Identified a run in progress waiting for run completion
20:38 DEBUG [databricks.labs.ucx.installer.workflows] starting migrate-tables job: https://DATABRICKS_HOST#job/302484507554947
20:38 INFO [databricks.labs.ucx.installer.workflows] Named parameters for migrate-tables job: None
20:38 INFO [databricks.labs.ucx.installer.workflows] Started migrate-tables job: https://DATABRICKS_HOST#job/302484507554947/runs/707327961205719
20:38 DEBUG [databricks.labs.ucx.installer.workflows] Validating migrate-tables workflow: https://DATABRICKS_HOST#job/302484507554947
20:38 INFO [databricks.labs.ucx.installer.workflows] Identified a run in progress waiting for run completion
20:28 DEBUG [databricks.labs.ucx.install] Cannot find previous installation: Path (/Users/0a330eb5-dd51-4d97-b6e4-c474356b1d5d/.O4YO/config.yml) doesn't exist.
20:28 INFO [databricks.labs.ucx.install] Please answer a couple of questions to configure Unity Catalog migration
20:28 INFO [databricks.labs.ucx.installer.hms_lineage] HMS lineage init script already exists and enabled
20:28 INFO [databricks.labs.ucx.install] Fetching installations...
20:28 WARNING [databricks.labs.ucx.install] Existing installation at /Users/0a330eb5-dd51-4d97-b6e4-c474356b1d5d/.O4YO is corrupted. Skipping...
20:28 INFO [databricks.labs.ucx.installer.policy] Creating UCX cluster policy.
20:28 DEBUG [tests.integration.conftest] Waiting for clusters to start...
20:28 DEBUG [tests.integration.conftest] Waiting for clusters to start...
20:28 INFO [databricks.labs.ucx.install] Installing UCX v0.60.2+1320260604202857
20:28 INFO [databricks.labs.ucx.install] Creating ucx schemas...
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=migrate-groups-legacy
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=migrate-tables-in-mounts-experimental
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=migration-progress-experimental
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=migrate-external-tables-ctas
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=scan-tables-in-mounts-experimental
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=convert-wasbs-to-adls-gen2-experimental
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=remove-workspace-local-backup-groups
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=migrate-groups
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=failing
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=migrate-tables
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=assessment
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=assess-workflows
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=migrate-external-hiveserde-tables-in-place-experimental
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=migrate-data-reconciliation
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=validate-groups-permissions
20:29 INFO [databricks.labs.ucx.install] Creating dashboards...
20:29 DEBUG [databricks.labs.ucx.install] Reading step folder /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/migration...
20:29 DEBUG [databricks.labs.ucx.install] Reading step folder /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/progress...
20:29 DEBUG [databricks.labs.ucx.install] Reading step folder /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/views...
20:29 DEBUG [databricks.labs.ucx.install] Reading step folder /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/assessment...
20:29 INFO [databricks.labs.ucx.install] Creating dashboard in /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/migration/groups...
20:29 INFO [databricks.labs.ucx.install] Creating dashboard in /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/migration/main...
20:29 INFO [databricks.labs.ucx.install] Creating dashboard in /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/progress/main...
20:29 INFO [databricks.labs.ucx.install] Creating dashboard in /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/assessment/CLOUD_ENV...
20:29 INFO [databricks.labs.ucx.install] Creating dashboard in /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/assessment/interactive...
20:29 INFO [databricks.labs.ucx.install] Creating dashboard in /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/assessment/estimates...
20:29 INFO [databricks.labs.ucx.install] Creating dashboard in /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/assessment/main...
20:29 INFO [databricks.labs.ucx.installer.mixins] Fetching warehouse_id from a config
20:29 INFO [databricks.labs.ucx.installer.mixins] Fetching warehouse_id from a config
20:29 INFO [databricks.labs.ucx.installer.mixins] Fetching warehouse_id from a config
20:29 INFO [databricks.labs.ucx.installer.mixins] Fetching warehouse_id from a config
20:29 INFO [databricks.labs.ucx.installer.mixins] Fetching warehouse_id from a config
20:29 INFO [databricks.labs.ucx.installer.mixins] Fetching warehouse_id from a config
20:29 INFO [databricks.labs.ucx.installer.mixins] Fetching warehouse_id from a config
20:29 INFO [databricks.labs.ucx.install] Installation completed successfully! Please refer to the https://DATABRICKS_HOST/#workspace/Users/0a330eb5-dd51-4d97-b6e4-c474356b1d5d/.O4YO/README for the next steps.
20:29 INFO [databricks.labs.ucx.progress.install] Installation completed successfully!
20:29 DEBUG [databricks.labs.ucx.installer.workflows] starting assessment job: https://DATABRICKS_HOST#job/153464144646318
20:29 INFO [databricks.labs.ucx.installer.workflows] Named parameters for assessment job: None
20:29 INFO [databricks.labs.ucx.installer.workflows] Started assessment job: https://DATABRICKS_HOST#job/153464144646318/runs/359899143931039
20:29 DEBUG [databricks.labs.ucx.installer.workflows] Validating assessment workflow: https://DATABRICKS_HOST#job/153464144646318
20:29 INFO [databricks.labs.ucx.installer.workflows] Identified a run in progress waiting for run completion
20:38 DEBUG [databricks.labs.ucx.installer.workflows] starting migrate-tables job: https://DATABRICKS_HOST#job/302484507554947
20:38 INFO [databricks.labs.ucx.installer.workflows] Named parameters for migrate-tables job: None
20:38 INFO [databricks.labs.ucx.installer.workflows] Started migrate-tables job: https://DATABRICKS_HOST#job/302484507554947/runs/707327961205719
20:38 DEBUG [databricks.labs.ucx.installer.workflows] Validating migrate-tables workflow: https://DATABRICKS_HOST#job/302484507554947
20:38 INFO [databricks.labs.ucx.installer.workflows] Identified a run in progress waiting for run completion
20:58 INFO [databricks.labs.ucx.install] Deleting UCX v0.60.2+1320260604202857 from https://DATABRICKS_HOST
20:59 INFO [databricks.labs.ucx.install] Deleting inventory database dummy_syukgnwk1
20:59 INFO [databricks.labs.ucx.installer.workflows] Removing job_id=262548447253466, as it is no longer needed
20:59 INFO [databricks.labs.ucx.installer.workflows] Removing job_id=872287403214956, as it is no longer needed
20:59 INFO [databricks.labs.ucx.installer.workflows] Removing job_id=956950764306197, as it is no longer needed
20:59 INFO [databricks.labs.ucx.installer.workflows] Removing job_id=188244233062995, as it is no longer needed
20:59 INFO [databricks.labs.ucx.installer.workflows] Removing job_id=328056153795749, as it is no longer needed
20:59 INFO [databricks.labs.ucx.installer.workflows] Removing job_id=590921598797592, as it is no longer needed
20:59 INFO [databricks.labs.ucx.installer.workflows] Removing job_id=194424425647814, as it is no longer needed
20:59 INFO [databricks.labs.ucx.installer.workflows] Removing job_id=6717607574914, as it is no longer needed
20:59 INFO [databricks.labs.ucx.installer.workflows] Removing job_id=854922465310464, as it is no longer needed
20:59 INFO [databricks.labs.ucx.installer.workflows] Removing job_id=302484507554947, as it is no longer needed
20:59 INFO [databricks.labs.ucx.installer.workflows] Removing job_id=153464144646318, as it is no longer needed
20:59 INFO [databricks.labs.ucx.installer.workflows] Removing job_id=451021897715081, as it is no longer needed
20:59 INFO [databricks.labs.ucx.installer.workflows] Removing job_id=657006823251053, as it is no longer needed
20:59 INFO [databricks.labs.ucx.installer.workflows] Removing job_id=145691085567852, as it is no longer needed
20:59 INFO [databricks.labs.ucx.installer.workflows] Removing job_id=219542001364644, as it is no longer needed
20:59 INFO [databricks.labs.ucx.install] Deleting cluster policy
20:59 INFO [databricks.labs.ucx.install] Deleting secret scope
20:59 INFO [databricks.labs.ucx.install] UnInstalling UCX complete
[gw3] linux -- Python 3.10.20 /home/runner/work/ucx/ucx/.venv/bin/python
❌ test_table_migration_job_refreshes_migration_status[hiveserde-migrate-external-hiveserde-tables-in-place-experimental]: AssertionError: Workflow failed: migrate-external-hiveserde-tables-in-place-experimental (30m55.07s)
AssertionError: Workflow failed: migrate-external-hiveserde-tables-in-place-experimental
assert False
[gw2] linux -- Python 3.10.20 /home/runner/work/ucx/ucx/.venv/bin/python
20:28 DEBUG [databricks.labs.ucx.install] Cannot find previous installation: Path (/Users/0a330eb5-dd51-4d97-b6e4-c474356b1d5d/.x7Cy/config.yml) doesn't exist.
20:28 INFO [databricks.labs.ucx.install] Please answer a couple of questions to configure Unity Catalog migration
20:28 INFO [databricks.labs.ucx.installer.hms_lineage] HMS lineage init script already exists and enabled
20:28 INFO [databricks.labs.ucx.install] Fetching installations...
20:28 WARNING [databricks.labs.ucx.install] Existing installation at /Users/0a330eb5-dd51-4d97-b6e4-c474356b1d5d/.x7Cy is corrupted. Skipping...
20:28 INFO [databricks.labs.ucx.installer.policy] Creating UCX cluster policy.
20:28 DEBUG [tests.integration.conftest] Waiting for clusters to start...
20:28 DEBUG [tests.integration.conftest] Waiting for clusters to start...
20:29 INFO [databricks.labs.ucx.install] Installing UCX v0.60.2+1320260604202900
20:29 INFO [databricks.labs.ucx.install] Creating ucx schemas...
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=convert-wasbs-to-adls-gen2-experimental
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=migration-progress-experimental
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=migrate-groups
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=failing
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=validate-groups-permissions
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=remove-workspace-local-backup-groups
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=migrate-tables-in-mounts-experimental
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=assessment
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=scan-tables-in-mounts-experimental
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=migrate-groups-legacy
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=migrate-data-reconciliation
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=migrate-external-hiveserde-tables-in-place-experimental
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=migrate-external-tables-ctas
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=migrate-tables
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=assess-workflows
20:29 INFO [databricks.labs.ucx.install] Creating dashboards...
20:29 DEBUG [databricks.labs.ucx.install] Reading step folder /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/migration...
20:29 DEBUG [databricks.labs.ucx.install] Reading step folder /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/progress...
20:29 DEBUG [databricks.labs.ucx.install] Reading step folder /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/views...
20:29 DEBUG [databricks.labs.ucx.install] Reading step folder /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/assessment...
20:29 INFO [databricks.labs.ucx.install] Creating dashboard in /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/migration/groups...
20:29 INFO [databricks.labs.ucx.install] Creating dashboard in /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/migration/main...
20:29 INFO [databricks.labs.ucx.install] Creating dashboard in /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/progress/main...
20:29 INFO [databricks.labs.ucx.install] Creating dashboard in /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/assessment/CLOUD_ENV...
20:29 INFO [databricks.labs.ucx.install] Creating dashboard in /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/assessment/interactive...
20:29 INFO [databricks.labs.ucx.install] Creating dashboard in /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/assessment/estimates...
20:29 INFO [databricks.labs.ucx.install] Creating dashboard in /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/assessment/main...
20:29 INFO [databricks.labs.ucx.installer.mixins] Fetching warehouse_id from a config
20:29 INFO [databricks.labs.ucx.installer.mixins] Fetching warehouse_id from a config
20:29 INFO [databricks.labs.ucx.installer.mixins] Fetching warehouse_id from a config
20:29 INFO [databricks.labs.ucx.installer.mixins] Fetching warehouse_id from a config
20:29 INFO [databricks.labs.ucx.installer.mixins] Fetching warehouse_id from a config
20:29 INFO [databricks.labs.ucx.installer.mixins] Fetching warehouse_id from a config
20:29 INFO [databricks.labs.ucx.installer.mixins] Fetching warehouse_id from a config
20:29 INFO [databricks.labs.ucx.install] Installation completed successfully! Please refer to the https://DATABRICKS_HOST/#workspace/Users/0a330eb5-dd51-4d97-b6e4-c474356b1d5d/.x7Cy/README for the next steps.
20:29 INFO [databricks.labs.ucx.progress.install] Installation completed successfully!
20:29 DEBUG [databricks.labs.ucx.installer.workflows] starting assessment job: https://DATABRICKS_HOST#job/240153047642597
20:29 INFO [databricks.labs.ucx.installer.workflows] Named parameters for assessment job: None
20:29 INFO [databricks.labs.ucx.installer.workflows] Started assessment job: https://DATABRICKS_HOST#job/240153047642597/runs/403639978099913
20:29 DEBUG [databricks.labs.ucx.installer.workflows] Validating assessment workflow: https://DATABRICKS_HOST#job/240153047642597
20:29 INFO [databricks.labs.ucx.installer.workflows] Identified a run in progress waiting for run completion
20:39 DEBUG [databricks.labs.ucx.installer.workflows] starting migrate-external-hiveserde-tables-in-place-experimental job: https://DATABRICKS_HOST#job/630100195718718
20:39 INFO [databricks.labs.ucx.installer.workflows] Named parameters for migrate-external-hiveserde-tables-in-place-experimental job: None
20:39 INFO [databricks.labs.ucx.installer.workflows] Started migrate-external-hiveserde-tables-in-place-experimental job: https://DATABRICKS_HOST#job/630100195718718/runs/904999858356260
20:39 DEBUG [databricks.labs.ucx.installer.workflows] Validating migrate-external-hiveserde-tables-in-place-experimental workflow: https://DATABRICKS_HOST#job/630100195718718
20:39 INFO [databricks.labs.ucx.installer.workflows] Identified a run in progress waiting for run completion
20:28 DEBUG [databricks.labs.ucx.install] Cannot find previous installation: Path (/Users/0a330eb5-dd51-4d97-b6e4-c474356b1d5d/.x7Cy/config.yml) doesn't exist.
20:28 INFO [databricks.labs.ucx.install] Please answer a couple of questions to configure Unity Catalog migration
20:28 INFO [databricks.labs.ucx.installer.hms_lineage] HMS lineage init script already exists and enabled
20:28 INFO [databricks.labs.ucx.install] Fetching installations...
20:28 WARNING [databricks.labs.ucx.install] Existing installation at /Users/0a330eb5-dd51-4d97-b6e4-c474356b1d5d/.x7Cy is corrupted. Skipping...
20:28 INFO [databricks.labs.ucx.installer.policy] Creating UCX cluster policy.
20:28 DEBUG [tests.integration.conftest] Waiting for clusters to start...
20:28 DEBUG [tests.integration.conftest] Waiting for clusters to start...
20:29 INFO [databricks.labs.ucx.install] Installing UCX v0.60.2+1320260604202900
20:29 INFO [databricks.labs.ucx.install] Creating ucx schemas...
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=convert-wasbs-to-adls-gen2-experimental
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=migration-progress-experimental
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=migrate-groups
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=failing
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=validate-groups-permissions
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=remove-workspace-local-backup-groups
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=migrate-tables-in-mounts-experimental
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=assessment
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=scan-tables-in-mounts-experimental
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=migrate-groups-legacy
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=migrate-data-reconciliation
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=migrate-external-hiveserde-tables-in-place-experimental
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=migrate-external-tables-ctas
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=migrate-tables
20:29 INFO [databricks.labs.ucx.installer.workflows] Creating new job configuration for step=assess-workflows
20:29 INFO [databricks.labs.ucx.install] Creating dashboards...
20:29 DEBUG [databricks.labs.ucx.install] Reading step folder /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/migration...
20:29 DEBUG [databricks.labs.ucx.install] Reading step folder /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/progress...
20:29 DEBUG [databricks.labs.ucx.install] Reading step folder /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/views...
20:29 DEBUG [databricks.labs.ucx.install] Reading step folder /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/assessment...
20:29 INFO [databricks.labs.ucx.install] Creating dashboard in /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/migration/groups...
20:29 INFO [databricks.labs.ucx.install] Creating dashboard in /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/migration/main...
20:29 INFO [databricks.labs.ucx.install] Creating dashboard in /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/progress/main...
20:29 INFO [databricks.labs.ucx.install] Creating dashboard in /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/assessment/CLOUD_ENV...
20:29 INFO [databricks.labs.ucx.install] Creating dashboard in /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/assessment/interactive...
20:29 INFO [databricks.labs.ucx.install] Creating dashboard in /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/assessment/estimates...
20:29 INFO [databricks.labs.ucx.install] Creating dashboard in /home/runner/work/ucx/ucx/src/databricks/labs/ucx/queries/assessment/main...
20:29 INFO [databricks.labs.ucx.installer.mixins] Fetching warehouse_id from a config
20:29 INFO [databricks.labs.ucx.installer.mixins] Fetching warehouse_id from a config
20:29 INFO [databricks.labs.ucx.installer.mixins] Fetching warehouse_id from a config
20:29 INFO [databricks.labs.ucx.installer.mixins] Fetching warehouse_id from a config
20:29 INFO [databricks.labs.ucx.installer.mixins] Fetching warehouse_id from a config
20:29 INFO [databricks.labs.ucx.installer.mixins] Fetching warehouse_id from a config
20:29 INFO [databricks.labs.ucx.installer.mixins] Fetching warehouse_id from a config
20:29 INFO [databricks.labs.ucx.install] Installation completed successfully! Please refer to the https://DATABRICKS_HOST/#workspace/Users/0a330eb5-dd51-4d97-b6e4-c474356b1d5d/.x7Cy/README for the next steps.
20:29 INFO [databricks.labs.ucx.progress.install] Installation completed successfully!
20:29 DEBUG [databricks.labs.ucx.installer.workflows] starting assessment job: https://DATABRICKS_HOST#job/240153047642597
20:29 INFO [databricks.labs.ucx.installer.workflows] Named parameters for assessment job: None
20:29 INFO [databricks.labs.ucx.installer.workflows] Started assessment job: https://DATABRICKS_HOST#job/240153047642597/runs/403639978099913
20:29 DEBUG [databricks.labs.ucx.installer.workflows] Validating assessment workflow: https://DATABRICKS_HOST#job/240153047642597
20:29 INFO [databricks.labs.ucx.installer.workflows] Identified a run in progress waiting for run completion
20:39 DEBUG [databricks.labs.ucx.installer.workflows] starting migrate-external-hiveserde-tables-in-place-experimental job: https://DATABRICKS_HOST#job/630100195718718
20:39 INFO [databricks.labs.ucx.installer.workflows] Named parameters for migrate-external-hiveserde-tables-in-place-experimental job: None
20:39 INFO [databricks.labs.ucx.installer.workflows] Started migrate-external-hiveserde-tables-in-place-experimental job: https://DATABRICKS_HOST#job/630100195718718/runs/904999858356260
20:39 DEBUG [databricks.labs.ucx.installer.workflows] Validating migrate-external-hiveserde-tables-in-place-experimental workflow: https://DATABRICKS_HOST#job/630100195718718
20:39 INFO [databricks.labs.ucx.installer.workflows] Identified a run in progress waiting for run completion
20:59 INFO [databricks.labs.ucx.install] Deleting UCX v0.60.2+1320260604202900 from https://DATABRICKS_HOST
20:59 INFO [databricks.labs.ucx.install] Deleting inventory database dummy_sxaxidlsu
20:59 INFO [databricks.labs.ucx.installer.workflows] Removing job_id=241131687657895, as it is no longer needed
20:59 INFO [databricks.labs.ucx.installer.workflows] Removing job_id=399023822707831, as it is no longer needed
20:59 INFO [databricks.labs.ucx.installer.workflows] Removing job_id=649044904221375, as it is no longer needed
20:59 INFO [databricks.labs.ucx.installer.workflows] Removing job_id=753158298866629, as it is no longer needed
20:59 INFO [databricks.labs.ucx.installer.workflows] Removing job_id=647737237758047, as it is no longer needed
20:59 INFO [databricks.labs.ucx.installer.workflows] Removing job_id=763755260021155, as it is no longer needed
20:59 INFO [databricks.labs.ucx.installer.workflows] Removing job_id=830552053023745, as it is no longer needed
20:59 INFO [databricks.labs.ucx.installer.workflows] Removing job_id=240153047642597, as it is no longer needed
20:59 INFO [databricks.labs.ucx.installer.workflows] Removing job_id=638331350491022, as it is no longer needed
20:59 INFO [databricks.labs.ucx.installer.workflows] Removing job_id=1085871013854748, as it is no longer needed
20:59 INFO [databricks.labs.ucx.installer.workflows] Removing job_id=1097852224898479, as it is no longer needed
20:59 INFO [databricks.labs.ucx.installer.workflows] Removing job_id=630100195718718, as it is no longer needed
20:59 INFO [databricks.labs.ucx.installer.workflows] Removing job_id=48212029102328, as it is no longer needed
20:59 INFO [databricks.labs.ucx.installer.workflows] Removing job_id=1055828451522223, as it is no longer needed
20:59 INFO [databricks.labs.ucx.installer.workflows] Removing job_id=749941883360760, as it is no longer needed
20:59 INFO [databricks.labs.ucx.install] Deleting cluster policy
20:59 INFO [databricks.labs.ucx.install] Deleting secret scope
20:59 INFO [databricks.labs.ucx.install] UnInstalling UCX complete
[gw2] linux -- Python 3.10.20 /home/runner/work/ucx/ucx/.venv/bin/python

Flaky tests:

  • 🤪 test_migrate_external_tables_with_principal_acl_CLOUD_ENV (24m35.927s)
  • 🤪 test_migrate_managed_tables_with_acl (25m57.343s)

Running from acceptance #9022

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

create-table-mapping issue if the mapping size exceeds 10Mbs

1 participant