Skip to content

Optimisation docs#387

Open
siddharth-chevella wants to merge 5 commits intodatazip-inc:masterfrom
siddharth-chevella:optimisation-docs
Open

Optimisation docs#387
siddharth-chevella wants to merge 5 commits intodatazip-inc:masterfrom
siddharth-chevella:optimisation-docs

Conversation

@siddharth-chevella
Copy link
Copy Markdown
Contributor

  1. Update sidebar
  2. Added optimisation docs


1. **Lite** – Performs a lightweight optimization by converting **equality delete files** into **positional delete files**. Some query engines do not support reading equality delete files, so this conversion ensures better compatibility across query engines without rewriting the underlying data files.

2. **Medium** – Performs partial compaction by merging smaller **data files** and **delete files** into medium-sized files closer to the configured **target file size**. This helps reduce file fragmentation, lowers metadata overhead, and improves query performance while avoiding a full rewrite of the table.
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Also mention that files larger than 16 MB are compacted to align more closely with the configured target file size.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants