A tool facilitating matching columns across tabular datasets. It also serves as an experiment suite for state-of-the-art schema matching methods.
-
Updated
May 15, 2026 - Python
A tool facilitating matching columns across tabular datasets. It also serves as an experiment suite for state-of-the-art schema matching methods.
WInte.r is a Java framework for end-to-end data integration. The WInte.r framework implements well-known methods for data pre-processing, schema matching, identity resolution, data fusion, and result evaluation.
Entity linking, entity typing and relation extraction: Matching CSV to a Wikibase instance (e.g., Wikidata) via Meta-lookup
A .NET class library that allows you to import data from different sources into a unified destination
A python tool using XGboost and sentence-transformers to perform schema matching task on tables.
AgreementMaker Ontology Matching System
Match schema attributes of relational databases by value similarity. As a study assignment, this isn't well documented, but you can contact me for questions and I may even add docs, if I sense enough interest.
Functional and structural analysis of tables in research papers (Table disentangling)
[VLDB '25] Magneto combines small and large language models to provide cost-effective schema matching.
Knowledge Graph-based Retrieval-Augmented Generation for Schema Matching
🌮 Table-based KB Completer
The PyDI framework provides methods for end-to-end data integration. The framework covers all steps of the integration process, including schema matching, data translation, entity matching, and data fusion. The framework offers traditional string-based methods as well as modern LLM- and embedding-based techniques for these tasks.
Valentine scalable deployment for VLDB demo
Master thesis - reproducing state-of-the-art schema matching algorithms
CLI tool for inserting SELECT query results into ClickHouse with automatic schema matching and type-safe casting. Ideal for ETL pipelines and SQL-driven data flows.
Data Integration - Schema Matching & Mapping
Python client for the Serene Data Integration software
[Information System] SMUTF: Schema Matching Using Generative Tags and Hybrid Features
Master thesis: Holistic Schema Matching at Scale
Deterministic key and join discovery for structured datasets
Add a description, image, and links to the schema-matching topic page so that developers can more easily learn about it.
To associate your repository with the schema-matching topic, visit your repo's landing page and select "manage topics."