

Determines if the values of the specified entity property in two or more records are exactly the same. The comparison can be one of the following types: Each rule compares the values of a single property in the candidate records. You can create one or more rules to determine if two or more records match. We advise them to test with a small sample number of documents (ie. What a successful match means to them, so they can identify whether their matchesĪre being made for the right reasons or not.

We advise that when people are testing configs, they should have at least an idea of Other), with a limit of displaying 100 matches With each other and the database, test on All Data (everything in database with each Test scenarios: test on a list of URIs with each other only, test on a list of URIs Merge collections are sets of records that have the same collection tags.ĭata Hub provides smart mastering hardening.Merge strategies are sets of merge options that you can name and reuse.Merge options define how the properties of the candidate records are combined.The original records stay in the database and are tagged as archived. A new record is created with the combined contents of the original records, according to the merging rules you create. Merging rules define how two or more records in the data would be merged together. Merging handles the candidates accordingly, based on thresholds.Match thresholds define the limits and the actions to take when those limits are exceeded.Match options define how the records are compared.
#DATA MERGE DEFINITION HOW TO#
Matching rules define the properties to compare, how to compare them, and what thresholds must be exceeded before taking action. Matching determines if two records are candidates for merging, based on the degree of similarity between them and the weight of the comparison.In MarkLogic Data Hub, the mastering step involves two processes with their associated sets of rules: Smart Mastering is a MarkLogic technology that searches for records in your data that refer to the same entity based on rules you define and then merges them depending on thresholds you specify. Overview of mastering (matching and merging) in Data Hub. Set Up Gradle for Different Environments.QuickStart Format: Flow Config Structure.

Hub Central Format: Flow and Step Config Structures.Import/Migrate/Convert Existing Artifacts.Consulting Services Expert implementation.Multi-Model Database Documents, graphs, relational data.Data Hub Service Fully managed cloud data hub.MarkLogic Data Hub Platform → Learn how MarkLogic simplifies data integration.
