Web Based Training

InfoSphere BigMatch for Hadoop v11.4 - SPVC - 2Z850G

Contains: PDF course guide, as well as a lab environment where students can work through demonstrations and exercises at their own pace.

Grundlagen Englisch
quofox Qualitätssiegel

747,74 €

inkl. 19% MwSt

628,35 € netto

In Warenkorb legen

Beschreibung

The IBM InfoSphere Big Match on Hadoop course will introduce students to the Probabilistic Matching Engine (PME) and how it can be used to resolve and discover entities across multiple data sets in Hadoop.
Students will learn the basics of a PME algorithm including data model configuration, standardization, comparison and bucketing functions, weight generation, and threshold.
During the exercises, the student will work on a large use case, where they will apply their knowledge of Big Match to discover relationships be two data sets that can be used to understand the full view of the member data.

If you are enrolling in a Self Paced Virtual Classroom or Web Based Training course, before you enroll, please review the Self-Paced Virtual Classes and Web-Based Training Classes on our Terms and Conditions page, as well as the system requirements, to ensure that your system meets the minimum requirements for this course. http://www.ibm.com/training/terms

1. Introduction to Big Match for Apache Hadoop
What is Big Match
How Big Match Works
Big Match Components
Big Match Architecture
2. Big Match Data Model Definition
Members
Attribute Types
Member Attributes
Sources
Information Sources
3. PME Algorithm
Standardization
Bucketing
Comparison Functions
4. Bucket Analysis
Bucket Optimization
Bucket Concerns
5. Weights
String Weights
Numeric Weights
Multi-dimensional Weights
Troubleshooting Weights
6. HBase Tables
HBase concepts
Big Match commands
Big Match Tables (.pmebktidx, .pmemdmidx, .pmeentidx)
Best Practices
7. BigMatch Applications
PME Derive
PME Compare
PME Link
PME Analysis

Ziel


    Understand the capabilities of the Probabilistic Matching Engine
    Understand how the Probabilistic Matching engine is used with Big Insights to solve certain use cases.
    Understand the technical framework of the Big Match solution and how member data is derived, bucketed and compared to produce a complete entity from multiple data sets.
    Create a project and data model using the Big Match Console
    Configure the HBase tables that will be used in a Big Match solution
    Configure an algorithm using he Big Match console that includes Standardization, Comparison and Bucketing functions.
    Set up Strings for Anonymous value, Equivalency values, Frequency values, and character maps using the Big Match console
    Set up and run the Weight Generation process
    Evaluate and set thresholds for the algorithm
    Deploy a new algorithm to Big Match
    Evaluate Entity results and reconfigure algorithm based on evaluation. E.g. Large Buckets, Large Entities, Member not belonging to any buckets, etc

Anforderungen an die Teilnehmer

This course has no pre-requisites.

Teilnehmer

The course is designed for a technical audience that will be setting up a custom algorithm for the Probabilistic Matching Engine to use Big Match on Apache Hadoop to compare, match and/or search member records across multiple data sets.


Das könnte Dich auch interessieren

Problem melden


Danke

für Ihr Interesse an diesem Produkt.

Wir setzen uns umgehend mit dem Anbieter in Verbindung und werden in Kürze neue Termine anbieten.
Spätestens in 14 Tagen werden wir Sie über die neuen Termine informieren.

Solange finden Sie das Produkt in Ihrer Wunschliste wieder.