ETL_project_best_practice1.ppt

Similar documents
DW 개요.PDF

PowerPoint 프레젠테이션

슬라이드 1

CRM Fair 2004

Oracle Apps Day_SEM

Microsoft PowerPoint - 3.공영DBM_최동욱_본부장-중소기업의_실용주의_CRM

歯CRM개괄_허순영.PDF

I. - II. DW ETT Best Practice

歯목차45호.PDF

Manufacturing6

PCServerMgmt7

ecorp-프로젝트제안서작성실무(양식3)

15_3oracle

歯두산3.PDF

Model Investor MANDO Portal Site People Customer BIS Supplier C R M PLM ERP MES HRIS S C M KMS Web -Based

ORANGE FOR ORACLE V4.0 INSTALLATION GUIDE (Online Upgrade) ORANGE CONFIGURATION ADMIN O

Intra_DW_Ch4.PDF

PowerPoint 프레젠테이션

thesis

Remote UI Guide

IBM Business Intelligence Solution Seminar 2005 Choose the Right Data Integration Solution ; Best Practices on EII/EAI/ETL IBM DB2 Technical Sales BI

How we create value? 안전경영 조직 및 시스템 강화 위원장 위원 간사 CEO 전략사장, CFO, 인사지원실장, 사업부장, 사업장장 안전환경인프라팀장 삼성SDI는 안전사고의 위험성에 대비하고 안전한 근무환경을 조성하기 위해 전담부서 개 편과 업무 관리범위

Microsoft PowerPoint - Smart CRM v4.0_TM 소개_ pptx

김기남_ATDC2016_160620_[키노트].key

Portal_9iAS.ppt [읽기 전용]

untitled

I What is Syrup Store? 1. Syrup Store 2. Syrup Store Component 3.

<30362E20C6EDC1FD2DB0EDBFB5B4EBB4D420BCF6C1A42E687770>

SW¹é¼Ł-³¯°³Æ÷ÇÔÇ¥Áö2013

E-BI Day Presentation

untitled

13 Who am I? R&D, Product Development Manager / Smart Worker Visualization SW SW KAIST Software Engineering Computer Engineering 3

PowerPoint 프레젠테이션

Intro to Servlet, EJB, JSP, WS

歯경영혁신 단계별 프로그램 사례.ppt

歯sql_tuning2

Service-Oriented Architecture Copyright Tmax Soft 2005

정보기술응용학회 발표

untitled

final_thesis

Oracle Database 10g: Self-Managing Database DB TSC

Orcad Capture 9.x

PRO1_09E [읽기 전용]

6주차.key

Microsoft PowerPoint - 6.CRM_Consulting.ppt

U.Tu System Application DW Service AGENDA 1. 개요 4. 솔루션 모음 1.1. 제안의 배경 및 목적 4.1. 고객정의 DW구축에 필요한 메타정보 생성 1.2. 제품 개요 4.2. 사전 변경 관리 1.3. 제품 특장점 4.3. 부품화형

PowerPoint

PMP수험서_8-2쇄

thesis


Analyst Briefing

PRO1_04E [읽기 전용]

2017 1

첨 부 1. 설문분석 결과 2. 교육과정 프로파일 169

1.장인석-ITIL 소개.ppt

<49534F C0CEC1F520BBE7C8C4BDC9BBE720C4C1BCB3C6C320B9D D20BDC3BDBAC5DB20B0EDB5B5C8AD20C1A6BEC8BFE4C3BBBCAD2E687770>

세션 3 (오이식).ppt

SAP ERP SAP Korea / Public &

VOL /2 Technical SmartPlant Materials - Document Management SmartPlant Materials에서 기본적인 Document를 관리하고자 할 때 필요한 세팅, 파일 업로드 방법 그리고 Path Type인 Ph

Microsoft PowerPoint - SVPSVI for LGNSYS_ ppt

USER GUIDE

スライド タイトルなし

PowerPoint Presentation

슬라이드 1

Session3. 한국마이크로소프트(전사적 데이터 통합 컨퍼런스).ppt

세션 2-2(허태경).ppt

3Æí2Àå¨éÀç

untitled

SchoolNet튜토리얼.PDF

제 출 문 국방부 장관 귀하 본 보고서를 국방부 군인연금과에서 당연구원에 의뢰한 군인연금기금 체 계적 관리방안 연구용역의 최종보고서로 제출합니다 (주)한국채권연구원 대표이사 오 규 철

비식별화 기술 활용 안내서-최종수정.indd

untitled

example code are examined in this stage The low pressure pressurizer reactor trip module of the Plant Protection System was programmed as subject for

UML

Microsoft SQL Server 2005 포켓 컨설턴트 관리자용

Security Overview

Microsoft Word doc

I 1 1) TESCO, 1993, ( 96, 98, 99) - : : 354 (19993 ~ , 1 =1737 ) - : 845 ( : 659 ) - : ) CM 9 (CM), CM , 2 CM, -

기타자료.PDF

리뉴얼 xtremI 최종 softcopy

Open Cloud Engine Open Source Big Data Platform Flamingo Project Open Cloud Engine Flamingo Project Leader 김병곤

F1-1(수정).ppt

BSC Discussion 1

The Self-Managing Database : Automatic Health Monitoring and Alerting

untitled

MAX+plus II Getting Started - 무작정따라하기

DBMS & SQL Server Installation Database Laboratory

PowerPoint 프레젠테이션

<31302E204D43545F47535FC3D6C1BEBAB8B0EDBCAD2E687770>

DocsPin_Korean.pages

1. BSC, Cycle [Uncertainty Issue], P What To Do? -, IT Process ing Issue ( Key Initiative) [Decision Making Issue] Workout -Brain Storming - Logic Tre

NoSQL

이제는 쓸모없는 질문들 1. 스마트폰 열기가 과연 계속될까? 2. 언제 스마트폰이 일반 휴대폰을 앞지를까? (2010년 10%, 2012년 33% 예상) 3. 삼성의 스마트폰 OS 바다는 과연 성공할 수 있을까? 지금부터 기업들이 관심 가져야 할 질문들 1. 스마트폰은

KYO_SCCD.PDF

<C0CCBCBCBFB52DC1A4B4EBBFF82DBCAEBBE7B3EDB9AE2D D382E687770>

s SINUMERIK 840C Service and User Manual DATA SAVING & LOADING & & /

KARAAUTO_1¿ù.qxd.ps, page Normalize

교육정책연구 2005-지정-52 공무원 채용시험이 대학교육, 노동시장에 미치는 영향분석 및 공무원 채용제도 개선방안 연구책임자 : 오 호 영 (한국직업능력개발원 부연구위원) 이 정책연구는 2005년도 교육인적자원부 인적자원개발 정책연구비 지원에 의 한


슬라이드 제목 없음

Transcription:

ETL ETL Data,., Data Warehouse DataData Warehouse ETL tool/system: ETL, ETL Process Data Warehouse Platform Database, Access Method Data Source Data Operational Data Near Real-Time Data Modeling Refresh/Replication Metadata Data Bulk Load Near Real-Time User Data Mart Enterprise Warehouse

ETL ETL SoR(System of Record)Table/File, Column/? Target Column?,? Transformation? Process Flow Cleansing, Transformation? Metadata? Load/Refresh??? Transformation???

ETL Process ETL Process Flow

ETL Process ETL Process Metadata Scheduling Metadata DSA SOR Parsing EDW Scheduling Clickstream Data

ETL Process System of Record (SoR) data source, data source : SoR DSA SOR change data capture time stamp

ETL Process data data data DSA SOR

ETL Process source source DSA SOR DSA match-merge/purge rule row

ETL Process ETL Data Demographics(row count, Distinct Value Count, Value Variance) Domain Violation Aggregation Outer Join Analysis Cartesian Product Analysis Business Rule Validation Report

ETL Process Load Load Operation Incremental Update Time stamp CDC(Change Data Capture) Frequent Load(Store & Forward) Bulk Load Near Real-Time EAI Peer-to-Peer Log Audit File Before ImageAfter Image (Refresh) : Insert, Mass Insert, Update, Delete, Mass Delete DB (Utility): Check Data, Load, Reorg, Recover

ETL Process Metadata Metadata ETL metadata metadata

ETL Process Metadata Metadata data definition report data quality tracking metadata driven business user interface decision support impact analysis enterprise wide impact analysis metadata controlled system Meta Data ROI Curve Meta Data Controlled System Enterprise-Wide Impact Analysis Decision Support Impact Analysis Meta Data Driven Business User Interface Data Quality Tracking Data Definition Reporting Meta Data

ETL Process Metadata Metadata : Metadata Exchange Standard Model(Meta Data Council) TOOL A TOOL B TOOL C TOOL D TOOL E TOOL PROFILE TOOL PROFILE TOOL PROFILE TOOL PROFILE TOOL PROFILE USER CONFIGURATRION STANDARD ACCESS FRAMEWORK STANDARD API STANDARD METADATA MODEL

ETL Process Metadata Metadata Model

Data Quality 1. Data Quality Data Quality Management Data Quality Data Quality DW, BI,, (correction, cleansing) (preventing). Data Quality Management 4 Data Quality : Data Quality : / Data Quality : data cleansing, data integration, data enrichment Data Quality : application Data Quality Report Enterprise Data Quality Management Data Quality Tracking Data Quality Data Quality Data Quality Data Quality Data Quality

Data Quality Data Quality(DQ) DQ Process DQ Process Owner Sponsor ( ) DQ DQ DQ DQ - Diagram DQ Test Test DQ (/) DQ Process /

ETL - P h a s e / / T a s k ETL, Master Plan ETL Source Data Target Data ETL Review Master Plan Source Target System Interface Quality Data Data Data

ETL - P h a s e ETL Process ETL T a s k Data Mapping Transformation ETL Program Plan Set up Module Test Test Module Test Data Transformation Mapping Plan Program Source Load Script Test

Data Sampling Prototype Test Prototype Test Prototype Test ETL Plan Prototyping Plan Test Load Plan ETL - P h a s e T a s k

ETL -, ETL Target, Source Data, ETL Program Sampling ETL. Data, System, Data Prototyping Prototyping,.,.. Data Mapping,,. Data Plan. Transformation ETL Prototyping.... Module., Plan Data, ETL Program.,.... H I g h l I g h t Target Table Target Column Target Data Target DBMS Source Table Source Column Source Data Source DBMS Key Man, Source Target Target, ETL, Source 3 Interface Source Data Target Data Target Data Source Data Mapping Transformation Data Data Load ETL Process Flow Data Program Program Plan ETL ( ) Plan Plan ETL Set Up Data Sampling Prototype Test ETL Program Prototype Test Data Sampling Program ETL DataETL Test Test Prototype Test Prototype Test ETL

ETL Best Practice - SoR(system of record) BQ(business question)ds(data source) matrix, DT(data transformation) rule NFD(not found data) DQ(data quality) DQ data profiling: data(sample) DQ DQ DC(data cleansing) DSI(data source gap & issue) ETL review & BQ review DS DS System BQ/DS Matrix NFD NFD DQ Data Quality Data Cleansing DS Gap & Issue DS Gap & Issue DS Issue

ETL Best Practice - DE(data extract) DT (data transformation) rule DC(data cleansing) rule ETL process ETL program,, program, DFD(data flow diagram) program ETL metadata, data model ETL Mapping & Cleansing Data Mapping Data Mapping Matrix DE DT Data Cleansing DC Process & Program Process ETL Process ETL Process Flow Diagram Program Program Program DFD ETL Plan ETL Metadata Metadata Metadata Model

ETL Best Practice - ETL ETL program & ETL (prototyping) ETL tuning ETL Program Program Program Program DFD ETL Plan Program Source/Script Program ETL Prototyping Prototyping Prototyping Prototyping Prototyping Prototyping ETL Tuning ETL Process Tuning Process Tuning ETL Program Tuning Program Tuning

ETL ETL : : & error Metadata Repository / ETL ETL Manager Manager ETL data ETL PM ETL DSA SOR / / / ETL ETL Admin Admin PM ETL ETL & error / & error ETL & error / & DSA

ETL Scheduler Process USER : JOB : :,, loading Rule JOB : JOB Message : ETL LOG : ETL Logging : Scheduling : : JOB,, : JOB log. : JOB Process Manager Process Manager Process Manager Process Manager fork() Init OK Start ETL Process Status Check ETL Status Response Process, Terminate code ETL, Process Kill Kill OK ETL Process JOB, Kill exit

Metadata Data Model(sample) ETL

Metadata Data Model(sample) ETL

ETL, 2 / R&D Data DW Data Data Data,,,? ETL / Data Data Data Product Data Data Customer Data Data Sales Data Data Market Data Data G/L Data Revenue Data External Data R&D Operational System Business Intelligence

ETL