
曙海教學(xué)優(yōu)勢
面向企事業(yè)單位的項(xiàng)目實(shí)際需要,本課程以項(xiàng)目實(shí)現(xiàn)為導(dǎo)向,秉承二十一年教學(xué)品質(zhì),授課老師將會與您分享設(shè)計(jì)的全流程以及工具的綜合使用技巧以及經(jīng)驗(yàn)。您可以定制課程,線上/線下/上門都可以,報(bào)名熱線:4008699035。
二十多年來,曙海培訓(xùn)的課程培養(yǎng)了大批受歡迎的工程師。曙海的課程在業(yè)內(nèi)廣受好評。大批企事業(yè)單位和曙海
建立了良好的合作關(guān)系,合作企業(yè)30萬+。
?培訓(xùn)對象:需要使用Hadoop來進(jìn)行數(shù)據(jù)分析的數(shù)據(jù)分析員,商業(yè)分析
教學(xué)大綱:
Hadoop基礎(chǔ)
Pig基礎(chǔ)
使用Pig進(jìn)行簡單數(shù)據(jù)分析
使用Pig處理復(fù)雜數(shù)據(jù)
使用Pig分析處理多數(shù)據(jù)集
Pig排錯和優(yōu)化
Hive與Impala基礎(chǔ)
使用Hive與Impala進(jìn)行數(shù)據(jù)分析
數(shù)據(jù)管理
數(shù)據(jù)存儲與性能
使用Hive與Impala進(jìn)行數(shù)據(jù)分析
Impala如何執(zhí)行查詢/擴(kuò)展及改善性能
使用Hive分析處理文本數(shù)據(jù)
Hive優(yōu)化
擴(kuò)展Hive
如何選取數(shù)據(jù)分析工具
?
課程大綱:
Hadoop?Fundamentals?
?
??????Hadoop?Overview?
?
??????Data?Storage:?HDFS?
?
??????Distributed?Data?Processing:?YARN,?MapReduce,?and?Spark?
?
??????Data?Processing?and?Analysis:?Pig,?Hive,?and?Impala?
?
??????Data?Integration:?Sqoop?
?
??????Other?Hadoop?Data?Tools?
?
??????Exercise?Scenarios?Explanation?
?
?
?
Introduction?to?Pig?
?
??????What?Is?Pig??
?
??????Pig’s?Features?
?
??????Pig?Use?Cases?
?
??????Interacting?with?Pig?
?
Basic?Data?Analysis?with?Pig?
?
??????Pig?Latin?Syntax?
?
??????Loading?Data?
?
??????Simple?Data?Types?
?
??????Field?Definitions?
?
??????Data?Output?
?
??????Viewing?the?Schema?
?
??????Filtering?and?Sorting?Data?
?
??????Commonly-Used?Functions?
?
Processing?Complex?Data?with?Pig?
?
??????S?torage?Formats?
?
??????Complex/Nested?Data?Types?
?
??????G?rouping?
?
??????Built-In?Functions?for?Complex?Data?
?
??????Iterating?Grouped?Data?
?
Multi-Dataset?Operations?with?Pig?
?
??????Techniques?for?Combining?Data?Sets?
?
??????Joining?Data?Sets?in?Pig?
?
??????Set?Operations?
?
??????Splitting?Data?Sets?
?
Pig?Troubleshooting?and?Optimization?
?
??????Troubleshooting?Pig?
?
??????Logging?
?
??????Using?Hadoop’s?Web?UI?
?
??????Data?Sampling?and?Debugging?
?
??????Performance?Overview?