当前位置：和泉文库 > 计算机 > 浏览文档

《系统软件与软件安全》课程教学课件（PPT讲稿，英文）Lecture-3-MR-model-and-systems

文件格式：PPTX，文件大小：1.7MB，售价：13.64元

文档详细内容（约67页）

MapReduce Operations on Hadoop Calculate the average salary of every department(name: (org., salary)[org.: avg. salary)HDFS业业MapMapMap业业Shuffle the data using org. as Partition Key (PK)Records of “org-1"Records of“org-2"11

HDFS MapReduce Operations on Hadoop • Calculate the average salary of every department Map Map Map {name: (org., salary)} {org.: avg. salary} 11 Shuffle the data using org. as Partition Key (PK) Records of “org-1” Records of “org-2

MapReduce Operations on Hadoop: Calculate the average salary of every department(name: (org., salary))(org.: avg.salary)HDFS业MapMapMap业业Calculate theCalculatethe口average salaryaverage salaryJJLfor “org-1"for “org-2"Reduce(Avg.)Reduce(Avg.)贝只HDFS

HDFS MapReduce Operations on Hadoop • Calculate the average salary of every department Map Map Map Reduce (Avg.) Reduce (Avg.) HDFS {name: (org., salary)} {org.: avg. salary} 12 Calculate the average salary for “org-1” Calculate the average salary for “org-2

KeyNalue Pairs in MapReduce. A simple but effective programming model designed toprocess huge volumes of data concurrently on a cluster· Map: (k1, v1) → (k2, v2),- e.g. (name, org & salary) →> (org, salary)·Reduce: (k2, v2) →> (k3, v3),- e.g. (org, salary) →> (org, avg. salary): Shuffle: Partition Key (lt could be the same as k2, or not)- Partition Key: to determine how a key/lvalue pair in the mapoutput be transferredtoareducetask- e.g. org. name is used to partition the map output fileaccordingly13

Key/Value Pairs in MapReduce • A simple but effective programming model designed to process huge volumes of data concurrently on a cluster • Map: (k1, v1) → (k2, v2), – e.g. (name, org & salary) → (org, salary) • Reduce: (k2, v2) → (k3, v3), – e.g. (org, salary) → (org, avg. salary) • Shuffle: Partition Key (It could be the same as k2, or not) – Partition Key: to determine how a key/value pair in the map output be transferred to a reduce task – e.g. org. name is used to partition the map output file accordingly 13

MR(Hadoop) Job Execution PatternsMR program (job)The execution ofMap TasksaMR job involvesReduce TasksControl level work, e.g.job scheduling and taskData is stored in al:Job submissionassignmentDistributedFileSystem(e.g.HadoopDistributedMaster nodeFileSystem)WorkernodesWorkernodes2: Assign TasksDo data processingwork specified by Map14orReduceFunction

14 MR(Hadoop) Job Execution Patterns MR program (job) Master node 1: Job submission Worker nodes Worker nodes 2: Assign Tasks Map Tasks Reduce Tasks Data is stored in a Distributed File System (e.g. Hadoop Distributed File System) Control level work, e.g. job scheduling and task assignment Do data processing work specified by Map or Reduce Function The execution of a MR job involves 6 steps

MR(Hadoop) Job Execution PatternsMRprogramTheexecutionofMap TasksaMRjobinvolvesReduce Tasks6 steps1:Job submissionMap outputMaster nodeWorkernodesWorkernodeMap output will be shuffled todifferentreducetasksbasedon3: Map phase4:Shuffle phasePartitionKeys(PKs)(usuallyConcurrenttasksMap output keys)15

15 MR(Hadoop) Job Execution Patterns MR program Master node 1: Job submission Worker nodes Worker nodes 3: Map phase Concurrent tasks Map Tasks Reduce Tasks 4: Shuffle phase Map output Map output will be shuffled to different reduce tasks based on Partition Keys (PKs) (usually Map output keys) The execution of a MR job involves 6 steps

点击进入文档下载页（PPTX格式）

共67页，可试读20页，点击继续阅读 ↓↓

您可能感兴趣的文档

《系统软件与软件安全》课程教学课件（PPT讲稿，英文）Lecture-2-access-patterns-in-big-data
《系统软件与软件安全》课程教学课件（PPT讲稿，英文）Lecture-1-balanced-systems-updated
《系统软件与软件安全》课程教学资源（文献资料）系统软件与软件安全文献合集
济南大学：研究生院《人工智能》专业课程教学大纲汇编
济南大学：研究生院《计算机技术》专业课程教学大纲汇编
济南大学：研究生院《计算机科学与技术》专业课程教学大纲汇编
北京信息科技大学：研究生院计算机学院课程教学大纲汇编
湖南工业大学：计算机与人工智能学院人工智能专业课程教学大纲汇编（2023版人才培养方案）
湖南工业大学：计算机与人工智能学院智能科学与技术专业课程教学大纲汇编（2023版人才培养方案）
湖南工业大学：计算机与人工智能学院物联网工程专业课程教学大纲汇编（2023版人才培养方案）
湖南工业大学：计算机与人工智能学院网络工程专业课程教学大纲汇编（2023版人才培养方案）
湖南工业大学：计算机与人工智能学院通信工程专业课程教学大纲汇编（2023版人才培养方案）
《系统软件与软件安全》课程教学课件（PPT讲稿，英文）Lecture-4-LSbM-tree
《系统软件与软件安全》课程教学课件（PPT讲稿，英文）Lecture-7-big-volume-data-accesses
《系统软件与软件安全》课程教学课件（PPT讲稿，英文）Lecture-6-locks-and-CC
《系统软件与软件安全》课程教学课件（PPT讲稿，英文）Lecture-7-SSD-sys
《系统软件与软件安全》课程教学课件（PPT讲稿，英文）Lecture-8-SDS-vision
江苏科技大学：《计算机组成原理》课程教学资源（PPT课件，完整讲稿，共十章）
江苏科技大学：《微机原理与接口技术》课程教学资源（PPT课件）Chapter1_1计算机基础知识
江苏科技大学：《微机原理与接口技术》课程教学资源（PPT课件）Chapter1_2计算机中数的表示和编码
江苏科技大学：《微机原理与接口技术》课程教学资源（PPT课件）Chapter2_1 8086-8088微处理器结构
江苏科技大学：《微机原理与接口技术》课程教学资源（PPT课件）Chapter2_2 8086-8088的寻址方式
江苏科技大学：《微机原理与接口技术》课程教学资源（PPT课件）Chapter2_3 8086-8088的指令系统
江苏科技大学：《微机原理与接口技术》课程教学资源（PPT课件）Chapter2_4逻辑指令-控制转移指令

点击购买下载（PPTX）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录