当前位置：和泉文库 > 计算机 > 浏览文档

《系统软件与软件安全》课程教学课件（PPT讲稿，英文）Lecture-3-MR-model-and-systems

文件格式：PPTX，文件大小：1.7MB，售价：13.64元

文档详细内容（约67页）

Building Big Data Processing Systemsbased on Scale-Out Computing Models

1 Building Big Data Processing Systems based on Scale-Out Computing Models

Small Data: Locality of ReferencesPrinciple of Locality- A small set of data that are frequently accessed temporally and spatially- Keeping it close to the processing unit is critical for performanceOneoflimitedprinciples/lawsincomputerscienceWhere can weget locality?- Everywhereincomputing:architecture,softwaresystems,applicationsFoundations of exploiting locality-Locality-awarearchitecture-Locality-awaresystemsLocalitypredictionfromaccesspatterns2

Small Data: Locality of References • Principle of Locality – A small set of data that are frequently accessed temporally and spatially – Keeping it close to the processing unit is critical for performance – One of limited principles/laws in computer science • Where can we get locality? – Everywhere in computing: architecture, software systems, applications • Foundations of exploiting locality – Locality-aware architecture – Locality-aware systems – Locality prediction from access patterns 2

Conventional Databases: Move data to compute. Centralized control to achieve ACID- Atomicity: if one part of the transaction fails, the entire onefails- Consistency: from one valid state to another valid state- lsolation: Resource sharing is not allowed- Durability: once a transaction is committed, the resultsneed to be permanently stored.: A centralized approach (or a scale up)- Scale-out: throughput increases as the # nodes increases. A vender controlled technical/business model- Expensive (designed for banks and high profit orgs)- ACID may not be required for massive data processing3

Conventional Databases: Move data to compute • Centralized control to achieve ACID – Atomicity: if one part of the transaction fails, the entire one fails – Consistency: from one valid state to another valid state – Isolation: Resource sharing is not allowed – Durability: once a transaction is committed, the results need to be permanently stored. • A centralized approach (or a scale up) – Scale-out: throughput increases as the # nodes increases • A vender controlled technical/business model – Expensive (designed for banks and high profit orgs) – ACID may not be required for massive data processing 3

How to handle increasingly large volume data? Anew paradigm (from Ivy League to Land Grant model)- 150+ years ago, Europe ended the industrial revolution- But US was a backwardagriculture country- Higher education is the foundation to become a strongindustrialcountry. Extending thelvyLeaguesto massively accept students? Impossible!.Anew higher education model? Must be! Land grant university model: at a low cost and be scalable- Lincoln singed the“Land Grant UniversityBill"in 1862Togivefederal landtomanyStatestobuildpublicuniversities- The missionis to build low costuniversities and open to massesThe success of land grant universities-Althoughthemodelislowcostandlessselectiveinadmissions,theexcellenceofeducationremains- Manyworld class universities wereborn and established bythismodel:Cornel,MiT,OhioState,Purdue,UCBerkeley,UluC,Wisconsin..4

How to handle increasingly large volume data? • A new paradigm (from Ivy League to Land Grant model) – 150+ years ago, Europe ended the industrial revolution – But US was a backward agriculture country – Higher education is the foundation to become a strong industrial country • Extending the Ivy Leagues to massively accept students? Impossible! • A new higher education model? Must be! • Land grant university model: at a low cost and be scalable – Lincoln singed the “Land Grant University Bill” in 1862 – To give federal land to many States to build public universities – The mission is to build low cost universities and open to masses • The success of land grant universities – Although the model is low cost and less selective in admissions, the excellence of education remains – Many world class universities were born and established by this model: Cornel, MIT, Ohio State, Purdue, UC Berkeley, UIUC, Wisconsin . 4

5MajorDifferencesinNewInfrastructure:Shared with conventional databases- SQLcontinues- Enterprise data warehouse (EDW)frameworkcontinues- Other commonly used API and standards (e.g. JDBC, ODBC).Majordifferences-A scale-out computing model (e.g. MapReduce)-Commodity computing and storage systems based-Scale up software efforts and advanced hardwareacceleration are additional efforts-Affordability is a requirement-Community driven open source software

Major Differences in New Infrastructure • Shared with conventional databases – SQL continues – Enterprise data warehouse (EDW) framework continues – Other commonly used API and standards (e.g. JDBC, ODBC) • Major differences – A scale-out computing model (e.g. MapReduce) – Commodity computing and storage systems based – Scale up software efforts and advanced hardware acceleration are additional efforts – Affordability is a requirement – Community driven open source software 5

点击进入文档下载页（PPTX格式）

共67页，可试读20页，点击继续阅读 ↓↓

您可能感兴趣的文档

《系统软件与软件安全》课程教学课件（PPT讲稿，英文）Lecture-2-access-patterns-in-big-data
《系统软件与软件安全》课程教学课件（PPT讲稿，英文）Lecture-1-balanced-systems-updated
《系统软件与软件安全》课程教学资源（文献资料）系统软件与软件安全文献合集
济南大学：研究生院《人工智能》专业课程教学大纲汇编
济南大学：研究生院《计算机技术》专业课程教学大纲汇编
济南大学：研究生院《计算机科学与技术》专业课程教学大纲汇编
北京信息科技大学：研究生院计算机学院课程教学大纲汇编
湖南工业大学：计算机与人工智能学院人工智能专业课程教学大纲汇编（2023版人才培养方案）
湖南工业大学：计算机与人工智能学院智能科学与技术专业课程教学大纲汇编（2023版人才培养方案）
湖南工业大学：计算机与人工智能学院物联网工程专业课程教学大纲汇编（2023版人才培养方案）
湖南工业大学：计算机与人工智能学院网络工程专业课程教学大纲汇编（2023版人才培养方案）
湖南工业大学：计算机与人工智能学院通信工程专业课程教学大纲汇编（2023版人才培养方案）
《系统软件与软件安全》课程教学课件（PPT讲稿，英文）Lecture-4-LSbM-tree
《系统软件与软件安全》课程教学课件（PPT讲稿，英文）Lecture-7-big-volume-data-accesses
《系统软件与软件安全》课程教学课件（PPT讲稿，英文）Lecture-6-locks-and-CC
《系统软件与软件安全》课程教学课件（PPT讲稿，英文）Lecture-7-SSD-sys
《系统软件与软件安全》课程教学课件（PPT讲稿，英文）Lecture-8-SDS-vision
江苏科技大学：《计算机组成原理》课程教学资源（PPT课件，完整讲稿，共十章）
江苏科技大学：《微机原理与接口技术》课程教学资源（PPT课件）Chapter1_1计算机基础知识
江苏科技大学：《微机原理与接口技术》课程教学资源（PPT课件）Chapter1_2计算机中数的表示和编码
江苏科技大学：《微机原理与接口技术》课程教学资源（PPT课件）Chapter2_1 8086-8088微处理器结构
江苏科技大学：《微机原理与接口技术》课程教学资源（PPT课件）Chapter2_2 8086-8088的寻址方式
江苏科技大学：《微机原理与接口技术》课程教学资源（PPT课件）Chapter2_3 8086-8088的指令系统
江苏科技大学：《微机原理与接口技术》课程教学资源（PPT课件）Chapter2_4逻辑指令-控制转移指令

点击购买下载（PPTX）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录