Computation and Storage in the Cloud: Understanding the Trade-Offs

Ebook228 pages2 hours

Computation and Storage in the Cloud: Understanding the Trade-Offs

Name: Computation and Storage in the Cloud: Understanding the Trade-Offs
Brand: Elsevier Science
Rating: 5.0 (2 reviews)

By Dong Yuan, Yun Yang and Jinjun Chen

Rating: 5 out of 5 stars

5/5

()

Read preview

About this ebook

Computation and Storage in the Cloud is the first comprehensive and systematic work investigating the issue of computation and storage trade-off in the cloud in order to reduce the overall application cost. Scientific applications are usually computation and data intensive, where complex computation tasks take a long time for execution and the generated datasets are often terabytes or petabytes in size. Storing valuable generated application datasets can save their regeneration cost when they are reused, not to mention the waiting time caused by regeneration. However, the large size of the scientific datasets is a big challenge for their storage. By proposing innovative concepts, theorems and algorithms, this book will help bring the cost down dramatically for both cloud users and service providers to run computation and data intensive scientific applications in the cloud.

Covers cost models and benchmarking that explain the necessary tradeoffs for both cloud providers and users
Describes several novel strategies for storing application datasets in the cloud
Includes real-world case studies of scientific research applications

Covers cost models and benchmarking that explain the necessary tradeoffs for both cloud providers and users
Describes several novel strategies for storing application datasets in the cloud
Includes real-world case studies of scientific research applications

Skip carousel

LanguageEnglish

PublisherElsevier Science

Release dateDec 31, 2012

ISBN9780124078796

Author

Dong Yuan

Dong Yuan is currently a research fellow in School of Software and Electrical Engineering at Swinburne University of Technology, Melbourne, Australia. His research interests include data management in parallel and distributed systems, scheduling and resource management, grid and cloud computing.

Related authors

Skip carousel

Related to Computation and Storage in the Cloud

Related ebooks

Skip carousel

Web Semantics: Cutting Edge and Future Directions in Healthcare
Ebook
Web Semantics: Cutting Edge and Future Directions in Healthcare
bySarika Jain
Rating: 0 out of 5 stars
0 ratings
Energy Efficiency of Medical Devices and Healthcare Applications
Ebook
Energy Efficiency of Medical Devices and Healthcare Applications
byAmr Mohamed
Rating: 0 out of 5 stars
0 ratings
Deep Learning for Medical Applications with Unique Data
Ebook
Deep Learning for Medical Applications with Unique Data
byDeepak Gupta
Rating: 0 out of 5 stars
0 ratings
An Introduction to the History of Science
Ebook
An Introduction to the History of Science
byWalter Libby
Rating: 0 out of 5 stars
0 ratings
Uttar Pradesh: Modern Business Hub
Ebook
Uttar Pradesh: Modern Business Hub
byPaul McNamara
Rating: 0 out of 5 stars
0 ratings
Big Data Mining for Climate Change
Ebook
Big Data Mining for Climate Change
byZhihua Zhang
Rating: 0 out of 5 stars
0 ratings
Laboring Bodies and the Quantified Self
Ebook
Laboring Bodies and the Quantified Self
byUlfried Reichardt
Rating: 0 out of 5 stars
0 ratings
Service Science, Management, and Engineering:: Theory and Applications
Ebook
Service Science, Management, and Engineering:: Theory and Applications
byGang Xiong
Rating: 0 out of 5 stars
0 ratings
India, Pakistan, and the Bomb: Debating Nuclear Stability in South Asia
Ebook
India, Pakistan, and the Bomb: Debating Nuclear Stability in South Asia
bySumit Ganguly
Rating: 0 out of 5 stars
0 ratings
Path Planning for Vehicles Operating in Uncertain 2D Environments
Ebook
Path Planning for Vehicles Operating in Uncertain 2D Environments
byViacheslav Pshikhopov
Rating: 0 out of 5 stars
0 ratings
Artificial Intelligence, Expert Systems & Symbolic Computing
Ebook
Artificial Intelligence, Expert Systems & Symbolic Computing
byE.N. Houstis
Rating: 0 out of 5 stars
0 ratings
Media, Conflict and Peace in Northeast India
Ebook
Media, Conflict and Peace in Northeast India
byDr. KH Kabi
Rating: 0 out of 5 stars
0 ratings
Dictionary of Information Science and Technology
Ebook
Dictionary of Information Science and Technology
byCarolyn Watters
Rating: 0 out of 5 stars
0 ratings
Recent Trends in Computational Intelligence Enabled Research: Theoretical Foundations and Applications
Ebook
Recent Trends in Computational Intelligence Enabled Research: Theoretical Foundations and Applications
bySiddhartha Bhattacharyya
Rating: 0 out of 5 stars
0 ratings
Nanomaterials-Based Charge Trapping Memory Devices
Ebook
Nanomaterials-Based Charge Trapping Memory Devices
byAmmar Nayfeh
Rating: 0 out of 5 stars
0 ratings
Reliability Assurance of Big Data in the Cloud: Cost-Effective Replication-Based Storage
Ebook
Reliability Assurance of Big Data in the Cloud: Cost-Effective Replication-Based Storage
byYun Yang
Rating: 5 out of 5 stars
5/5
Data Analysis in the Cloud: Models, Techniques and Applications
Ebook
Data Analysis in the Cloud: Models, Techniques and Applications
byDomenico Talia
Rating: 0 out of 5 stars
0 ratings
Deep Learning on Edge Computing Devices: Design Challenges of Algorithm and Architecture
Ebook
Deep Learning on Edge Computing Devices: Design Challenges of Algorithm and Architecture
byXichuan Zhou
Rating: 0 out of 5 stars
0 ratings
Distributed and Cloud Computing: From Parallel Processing to the Internet of Things
Ebook
Distributed and Cloud Computing: From Parallel Processing to the Internet of Things
byKai Hwang
Rating: 5 out of 5 stars
5/5
Computational Intelligence for Multimedia Big Data on the Cloud with Engineering Applications
Ebook
Computational Intelligence for Multimedia Big Data on the Cloud with Engineering Applications
byArun Kumar Sangaiah
Rating: 0 out of 5 stars
0 ratings
Systems Programming: Designing and Developing Distributed Applications
Ebook
Systems Programming: Designing and Developing Distributed Applications
byRichard Anthony
Rating: 0 out of 5 stars
0 ratings
Mastering Cloud Computing: Foundations and Applications Programming
Ebook
Mastering Cloud Computing: Foundations and Applications Programming
byRajkumar Buyya
Rating: 0 out of 5 stars
0 ratings
Computational Learning Approaches to Data Analytics in Biomedical Applications
Ebook
Computational Learning Approaches to Data Analytics in Biomedical Applications
byKhalid Al-Jabery
Rating: 5 out of 5 stars
5/5
Optimized Cloud Resource Management and Scheduling: Theories and Practices
Ebook
Optimized Cloud Resource Management and Scheduling: Theories and Practices
byWenhong Dr. Tian
Rating: 0 out of 5 stars
0 ratings
Fundamentals of Data Science: Theory and Practice
Ebook
Fundamentals of Data Science: Theory and Practice
byJugal K. Kalita
Rating: 0 out of 5 stars
0 ratings
Managing the Web of Things: Linking the Real World to the Web
Ebook
Managing the Web of Things: Linking the Real World to the Web
byMichael Sheng
Rating: 0 out of 5 stars
0 ratings
Microgrid Methodologies and Emergent Applications
Ebook
Microgrid Methodologies and Emergent Applications
byChengshan Wang
Rating: 0 out of 5 stars
0 ratings
Temporal QOS Management in Scientific Cloud Workflow Systems
Ebook
Temporal QOS Management in Scientific Cloud Workflow Systems
byXiao Liu
Rating: 0 out of 5 stars
0 ratings
Energy Positive Neighborhoods and Smart Energy Districts: Methods, Tools, and Experiences from the Field
Ebook
Energy Positive Neighborhoods and Smart Energy Districts: Methods, Tools, and Experiences from the Field
byAntonello Monti
Rating: 0 out of 5 stars
0 ratings
Deep Learning: Convergence to Big Data Analytics
Ebook
Deep Learning: Convergence to Big Data Analytics
byMurad Khan
Rating: 0 out of 5 stars
0 ratings

Databases For You

Skip carousel

Summary of Building a Second Brain: by Tiago Forte - A Proven Method to Organize Your Digital Life and Unlock Your Creative Potential - A Comprehensive Summary
Ebook
Summary of Building a Second Brain: by Tiago Forte - A Proven Method to Organize Your Digital Life and Unlock Your Creative Potential - A Comprehensive Summary
byAlexander Cooper
Rating: 1 out of 5 stars
1/5
Blockchain Basics: A Non-Technical Introduction in 25 Steps
Ebook
Blockchain Basics: A Non-Technical Introduction in 25 Steps
byDaniel Drescher
Rating: 5 out of 5 stars
5/5
Practical Data Analysis
Ebook
Practical Data Analysis
byHector Cuesta
Rating: 4 out of 5 stars
4/5
Access 2019 For Dummies
Ebook
Access 2019 For Dummies
byLaurie A. Ulrich
Rating: 0 out of 5 stars
0 ratings
100+ SQL Queries T-SQL for Microsoft SQL Server
Ebook
100+ SQL Queries T-SQL for Microsoft SQL Server
byIFS Harrison
Rating: 4 out of 5 stars
4/5
Grokking Algorithms: An illustrated guide for programmers and other curious people
Ebook
Grokking Algorithms: An illustrated guide for programmers and other curious people
byAditya Bhargava
Rating: 4 out of 5 stars
4/5
SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL
Ebook
SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL
byWalter Shields
Rating: 4 out of 5 stars
4/5
Learn SQL in 24 Hours
Ebook
Learn SQL in 24 Hours
byAlex Nordeen
Rating: 5 out of 5 stars
5/5
Access 2016 For Dummies
Ebook
Access 2016 For Dummies
byLaurie A. Ulrich
Rating: 0 out of 5 stars
0 ratings
Python Projects for Everyone
Ebook
Python Projects for Everyone
byMohamad Charara
Rating: 0 out of 5 stars
0 ratings
Learn SQL Server Administration in a Month of Lunches
Ebook
Learn SQL Server Administration in a Month of Lunches
byDon Jones
Rating: 3 out of 5 stars
3/5
SQL Clearly Explained
Ebook
SQL Clearly Explained
byJan L. Harrington
Rating: 5 out of 5 stars
5/5
Artificial Intelligence for Fashion: How AI is Revolutionizing the Fashion Industry
Ebook
Artificial Intelligence for Fashion: How AI is Revolutionizing the Fashion Industry
byLeanne Luce
Rating: 0 out of 5 stars
0 ratings
LINUX: Beginner's Crash Course. Your Step-By-Step Guide To Learning The Linux Operating System And Command Line Easy & Fast!
Ebook
LINUX: Beginner's Crash Course. Your Step-By-Step Guide To Learning The Linux Operating System And Command Line Easy & Fast!
byJeremy Li
Rating: 3 out of 5 stars
3/5
Query Store for SQL Server 2019: Identify and Fix Poorly Performing Queries
Ebook
Query Store for SQL Server 2019: Identify and Fix Poorly Performing Queries
byTracy Boggiano
Rating: 0 out of 5 stars
0 ratings
SQL Programming & Database Management For Absolute Beginners SQL Server, Structured Query Language Fundamentals: "Learn - By Doing" Approach And Master SQL
Ebook
SQL Programming & Database Management For Absolute Beginners SQL Server, Structured Query Language Fundamentals: "Learn - By Doing" Approach And Master SQL
byWilliam Sullivan
Rating: 5 out of 5 stars
5/5
COBOL Basic Training Using VSAM, IMS and DB2
Ebook
COBOL Basic Training Using VSAM, IMS and DB2
byRobert Wingate
Rating: 5 out of 5 stars
5/5
Developing High Quality Data Models
Ebook
Developing High Quality Data Models
byMatthew West
Rating: 0 out of 5 stars
0 ratings
Oracle DBA Mentor: Succeeding as an Oracle Database Administrator
Ebook
Oracle DBA Mentor: Succeeding as an Oracle Database Administrator
byBrian Peasland
Rating: 0 out of 5 stars
0 ratings
Data Governance: How to Design, Deploy and Sustain an Effective Data Governance Program
Ebook
Data Governance: How to Design, Deploy and Sustain an Effective Data Governance Program
byJohn Ladley
Rating: 4 out of 5 stars
4/5
Text Analytics with Python: A Practitioner's Guide to Natural Language Processing
Ebook
Text Analytics with Python: A Practitioner's Guide to Natural Language Processing
byDipanjan Sarkar
Rating: 0 out of 5 stars
0 ratings
SQL Server: Tips and Tricks - 2
Ebook
SQL Server: Tips and Tricks - 2
byPriyanka Agarwal
Rating: 4 out of 5 stars
4/5
Beginning Microsoft Power BI: A Practical Guide to Self-Service Data Analytics
Ebook
Beginning Microsoft Power BI: A Practical Guide to Self-Service Data Analytics
byDan Clark
Rating: 0 out of 5 stars
0 ratings
Building a Scalable Data Warehouse with Data Vault 2.0
Ebook
Building a Scalable Data Warehouse with Data Vault 2.0
byDaniel Linstedt
Rating: 4 out of 5 stars
4/5
Business Intelligence Strategy and Big Data Analytics: A General Management Perspective
Ebook
Business Intelligence Strategy and Big Data Analytics: A General Management Perspective
bySteve Williams
Rating: 5 out of 5 stars
5/5
Access 2010 All-in-One For Dummies
Ebook
Access 2010 All-in-One For Dummies
byAlison Barrows
Rating: 4 out of 5 stars
4/5
Data Mining: Concepts and Techniques
Ebook
Data Mining: Concepts and Techniques
byJiawei Han
Rating: 4 out of 5 stars
4/5
COMPUTER SCIENCE FOR ROOKIES
Ebook
COMPUTER SCIENCE FOR ROOKIES
byAngel Bahabwa
Rating: 0 out of 5 stars
0 ratings
Serverless Architectures on AWS, Second Edition
Ebook
Serverless Architectures on AWS, Second Edition
byPeter Sbarski
Rating: 5 out of 5 stars
5/5
Data Science Strategy For Dummies
Ebook
Data Science Strategy For Dummies
byUlrika Jägare
Rating: 0 out of 5 stars
0 ratings

Related podcast episodes

Skip carousel

Building Tools And Platforms For Data Analytics - Episode 95: An interview on what data engineers need to know about building tools and platforms for data analytics
Podcast episode
Building Tools And Platforms For Data Analytics - Episode 95: An interview on what data engineers need to know about building tools and platforms for data analytics
byData Engineering Podcast
0 ratings
0% found this document useful
Big Data: The money-making world of big data is discussed by Evan Davis and guests.
Podcast episode
Big Data: The money-making world of big data is discussed by Evan Davis and guests.
byThe Bottom Line
0 ratings
0% found this document useful
Build Your Data Analytics Like An Engineer - Episode 81: An interview about how dbt enables your data teams to build better analytics in your data warehouse
Podcast episode
Build Your Data Analytics Like An Engineer - Episode 81: An interview about how dbt enables your data teams to build better analytics in your data warehouse
byData Engineering Podcast
0 ratings
0% found this document useful
007 Prof. Kristin Persson of the Materials Project – Building a Global Materials Informatics Platform: Summary: This episode focuses on Prof. Kristin Persson’s work directing the Materials Project, where she had her group have built an open-source materials informatics platform that reaches over 75,000 users worldwide. In this episode,...
Podcast episode
007 Prof. Kristin Persson of the Materials Project – Building a Global Materials Informatics Platform: Summary: This episode focuses on Prof. Kristin Persson’s work directing the Materials Project, where she had her group have built an open-source materials informatics platform that reaches over 75,000 users worldwide. In this episode,...
byDataLab: The Materials Informatics Podcast
0 ratings
0% found this document useful
[Bite] Data Science and the Scientific Method
Podcast episode
[Bite] Data Science and the Scientific Method
byDataCafé
0 ratings
0% found this document useful
Episode 21: Remember when RealNetworks used to-- BUFFERING: Are you about to head off to college? Interested in DevOps and the Cloud? Is there a good way for someone like you who is starting out in the world of technology to absorb the necessary skills? The Open Source Lab (OSL) at Oregon State University (OSU) is
Podcast episode
Episode 21: Remember when RealNetworks used to-- BUFFERING: Are you about to head off to college? Interested in DevOps and the Cloud? Is there a good way for someone like you who is starting out in the world of technology to absorb the necessary skills? The Open Source Lab (OSL) at Oregon State University (OSU) is
byScreaming in the Cloud
0 ratings
0% found this document useful
Build Your Own Data Pipeline - Andreas Kretz
Podcast episode
Build Your Own Data Pipeline - Andreas Kretz
byDataTalks.Club
0 ratings
0% found this document useful
Making Agile work for data science: On this episode, we chat with Michael Carrico, director of data science, and Chris Wones, an engineering lead. Both work at 84.51°, which builds software and data solutions to power the pricing and promotions that reach over eight million shoppers a day across 2,500 Kroger stores.
Podcast episode
Making Agile work for data science: On this episode, we chat with Michael Carrico, director of data science, and Chris Wones, an engineering lead. Both work at 84.51°, which builds software and data solutions to power the pricing and promotions that reach over eight million shoppers a day across 2,500 Kroger stores.
byThe Stack Overflow Podcast
0 ratings
0% found this document useful
Why Microservices Are Better Than Cloud Computing: This episode on Systems—one of the four Domains of Data Science UVA uses to define the field—explores the challenges of cloud computing within the framework of biomedical research. Phil Bourne, Dean of the UVA School of Data Science, speaks with computational biologist and associate professor Nathan Sheffield about a paper they co-wrote on systemic issues from cloud platforms that do not support FAIRness, including platform lock-in, poor integration across platforms, and duplicated efforts for users and developers. They suggest instead prioritizing microservices and access to modular data in smaller chunks or summarized form. Emphasizing modularity and interoperability would lead to a more powerful Unix-like ecosystem of web services for biomedical analysis and data retrieval. The two discuss how funders, developers, and researchers can support microservices as the next generation of cloud-based bioinformatics. From Cloud Computing to
Podcast episode
Why Microservices Are Better Than Cloud Computing: This episode on Systems—one of the four Domains of Data Science UVA uses to define the field—explores the challenges of cloud computing within the framework of biomedical research. Phil Bourne, Dean of the UVA School of Data Science, speaks with computational biologist and associate professor Nathan Sheffield about a paper they co-wrote on systemic issues from cloud platforms that do not support FAIRness, including platform lock-in, poor integration across platforms, and duplicated efforts for users and developers. They suggest instead prioritizing microservices and access to modular data in smaller chunks or summarized form. Emphasizing modularity and interoperability would lead to a more powerful Unix-like ecosystem of web services for biomedical analysis and data retrieval. The two discuss how funders, developers, and researchers can support microservices as the next generation of cloud-based bioinformatics. From Cloud Computing to
byUVA Data Points
0 ratings
0% found this document useful
The Modern Data Stack vs Hyperscale Data Warehousing: The modern data stack is a collection of cloud-based tools and technologies used to collect, store, process, and analyze data in a scalable way. It is a departure from traditional data stacks, which were often based on on-premises infrastructure and...
Podcast episode
The Modern Data Stack vs Hyperscale Data Warehousing: The modern data stack is a collection of cloud-based tools and technologies used to collect, store, process, and analyze data in a scalable way. It is a departure from traditional data stacks, which were often based on on-premises infrastructure and...
byDM Radio
0 ratings
0% found this document useful
215 — Workplace design in the Covid era: Earlier in the year, a report by academics at Cardiff and Southampton Universities found that a majority of people would like to continue working from home in some capacity, even after social distancing is no longer a requirement. But what would a...
Podcast episode
215 — Workplace design in the Covid era: Earlier in the year, a report by academics at Cardiff and Southampton Universities found that a majority of people would like to continue working from home in some capacity, even after social distancing is no longer a requirement. But what would a...
byThe Mind Tools L&D Podcast
0 ratings
0% found this document useful
Pushing The Limits Of Scalability And User Experience For Data Processing WIth Jignesh Patel: Data processing technologies have dramatically improved in their sophistication and raw throughput. Unfortunately, the volumes of data that are being generated continue to double, requiring further advancements in the platform capabilities to keep up. As the sophistication increases, so does the complexity, leading to challenges for user experience. Jignesh Patel has been researching these areas for several years in his work as a professor at Carnegie Mellon University. In this episode he illuminates the landscape of problems that we are faced with and how his research is aimed at helping to solve these problems.
Podcast episode
Pushing The Limits Of Scalability And User Experience For Data Processing WIth Jignesh Patel: Data processing technologies have dramatically improved in their sophistication and raw throughput. Unfortunately, the volumes of data that are being generated continue to double, requiring further advancements in the platform capabilities to keep up. As the sophistication increases, so does the complexity, leading to challenges for user experience. Jignesh Patel has been researching these areas for several years in his work as a professor at Carnegie Mellon University. In this episode he illuminates the landscape of problems that we are faced with and how his research is aimed at helping to solve these problems.
byData Engineering Podcast
0 ratings
0% found this document useful
Understanding Graph Database Patterns
Podcast episode
Understanding Graph Database Patterns
byThe Cloudcast
0 ratings
0% found this document useful
Unlocking Scientific Data with Mike Tarselli at TetraScience
Podcast episode
Unlocking Scientific Data with Mike Tarselli at TetraScience
byFrom Lab to Launch by Qualio
0 ratings
0% found this document useful
[Bite] Documenting Data Science Projects
Podcast episode
[Bite] Documenting Data Science Projects
byDataCafé
0 ratings
0% found this document useful
MLOps: A leader's perspective // Stephen Galsworthy // MLOps Coffee Sessions #39
Podcast episode
MLOps: A leader's perspective // Stephen Galsworthy // MLOps Coffee Sessions #39
byMLOps.community
0 ratings
0% found this document useful
Commanding the Council of the Lords of Thought with Anna Belak: A few years ago Corey caught wind of the open source project Sysdig, which at the time attracted his attention. Now it has turned into something “rather interesting” when it comes to observability and security. Anna Belak, Sysdig’s Director of Thought Lea
Podcast episode
Commanding the Council of the Lords of Thought with Anna Belak: A few years ago Corey caught wind of the open source project Sysdig, which at the time attracted his attention. Now it has turned into something “rather interesting” when it comes to observability and security. Anna Belak, Sysdig’s Director of Thought Lea
byScreaming in the Cloud
0 ratings
0% found this document useful
Exploring Open-Source for Tissue Image Analysis and Data Science Business w/ Trevor McKee, Pathomics.io
Podcast episode
Exploring Open-Source for Tissue Image Analysis and Data Science Business w/ Trevor McKee, Pathomics.io
byDigital Pathology Podcast
0 ratings
0% found this document useful
Strachey Lecture - Privacy-preserving analytics in, or out of, the cloud: This talk is about the experience of providing privacy when running analytics on users’ personal data.
Podcast episode
Strachey Lecture - Privacy-preserving analytics in, or out of, the cloud: This talk is about the experience of providing privacy when running analytics on users’ personal data.
byComputer Science
0 ratings
0% found this document useful
Understanding Time-Series Database Patterns
Podcast episode
Understanding Time-Series Database Patterns
byThe Cloudcast
0 ratings
0% found this document useful
The Cloudcast #355 - Exploring IoT Edge
Podcast episode
The Cloudcast #355 - Exploring IoT Edge
byThe Cloudcast
0 ratings
0% found this document useful
MLOps Coffee Sessions #10 Analyzing the Article “Continuous Delivery and Automation Pipelines in Machine Learning" // Part 2
Podcast episode
MLOps Coffee Sessions #10 Analyzing the Article “Continuous Delivery and Automation Pipelines in Machine Learning" // Part 2
byMLOps.community
0 ratings
0% found this document useful
ATLAS with Dr. Mario Lassnig: Our guest today is Dr. Mario Lassnig, a software engineer working on the ATLAS Experiment at CERN!
Podcast episode
ATLAS with Dr. Mario Lassnig: Our guest today is Dr. Mario Lassnig, a software engineer working on the ATLAS Experiment at CERN!
byGoogle Cloud Platform Podcast
0 ratings
0% found this document useful
La Trobe University | Director of Data & Analytics | Anthony Perera
Podcast episode
La Trobe University | Director of Data & Analytics | Anthony Perera
byThe iTnews Podcast
0 ratings
0% found this document useful
Quantum Tech Pod Episode 16: Andrew Horsley, Quantum Brilliance Co-Founder & CEO: (QuantumTechPod) Host Chris Bishop, today interviews Dr. Andrew Horsley, Quantum Brilliance Co-Founder & CEO. Andrew is the founder and applied quantum physicist, working on room temperature quantum computing using NV centres in diamond with 8+ years’...
Podcast episode
Quantum Tech Pod Episode 16: Andrew Horsley, Quantum Brilliance Co-Founder & CEO: (QuantumTechPod) Host Chris Bishop, today interviews Dr. Andrew Horsley, Quantum Brilliance Co-Founder & CEO. Andrew is the founder and applied quantum physicist, working on room temperature quantum computing using NV centres in diamond with 8+ years’...
byQuantum Tech Pod
0 ratings
0% found this document useful
Podcast Ep. #18 – Prof. Wenbin Yu on the Structure Genome: On this episode I am speaking to Wenbin Yu, who is a professor at the School of Aeronautics and Astronautics of Purdue University and CTO of AnalySwift, a provider of simulation software for composites. Wenbin has achieved many accolades in both the ac...
Podcast episode
Podcast Ep. #18 – Prof. Wenbin Yu on the Structure Genome: On this episode I am speaking to Wenbin Yu, who is a professor at the School of Aeronautics and Astronautics of Purdue University and CTO of AnalySwift, a provider of simulation software for composites. Wenbin has achieved many accolades in both the ac...
byAerospace Engineering Podcast
0 ratings
0% found this document useful
The Cloudcast #306 - PaaS Adoption from Around the World: Aaron and Brian talk with Thurupathan Vijayakumar (<a href="https://twitter.com/thurutweets">@ThuruTweets</a>, Solutions Architect | Developer | Microsoft Azure MVP) about cloud deployments in Asia, the business drivers for using public cloud services,...
Podcast episode
The Cloudcast #306 - PaaS Adoption from Around the World: Aaron and Brian talk with Thurupathan Vijayakumar (<a href="https://twitter.com/thurutweets">@ThuruTweets</a>, Solutions Architect | Developer | Microsoft Azure MVP) about cloud deployments in Asia, the business drivers for using public cloud services,...
byThe Cloudcast
0 ratings
0% found this document useful
1027: Simplifying Microsegmentation: Edgewise founder and CEO Peter Smith joins me on my daily tech podcast to share his expertise and insights
Podcast episode
1027: Simplifying Microsegmentation: Edgewise founder and CEO Peter Smith joins me on my daily tech podcast to share his expertise and insights
byThe Tech Talks Daily Podcast
0 ratings
0% found this document useful
Debezium - Capturing Data the Instant it Happens (with Gunnar Morling)
Podcast episode
Debezium - Capturing Data the Instant it Happens (with Gunnar Morling)
byDeveloper Voices
0 ratings
0% found this document useful
4 + 1 Model of Data Science: Before diving into the complex world of data science it seemed to wise to establish a shared definition of the field. Here at the UVA School of Data Science, we have defined data science with the 4 + 1 Model. This model serves an outline for the first series of UVA Data Points. It also serves as a guiding definition within the School of Data Science, touching everything from research to course planning. In this introduction trailer, host Monica Manney discusses the history, development, and function of the 4 + 1 Model of Data Science with its main author, Raf Alvarado. Below is a brief expect from An Outline of the 4 + 1 Model of Data Science by Raf Alvarado: “The point of the 4 + 1 model, abstract as it is, is to provide a practical template for strategically planning the various elements of a school of data science. To serve as an effective template, a model must be general. But generality if often purchased at the cost of intuitive understanding. The fol
Podcast episode
4 + 1 Model of Data Science: Before diving into the complex world of data science it seemed to wise to establish a shared definition of the field. Here at the UVA School of Data Science, we have defined data science with the 4 + 1 Model. This model serves an outline for the first series of UVA Data Points. It also serves as a guiding definition within the School of Data Science, touching everything from research to course planning. In this introduction trailer, host Monica Manney discusses the history, development, and function of the 4 + 1 Model of Data Science with its main author, Raf Alvarado. Below is a brief expect from An Outline of the 4 + 1 Model of Data Science by Raf Alvarado: “The point of the 4 + 1 model, abstract as it is, is to provide a practical template for strategically planning the various elements of a school of data science. To serve as an effective template, a model must be general. But generality if often purchased at the cost of intuitive understanding. The fol
byUVA Data Points
0 ratings
0% found this document useful

Skip carousel

Challenges and Impact of Cloud Technology in The Healthcare Industry
Techfastly
Article
Challenges and Impact of Cloud Technology in The Healthcare Industry
Aug 2, 2021
4 min read
Why Does Uzbekistan Export So Many Terrorists?
The Atlantic
Article
Why Does Uzbekistan Export So Many Terrorists?
Nov 1, 2017
3 min read
A.I. Scans For Big Farms That Might Be Polluters
Futurity
Article
A.I. Scans For Big Farms That Might Be Polluters
Apr 9, 2019
3 min read
Are You Following The Right Digital Recipe?
The European Business Review
Article
Are You Following The Right Digital Recipe?
May 22, 2018
To produce a successful dish of digital reinvention, start by combining six ingredient technologies. Here’s how to get cooking. Digital technologies have been feeding executives’ appetite for growth, cost savings, and innovation for years. Each one
9 min read
Preparing for the Next Wave of Innovation? Think Beyond Technology
The European Business Review
Article
Preparing for the Next Wave of Innovation? Think Beyond Technology
Jan 31, 2020
4 min read
Can Bosses Make Their Workers Get Vaccine If They Don't Want To?
Los Angeles Times
Article
Can Bosses Make Their Workers Get Vaccine If They Don't Want To?
Jan 15, 2021
Most of the faculty at a southern Minnesota high school can't wait to get the shots that will protect them against COVID-19. But an instructor who teaches business classes said he's not ready to take it, and he fears that his refusal to get vaccinate
6 min read
Azerbaijani Experts Debate Causes Of Recent Escalation
Global Voices
Article
Azerbaijani Experts Debate Causes Of Recent Escalation
Aug 24, 2021
4 min read
Team Encodes Digital ‘Hello’ Into Lab-made DNA
Futurity
Article
Team Encodes Digital ‘Hello’ Into Lab-made DNA
Mar 26, 2019
4 min read
Federated Learning Uses The Data Right On Our Devices
Futurity
Article
Federated Learning Uses The Data Right On Our Devices
Jul 21, 2022
2 min read
Research in Large Australian Practices: A Roundtable Discussion
Architecture Australia
Article
Research in Large Australian Practices: A Roundtable Discussion
Jul 2, 2018
What research is actually happening in large architectural practices in Australia? How are practices pursuing research and what are their motivations? What do they see as the benefits and how are they justifying the cost? What are the challenges and
12 min read
Data Centers Aren’t The Energy Hogs We Thought
Futurity
Article
Data Centers Aren’t The Energy Hogs We Thought
Feb 28, 2020
2 min read
Cloud Computing in Health Care Industry
Techfastly
Article
Cloud Computing in Health Care Industry
Apr 1, 2021
The vast impact of digital transformation in the health care industry makes its future firm. What’s interesting in cloud computing with healthcare? Cloud computing changes the traditional way of dealing with data. Big data analytics with cloud comput
4 min read
Building Trends, Building Momentum
Facility Management
Article
Building Trends, Building Momentum
Oct 14, 2019
3 min read
Safer Cyber
Cosmos Magazine
Article
Safer Cyber
Mar 14, 2024
3 min read
These Walls Can Talk
Facility Management
Article
These Walls Can Talk
Aug 23, 2018
3 min read
태도가 건축이 될 때 When Attitude Becomes Architecture
Space
Article
태도가 건축이 될 때 When Attitude Becomes Architecture
Dec 5, 2023
12 min read
Arrested Development
Architecture Australia
Article
Arrested Development
Jul 2, 2018
Architectural firms are great at creating knowledge and value through design. But when it comes to R and D, architects are good at doing the R but not so good at the D. Arguably, the disintermediation of architects in procurement processes, the rise
4 min read
Digital Files And University Design Space
Facility Management
Article
Digital Files And University Design Space
Mar 28, 2019
3 min read
Art Vs Science
Architectural Review Asia Pacific
Article
Art Vs Science
Sep 21, 2023
7 min read
Prototype Paves Way For ‘Computer-on-a-chip’
Futurity
Article
Prototype Paves Way For ‘Computer-on-a-chip’
Feb 22, 2019
2 min read
Powering Costing With Artificial Intelligence: The Case Of Vodafone Procurement
The European Business Review
Article
Powering Costing With Artificial Intelligence: The Case Of Vodafone Procurement
May 25, 2021
8 min read
How Women Are Leading The Charge In Emerging Tech
Business Today
Article
How Women Are Leading The Charge In Emerging Tech
Mar 4, 2023
3 min read
Inform And Enhance Your Business With Open Data
PC Pro Magazine
Article
Inform And Enhance Your Business With Open Data
Jun 10, 2021
7 min read
How And Where You Use Machine-learning
APC
Article
How And Where You Use Machine-learning
Oct 7, 2019
4 min read
Quantum Simulators An Overview
Techfastly
Article
Quantum Simulators An Overview
Oct 1, 2021
4 min read
Always Looking Forward
Recoil
Article
Always Looking Forward
Mar 23, 2021
3 min read
The Cloud Is All Around Us
MoneyWeek
Article
The Cloud Is All Around Us
Mar 17, 2023
The ways the cloud can be used in our day-to-day lives is unlimited, as these examples help to illustrate. Within entertainment, whether it’s Disney+ or Netflix, the television shows and films we watch are stored in the cloud so that millions can sim
2 min read
Real World Computing
PC Pro Magazine
Article
Real World Computing
May 11, 2023
Migrating to Azure isn’t necessarily the toughest part of a successful cloud migration, explains our guest columnist Many organisations succeed at deploying resources in or migrating to Microsoft Azure. But many of those same organisations fail to en
6 min read
Moore’s Law Is About to Get Weird: Never mind tablet computers. Wait till you see bubbles and slime mold.
Nautilus
Article
Moore’s Law Is About to Get Weird: Never mind tablet computers. Wait till you see bubbles and slime mold.
Feb 12, 2015
I’ve never seen the computer you’re reading this story on, but I can tell you a lot about it. It runs on electricity. It uses binary logic to carry out programmed instructions. It shuttles information using materials known as semiconductors. Its brai
7 min read
Wireless Network Gets Data From Sensors The Size Of Salt Grains
Futurity
Article
Wireless Network Gets Data From Sensors The Size Of Salt Grains
Mar 19, 2024
Tiny chips may be a big breakthrough, researchers report. They have a new approach for a wireless communication network that can efficiently transmit, receive, and decode data from thousands of microelectronic chips that are each no larger than a gra
3 min read

Related categories

Skip carousel

Reviews for Computation and Storage in the Cloud

Rating: 5 out of 5 stars

5/5

2 ratings0 reviews

Book preview

Computation and Storage in the Cloud - Dong Yuan

computing.

Preface

Nowadays, scientific research increasingly relies on IT technologies, where large-scale and high-performance computing systems (e.g. clusters, grids and supercomputers) are utilised by the communities of researchers to carry out their applications. Scientific applications are usually computation and data-intensive, where complex computation tasks take a long time for execution and the generated data sets are often terabytes or petabytes in size. Storing valuable generated application data sets can save their regeneration cost when they are reused, not to mention the waiting time caused by regeneration. However, the large size of the scientific data sets makes their storage a big challenge.

In recent years, cloud computing is emerging as the latest distributed computing paradigm which provides redundant, inexpensive and scalable resources on demand to system requirements. It offers researchers a new way to deploy computation and data-intensive applications (e.g. scientific applications) without any infrastructure investments. Large generated application data sets can be flexibly stored or deleted (and regenerated whenever needed) in the cloud, since, theoretically, unlimited storage and computation resources can be obtained from commercial cloud service providers.

With the pay-as-you-go model, the total application cost for generated data sets in the cloud depends chiefly on the method used for storing them. For example, storing all the generated application data sets in the cloud may result in a high storage cost since some data sets may be seldom used but large in size; but if we delete all the generated data sets and regenerate them every time they are needed, the computation cost may also be very high. Hence, there is a trade-off between computation and storage in the cloud. In order to reduce the overall application cost, a good strategy is to find a balance to selectively store some popular data sets and regenerate the rest when needed. This book focuses on cost-effective data sets storage of scientific applications in the cloud, which is currently a leading-edge and challenging topic. By investigating the niche issue of computation and storage trade-off, we (1) propose a new cost model for data sets storage in the cloud; (2) develop novel benchmarking approaches to find the minimum cost of storing the application data; and (3) design innovative runtime storage strategies to store the application data in the cloud.

We start with introducing a motivating example from astrophysics and analyse the problems of computation and storage trade-off in the cloud. Based on the requirements identified, we propose a novel concept of Data Dependency Graph (DDG) and propose an effective data sets storage cost model in the cloud. DDG is based on data provenance, which records the generation relationship of all the data sets. With DDG, we know how to effectively regenerate data sets in the cloud and can further calculate their generation costs. The total application cost for the generated data sets includes both their generation cost and their storage cost.

Based on the cost model, we develop novel algorithms which can calculate the minimum cost for storing data sets in the cloud, i.e. the best trade-off between computation and storage. This minimum cost is a benchmark for evaluating the cost-effectiveness of different storage strategies in the cloud. For different situations, we develop different benchmarking approaches with polynomial time complexity for a seemingly NP-hard problem, where (1) the static on-demand approach is for situations in which only occasional benchmarking is requested; and (2) the dynamic on-the-fly approach is suitable for situations in which more frequent benchmarking is requested at runtime.

We develop novel cost-effective storage strategies for users to facilitate at runtime of the cloud. These are different from the minimum cost benchmarking approach, and sometimes users may have certain preferences regarding storage of some particular data sets due to reasons other than cost – e.g. guaranteeing immediate access to certain data sets. Hence, users’ preferences should also be considered in a storage strategy. Based on these considerations, we develop two cost-effective storage strategies for different situations: (1) the cost-rate-based strategy is highly efficient with fairly reasonable cost-effectiveness; and (2) the local-optimisation-based strategy is highly cost-effective with very reasonable time complexity.

To the best of our knowledge, this book is the first comprehensive and systematic work investigating the issue of computation and storage trade-off in the cloud in order to reduce the overall application cost. By proposing innovative concepts, theorems and algorithms, the major contribution of this book is that it helps bring the cost down dramatically for both cloud users and service providers to run computation and data-intensive scientific applications in the cloud.

1 Introduction

This book investigates the trade-off between computation and storage in the cloud. This is a brand new and significant issue for deploying applications with the pay-as-you-go model in the cloud, especially computation and data-intensive scientific applications. The novel research reported in this book is for both cloud service providers and users to reduce the cost of storing large generated application data sets in the cloud. A suite consisting of a novel cost model, benchmarking approaches and storage strategies is designed and developed with the support of new concepts, solid theorems and innovative algorithms. Experimental evaluation and case study demonstrate that our work helps bring the cost down dramatically for running the computation and data-intensive scientific applications in the

Enjoying the preview?

Page 1 of 1

Computation and Storage in the Cloud: Understanding the Trade-Offs

About this ebook

Dong Yuan

Related authors

Related to Computation and Storage in the Cloud

Related ebooks

Databases For You

Related podcast episodes

Related articles

Related categories

Reviews for Computation and Storage in the Cloud

What did you think?

Book preview

Computation and Storage in the Cloud - Dong Yuan

Preface

1

Introduction