Information Storage and Management: Storing, Managing, and Protecting Digital Information in Classic, Virtualized, and Cloud Environments

Ebook751 pages7 hours

Information Storage and Management: Storing, Managing, and Protecting Digital Information in Classic, Virtualized, and Cloud Environments

Name: Information Storage and Management: Storing, Managing, and Protecting Digital Information in Classic, Virtualized, and Cloud Environments
ISBN: 9781118236963

Rating: 0 out of 5 stars

()

Read preview

About this ebook

The new edition of a bestseller, now revised and update throughout!

This new edition of the unparalleled bestseller serves as a full training course all in one and as the world's largest data storage company, EMC is the ideal author for such a critical resource. They cover the components of a storage system and the different storage system models while also offering essential new material that explores the advances in existing technologies and the emergence of the "Cloud" as well as updates and vital information on new technologies.

Features a separate section on emerging area of cloud computing
Covers new technologies such as: data de-duplication, unified storage, continuous data protection technology, virtual provisioning, FCoE, flash drives, storage tiering, big data, and more
Details storage models such as Network Attached Storage (NAS), Storage Area Network (SAN), Object Based Storage along with virtualization at various infrastructure components
Explores Business Continuity and Security in physical and virtualized environment
Includes an enhanced Appendix for additional information

This authoritative guide is essential for getting up to speed on the newest advances in information storage and management.

Skip carousel

Networking

LanguageEnglish

PublisherWiley

Release dateApr 30, 2012

ISBN9781118236963

Related to Information Storage and Management

Related ebooks

Skip carousel

Information Storage and Management: Storing, Managing, and Protecting Digital Information
Ebook
Information Storage and Management: Storing, Managing, and Protecting Digital Information
byEMC Education Services
Rating: 0 out of 5 stars
0 ratings
VMware Performance and Capacity Management - Second Edition
Ebook
VMware Performance and Capacity Management - Second Edition
byRahabok Iwan 'e1'
Rating: 0 out of 5 stars
0 ratings
CompTIA Cloud+ Study Guide: Exam CV0-003
Ebook
CompTIA Cloud+ Study Guide: Exam CV0-003
byBen Piper
Rating: 0 out of 5 stars
0 ratings
Designing Storage for Exchange 2007 SP1
Ebook
Designing Storage for Exchange 2007 SP1
byPierre Bijaoui
Rating: 0 out of 5 stars
0 ratings
Computer Storage Fundamentals: Storage system, storage networking and host connectivity
Ebook
Computer Storage Fundamentals: Storage system, storage networking and host connectivity
bySusanta Dutta
Rating: 0 out of 5 stars
0 ratings
Privacy-Preserving Machine Learning
Ebook
Privacy-Preserving Machine Learning
byJ. Morris Chang
Rating: 0 out of 5 stars
0 ratings
Security Intelligence: A Practitioner's Guide to Solving Enterprise Security Challenges
Ebook
Security Intelligence: A Practitioner's Guide to Solving Enterprise Security Challenges
byQing Li
Rating: 0 out of 5 stars
0 ratings
Storage Area Networks For Dummies
Ebook
Storage Area Networks For Dummies
byChristopher Poelker
Rating: 4 out of 5 stars
4/5
Citrix XenApp® 7.5 Desktop Virtualization Solutions
Ebook
Citrix XenApp® 7.5 Desktop Virtualization Solutions
byAndy Paul
Rating: 0 out of 5 stars
0 ratings
The Cloud Security Ecosystem: Technical, Legal, Business and Management Issues
Ebook
The Cloud Security Ecosystem: Technical, Legal, Business and Management Issues
byRaymond Choo
Rating: 0 out of 5 stars
0 ratings
Storage Systems: Organization, Performance, Coding, Reliability, and Their Data Processing
Ebook
Storage Systems: Organization, Performance, Coding, Reliability, and Their Data Processing
byAlexander Thomasian
Rating: 0 out of 5 stars
0 ratings
Cloud computing: Moving IT out of the office
Ebook
Cloud computing: Moving IT out of the office
byBCS, The Chartered Institute for IT
Rating: 0 out of 5 stars
0 ratings
The Official (ISC)2 Guide to the CCSP CBK
Ebook
The Official (ISC)2 Guide to the CCSP CBK
byGordon
Rating: 0 out of 5 stars
0 ratings
The Cloud at Your Service: The when, how, and why of enterprise cloud computing
Ebook
The Cloud at Your Service: The when, how, and why of enterprise cloud computing
byArthur Mateos
Rating: 0 out of 5 stars
0 ratings
Data Storage A Complete Guide - 2019 Edition
Ebook
Data Storage A Complete Guide - 2019 Edition
byGerardus Blokdyk
Rating: 0 out of 5 stars
0 ratings
LPIC-1 Linux Professional Institute Certification Practice Tests: Exam 101-500 and Exam 102-500
Ebook
LPIC-1 Linux Professional Institute Certification Practice Tests: Exam 101-500 and Exam 102-500
bySteve Suehring
Rating: 0 out of 5 stars
0 ratings
Thor's Microsoft Security Bible: A Collection of Practical Security Techniques
Ebook
Thor's Microsoft Security Bible: A Collection of Practical Security Techniques
byTimothy "Thor" Mullen
Rating: 0 out of 5 stars
0 ratings
Breaking the Availability Barrier Ii: Achieving Century Uptimes with Active/Active Systems
Ebook
Breaking the Availability Barrier Ii: Achieving Century Uptimes with Active/Active Systems
byDr. Bruce Holenstein
Rating: 0 out of 5 stars
0 ratings
How to Cheat at Managing Windows Server Update Services
Ebook
How to Cheat at Managing Windows Server Update Services
byB. Barber
Rating: 0 out of 5 stars
0 ratings
OSI-model Third Edition
Ebook
OSI-model Third Edition
byGerardus Blokdyk
Rating: 0 out of 5 stars
0 ratings
Implementing Database Security and Auditing
Ebook
Implementing Database Security and Auditing
byRon Ben Natan
Rating: 4 out of 5 stars
4/5
Seven Deadliest Microsoft Attacks
Ebook
Seven Deadliest Microsoft Attacks
byRob Kraus
Rating: 0 out of 5 stars
0 ratings
Securing Citrix XenApp Server in the Enterprise
Ebook
Securing Citrix XenApp Server in the Enterprise
byTariq Azad
Rating: 0 out of 5 stars
0 ratings
Microsoft Windows Server 2008 R2 Administrator's Reference: The Administrator's Essential Reference
Ebook
Microsoft Windows Server 2008 R2 Administrator's Reference: The Administrator's Essential Reference
byDustin Hannifin
Rating: 5 out of 5 stars
5/5
Implementing SSL / TLS Using Cryptography and PKI
Ebook
Implementing SSL / TLS Using Cryptography and PKI
byJoshua Davies
Rating: 0 out of 5 stars
0 ratings
System administrator Second Edition
Ebook
System administrator Second Edition
byGerardus Blokdyk
Rating: 0 out of 5 stars
0 ratings
Cloud Computing Bible
Ebook
Cloud Computing Bible
byBarrie Sosinsky
Rating: 4 out of 5 stars
4/5
macOS Ventura For Dummies
Ebook
macOS Ventura For Dummies
byGuy Hart-Davis
Rating: 0 out of 5 stars
0 ratings
Microsoft Windows Server 2012 Administration Instant Reference
Ebook
Microsoft Windows Server 2012 Administration Instant Reference
byChris Henley
Rating: 0 out of 5 stars
0 ratings
MCSE Planning, Implementing, and Maintaining a Microsoft Windows Server 2003 Active Directory Infrastructure (Exam 70-294): Study Guide and DVD Training System
Ebook
MCSE Planning, Implementing, and Maintaining a Microsoft Windows Server 2003 Active Directory Infrastructure (Exam 70-294): Study Guide and DVD Training System
bySyngress
Rating: 2 out of 5 stars
2/5

Networking For You

Skip carousel

Cybersecurity: The Beginner's Guide: A comprehensive guide to getting started in cybersecurity
Ebook
Cybersecurity: The Beginner's Guide: A comprehensive guide to getting started in cybersecurity
byDr. Erdal Ozkaya
Rating: 5 out of 5 stars
5/5
Mike Meyers' CompTIA Network+ Certification Passport, Sixth Edition (Exam N10-007)
Ebook
Mike Meyers' CompTIA Network+ Certification Passport, Sixth Edition (Exam N10-007)
byMike Meyers
Rating: 1 out of 5 stars
1/5
A Beginner's Guide to Ham Radio
Ebook
A Beginner's Guide to Ham Radio
byGeorge Freeman
Rating: 0 out of 5 stars
0 ratings
Networking All-in-One For Dummies
Ebook
Networking All-in-One For Dummies
byDoug Lowe
Rating: 5 out of 5 stars
5/5
CCNA Certification Study Guide, Volume 2: Exam 200-301
Ebook
CCNA Certification Study Guide, Volume 2: Exam 200-301
byTodd Lammle
Rating: 0 out of 5 stars
0 ratings
Computer Networking: The Complete Beginner's Guide to Learning the Basics of Network Security, Computer Architecture, Wireless Technology and Communications Systems (Including Cisco, CCENT, and CCNA)
Ebook
Computer Networking: The Complete Beginner's Guide to Learning the Basics of Network Security, Computer Architecture, Wireless Technology and Communications Systems (Including Cisco, CCENT, and CCNA)
byBenjamin Walker
Rating: 4 out of 5 stars
4/5
CompTIA Network+ Practice Tests: Exam N10-008
Ebook
CompTIA Network+ Practice Tests: Exam N10-008
byCraig Zacker
Rating: 0 out of 5 stars
0 ratings
Programming Arduino: Getting Started with Sketches
Ebook
Programming Arduino: Getting Started with Sketches
bySimon Monk
Rating: 4 out of 5 stars
4/5
CompTIA Network+ Certification Guide (Exam N10-008): Unleash your full potential as a Network Administrator (English Edition)
Ebook
CompTIA Network+ Certification Guide (Exam N10-008): Unleash your full potential as a Network Administrator (English Edition)
byEithne Hogan
Rating: 0 out of 5 stars
0 ratings
Network+ Study Guide & Practice Exams
Ebook
Network+ Study Guide & Practice Exams
byRobert Shimonski
Rating: 4 out of 5 stars
4/5
AWS Certified Solutions Architect Study Guide: Associate SAA-C02 Exam
Ebook
AWS Certified Solutions Architect Study Guide: Associate SAA-C02 Exam
byDavid Clinton
Rating: 0 out of 5 stars
0 ratings
Linux Bible
Ebook
Linux Bible
byChristopher Negus
Rating: 0 out of 5 stars
0 ratings
AWS Certified Cloud Practitioner Study Guide: CLF-C01 Exam
Ebook
AWS Certified Cloud Practitioner Study Guide: CLF-C01 Exam
byBen Piper
Rating: 5 out of 5 stars
5/5
The Compete Ccna 200-301 Study Guide: Network Engineering Edition
Ebook
The Compete Ccna 200-301 Study Guide: Network Engineering Edition
byJoe Spoto
Rating: 5 out of 5 stars
5/5
Computer Networking: An introductory guide for complete beginners: Computer Networking, #1
Ebook
Computer Networking: An introductory guide for complete beginners: Computer Networking, #1
byRamon Nastase
Rating: 5 out of 5 stars
5/5
Practical Ethical Hacking from Scratch
Ebook
Practical Ethical Hacking from Scratch
byAnsh Goyal
Rating: 5 out of 5 stars
5/5
Microsoft Azure For Dummies
Ebook
Microsoft Azure For Dummies
byJack A. Hyman
Rating: 0 out of 5 stars
0 ratings
Cybersecurity: A Simple Beginner’s Guide to Cybersecurity, Computer Networks and Protecting Oneself from Hacking in the Form of Phishing, Malware, Ransomware, and Social Engineering
Ebook
Cybersecurity: A Simple Beginner’s Guide to Cybersecurity, Computer Networks and Protecting Oneself from Hacking in the Form of Phishing, Malware, Ransomware, and Social Engineering
byQuinn Kiser
Rating: 5 out of 5 stars
5/5
Raspberry Pi Electronics Projects for the Evil Genius
Ebook
Raspberry Pi Electronics Projects for the Evil Genius
byDonald Norris
Rating: 3 out of 5 stars
3/5
CompTIA Network+ Certification Study Guide: Exam N10-004: Exam N10-004 2E
Ebook
CompTIA Network+ Certification Study Guide: Exam N10-004: Exam N10-004 2E
byRobert Shimonski
Rating: 4 out of 5 stars
4/5
Wireshark Essentials
Ebook
Wireshark Essentials
byJames H. Baxter
Rating: 0 out of 5 stars
0 ratings
Networking For Dummies
Ebook
Networking For Dummies
byDoug Lowe
Rating: 5 out of 5 stars
5/5
Cisco CCNA Command Guide: An Introductory Guide for CCNA & Computer Networking Beginners: Computer Networking, #3
Ebook
Cisco CCNA Command Guide: An Introductory Guide for CCNA & Computer Networking Beginners: Computer Networking, #3
byRamon Nastase
Rating: 0 out of 5 stars
0 ratings
Amazon Web Services (AWS) Interview Questions and Answers
Ebook
Amazon Web Services (AWS) Interview Questions and Answers
byTech Interviews
Rating: 5 out of 5 stars
5/5
Home Networking Do-It-Yourself For Dummies
Ebook
Home Networking Do-It-Yourself For Dummies
byLawrence C. Miller
Rating: 4 out of 5 stars
4/5
Cisco Networking All-in-One For Dummies
Ebook
Cisco Networking All-in-One For Dummies
byEdward Tetz
Rating: 4 out of 5 stars
4/5
TCP/IP Clearly Explained
Ebook
TCP/IP Clearly Explained
byPeter Loshin
Rating: 4 out of 5 stars
4/5
Concise and Simple Guide to IP Subnets
Ebook
Concise and Simple Guide to IP Subnets
byalasdair gilchrist
Rating: 5 out of 5 stars
5/5
Learning Linux Shell Scripting
Ebook
Learning Linux Shell Scripting
byNaik Ganesh Sanjiv
Rating: 4 out of 5 stars
4/5
Emergency Preparedness and Off-Grid Communication
Ebook
Emergency Preparedness and Off-Grid Communication
byPraying Medic
Rating: 0 out of 5 stars
0 ratings

Related podcast episodes

Skip carousel

SQL Server and Azure SQL Database with Neeraj Joshi: We talk with Neeraj Joshi about SQL and Azure SQL Database. Your OS sucks. Make it rain every time you get paid.
Podcast episode
SQL Server and Azure SQL Database with Neeraj Joshi: We talk with Neeraj Joshi about SQL and Azure SQL Database. Your OS sucks. Make it rain every time you get paid.
byMS Dev Show
0 ratings
0% found this document useful
Keeping Your Data Warehouse In Order With DataForm - Episode 102: An interview about Dataform and how it helps you to keep your data warehouse in good working order
Podcast episode
Keeping Your Data Warehouse In Order With DataForm - Episode 102: An interview about Dataform and how it helps you to keep your data warehouse in good working order
byData Engineering Podcast
0 ratings
0% found this document useful
Professor Messer's CompTIA 220-1102 A+ Study Group After Show - May 2023: Your questions and more!
Podcast episode
Professor Messer's CompTIA 220-1102 A+ Study Group After Show - May 2023: Your questions and more!
byProfessor Messer's A+ Study Group
0 ratings
0% found this document useful
Active Directory, Azure and Windows Security - Sean Metcalf - PSW #642: Active Directory & Microsoft Cloud (Azure AD & Office 365) Security, including a breakdown of Microsoft's security offerings and recommendations for cloud migrations for Active Directory. Visit for all the latest episodes! Show Notes:
Podcast episode
Active Directory, Azure and Windows Security - Sean Metcalf - PSW #642: Active Directory & Microsoft Cloud (Azure AD & Office 365) Security, including a breakdown of Microsoft's security offerings and recommendations for cloud migrations for Active Directory. Visit for all the latest episodes! Show Notes:
bySecurity Weekly Podcast Network (Video)
0 ratings
0% found this document useful
Super Intelligence: Nick Bostrom on machines that are becoming super intelligent.
Podcast episode
Super Intelligence: Nick Bostrom on machines that are becoming super intelligent.
byPhilosophy 247
0 ratings
0% found this document useful
Who Is Your SOC Really For? - Ricardo Lafosse - CSP #43: Managing the volume of security events and continuous threat intelligence can be daunting for the largest of organizations. How do you increase the effectiveness of a Security Operations Center (SOC) and share this information across the organization...
Podcast episode
Who Is Your SOC Really For? - Ricardo Lafosse - CSP #43: Managing the volume of security events and continuous threat intelligence can be daunting for the largest of organizations. How do you increase the effectiveness of a Security Operations Center (SOC) and share this information across the organization...
byCISO Stories Podcast (Audio)
0 ratings
0% found this document useful
Cybersecurity Failure, Reboot Security Strategy, & Solving the Skills Gap - BSW #203: In the Leadership and Communications section, Cybersecurity Failure among Highest Risks, warns World Economic Forum, How to reboot a broken or outdated security strategy, A 21st Century Solution to Our Cybersecurity Skills Shortfall, and more! ...
Podcast episode
Cybersecurity Failure, Reboot Security Strategy, & Solving the Skills Gap - BSW #203: In the Leadership and Communications section, Cybersecurity Failure among Highest Risks, warns World Economic Forum, How to reboot a broken or outdated security strategy, A 21st Century Solution to Our Cybersecurity Skills Shortfall, and more! ...
bySecurity Weekly Podcast Network (Video)
0 ratings
0% found this document useful
Morgan Senkal: Using Epics to Improve Code Quality Within Sprints: Robby speaks with Morgan Senkal, Software Architect at Metal Toad. Morgan recalls a challenging 15-year-old legacy project that was reminiscent of a Stephen King story and explains what to think about when considering a software rewrite. Morgan and Robby keep a running analogy of technical debt and automotive repairs.
Podcast episode
Morgan Senkal: Using Epics to Improve Code Quality Within Sprints: Robby speaks with Morgan Senkal, Software Architect at Metal Toad. Morgan recalls a challenging 15-year-old legacy project that was reminiscent of a Stephen King story and explains what to think about when considering a software rewrite. Morgan and Robby keep a running analogy of technical debt and automotive repairs.
byMaintainable
0 ratings
0% found this document useful
651. Insights: How will PSD3 shape the future of Open Banking?: David M. Brear is joined by some great guests, from Plaid, Payments Solved, and The Payments Association EU, to look at how PSD3 might influence Open Banking going forward.
Podcast episode
651. Insights: How will PSD3 shape the future of Open Banking?: David M. Brear is joined by some great guests, from Plaid, Payments Solved, and The Payments Association EU, to look at how PSD3 might influence Open Banking going forward.
byFintech Insider Podcast by 11:FS
0 ratings
0% found this document useful
Jobs of Tomorrow: Windows Insider Podcast Episode 17
Podcast episode
Jobs of Tomorrow: Windows Insider Podcast Episode 17
byWindows Insider Podcast
100%
100% found this document useful
cybersecurity maturity model certification (CMMC) (noun) [Word Notes]
Podcast episode
cybersecurity maturity model certification (CMMC) (noun) [Word Notes]
byHacking Humans
0 ratings
0% found this document useful
Episode 45: Everybody Needs Backup and Recovery: Do you have to deal with data protection? Do you usually mess it up? Some people think data protection architecture is broken and requires too many dependencies. By the time a business needs to backup a lot of data, it’s a complex problem to go back in ti
Podcast episode
Episode 45: Everybody Needs Backup and Recovery: Do you have to deal with data protection? Do you usually mess it up? Some people think data protection architecture is broken and requires too many dependencies. By the time a business needs to backup a lot of data, it’s a complex problem to go back in ti
byScreaming in the Cloud
0 ratings
0% found this document useful
System Observability For The Cloud Native Era With Chronosphere: An interview about the Chronosphere platform and the M3DB storage engine for managing system metrics to power observability in the cloud native era.
Podcast episode
System Observability For The Cloud Native Era With Chronosphere: An interview about the Chronosphere platform and the M3DB storage engine for managing system metrics to power observability in the cloud native era.
byData Engineering Podcast
0 ratings
0% found this document useful
Backing Up into the Cloud
Podcast episode
Backing Up into the Cloud
byThe Cloudcast
0 ratings
0% found this document useful
Cloud Native Data Security As Code With Cyral - Episode 156: An interview about the Cyral platform and how it enforces data security as code for protecting databases and object storage in the cloud.
Podcast episode
Cloud Native Data Security As Code With Cyral - Episode 156: An interview about the Cyral platform and how it enforces data security as code for protecting databases and object storage in the cloud.
byData Engineering Podcast
0 ratings
0% found this document useful
Edge Ecosystem in Healthcare, Active Directory Modernization, Security Data Pipelines - Matthias Vallentin, Mickey Bresman, Theresa Lanowitz - ESW #328: As more organizations explore edge computing, understanding the entire ecosystem is paramount for bolstering security and resiliency, especially within a critical industry like healthcare. In this segment, Theresa Lanowitz, Head of Cybersecurity...
Podcast episode
Edge Ecosystem in Healthcare, Active Directory Modernization, Security Data Pipelines - Matthias Vallentin, Mickey Bresman, Theresa Lanowitz - ESW #328: As more organizations explore edge computing, understanding the entire ecosystem is paramount for bolstering security and resiliency, especially within a critical industry like healthcare. In this segment, Theresa Lanowitz, Head of Cybersecurity...
byEnterprise Security Weekly (Video)
0 ratings
0% found this document useful
Edge Ecosystem in Healthcare, Active Directory Modernization, Security Data Pipelines - Matthias Vallentin, Mickey Bresman, Theresa Lanowitz - ESW #328: As more organizations explore edge computing, understanding the entire ecosystem is paramount for bolstering security and resiliency, especially within a critical industry like healthcare. In this segment, Theresa Lanowitz, Head of Cybersecurity...
Podcast episode
Edge Ecosystem in Healthcare, Active Directory Modernization, Security Data Pipelines - Matthias Vallentin, Mickey Bresman, Theresa Lanowitz - ESW #328: As more organizations explore edge computing, understanding the entire ecosystem is paramount for bolstering security and resiliency, especially within a critical industry like healthcare. In this segment, Theresa Lanowitz, Head of Cybersecurity...
bySecurity Weekly Podcast Network (Video)
0 ratings
0% found this document useful
Database Monitoring & Observability
Podcast episode
Database Monitoring & Observability
byThe Cloudcast
0 ratings
0% found this document useful
Top Skills Every young Executive Must Have: Top Skills Every young Executive Must Have
Podcast episode
Top Skills Every young Executive Must Have: Top Skills Every young Executive Must Have
byPersonal Branding Podcast
0 ratings
0% found this document useful
Introducing Data Downtime: From Firefighting to Winning // Barr Moses // MLOps Coffee Sessions #19
Podcast episode
Introducing Data Downtime: From Firefighting to Winning // Barr Moses // MLOps Coffee Sessions #19
byMLOps.community
0 ratings
0% found this document useful
Building Real Time Applications On Streaming Data With Eventador - Episode 129: An interview with Eventador CEO Kenny Gorman about the challenges of building a managed service for streaming data to simplify building real time applications
Podcast episode
Building Real Time Applications On Streaming Data With Eventador - Episode 129: An interview with Eventador CEO Kenny Gorman about the challenges of building a managed service for streaming data to simplify building real time applications
byData Engineering Podcast
0 ratings
0% found this document useful
Understanding Time-Series Database Patterns
Podcast episode
Understanding Time-Series Database Patterns
byThe Cloudcast
0 ratings
0% found this document useful
Data Orchestration For Hybrid Cloud Analytics - Episode 103: An interview about the emerging category of data orchestration platforms and how they can be used to bridge the gap between modern and legacy analytics systems
Podcast episode
Data Orchestration For Hybrid Cloud Analytics - Episode 103: An interview about the emerging category of data orchestration platforms and how they can be used to bridge the gap between modern and legacy analytics systems
byData Engineering Podcast
0 ratings
0% found this document useful
1633: CloudSphere - Cloud Access Control Risks: Why Enterprise Blind Spots are Leaving Sensitive Enterprise Data Vulnerable to Breaches
Podcast episode
1633: CloudSphere - Cloud Access Control Risks: Why Enterprise Blind Spots are Leaving Sensitive Enterprise Data Vulnerable to Breaches
byThe Tech Talks Daily Podcast
0 ratings
0% found this document useful
Streaming alternatives to Kafka
Podcast episode
Streaming alternatives to Kafka
byThe Cloudcast
0 ratings
0% found this document useful
An Exploration Of The Composable Customer Data Platform: The customer data platform is a category of services that was developed early in the evolution of the current era of cloud services for data processing. When it was difficult to wire together the event collection, data modeling, reporting, and activation it made sense to buy monolithic products that handled every stage of the customer data lifecycle. Now that the data warehouse has taken center stage a new approach of composable customer data platforms is emerging. In this episode Darren Haken is joined by Tejas Manohar to discuss how Autotrader UK is addressing their customer data needs by building on top of their existing data stack.
Podcast episode
An Exploration Of The Composable Customer Data Platform: The customer data platform is a category of services that was developed early in the evolution of the current era of cloud services for data processing. When it was difficult to wire together the event collection, data modeling, reporting, and activation it made sense to buy monolithic products that handled every stage of the customer data lifecycle. Now that the data warehouse has taken center stage a new approach of composable customer data platforms is emerging. In this episode Darren Haken is joined by Tejas Manohar to discuss how Autotrader UK is addressing their customer data needs by building on top of their existing data stack.
byData Engineering Podcast
0 ratings
0% found this document useful
"Saga of a Gnarly Report" with Owen and Dan: Elixir Wizards Owen and Dan delve into the complexities of building advanced reporting features within software applications. They share personal insights and challenges encountered while developing reporting solutions for user-generated data, leveraging both Elixir/Phoenix and Ruby on Rails.
Podcast episode
"Saga of a Gnarly Report" with Owen and Dan: Elixir Wizards Owen and Dan delve into the complexities of building advanced reporting features within software applications. They share personal insights and challenges encountered while developing reporting solutions for user-generated data, leveraging both Elixir/Phoenix and Ruby on Rails.
byElixir Wizards
0 ratings
0% found this document useful
Streaming Data Integration Without The Code at Equalum - Episode 161: An interview about how the Equalum platform is architected to provide streaming data integration workflows with a no-code interface.
Podcast episode
Streaming Data Integration Without The Code at Equalum - Episode 161: An interview about how the Equalum platform is architected to provide streaming data integration workflows with a no-code interface.
byData Engineering Podcast
0 ratings
0% found this document useful
Andreas Tsamados & Vijay Krishnavanshi: Fileverse – Decentralised P2P File Sharing: We were joined by Andreas Tsamados & Vijay Krishnavanshi, co-founders of Fileverse, to discuss their decentralised file sharing solution and how they plan to disrupt the Web2 quasi-monopoly.
Podcast episode
Andreas Tsamados & Vijay Krishnavanshi: Fileverse – Decentralised P2P File Sharing: We were joined by Andreas Tsamados & Vijay Krishnavanshi, co-founders of Fileverse, to discuss their decentralised file sharing solution and how they plan to disrupt the Web2 quasi-monopoly.
byEpicenter - Learn about Crypto, Blockchain, Ethereum, Bitcoin and Distributed Technologies
0 ratings
0% found this document useful
News and Interviews from BlackHat 2023 - ESW #328: In the Enterprise Security News, 1. Check Point buys Perimeter 81 to augment its cybersecurity 2. 2023 Layoff Tracker: SecureWorks Cuts 300 Jobs 3. Hackers Rig Casino Card-Shuffling Machines for ‘Full Control’ Cheating 4. ‘DoubleDrive’ attack...
Podcast episode
News and Interviews from BlackHat 2023 - ESW #328: In the Enterprise Security News, 1. Check Point buys Perimeter 81 to augment its cybersecurity 2. 2023 Layoff Tracker: SecureWorks Cuts 300 Jobs 3. Hackers Rig Casino Card-Shuffling Machines for ‘Full Control’ Cheating 4. ‘DoubleDrive’ attack...
bySecurity Weekly Podcast Network (Audio)
0 ratings
0% found this document useful

Skip carousel

4 Windows Command Prompt Tricks Everyone Should Know
PCWorld
Article
4 Windows Command Prompt Tricks Everyone Should Know
Feb 8, 2017
3 min read
Enterprise-grade Monitoring Made Easy
Linux Format
Article
Enterprise-grade Monitoring Made Easy
Mar 10, 2020
9 min read
How To Be Invisible Online
Maximum PC
Article
How To Be Invisible Online
Jun 20, 2023
16 min read
Six Security Holes You Need To Plug Now
PC Pro Magazine
Article
Six Security Holes You Need To Plug Now
May 7, 2022
6 min read
Kernel Watch
Linux Format
Article
Kernel Watch
Mar 7, 2023
Linus Torvalds announced the release of Linux kernel 6.2-rc8, noting that the “6.2 series continues to be fairly calm, and the only real reason for an RC8 is – as now mentioned several times – just to make up for some time during the holiday season”.
2 min read
Using Bash Aliases For An Easier Life
Linux Format
Article
Using Bash Aliases For An Easier Life
Sep 19, 2023
2 min read
The State Of Linux Security
Linux Format
Article
The State Of Linux Security
Apr 7, 2020
1 min read
Getting Started With The Powerful EBPF
Linux Format
Article
Getting Started With The Powerful EBPF
Sep 20, 2022
Credit: https://ebpf.io Don’t miss next issue! Subscribe on page 16 Mihalis Tsoukalos is a systems engineer and a technical writer. You can reach him at www. mtsoukalos.eu and @mactsouk. Get the code for this tutorial from the Linux Format archive:
10 min read
Answers
Linux Format
Article
Answers
May 31, 2022
8 min read
Run Windows 11 on a Raspberry Pi 4
Maximum PC
Article
Run Windows 11 on a Raspberry Pi 4
Feb 1, 2022
5 min read
Proton Turns Five And Linux Overtakes Mac OS
Linux Format
Article
Proton Turns Five And Linux Overtakes Mac OS
Sep 19, 2023
2 min read
Runing with the GNU Debian Hurd
Linux Format
Article
Runing with the GNU Debian Hurd
Mar 7, 2023
10 min read
How To Fix A Windows Blue Screen Of Death (BSOD)
APC
Article
How To Fix A Windows Blue Screen Of Death (BSOD)
Jul 12, 2021
4 min read
Modern Sites, JavaScript And More
Linux Format
Article
Modern Sites, JavaScript And More
Aug 24, 2021
Many websites require JavaScript to run at all times. Other question and answer sites, like Stackexchange, run fine without it. Another standard that has a profound effect on web pages is CSS, which is mostly used to style the page. This includes pla
1 min read
The Art Of AI
Linux Format
Article
The Art Of AI
Feb 7, 2023
“Everywhere you look, people are discussing the AI chatbot ChatGPT. It’s been applied to write email campaigns, code and articles. It’s been banned in some schools as students use it to write essays. Singer Nick Cave called ChatGPT’s output in the st
1 min read
Doctor
Maximum PC
Article
Doctor
Feb 1, 2022
6 min read
Build A Search And Analytic Engine
Linux Format
Article
Build A Search And Analytic Engine
Mar 10, 2020
7 min read
Mailserver MAILSERVER
Linux Format
Article
Mailserver MAILSERVER
Nov 16, 2021
4 min read
How To Encrypt Files
Tech Advisor
Article
How To Encrypt Files
Jan 5, 2022
5 min read
Vulnerability Assessments
PC Pro Magazine
Article
Vulnerability Assessments
Jan 7, 2021
3 min read
Using EBPF To Monitor Filesystems
Linux Format
Article
Using EBPF To Monitor Filesystems
Dec 13, 2022
10 min read
Create A Custom Windows Installer
Maximum PC
Article
Create A Custom Windows Installer
May 28, 2019
8 min read
All about Ethernet
TechLife
Article
All about Ethernet
Jan 11, 2021
4 min read
Meet The Team
Linux Format
Article
Meet The Team
Oct 19, 2021
1 min read
Basic Concepts
Linux Format
Article
Basic Concepts
Jul 2, 2019
A messaging system such as Kafka enables you to send messages between processes, applications and servers. Applications connect to Kafka to send or get data. Strictly speaking, a Kafka ‘topic’ is a unit of storage in Kafka: data in Kafka is stored in
1 min read
Set Up Your Own Nas Server
Maximum PC
Article
Set Up Your Own Nas Server
Mar 30, 2021
11 min read
Set Up Your First Database
Linux Format
Article
Set Up Your First Database
Aug 25, 2020
1 min read
“DORA Is A Force For Good And Will Help Businesses To Make Better Decisions, Faster”
PC Pro Magazine
Article
“DORA Is A Force For Good And Will Help Businesses To Make Better Decisions, Faster”
Apr 10, 2022
6 min read
Distro Watch
Linux Format
Article
Distro Watch
Jun 28, 2022
This simple-to-use Ubuntu-based distro has a new version available, and it’s looking to encourage Windows users to make the switch by offering accessibility options that rival Microsoft’s admittedly very good offerings. New features include a screen
1 min read
Don’t Pay For ANTI-VIRUS
TechLife
Article
Don’t Pay For ANTI-VIRUS
May 30, 2022
13 min read

Related categories

Skip carousel

Reviews for Information Storage and Management

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

Information Storage and Management - EMC Education Services

Part I

Storage System

In This Section

Chapter 1: Introduction to Information Storage

Chapter 2: Data Center Environment

Chapter 3: Data Protection: RAID

Chapter 4: Intelligent Storage Systems

Chapter 1 Introduction to Information Storage

Key Concepts

Data and Information

Structured and Unstructured Data

Evolution of Storage Architecture

Core Elements of a Data Center

Virtualization and Cloud Computing

Information is increasingly important in our daily lives. We have become information-dependent in the 21st century, living in an on-command, on-demand world, which means, we need information when and where it is required. We access the Internet every day to perform searches, participate in social networking, send and receive e-mails, share pictures and videos, and use scores of other applications. Equipped with a growing number of content-generating devices, more information is created by individuals than by organizations (including business, governments, non-profits and so on). Information created by individuals gains value when shared with others. When created, information resides locally on devices, such as cell phones, smartphones, tablets, cameras, and laptops. To be shared, this information needs to be uploaded to central data repositories (data centers) via networks. Although the majority of information is created by individuals, it is stored and managed by a relatively small number of organizations.

The importance, dependency, and volume of information for the business world also continue to grow at astounding rates. Businesses depend on fast and reliable access to information critical to their success. Examples of business processes or systems that rely on digital information include airline reservations, telecommunications billing, Internet commerce, electronic banking, credit card transaction processing, capital/stock trading, health care claims processing, life science research, and so on. The increasing dependence of businesses on information has amplified the challenges in storing, protecting, and managing data. Legal, regulatory, and contractual obligations regarding the availability and protection of data further add to these challenges.

Organizations usually maintain one or more data centers to store and manage information. A data center is a facility that contains information storage and other physical information technology (IT) resources for computing, networking, and storing information. In traditional data centers, the storage resources are typically dedicated for each of the business units or applications. The proliferation of new applications and increasing data growth have resulted in islands of discrete information storage infrastructures in these data centers. This leads to complex information management and underutilization of storage resources. Virtualization optimizes resource utilization and eases resource management. Organizations incorporate virtualization in their data centers to transform them into virtualized data centers (VDCs). Cloud computing, which represents a fundamental shift in how IT is built, managed, and provided, further reduces information storage and management complexity and IT resource provisioning time. Cloud computing brings in a fully automated request-fulfillment process that enables users to rapidly obtain storage and other IT resources on demand. Through cloud computing, an organization can rapidly deploy applications where the underlying storage capability can scale-up and scale-down, based on the business requirements.

This chapter describes the evolution of information storage architecture from a server-centric model to an information-centric model. It also provides an overview of virtualization and cloud computing.

1.1 Information Storage

Organizations process data to derive the information required for their day-to-day operations. Storage is a repository that enables users to persistently store and retrieve this digital data.

1.1.1 Data

Data is a collection of raw facts from which conclusions might be drawn. Handwritten letters, a printed book, a family photograph, printed and duly signed copies of mortgage papers, a bank's ledgers, and an airline ticket are all examples that contain data.

Before the advent of computers, the methods adopted for data creation and sharing were limited to fewer forms, such as paper and film. Today, the same data can be converted into more convenient forms, such as an e-mail message, an e-book, a digital image, or a digital movie. This data can be generated using a computer and stored as strings of binary numbers (0s and 1s), as shown in Figure 1.1. Data in this form is called digital data and is accessible by the user only after a computer processes it.

Figure 1.1 Digital data

1.1

With the advancement of computer and communication technologies, the rate of data generation and sharing has increased exponentially. The following is a list of some of the factors that have contributed to the growth of digital data:

Increase in data-processing capabilities: Modern computers provide a significant increase in processing and storage capabilities. This enables the conversion of various types of content and media from conventional forms to digital formats.

Lower cost of digital storage: Technological advances and the decrease in the cost of storage devices have provided low-cost storage solutions. This cost benefit has increased the rate at which digital data is generated and stored.

Affordable and faster communication technology: The rate of sharing digital data is now much faster than traditional approaches. A handwritten letter might take a week to reach its destination, whereas it typically takes only a few seconds for an e-mail message to reach its recipient.

Proliferation of applications and smart devices: Smartphones, tablets, and newer digital devices, along with smart applications, have significantly contributed to the generation of digital content.

Inexpensive and easier ways to create, collect, and store all types of data, coupled with increasing individual and business needs, have led to accelerated data growth, popularly termed data explosion. Both individuals and businesses have contributed in varied proportions to this data explosion.

The importance and value of data vary with time. Most of the data created holds significance for a short term but becomes less valuable over time. This governs the type of data storage solutions used. Typically, recent data which has higher usage is stored on faster and more expensive storage. As it ages, it may be moved to slower, less expensive but reliable storage.

Examples of Research and Business Data

1.1 Following are some examples of research and business data:

Customer data: Data related to a company's customers, such as order details, shipping addresses, and purchase history.

Product data: Includes data related to various aspects of a product, such as inventory, description, pricing, availability, and sales.

Medical data: Data related to the healthcare industry, such as patient history, radiological images, details of medication and other treatment, and insurance information.

Seismic data: Seismology is a scientific study of earthquakes. It involves collecting data and processes to derive information that helps determine the location and magnitude of earthquakes.

Businesses generate vast amounts of data and then extract meaningful information from this data to derive economic benefits. Therefore, businesses need to maintain data and ensure its availability over a longer period. Furthermore, the data can vary in criticality and might require special handling. For example, legal and regulatory requirements mandate that banks maintain account information for their customers accurately and securely. Some businesses handle data for millions of customers and ensure the security and integrity of data over a long period of time. This requires high-performance and high-capacity storage devices with enhanced security and compliance that can retain data for a long period.

1.1.2 Types of Data

Data can be classified as structured or unstructured (see Figure 1.2) based on how it is stored and managed. Structured data is organized in rows and columns in a rigidly defined format so that applications can retrieve and process it efficiently. Structured data is typically stored using a database management system (DBMS).

Figure 1.2 Types of data

1.2

Data is unstructured if its elements cannot be stored in rows and columns, which makes it difficult to query and retrieve by applications. For example, customer contacts that are stored in various forms such as sticky notes, e-mail messages, business cards, or even digital format files, such as .doc, .txt, and .pdf. Due to its unstructured nature, it is difficult to retrieve this data using a traditional customer relationship management application. A vast majority of new data being created today is unstructured. The industry is challenged with with new architectures, technologies, techniques, and skills to store, manage, analyze, and derive value from unstructured data from numerous sources.

1.1.3 Big Data

Big data is a new and evolving concept, which refers to data sets whose sizes are beyond the capability of commonly used software tools to capture, store, manage, and process within acceptable time limits. It includes both structured and unstructured data generated by a variety of sources, including business application transactions, web pages, videos, images, e-mails, social media, and so on. These data sets typically require real-time capture or updates for analysis, predictive modeling, and decision making.

Significant opportunities exist to extract value from big data. The big data ecosystem (see Figure 1.3) consists of the following:

1. Devices that collect data from multiple locations and also generate new data about this data (metadata).

2. Data collectors who gather data from devices and users.

3. Data aggregators that compile the collected data to extract meaningful information.

4. Data users and buyers who benefit from the information collected and aggregated by others in the data value chain.

Figure 1.3 Big data ecosystem

1.3

Traditional IT infrastructure and data processing tools and methodologies are inadequate to handle the volume, variety, dynamism, and complexity of big data. Analyzing big data in real time requires new techniques, architectures, and tools that provide high performance, massively parallel processing (MPP) data platforms, and advanced analytics on the data sets.

Data science is an emerging discipline, which enables organizations to derive business value from big data. Data science represents the synthesis of several existing disciplines, such as statistics, math, data visualization, and computer science to enable data scientists to develop advanced algorithms for the purpose of analyzing vast amounts of information to drive new value and make more data-driven decisions.

Several industries and markets currently looking to employ data science techniques include medical and scientific research, health care, public administration, fraud detection, social media, banks, insurance companies, and other digital information-based entities that benefit from the analytics of big data.

1.1.4 Information

Data, whether structured or unstructured, does not fulfill any purpose for individuals or businesses unless it is presented in a meaningful form. Information is the intelligence and knowledge derived from data.

Businesses analyze raw data to identify meaningful trends. On the basis of these trends, a company can plan or modify its strategy. For example, a retailer identifies customers' preferred products and brand names by analyzing their purchase patterns and maintaining an inventory of those products. Effective data analysis not only extends its benefits to existing businesses, but also creates the potential for new business opportunities by using the information in creative ways.

1.1.5 Storage

Data created by individuals or businesses must be stored so that it is easily accessible for further processing. In a computing environment, devices designed for storing data are termed storage devices or simply storage. The type of storage used varies based on the type of data and the rate at which it is created and used. Devices, such as a media card in a cell phone or digital camera, DVDs, CD-ROMs, and disk drives in personal computers are examples of storage devices.

Businesses have several options available for storing data, including internal hard disks, external disk arrays, and tapes.

1.2 Evolution of Storage Architecture

Historically, organizations had centralized computers (mainframes) and information storage devices (tape reels and disk packs) in their data center. The evolution of open systems, their affordability, and ease of deployment made it possible for business units/departments to have their own servers and storage. In earlier implementations of open systems, the storage was typically internal to the server. These storage devices could not be shared with any other servers. This approach is referred to as server-centric storage architecture (see Figure 1.4 [a]). In this architecture, each server has a limited number of storage devices, and any administrative tasks, such as maintenance of the server or increasing storage capacity, might result in unavailability of information. The proliferation of departmental servers in an enterprise resulted in unprotected, unmanaged, fragmented islands of information and increased capital and operating expenses..

Figure 1.4 Evolution of storage architecture

1.4

To overcome these challenges, storage evolved from server-centric to information-centric architecture (see Figure 1.4 [b]). In this architecture, storage devices are managed centrally and independent of servers. These centrally-managed storage devices are shared with multiple servers. When a new server is deployed in the environment, storage is assigned from the same shared storage devices to that server. The capacity of shared storage can be increased dynamically by adding more storage devices without impacting information availability. In this architecture, information management is easier and cost-effective.

Storage technology and architecture continue to evolve, which enables organizations to consolidate, protect, optimize, and leverage their data to achieve the highest return on information assets.

1.3 Data Center Infrastructure

Organizations maintain data centers to provide centralized data-processing capabilities across the enterprise. Data centers house and manage large amounts of data. The data center infrastructure includes hardware components, such as computers, storage systems, network devices, and power backups; and software components, such as applications, operating systems, and management software. It also includes environmental controls, such as air conditioning, fire suppression, and ventilation.

Large organizations often maintain more than one data center to distribute data processing workloads and provide backup if a disaster occurs.

1.3.1 Core Elements of a Data Center

Five core elements are essential for the functionality of a data center:

Application: A computer program that provides the logic for computing operations

Database management system (DBMS): Provides a structured way to store data in logically organized tables that are interrelated

Host or compute: A computing platform (hardware, firmware, and software) that runs applications and databases

Network: A data path that facilitates communication among various networked devices

Storage: A device that stores data persistently for subsequent use

These core elements are typically viewed and managed as separate entities, but all the elements must work together to address data-processing requirements.

1.1 In this book, host, compute, and server are used interchangeably to represent the element that runs applications.

Figure 1.5 shows an example of an online order transaction system that involves the five core elements of a data center and illustrates their functionality in a business process.

Figure 1.5 Example of an online order transaction system

1.5

A customer places an order through a client machine connected over a LAN/WAN to a host running an order-processing application. The client accesses the DBMS on the host through the application to provide order-related information, such as the customer name, address, payment method, products ordered, and quantity ordered.

The DBMS uses the host operating system to write this data to the physical disks in the storage array. The storage networks provide the communication link between the host and the storage array and transports the request to read or write data between them. The storage array, after receiving the read or write request from the host, performs the necessary operations to store the data on physical disks.

1.3.2 Key Characteristics of a Data Center

Uninterrupted operation of data centers is critical to the survival and success of a business. Organizations must have a reliable infrastructure that ensures that data is accessible at all times. Although the characteristics shown in Figure 1.6 are applicable to all elements of the data center infrastructure, the focus here is on storage systems. This book covers the various technologies and solutions to meet these requirements.

Availability: A data center should ensure the availability of information when required. Unavailability of information could cost millions of dollars per hour to businesses, such as financial services, telecommunications, and e-commerce.

Security: Data centers must establish policies, procedures, and core element integration to prevent unauthorized access to information.

Scalability: Business growth often requires deploying more servers, new applications, and additional databases. Data center resources should scale based on requirements, without interrupting business operations.

Performance: All the elements of the data center should provide optimal performance based on the required service levels.

Data integrity: Data integrity refers to mechanisms, such as error correction codes or parity bits, which ensure that data is stored and retrieved exactly as it was received.

Capacity: Data center operations require adequate resources to store and process large amounts of data, efficiently. When capacity requirements increase, the data center must provide additional capacity without interrupting availability or with minimal disruption. Capacity may be managed by reallocating the existing resources or by adding new resources.

Manageability: A data center should provide easy and integrated management of all its elements. Manageability can be achieved through automation and reduction of human (manual) intervention in common tasks.

Figure 1.6 Key characteristics of a data center

1.6

1.3.3 Managing a Data Center

Managing a data center involves many tasks. The key management activities include the following:

Monitoring: It is a continuous process of gathering information on various elements and services running in a data center. The aspects of a data center that are monitored include security, performance, availability, and capacity.

Reporting: It is done periodically on resource performance, capacity, and utilization. Reporting tasks help to establish business justifications and chargeback of costs associated with data center operations.

Provisioning: It is a process of providing the hardware, software, and other resources required to run a data center. Provisioning activities primarily include resources management to meet capacity, availability, performance, and security requirements.

Virtualization and cloud computing have dramatically changed the way data center infrastructure resources are provisioned and managed. Organizations are rapidly deploying virtualization on various elements of data centers to optimize their utilization. Further, continuous cost pressure on IT and on-demand data processing requirements have resulted in the adoption of cloud computing.

1.4 Virtualization and Cloud Computing

Virtualization is a technique of abstracting physical resources, such as compute, storage, and network, and making them appear as logical resources. Virtualization has existed in the IT industry for several years and in different forms. Common examples of virtualization are virtual memory used on compute systems and partitioning of raw disks.

Virtualization enables pooling of physical resources and providing an aggregated view of the physical resource capabilities. For example, storage virtualization enables multiple pooled storage devices to appear as a single large storage entity. Similarly, by using compute virtualization, the CPU capacity of the pooled physical servers can be viewed as the aggregation of the power of all CPUs (in megahertz). Virtualization also enables centralized management of pooled resources.

Virtual resources can be created and provisioned from the pooled physical resources. For example, a virtual disk of a given capacity can be created from a storage pool or a virtual server with specific CPU power and memory can be configured from a compute pool. These virtual resources share pooled physical resources, which improves the utilization of physical IT resources. Based on business requirements, capacity can be added to or removed from the virtual resources without any disruption to applications or users. With improved utilization of IT assets, organizations save the costs associated with procurement and management of new physical resources. Moreover, fewer physical resources means less space and energy, which leads to better economics and green computing.

In today's fast-paced and competitive environment, organizations must be agile and flexible to meet changing market requirements. This leads to rapid expansion and upgrade of resources while meeting shrinking or stagnant IT budgets. Cloud computing, addresses these challenges efficiently. Cloud computing enables individuals or businesses to use IT resources as a service over the network. It provides highly scalable and flexible computing that enables provisioning of resources on demand. Users can scale up or scale down the demand of computing resources, including storage capacity, with minimal management effort or service provider interaction. Cloud computing empowers self-service requesting through a fully automated request-fulfillment process. Cloud computing enables consumption-based metering; therefore, consumers pay only for the resources they use, such as CPU hours used, amount of data transferred, and gigabytes of data stored.

Cloud infrastructure is usually built upon virtualized data centers, which provide resource pooling and rapid provisioning of resources. Information storage in virtualized and cloud environments is detailed later in the book.

Summary

This chapter described the importance of data, information, and storage infrastructure. Meeting today's storage needs begins with understanding the type of data, its value, and key attributes of a data center.

The evolution of storage architecture and the core elements of a data center covered in this chapter provided the foundation for information storage and management. The emergence of virtualization has provided the opportunity to transform classic data centers into virtualized data centers. Cloud computing is further changing the way IT resources are provisioned and consumed.

The subsequent chapters in the book provide comprehensive details on various aspects of information storage and management in both classic and virtualized environments. It begins with describing the core elements of a data center with a focus on storage systems and RAID (covered in Chapters 2.1, 3.1, and 4.1). Chapters 5.1 through 8.1 of this book detail various storage networking technologies, such as storage area network (SAN), network attached storage (NAS), and object-based and unified storage. Chapters 9.1 through 12.1 cover various business continuity solutions, such as backup and replication, along with archival technologies. Chapter 13.1 introduces cloud infrastructure and services. Chapters 14.1 and 15.1 describe securing and managing storage in traditional and virtualized environments.

Exercises

1. What is structured and unstructured data? Research the challenges of storing and managing unstructured data.

2. Discuss the benefits of information-centric storage architecture over server-centric storage architecture.

3. What are the attributes of big data? Research and prepare a presentation on big data analytics.

4. Research how businesses use their information assets to derive competitive advantage and new business opportunities.

5. Research and prepare a presentation on personal data management.

Chapter 2 Data Center Environment

Key Concepts

Application, DBMS, Host, Connectivity, and Storage

Application Virtualization

File System and Volume Manager

Compute, Desktop, and Memory Virtualization

Storage Media

Disk Drive Components

Zoned Bit Recording

Logical Block Addressing

Flash Drives

Today, data centers are essential and integral parts of any business, whether small, medium, or large in size. The core elements of a data center are host, storage, connectivity (or network), applications, and DBMS that are managed centrally. These elements work together to process and store data. With the evolution of virtualization, data centers have also evolved from a classic data center to a virtualized data center (VDC). In a VDC, physical resources from a classic data center are pooled together and provided as virtual resources. This abstraction hides the complexity and limitation of physical resources from the user. By consolidating IT resources using virtualization, organizations can optimize their infrastructure utilization and reduce the total cost of owning an infrastructure. Moreover, in a VDC, virtual resources are created using software that enables faster deployment, compared to deploying physical resources in classic data centers. This chapter covers all the key components of a data center, including virtualization at compute, memory, desktop, and application. Storage and network virtualization is discussed later in the book.

With the increase in the criticality of information assets to businesses, storage—one of the core elements of a data center—is recognized as a distinct resource. Storage needs special focus and attention for its implementation and management. This chapter also focuses on storage subsystems and provides details on components, geometry, and performance parameters of a disk drive. The connectivity between the host and storage facilitated by various technologies is also explained.

2.1 Application

An application is a computer program that provides the logic for computing operations. The application sends requests to the underlying operating system to perform read/write (R/W) operations on the storage devices. Applications can be layered on the database, which in turn uses the OS services to perform R/W operations on the storage devices. Applications deployed in a data center environment are commonly categorized as business applications, infrastructure management applications, data protection applications, and security applications. Some examples of these applications are e-mail, enterprise resource planning (ERP), decision support system (DSS), resource management, backup, authentication and antivirus applications, and so on.

The characteristics of I/Os (Input/Output) generated by the application influence the overall performance of storage system and storage solution designs. For more information on application I/O characteristics, refer to Appendix A.

Application Virtualization

1.1 Application virtualization breaks the dependency between the application and the underlying platform (OS and hardware). Application virtualization encapsulates the application and the required OS resources within a virtualized container. This technology provides the ability to deploy applications without making any change to the underlying OS, file system, or registry of the computing platform on which they are deployed. Because virtualized applications run in an isolated environment, the underlying OS and other applications are protected from potential corruptions. There are many scenarios in which conflicts might arise if multiple applications or multiple versions of the same application are installed on the same computing platform. Application virtualization eliminates this conflict by isolating different versions of an application and the associated O/S resources.

2.2 Database Management System (DBMS)

A database is a structured way to store data in logically organized tables that are interrelated. A database helps to optimize the storage and retrieval of data. A DBMS controls the creation, maintenance, and use of a database. The DBMS processes an application's request for data and instructs the operating system to transfer the appropriate data from the storage.

2.3 Host (Compute)

Users store and retrieve data through applications. The computers on which these applications run are referred to as hosts or compute systems. Hosts can be physical or virtual machines. A compute virtualization software enables creating virtual machines on top of a physical compute infrastructure. Compute virtualization and virtual machines are discussed later in this chapter. Examples of physical hosts include desktop computers, servers or a cluster of servers, laptops, and mobile devices. A host consists of CPU, memory, I/O devices, and a collection of software to perform computing operations. This software includes the operating system, file system, logical volume manager, device drivers, and so on. This software can be installed as separate entities or as part of the operating system.

The CPU consists of four components: Arithmetic Logic Unit (ALU), control unit, registers, and L1 cache. There are two types of memory on a host, Random Access Memory (RAM) and Read-Only Memory (ROM). I/O devices enable communication with a host. Examples of I/O devices are keyboard, mouse, monitor, etc.

Software runs on a host and enables processing of input and output (I/O) data. The following section details various software components that are essential parts of a host system.

2.3.1 Operating System

In a traditional computing environment, an operating system controls all aspects of computing. It works between the application and the physical components of a compute system. One of the services it provides to the application is data access. The operating system also monitors and responds to user actions and the environment. It organizes and controls hardware components and manages the allocation of hardware resources. It provides basic security for the access and usage of all managed resources. An operating system also performs basic storage management tasks while managing other underlying components, such as the file system, volume manager, and device drivers.

In a virtualized compute environment, the virtualization layer works between the operating system and the hardware resources. Here the OS might work differently based on the type of compute virtualization implemented. In a typical implementation, the OS works as a guest and performs only the activities related to application interaction. In this case, hardware management functions are handled by the virtualization layer.

Memory Virtualization

Memory has been, and continues to be, an expensive component of a host. It determines both the size and number of applications that can run on a host. Memory virtualization enables multiple applications and processes, whose aggregate memory requirement is greater than the available physical memory, to run on a host without impacting each other.

Memory virtualization is an operating system feature that virtualizes the physical memory (RAM) of a host. It creates virtual memory with an address space larger than the physical memory space present in the compute system. The virtual memory encompasses the address space of the physical memory and part of the disk storage. The operating system utility that manages the virtual memory is known as the virtual memory manager (VMM). The VMM manages the virtual-to-physical memory mapping and fetches data from the disk storage when a process references a virtual address that points to data at the disk storage. The space used by the VMM on the disk is known as a swap space. A swap space (also known as page file or swap file) is a portion of the disk drive that appears to be physical memory to the operating system.

In a virtual memory implementation, the memory of a system is divided into contiguous blocks of fixed-size pages. A process known as paging moves inactive physical memory pages onto the swap file and brings them back to the physical memory when required. This enables efficient use of the available physical memory among different applications. The operating system typically moves the least used pages into the swap file so that enough RAM is available for processes that are more active. Access to swap file pages is slower than access to physical memory pages because swap file pages are allocated on the disk drive, which is slower than physical memory.

2.3.2 Device Driver

A device driver is special software that permits the operating system to interact with a specific device, such as a printer, a mouse, or a disk drive. A device driver enables the operating system to recognize the device and to access and control devices. Device drivers are hardware-dependent and operating-system-specific.

2.3.3 Volume Manager

In the early days, disk drives appeared to the operating system as a number of continuous disk blocks. The entire disk drive would be allocated to the file system or other data entity used by the operating system or application. The disadvantage was lack of flexibility. When a disk drive ran out of space, there was no easy way to extend the file system's size. Also, as the storage capacity of the disk drive increased, allocating the entire disk drive for the file system often resulted in underutilization of storage capacity.

The evolution of Logical Volume Managers (LVMs) enabled dynamic extension of file system capacity and efficient storage management. The LVM is software that runs on the compute system and manages logical and physical storage. LVM is an intermediate layer between the file system and the physical disk. It can partition a larger-capacity disk into virtual, smaller-capacity volumes (the process is called partitioning) or aggregate several smaller disks to form a larger virtual volume. (The process is called concatenation.) These volumes are then presented to applications.

Disk partitioning was introduced to improve the flexibility and utilization of disk drives. In partitioning, a disk drive is divided into logical containers called logical volumes (LVs) (see Figure 2.1). For example, a large physical drive can be partitioned into multiple LVs to maintain data according to the file system and application requirements. The partitions are created from groups of contiguous cylinders when the hard disk is initially set up on the host. The host's file system accesses the logical volumes without any knowledge of partitioning and physical structure of the disk.

Figure 2.1 Disk partitioning and concatenation

2.1

Concatenation is the process of grouping several physical drives and presenting them to the host as one big logical volume (see Figure 2.1).

The LVM provides optimized storage access and simplifies storage resource management. It hides details about the physical disk and the location of data on the disk. It enables administrators to change the storage allocation even when the application is running.

The basic LVM components are physical volumes, volume groups, and logical volumes. In LVM terminology, each physical disk connected to the host system is a physical volume (PV). The LVM converts the physical storage provided by the physical volumes to a logical view of storage, which is then used by the operating system and applications. A volume group is created by grouping together one or more physical volumes. A unique physical volume identifier (PVID) is assigned to each physical volume when it is initialized for use by the LVM. Physical volumes can be added or removed from a volume group dynamically. They cannot be shared between different volume groups, which means that the entire physical volume becomes part of a volume group. Each physical volume is partitioned into equal-sized data blocks called physical extents when the volume group is created.

Logical volumes are created within a given volume group. A logical volume can be thought of as a disk partition, whereas the volume group itself can be thought of as a disk. A volume group can have a number of logical volumes. The size of a logical volume is based on a multiple of the physical extents.

The logical volume appears as a physical device to the operating system. A logical volume is made up of noncontiguous physical extents and may span multiple physical volumes. A file system is created on a logical volume. These logical volumes are then assigned to the application. A logical volume can also be mirrored to provide enhanced data availability.

2.3.4 File System

A file is a collection of related records or data stored as a unit with a name. A file system is a hierarchical structure of files. A file system enables easy access to data files residing within a disk drive, a disk partition, or a logical volume. A file system consists of logical structures and software routines that control access to files. It provides users with the functionality to create, modify, delete, and access files. Access to files on the disks is controlled by the permissions assigned to the file by the owner, which are also maintained by the file system.

A file system organizes data in a structured hierarchical manner via the use of directories, which are containers for storing pointers to multiple files. All file systems maintain a pointer map to the directories, subdirectories, and files that are part of the file system. Examples of common file systems are:

FAT 32 (File Allocation Table) for Microsoft Windows

NT File System (NTFS) for Microsoft Windows

UNIX File System (UFS) for UNIX

Extended File System (EXT2/3) for Linux

Apart from the files and directories, the file system also includes a number of other related records, which are collectively called the metadata. For example, the metadata in a UNIX environment consists of the superblock, the inodes, and the list of data blocks free and in use. The metadata of a file system must be consistent for the file system to be considered healthy.

A superblock contains important information about the file system, such as the file system type, creation and modification dates, size, and layout. It also contains the count of available resources (such as the number of free blocks, inodes, and so on) and a flag indicating the mount status of the file system. An inode is associated with every file and directory and contains information such as the file length, ownership, access privileges, time of last access/modification, number of links, and the address of the data.

A file system block is the smallest unit allocated for storing data. Each file system block is a contiguous area on the physical disk. The block size of a file system is fixed at the time of its creation. The file system size depends on the block size and the total number of file system blocks. A file can span multiple file system blocks because most files are larger than the predefined block size of the file system. File system blocks cease to be contiguous and become fragmented when new blocks are added or deleted. Over time, as files grow larger, the file system becomes increasingly fragmented.

The following list shows the process of mapping user files to the disk storage subsystem with an LVM (see Figure 2.2):

1. Files are created and managed by users and applications.

2. These files reside in the file systems.

3. The file systems are mapped to file system blocks.

4. The file system blocks are mapped to logical extents of a logical volume.

5. These logical extents in turn are mapped to the disk physical extents either by the operating system or by the LVM.

6. These physical extents are mapped to the disk sectors in a storage subsystem.

If there is no LVM, then there are no logical extents. Without LVM, file system blocks are directly mapped to disk sectors.

Figure 2.2 Process of mapping user files to disk storage

2.2

The file system tree starts with the root directory. The root directory has a number of subdirectories. A file system should be mounted before it can be used.

1.1 The system utility fsck is run to check file system consistency in UNIX and Linux hosts. An example of the file system in an inconsistent state is when the file system has outstanding changes and

Enjoying the preview?

Page 1 of 1

Information Storage and Management: Storing, Managing, and Protecting Digital Information in Classic, Virtualized, and Cloud Environments

About this ebook

Related to Information Storage and Management

Related ebooks

Networking For You

Related podcast episodes

Related articles

Related categories

Reviews for Information Storage and Management

What did you think?

Book preview

Information Storage and Management - EMC Education Services

Part I

Storage System

Chapter 1

Introduction to Information Storage

Key Concepts

1.1 Information Storage

1.1.1 Data

Examples of Research and Business Data

1.1.2 Types of Data

1.1.3 Big Data

1.1.4 Information

1.1.5 Storage

1.2 Evolution of Storage Architecture

1.3 Data Center Infrastructure

1.3.1 Core Elements of a Data Center

1.3.2 Key Characteristics of a Data Center

1.3.3 Managing a Data Center

1.4 Virtualization and Cloud Computing

Summary

Chapter 2

Data Center Environment

Key Concepts

2.1 Application

Application Virtualization

2.2 Database Management System (DBMS)

2.3 Host (Compute)

2.3.1 Operating System

Memory Virtualization

2.3.2 Device Driver

2.3.3 Volume Manager

2.3.4 File System