Sunteți pe pagina 1din 35

Teradata Overview

Part-1

Amruta Patil

2010 Wipro Ltd - Confidential

Objective of the course


End of the session you will have gained an understanding on following Teradata overview.

2010 Wipro Ltd - Confidential 2009 Wipro Ltd - Confidential

Agenda
1 What is Teradata ? 2 Teradata Architecture 3 Teradata Advantages 4 Teradata Never Have to Do 5 Summary

2010 Wipro Ltd - Confidential 2009 Wipro Ltd - Confidential

What is Teradata ?

2010 Wipro Ltd - Confidential

What is Teradata?
Teradata is a relational database management system (RDBMS) that is:
An open system, running on a UNIX MP-RAS or Windows 2000 server platform. Capable of supporting many concurrent users from various client platforms (over a TCP/IP or IBM channel connection). Built on a parallel architecture. Share Nothing architecture. Linear Scalability. Automatic, even data distribution

2010 Wipro Ltd - Confidential 2009 Wipro Ltd - Confidential

Teradata Architecture

2010 Wipro Ltd - Confidential

Teradata Architecture

2010 Wipro Ltd - Confidential 2009 Wipro Ltd - Confidential

Teradata Advantages

2010 Wipro Ltd - Confidential

Teradata Advantages
Large capacity database machine: Teradata handles the large data storage requirements to process the large amounts of detail data for decision support. Billions of rows Terabytes of data Performance: Early relational systems suffered severe performance limitations as table size increased. Teradata addresses the performance issues of large databases. Single data store for multiple clients: Instead of replicating a database for different hosts, with Teradata you store it once and use it for all clients. This is what's known as "a single version of the truth". Connectivity: Teradata connects easily to network-attached host systems as well as mainframe hosts.
9

2010 Wipro Ltd - Confidential 2009 Wipro Ltd - Confidential

Teradata Advantages contd..

Standard access language (SQL): SQL has been adapted as the industry standard for relational databases. Manageable growth: Teradata systems are linearly expandable to allow for growth without performance drop-off. Fault tolerance: Teradata can automatically detect and recover from one or more hardware failures. Data integrity: To guarantee the integrity of the data, transactions are either completed or, if a fault occurs, rolled back.

10

2010 Wipro Ltd - Confidential 2009 Wipro Ltd - Confidential

Teradata Never Have to Do

11

2010 Wipro Ltd - Confidential

Teradata Never Have to Do

Teradata DBAs are not required to do the following : Reorganize data or index space. Pre-allocate table/index space and format partitioning. While it is possible to have partitioned indexes in Teradata, they are not required. Pre-prepare data for loading (convert, sort, split, etc.). Ensure that queries run in parallel. Unload/reload data spaces due to expansion. With Teradata, the data can be redistributed on the larger configuration with no unloading and reloading required.

12

2010 Wipro Ltd - Confidential 2009 Wipro Ltd - Confidential

Summary

13

2010 Wipro Ltd - Confidential

Summary
Teradata is designed and used for enterprise data warehousing. Designed and built with parallelism from day one

The secret to data warehousing is not just the ability to lay down a terabyte of data onto disk. Lots of vendors can do that. The trick is to be able to access it and use it in a meaningful way. That means lots of users doing lots of queries, some ad hoc, some planned, all getting what they need in a timely fashion.

14

2010 Wipro Ltd - Confidential 2009 Wipro Ltd - Confidential

Teradata Overview
Part-2

Amruta Patil

2010 Wipro Ltd - Confidential

Objective of the course


End of the session you will have gained an understanding on following Teradata overview Part 2.

16

2010 Wipro Ltd - Confidential 2009 Wipro Ltd - Confidential

Agenda
1 Describe a node 2 SMP and MMP 3 Clique 4 Major Components of Teradata system 5 Summary

17

2010 Wipro Ltd - Confidential 2009 Wipro Ltd - Confidential

Describe a node.

18

2010 Wipro Ltd - Confidential

Describe a node.

19

2010 Wipro Ltd - Confidential 2009 Wipro Ltd - Confidential

SMP and MPP

20

2010 Wipro Ltd - Confidential

Symmetric Multi-Processor (SMP)

A node is made up of various hardware and software components. All applications run under UNIX , or Windows, and all Teradata software runs under PDE. All share the resources of CPU and memory on the node. Teradata software, the LAN gateway, and channel-driver software run as processes. AMPs and PEs are virtual processors (vprocs) which run under Parallel Database Extensions (PDE). AMPs are associated with virtual disks (vdisks). A single node is called a Symmetric Multi-Processor (SMP).

21

2010 Wipro Ltd - Confidential 2009 Wipro Ltd - Confidential

Massively Parallel Processing

22

2010 Wipro Ltd - Confidential 2009 Wipro Ltd - Confidential

Massively Parallel Processing contd


When multiple SMP nodes are connected to form a larger configuration,we refer to this as a Massively Parallel Processing (MPP) system. Multiple nodes are configured into a Massively Parallel Processing (MPP) system. A physical message-passing layer called the BYNET is used to interconnect multiple nodes. Teradata is linearly expandableas your database grows, additional nodes may be added. The BYNET can support 512 nodes.

23

2010 Wipro Ltd - Confidential 2009 Wipro Ltd - Confidential

Clique

24

2010 Wipro Ltd - Confidential

Clique

25

2010 Wipro Ltd - Confidential 2009 Wipro Ltd - Confidential

Clique contd
A clique is a set of Teradata nodes that share a common set of disk arrays. Cabling a subset of nodes to the same disk arrays creates a clique. A clique is a defined set of nodes with failover capability. All nodes in a clique are able to access the vdisks of all AMPs in the clique. If a node fails, its vprocs will migrate to the remaining nodes in the clique. Each node can support 128 vprocs

26

2010 Wipro Ltd - Confidential 2009 Wipro Ltd - Confidential

Major components of Teradata system

27

2010 Wipro Ltd - Confidential

Major Components of a Teradata System


Parsing Engine (PE) : The Parsing Engine interprets the SQL command and converts the data record from the host into an AMP message Managing Individual sessions (up to 120) Parsing and optimizing SQL requests Dispatching the optimized plan to the AMPs Sending the answer set response back to the requesting client

28

2010 Wipro Ltd - Confidential 2009 Wipro Ltd - Confidential

Major Components of a Teradata System contd


AMP : The AMP is a virtual processor (vproc) designed for and dedicated to managing a portion of the entire database. It performs all database management functions such as sorting, aggregating, and formatting data. The AMP receives data from the PE, formats rows, and distributes them to the disk storage units it controls. The AMP also retrieves the rows requested by the Parsing Engine. Storing and retrieving rows to and from the disks Lock Management Sorting rows and aggregating columns Join Processing Output conversions and formatting

29

2010 Wipro Ltd - Confidential 2009 Wipro Ltd - Confidential

Major Components of a Teradata System contd


BYNET : The BYNET is the message-passing layer. It determines which AMP(s) (Access Module Processor) should receive a message. Point-to-point communications between nodes and virtual processors Merging answer sets back to the PE making Teradata parallelism possible

30

2010 Wipro Ltd - Confidential 2009 Wipro Ltd - Confidential

Summary

31

2010 Wipro Ltd - Confidential

Summary
SMP and MMP. Components of Teradata System. Clique

32

2010 Wipro Ltd - Confidential 2009 Wipro Ltd - Confidential

Q&A

33

2010 Wipro Ltd - Confidential 2009 Wipro Ltd - Confidential

Thank You
Amruta Patil
ADM Support Amrutha.patil@wipro.com

2010 Wipro Ltd - Confidential

References

v http://www.teradata.com/

35

2010 Wipro Ltd - Confidential 2009 Wipro Ltd - Confidential

S-ar putea să vă placă și