La présentation est en train de télécharger. S'il vous plaît, attendez

La présentation est en train de télécharger. S'il vous plaît, attendez

Parallel Data Warehouse: The Data Warehouse Consolidation Appliance

Présentations similaires


Présentation au sujet: "Parallel Data Warehouse: The Data Warehouse Consolidation Appliance"— Transcription de la présentation:

1

2 Parallel Data Warehouse: The Data Warehouse Consolidation Appliance
Lionel Pénuchot PDW Center of Excellence

3 Agenda Aperçu de SQL Server PDW SSAS + PDW: la dream team
Consolidation Les principales évolutions de l’appliance Une démo peut-être ?

4 Configuration HP Full Rack 2 Rack 3 Rack 3/4 Rack 1 1/2 Rack 1/2 Rack
PDW Backplane (6U): Redundant Infiniband Redundant Ethernet Management and control (Active) Rack Failover Node (Passive) Extension Base Unit (5U): Redundant Infiniband Redundant Ethernet Rack Failover Node (Passive) Extension Base Unit (5U): Redundant Infiniband Redundant Ethernet Rack Failover Node (Passive) Infiniband Ethernet Control Node Failover Node Infiniband Ethernet Failover Node Infiniband Ethernet Failover Node Reserved Space Reserved Space (9U) Data Integration Platform server Passive Unit (adds Failover Node) Future expansion Reserved Space Reserved Space (9U) Data Integration Platform server Passive Unit (adds Failover Node) Future expansion Reserved Space Reserved Space (8U) Data Integration Platform server Passive Unit (adds Failover Node) Future expansion JBOD 4 Compute Node 7 Compute Node 8 Scale Unit (7U): 2 HP 1U servers (16 cores/ea. Total: 32) JBOD 5U 1 TB drives User data capacity: 75 TB JBOD 8 Compute Node 15 Compute Node 16 Scale Unit (7U): 2 HP 1U servers (16 cores/ea. Total: 32) JBOD 5U 1 TB drives User data capacity: 75 TB JBOD 12 Compute Node 23 Compute Node 24 60TB (Raw) Full Rack Scale Unit (7U): 2 HP 1U servers (16 cores/ea. Total: 32) JBOD 5U 1 TB drives User data capacity: 75 TB 120.8TB (Raw) 2 Rack 181.2TB (Raw) 3 Rack JBOD 3 Compute Node 5 Compute Node 6 Scale Unit (7U): 2 HP 1U servers (16 cores/ea. Total: 32) JBOD 5U 1 TB drives User data capacity: 75 TB JBOD 7 Compute Node 13 Compute Node 14 Scale Unit (7U): 2 HP 1U servers (16 cores/ea. Total: 32) JBOD 5U 1 TB drives User data capacity: 75 TB JBOD 11 Compute Node 21 Compute Node 22 45.3TB (Raw) 3/4 Rack Scale Unit (7U): 2 HP 1U servers (16 cores/Ea. Total: 32) JBOD 5U 1 TB drives User data capacity: 75 TB JBOD 2 Compute Node 3 Compute Node 4 Scale Unit (7U): 2 HP 1U servers (16 cores/ea. Total: 32) JBOD 5U 1 TB drives User data capacity: 75 TB JBOD 6 Compute Node 11 Compute Node 12 Scale Unit (7U): 2 HP 1U servers (16 cores/ea. Total: 32) JBOD 5U 1 TB drives User data capacity: 75TB JBOD 10 Compute Node 19 Compute Node 20 30.2TB (Raw) 1/2 Rack Scale Unit (7U): 2 HP 1U servers (16 cores/Ea. Total: 32) JBOD 5U 1 TB drives User data capacity: 75 TB 90.6TB (Raw) 1 1/2 Rack JBOD 1 Compute Node 1 Compute Node 2 Base Unit (7U): 2 HP 1U servers (16 cores/ea. Total: 32) JBOD 5U 1 TB drives User data capacity: 75 TB JBOD 5 Compute Node 9 Compute Node 10 Extension Base Unit (7U): 2 HP 1U servers (16 cores/ea. Total: 32) JBOD 5U 1 TB drives User data capacity: 75 TB JBOD 9 Compute Node 17 Compute Node 18 Extension Base Unit (7U): 2 HP 1U servers (16 cores/ea. Total: 32) JBOD 5U 1 TB drives User data capacity: 75 TB 1¼ Rack 75.5TB (Raw) ¼ Rack 15.1TB (Raw)

5 Dell configuration Full Rack 2/3 Rack 1/3 Rack 67.9TB (Raw)
Infiniband Ethernet Control Node Failover Node Base Unit (6U): Redundant Infiniband Redundant Ethernet Management and Control (Active) Rack Failover Node (Passive) Reserved Use Reserved Space (6U) Passive Unit (adds Failover Node) Future expansion JBOD 5 Compute Node 8 Compute Node 9 JBOD 6 Compute Node 7 Scale Unit (10U): 3 servers in 2U enclosure (16 cores/ea. Total: 48) 2 JBOD 4U ea. 1 TB drives User data capacity: 79 TB 67.9TB (Raw) Full Rack JBOD 3 Compute Node 5 Compute Node 6 JBOD 4 Compute Node 4 Scale Unit (10U): 3 servers in 2U enclosure (16 cores/ea. Total: 48) 2 JBOD 4U ea. 1 TB drives User data capacity: 79 TB 45.3TB (Raw) 2/3 Rack JBOD 1 Compute Node 2 Compute Node 3 JBOD 2 Compute Node 1 Base Unit (10U): 3 servers in 2U enclosure (16 cores/ea. Total: 48) 2 JBOD 4U ea. 1 TB drives User data capacity: 79 TB 22.6TB (Raw) 1/3 Rack JBOD 2 Compute Node 2 Compute Node 3 JBOD 1 Compute Node 1

6 Architecture logiciel
Window Server 2012 Standard PDW engine DMS Manager SQL Server 2012 Enterprise Edition (PDW build) Shell databases Détails généraux Tous les serveurs sont sous Windows Server 2012 Standard Toutes les VMs sont sous Windows Server 2012 Standard Les tâches de type fabric et workload sont dans des VMs Fabric VM, MAD01, and CTL partagent un serveur Cela permet de réduire le surcoût spécialement pour les petites configurations L’agent PDW tourne sur tous les hosts et toutes les VMs DWConfig and Admin Console existent toujours La technologie Windows Storage Spaces gère le mirroring et les secours, ainsi cela permet de réduire les coûts en utilisant du DAS (JBODs) plutôt que du SAN Détails du moteur PDW SQL Server 2012 Enterprise Edition (PDW build) sur le control node et sur les compute nodes Détails sur le stockage Similaire à la V1 Double de datafiles par filegroup Plus de disques physiques en parallèle Détails logiciels HST01 CTL MAD FAD VMM Base unit HST02 HSA01 JBOD Compute 1 HSA02 IB and Ethernet Compute 2 Direct attached SAS Window Server 2012 Standard DMS Core SQL Server 2012 Enterprise Edition (PDW build)

7 Extensions possibles 2–56 nodes 15 TB–1.3 PB raw Up to 6 PB user data
HP Base Active Compute Incr. Spare Total Raw disk: 1TB Raw disk: 3TB Capacity Quarter-rack 1 2 N/A 4 15.1 45.3 TB Half 100% 6 30.2 90.6 TB Three-quarters 50% 8 135.9 TB Full rack 3 33% 10 60.4 181.2 TB One-&-quarter 25% 13 75.5 226.5 TB One-&-half 12 20% 15 271.8 TB Two racks 16 19 120.8 362.4 TB Two and a half 7 20 24 151 453 TB Three racks 9 28 543.6 TB Four racks 32 37 241.6 724.8 TB Five racks 5 40 46 302 906 TB Six racks 18 48 55 1087.2 TB Seven racks 21 56 17% 64 422.8 1268.4 TB 2–56 nodes 15 TB–1.3 PB raw Up to 6 PB user data DELL Base Active Compute Capacity inc. Spare Total Raw disk: 1TB Raw disk: 3TB Capacity One third-rack 1 3 N/A 5 22.65 67.95 TB Two thirds 6 100% 8 45.3 135.9 TB Full rack 2 9 50% 11 203.85 TB One & third 12 33% 15 90.6 271.8 TB One & two thirds 25% 18 113.25 339.75 TB Two racks 4 20% 21 407.7 TB Two & a third 17% 25 158.55 475.65 TB Two & two thirds 24 14% 28 181.2 543.6 TB Three racks 27 13% 31 611.55 TB Four racks 36 41 815.4 TB Five racks 10 45 51 TB Six racks 54 61 1223.1 TB 2–3 node increments for small topologies

8 SSAS + PDW: la dream team
PDW + MOLAP PDW + ROLAP PDW + TABULAR Direct Query

9 Les prérequis SSAS EnableRolapDistinctCountOnDataSource=1
ROLAPDimensionProcessingEffort>=300000 Connexion via le client natif SQL Server SNAC 11.

10 Prérequis PDW Inventaire des cubes et de leurs Data Warehouse
Migration vers PDW Répliquer les tables de dimensions Distribuer les tables de faits Distribuer les tables de dimensions changeantes Pas d’index CCI (Clustered Column Store Index)

11 PDW + MOLAP Gain processing Gain stockage
Les temps de processing peuvent être réduits à 10% du temps initial. Le processing tire antièrement partie du réseau Infiniband de l’appliance Cubes et dimensions inchangés Gain stockage Pas d’index Stockage In-memory columnstore .

12 PDW + ROLAP Gain processing Gain stockage Processing immédiat
Pas de données stockée par SSAS Pas d’index Stockage In-memory columnstore .

13 PDW + TABULAR Direct Query
Gain processing Processing immédiat Gain stockage Pas de données stockée par SSAS Pas d’index Clustered column store index. Plusieurs modes DirectQuery InMemoryWithDirectQuery DirectQueryWithinMemory

14 Exemple de consolidation 1
PDW User space SSAS ROLAP MOLAP TABULAR IB

15 Example de consolidation 2
PDW User space SSAS IB

16 Example de consolidation 3
PDW User space Sharepoint Excel Services IB

17 Les principales évolutions de l’appliance
Appliance Update Fréquence : tous les 6 mois

18 Région dédiée Hadoop SQL Server Parallel Data Warehouse
Appliance préassemblée et préconfigurée Massively Parallel Processing (MPP) jusqu’à 6 PBs In-memory columnstore : jusqu’à 100x plus rapide Région dédiée Hadoop Requêtes SQL sur données relationnelles et Hadoop Disponible auprès de HP et Dell

19 Des Terabytes aux Multi-Petabytes
Extension du stockage en petabytes Massively Parallel Processing (MPP) parallelise les requêtes Multiple noeuds avec CPU, mémoire et stockage dédiés Ajout incrémental de HW pour une augmentation quasi linéaire en volume et performance Ajout de capacité par incrément dans la même appliance Scale OUT Des Terabytes aux Multi-Petabytes

20 Différentes options de déploiement et solutions hybrides
Box Software Appliances Cloud SQL Server Parallel Data Warehouse SQL Server for data warehousing in Windows Azure VMs SQL Server Fast Track HDInsight for Windows Azure Hortonworks Data Platform

21 “Big Data” avec simplicité
Non- relational Explorer les données non-relationnelles Hadoop cluster in HDP for Windows and HDInsight Gère les données non-relationnelles 100% Apache Simplicité of Windows Hadoop sous toutes les formes : software, appliance, cloud Windows Azure Parallel Data Warehouse Hortonworks Data Platform Key goal of slide: Describe the three ways a customer can deploy Big Data from Microsoft. Slide talk track: To give customers the Microsoft stack of solutions from software (HDP for Windows, appliance (PDW v2 AU1), cloud (Windows Azure HDInsight). Windows Azure HDInsight – Microsoft’s cloud based Hadoop distribution. It is 100% Apache compatible distribution that is available on Windows Azure. SQL Server Parallel Data Warehouse – Parallel Data Warehouse will have a Hadoop region where it will have HDInsight on Windows Server integrated in the appliance. Hortonworks Data Platform For Windows – Hortonworks created HDP for Windows to have their Hadoop distribution available as a software offering available on Windows. These 3 “Big Data” offerings on software, appliance, cloud brings the simplicity of Windows to manage your non-relational data with simplicity. “Big Data” avec simplicité

22 Intégration relationel et non-relationel
Requête integrant PolyBase in SQL PDW SQL PolyBase Résultat Requête relationelle et Hadoop en parallèle Requête unique Pa besoin d’ETL d’Hadoop vers DW Requête Hadoop à l’aide de T-SQL Key goal of slide: Describe the three ways a customer can deploy Big Data from Microsoft. Slide talk track: Pioneered in the Jim Gray Systems Labs by David DeWitt, PolyBase is an integrated query processor in SQL Server 2012 Parallel Data Warehouse which represents a breakthrough innovation from traditional query processing to join structured and unstructured data from Hadoop together. Without manual intervention, PolyBase Query Processor can accept a standard SQL query and combine tables from a relational source with tables from a Hadoop source directly through external tables.  As well, PolyBase Query Processor parallelizes the ability to import/export data to and from Hadoop giving PDW speed, simplicity, and responsiveness in addressing these new types of queries. Ability to issue standard T-SQL that joins relational data with unstructured data in Hadoop PolyBase rapidly imports/exports data between Hadoop and PDW in parallel PolyBase can query data in Hadoop directly without movement (with external tables) Created in “Gray Systems Labs” by David DeWitt Requête relationelle + non relationelle Données relationelles

23 Démo

24 Questions ?

25 Merci à nos sponsors

26 Références

27


Télécharger ppt "Parallel Data Warehouse: The Data Warehouse Consolidation Appliance"

Présentations similaires


Annonces Google