Introduction to HPC with MPI for Data Science

Nonfiction, Computers, General Computing, Programming
Cover of the book Introduction to HPC with MPI for Data Science by Frank Nielsen, Springer International Publishing
View on Amazon View on AbeBooks View on Kobo View on B.Depository View on eBay View on Walmart
Author: Frank Nielsen ISBN: 9783319219035
Publisher: Springer International Publishing Publication: February 3, 2016
Imprint: Springer Language: English
Author: Frank Nielsen
ISBN: 9783319219035
Publisher: Springer International Publishing
Publication: February 3, 2016
Imprint: Springer
Language: English

This gentle introduction to High Performance Computing (HPC) for Data Science using the Message Passing Interface (MPI) standard has been designed as a first course for undergraduates on parallel programming on distributed memory models, and requires only basic programming notions.

Divided into two parts the first part covers high performance computing using C++ with the Message Passing Interface (MPI) standard followed by a second part providing high-performance data analytics on computer clusters.

In the first part, the fundamental notions of blocking versus non-blocking point-to-point communications, global communications (like broadcast or scatter) and collaborative computations (reduce), with Amdalh and Gustafson speed-up laws are described before addressing parallel sorting and parallel linear algebra on computer clusters. The common ring, torus and hypercube topologies of clusters are then explained and global communication procedures on these topologies are studied. This first part closes with the MapReduce (MR) model of computation well-suited to processing big data using the MPI framework.

In the second part, the book focuses on high-performance data analytics. Flat and hierarchical clustering algorithms are introduced for data exploration along with how to program these algorithms on computer clusters, followed by machine learning classification, and an introduction to graph analytics. This part closes with a concise introduction to data core-sets that let big data problems be amenable to tiny data problems.

Exercises are included at the end of each chapter in order for students to practice the concepts learned, and a final section contains an overall exam which allows them to evaluate how well they have assimilated the material covered in the book.

View on Amazon View on AbeBooks View on Kobo View on B.Depository View on eBay View on Walmart

This gentle introduction to High Performance Computing (HPC) for Data Science using the Message Passing Interface (MPI) standard has been designed as a first course for undergraduates on parallel programming on distributed memory models, and requires only basic programming notions.

Divided into two parts the first part covers high performance computing using C++ with the Message Passing Interface (MPI) standard followed by a second part providing high-performance data analytics on computer clusters.

In the first part, the fundamental notions of blocking versus non-blocking point-to-point communications, global communications (like broadcast or scatter) and collaborative computations (reduce), with Amdalh and Gustafson speed-up laws are described before addressing parallel sorting and parallel linear algebra on computer clusters. The common ring, torus and hypercube topologies of clusters are then explained and global communication procedures on these topologies are studied. This first part closes with the MapReduce (MR) model of computation well-suited to processing big data using the MPI framework.

In the second part, the book focuses on high-performance data analytics. Flat and hierarchical clustering algorithms are introduced for data exploration along with how to program these algorithms on computer clusters, followed by machine learning classification, and an introduction to graph analytics. This part closes with a concise introduction to data core-sets that let big data problems be amenable to tiny data problems.

Exercises are included at the end of each chapter in order for students to practice the concepts learned, and a final section contains an overall exam which allows them to evaluate how well they have assimilated the material covered in the book.

More books from Springer International Publishing

Cover of the book Improving Service Level Engineering by Frank Nielsen
Cover of the book Vitamin D in Chronic Kidney Disease by Frank Nielsen
Cover of the book Climate Change Impacts and Adaptation Strategies for Coastal Communities by Frank Nielsen
Cover of the book Voting Power and Procedures by Frank Nielsen
Cover of the book Forbidden Football in Ceausescu’s Romania by Frank Nielsen
Cover of the book Intelligent Transportation Systems by Frank Nielsen
Cover of the book Congenital Müllerian Anomalies by Frank Nielsen
Cover of the book Visualising the Charge and Cooper-Pair Density Waves in Cuprates by Frank Nielsen
Cover of the book Computer-Assisted and Robotic Endoscopy by Frank Nielsen
Cover of the book The Governance of Private Security by Frank Nielsen
Cover of the book Performing Music History by Frank Nielsen
Cover of the book Handbook of Theory and Practice of Sustainable Development in Higher Education by Frank Nielsen
Cover of the book Computational Science and Its Applications – ICCSA 2018 by Frank Nielsen
Cover of the book Atrial Fibrillation and Percutaneous Coronary Intervention by Frank Nielsen
Cover of the book The Science of Lay Theories by Frank Nielsen
We use our own "cookies" and third party cookies to improve services and to see statistical information. By using this website, you agree to our Privacy Policy