HPC Network Interface Software/Hardware Co-Design

Posted on May 7, 2014

Date: Thursday, May 8, 2014
Time: 11:00 am – 12:00 pm
Place: Mechanical Engineering 218

Ron Brightwell
Scalable System Software Department
Sandia National Laboratories

The Portals project has entered a third decade of research and development in scalable, high-performance networking for large-scale scientific parallel computing systems. Portals has evolved from its inception as a component of early lightweight operating systems to become an important vehicle for interconnect exploration. Unlike most user-level network programming interfaces, Portals employs a building

block approach that encapsulates the semantic requirements of a broad range of upper-level protocols needed to support high-performace computing applications and services. This approach has also enabled hardware designers to focus on developing components that accelerate key functions in Portals, facilitating the software/hardware co-design process. This talk will provide an overview of the latest
version of the Portals interconnect API and describe research activities aimed at exploiting some recently added capabilities.


Ron Brightwell currently manages the Scalable System Software Department at Sandia National Laboratories. He joined Sandia in 1995 after receiving his BS in mathematics and his MS in computer science from Mississippi State University. While at Sandia, he has designed and developed software for lightweight compute node operating systems and high-performance networks on several large-scale massively parallel systems, including the Intel Paragon and TeraFLOPS, and the Cray T3 and XT series of machines. He has authored more than 75 peer-reviewed journal, conference, and workshop publications. His research interests include high-performance, scalable communication interfaces and protocols for system area networks, operating systems for massively parallel processing machines, and parallel program performance analysis libraries and tools. He is a Senior Member of the IEEE and the ACM.

Read More

Big Data Challenges in Simulation-based Science

Posted on Apr 28, 2014


Data-related challenges are quickly dominating computational and data-enabled sciences, and are limiting the potential impact of scientific applications enabled by current and emerging high-performance
distributed computing environments. These data-intensive application workflows involve dynamic coordination, interactions and data coupling between multiple application process that run at scale on different resources, and with services for monitoring, analysis and visualization and archiving. In this talk I will explore data grand challenges in simulation-based science and investigate how solutions based on data sharing abstractions, managed data pipelines, in-memory data-staging, in-situ placement and execution, and in-transit data processing can be used to address these data challenges at extreme scales.

Manish Parashar is Professor of Electrical and Computer Engineering at Rutgers University. He is the founding Director of the Rutgers Discovery Informatics Institute (RDI2) and site Co-Director of the NSF Cloud and Autonomic Computing Center (CAC). Manish received a BE degree from Bombay
University, India, and MS and Ph.D. degrees from Syracuse University. His research interests are in the broad areas of Parallel and Distributed Computing and Computational and Data-Enabled Science and Engineering. A key focus of his research is on addressing the complexity or large-scale systems and applications through programming abstractions and systems. Manish serves on the editorial boards and organizing committees of a large number of journals and international conferences and workshops, and has deployed several software systems that are widely used. He has also received a number of awards and is Fellow of AAAS, Fellow of IEEE/IEEE Computer Society and Senior Member of ACM. For more information please visit http://parashar.rutgers.edu/.

Read More