Skip to main content

PNNL

  • About
  • News & Media
  • Careers
  • Events
  • Research
    • Scientific Discovery
      • Biology
        • Chemical Biology
        • Computational Biology
        • Ecosystem Science
        • Human Health
          • Cancer Biology
          • Exposure Science & Pathogen Biology
        • Integrative Omics
          • Advanced Metabolomics
          • Chemical Biology
          • Mass Spectrometry-Based Measurement Technologies
          • Spatial and Single-Cell Proteomics
          • Structural Biology
        • Microbiome Science
          • Biofuels & Bioproducts
          • Human Microbiome
          • Soil Microbiome
          • Synthetic Biology
        • Predictive Phenomics
      • Chemistry
        • Computational Chemistry
        • Chemical Separations
        • Chemical Physics
        • Catalysis
      • Earth & Coastal Sciences
        • Global Change
        • Atmospheric Science
          • Atmospheric Aerosols
          • Human-Earth System Interactions
          • Modeling Earth Systems
        • Coastal Science
        • Ecosystem Science
        • Subsurface Science
        • Terrestrial Aquatics
      • Materials Sciences
        • Materials in Extreme Environments
        • Precision Materials by Design
        • Science of Interfaces
        • Solid Phase Processing
          • Cold Spray
          • Friction Stir Welding & Processing
          • ShAPE
      • Nuclear & Particle Physics
        • Dark Matter
        • Fusion Energy Science
        • Neutrino Physics
      • Quantum Information Sciences
    • Energy Resiliency
      • Electric Grid Modernization
        • Emergency Response
        • Grid Analytics
          • AGM Program
          • Tools and Capabilities
        • Grid Architecture
        • Grid Cybersecurity
        • Grid Energy Storage
        • Transmission
        • Distribution
      • Energy Efficiency
        • Appliance and Equipment Standards
        • Building Energy Codes
        • Building Technologies
          • Advanced Building Controls
          • Advanced Lighting
          • Building-Grid Integration
        • Building and Grid Modeling
        • Commercial Buildings
        • Federal Buildings
          • Federal Performance Optimization
          • Resilience and Security
        • Residential Buildings
          • Building America Solution Center
          • Energy Efficient Technology Integration
          • Home Energy Score
        • Energy Efficient Technology Integration
      • Energy Storage
        • Electrochemical Energy Storage
        • Flexible Loads and Generation
        • Grid Integration, Controls, and Architecture
        • Regulation, Policy, and Valuation
        • Science Supporting Energy Storage
        • Chemical Energy Storage
      • Environmental Management
        • Waste Processing
        • Radiation Measurement
        • Environmental Remediation
      • Fossil Energy
        • Subsurface Energy Systems
        • Carbon Management
          • Carbon Capture
          • Carbon Storage
          • Carbon Utilization
        • Advanced Hydrocarbon Conversion
      • Nuclear Energy
        • Fuel Cycle Research
        • Advanced Reactors
        • Reactor Operations
        • Reactor Licensing
      • Renewable Energy
        • Solar Energy
        • Wind Energy
          • Wind Resource Characterization
          • Wildlife and Wind
          • Community Values and Ocean Co-Use
          • Wind Systems Integration
          • Wind Data Management
          • Distributed Wind
        • Marine Energy
          • Environmental Monitoring for Marine Energy
          • Marine Biofouling and Corrosion
          • Marine Energy Resource Characterization
          • Testing for Marine Energy
          • The Blue Economy
        • Hydropower
          • Environmental Performance of Hydropower
          • Hydropower Cybersecurity and Digitalization
          • Hydropower and the Electric Grid
          • Materials Science for Hydropower
          • Pumped Storage Hydropower
          • Water + Hydropower Planning
        • Grid Integration of Renewable Energy
        • Geothermal Energy
      • Transportation
        • Bioenergy Technologies
          • Algal Biofuels
          • Aviation Biofuels
          • Waste-to-Energy and Products
        • Hydrogen & Fuel Cells
        • Vehicle Technologies
          • Emission Control
          • Energy-Efficient Mobility Systems
          • Lightweight Materials
          • Vehicle Electrification
          • Vehicle Grid Integration
    • National Security
      • Chemical & Biothreat Signatures
        • Contraband Detection
        • Pathogen Science & Detection
        • Explosives Detection
        • Threat-Agnostic Biodefense
      • Cybersecurity
        • Discovery and Insight
        • Proactive Defense
        • Trusted Systems
      • Nuclear Material Science
      • Nuclear Nonproliferation
        • Radiological & Nuclear Detection
        • Nuclear Forensics
        • Ultra-Sensitive Nuclear Measurements
        • Nuclear Explosion Monitoring
        • Global Nuclear & Radiological Security
      • Stakeholder Engagement
        • Disaster Recovery
        • Global Collaborations
        • Legislative and Regulatory Analysis
        • Technical Training
      • Systems Integration & Deployment
        • Additive Manufacturing
        • Deployed Technologies
        • Rapid Prototyping
        • Systems Engineering
      • Threat Analysis
        • Advanced Wireless Security
          • 5G Security
          • RF Signal Detection & Exploitation
        • Internet of Things
        • Maritime Security
        • Millimeter Wave
        • Mission Risk and Resilience
    • Data Science & Computing
      • Artificial Intelligence
      • Graph and Data Analytics
      • Software Engineering
      • Computational Mathematics & Statistics
      • Future Computing Technologies
        • Adaptive Autonomous Systems
    • Publications & Reports
    • Featured Research
  • People
    • Inventors
    • Lab Leadership
    • Lab Fellows
    • Staff Accomplishments
  • Partner with PNNL
    • Education
      • Undergraduate Students
      • Graduate Students
      • Post-graduate Students
      • University Faculty
      • University Partnerships
      • K-12 Educators and Students
      • STEM Education
        • STEM Workforce Development
        • STEM Outreach
        • Meet the Team
      • Internships
    • Community
      • Regional Impact
      • Philanthropy
      • Volunteering
    • Industry
      • Available Technologies
      • Industry
      • Industry Partnerships
      • Licensing & Technology Transfer
      • Entrepreneurial Leave
      • Visual Intellectual Property Search (VIPS)
  • Facilities & Centers
    • All Facilities
      • Atmospheric Radiation Measurement User Facility
      • Electricity Infrastructure Operations Center
      • Energy Sciences Center
      • Environmental Molecular Sciences Laboratory
      • Grid Storage Launchpad
      • Institute for Integrated Catalysis
      • Interdiction Technology and Integration Laboratory
      • PNNL Portland Research Center
      • PNNL Seattle Research Center
      • PNNL-Sequim (Marine and Coastal Research)
      • Radiochemical Processing Laboratory
      • Shallow Underground Laboratory

Advanced Memory to Support Artificial Intelligence for Science (AMAIS)

  • Key Personnel
  • Research Thrusts
    • High-Level Synthesis Thrust
    • Performance Analysis Thrust
    • ML Benchmark Suite Thrust
    • Scale Up Thrust
    • Scale Out Thrust
    • Data Privacy, Security, and Confidentiality Thrust
    • Theory Thrust
  • News
  • Publications

Breadcrumb

  1. Home
  2. Projects
  3. Advanced Memory to Support Artificial Intelligence for Science (AMAIS)

Scale Out Thrust

Related Researchers

Joseph Manzano

Joseph Manzano

Computer Scientist
Photo of Joshua Suetterlein

Joshua Suetterlein

Computer Scientist

The rise of machine learning (ML) and artificial intelligence (AI) promises the automatization of specific tasks and increased productivity that will rival other technological revolutions. However, because of the ever-growing demands of computational performance, memory/storage capacity, and energy consumption, ML/AI can be classified as a high-performance workflow. As part of this family, they face similar challenges to other sibling workflows, e.g., better utilization of resources and concurrency, network contention at specific all-to-all communication points, while introducing more nuances challenges, such as vast sets of weights and biases which might overwhelm a single computing device, a long-time, usually expensive, investment when training. Moreover, introducing extremely heterogeneous computational substrates (i.e., computing nodes composed of several CPUs plus different accelerators, all communicating through intra or inter-network fabrics) makes the efficient porting these workflows even more difficult.

The concept of disaggregated memory, when applied in the context of high-performance workflows, can bridge the gap between 'regular' and 'expert' programmers. It provides a highly productive abstraction of the distributed memory system, offering a potential solution to the challenges posed by these workflows. However, to fully exploit the performance opportunities it presents, a collaborative effort between users and the system software is crucial. This thrust is dedicated to addressing the challenges that arise from the interaction of these concurrent actors at scale. Our focus is on the connections to higher scientific programming languages (in our case OpenMP) and the meso layers of the LLVM infrastructure to target asynchronous many task runtime systems with sophisticated distributed memory models.

The compiler and runtime are integral to addressing the challenges of high-performance workflows. From the compiler's perspective, we aim to establish a connection to higher-level scientific languages, such as OpenMP, and to evolve classical compiler analysis to introduce concepts such as remote, local, and fast memory under an abstract machine view. These optimizations are guided by the capabilities of the underlying runtime system and its memory model. On the runtime front, the disaggregated memory abstraction necessitates a redesign of some essential features in the underlying layer to better exploit the different memory levels available under this paradigm. This design includes determining the most effective channels of communication between memory regions, effective collective operations, and synchronization at scale. These design decisions evolve in a lock step, iterative fashion between the two components with information from other trusts (especially the scale up one).

As part of the iterative process, we have selected mini workflows that will inform our design decisions. The workflows of interest were selected based on their memory use, either by having "interesting" access patterns or large capacity requirements, their concurrency characteristics, and their relative simplicity. The first of these mini workflows is based on gradient-boosted decision trees / random forests, in which auxiliary structures (like histograms) can use more memory for observations to create a trade-off between accuracy and memory capacity. Moreover, because the data structure depends on the input data's entropy, the access patterns between nodes can result in remote operations.

We selected a nascent proxy application derived from PanGenomics for the second mini workflow. The team uses the Giraffe framework as the base application to extract the memory and concurrency requirements. The sequence matching algorithm matches many genetic sequences against a Burrows-Wheeler transform Graph (a genome graph representing a compressed genome version). This sequence stream creates concurrency while presenting interesting memory patterns since it must follow the graph structure to make the match.

These trusts have the following sub-trusts associated with them. We are developing a characterization of both mini workflows in terms of memory behavior and how concurrency affects them. We are producing a prototype implementation of the compiler infrastructure that uses the distributed runtime as its target, generated from a restricted OpenMP code. We are redesigning the runtime in terms of disaggregated memory, focusing on synchronization at scale. We consider integrating memory-centric optimization in the compiler layers while targeting a distributed memory model.

Publications

To appear in BioSys 2024 (In Conjunction with ASPLOS 2024). April 27, 2024. Jessica Imlau Dagostini, Scott Beamer, Tyler Sorensen, (University of California Santa Cruz), Joseph Manzano, and Andres Marquez, (Pacific Northwest National Laboratory) “Developing a Proxy Application Based on a Parallel Pangenome Mapping Tool.” Extended Abstract.

PNNL

  • Get in Touch
    • Contact
    • Careers
    • Doing Business
    • Environmental Reports
    • Security & Privacy
    • Vulnerability Disclosure Policy
  • Research
    • Scientific Discovery
    • Energy Resiliency
    • National Security
Subscribe to PNNL News
Department of Energy Logo Battelle Logo
Pacific Northwest National Laboratory (PNNL) is managed and operated by Battelle for the Department of Energy
  • YouTube
  • Facebook
  • X (formerly Twitter)
  • Instagram
  • LinkedIn