Skip to main content

Publications

Discover more of our presentations and articles over the past years.

*Please note that older presentations may contain outdated information.*

Presentations from SC24, November 24

Pdf icon1
Slinky: The Missing Link Between Slurm and Kubernetes

Skyler Malinowski & Tim Wickberg, SchedMD

Pdf icon1
Slurm Community Birds-of-a-Feather

Tim Wickberg & Danny Auble, SchedMD

Presentations from Slurm User Group Meeting, September 2024

Pdf icon1
ORNL Site Report & Feature Discussion

Matt Ezell and Paul Peltz, Oak Ridge National Laboratory

Pdf icon1
Bringing in Robust, Memory-Driven Affinity to Slurm

Edgar A. León, Lawrence Livermore National Laboratory

Pdf icon1
Step Management Enhancements

Felip Moll, Oriol Vilarrubí, and Brian Christiansen, SchedMD

Pdf icon1
Site Report: Jump Trading

Matthieu Hautreux and Larry Pezzaglia, Jump Trading

Pdf icon1
The Evolution of Slurm at CSCS: From Monolithic Service to Multi-tenant vService

Gennaro Oliva, CSCS

Pdf icon1
Slinky – Slurm Operator

Skyler Malinowski, Alan Mutschelknaus, and Marlow Warnicke, SchedMD

Pdf icon1
No-Touch Administration: Managing Slurm at Scale

Dr. Urban Borštnik, ETH Zürich

Pdf icon1
TrailblazingTurtle: A Comprehensive Web Portal for Maximizing HPC Resource Utilization

Simon Guilbault, Université Laval

Pdf icon1
Field Notes 8: How to Make the Most of Slurm, and Avoid Common Issues

Alejandro Sánchez, SchedMD

Pdf icon1
Enabling Event-Driven Workflows With AWS and the Slurm API

Cory Lueninghoener (Sandia National Laboratory), Lowell Wofford (AWS)

Pdf icon1
Gaining More Control Over Node Scheduling with the Topology/Block Plugin

Vasileios Karakasis, Felix Abecassis, Craig Tierney, and Douglas Wightman, NVIDIA

Pdf icon1
Improving Job Throughput in HPC with Adaptive Time Limit Management

Thomas Jakobsche, University of Basel

Pdf icon1
Slurm on SuperMUC-NG at LRZ

Dr. Alexander Block, Leibniz Supercomputing Centre (LRZ)

Pdf icon1
Slinky – Slurm Bridge

Skyler Malinowski, Alan Mutschelknaus, and Marlow Warnicke, SchedMD

Pdf icon1
Slurm Wiki and Tools – a Niflheim site report

Dr. Ole Helm Nielsen, Technical University of Denmark (DTU)

Pdf icon1
Maximizing HPC Efficiency for Ansys Simulations: Addressing Critical IT Concerns with Slurm Resource Management and Scheduling

David Clifton and Morten Loderup, Ansys

Pdf icon1
Magic Castle: Canadian HPC as a Service

Félix-Antoine Fortin, Digital Research Alliance of Canada

Pdf icon1
Slurm 24.05, 24.11, and Beyond

Danny Auble, SchedMD

Presentations from SC23, November 23

Pdf icon1
Slurm and/or/vs Kubernetes

Tim Wickberg, SchedMD

Pdf icon1
Slurm 23.02, 23.11, and Beyond

Tim Wickberg, SchedMD

Presentations from Slurm User Group Meeting, September 2023

Pdf icon1
Keynote: Improving Quinoa Through the Development of Genetic and Genomic Resources

David Jarvis, Brigham Young University

Pdf icon1
Never Use Slurm HA Again: Solve All Your Problems with Kubernetes

Chris Samuel and Doug Jacobsen, NERSC

Pdf icon1
Containers in Slurm

Scott Hilton, SchedMD

Pdf icon1
Build a Flexible and Powerful High Performance Computing Foundation with Google Cloud

Volker Eyrich (Google) and Joshua Fryer (Recursion)

Pdf icon1
Demand Driven Cluster Elasticity

Mike Fazio, Dow

Pdf icon1
Field Notes 7 – How to Make the Most of Slurm and Avoid Common Issues

Jason Booth, SchedMD

Pdf icon1
Accelerating Genomics Research Machine Learning with Slurm

Willy Markuske, San Diego Supercomputing Center (SDSC)

Pdf icon1
Saving Power with Slurm

Ole Nielsen, Technical University of Denmark (DTU)

Pdf icon1
Running Flux in Slurm

Ryan Day, LLNL

Pdf icon1
Site Report: CINECA Experience with Slurm

Alessandro Marani, CINECA

Pdf icon1
Step Management Enhancements

Brian Christiansen, SchedMD

Pdf icon1
System and Job Scheduling Simulation for Enhancing Production HPC

Vivian Hafener, LANL

Pdf icon1
Site Update: Georgia Institute of Technology

Marian Zvada and Aaron Jezghani, Georgia Tech

Pdf icon1
Building Blocks in the Cloud: Scaling LEGO Engineering with AWS High-Performance Computing

Brian Skjerven and Matt Vaugh, AWS

Pdf icon1
Slurm 23.02, 23.11, and Beyond (Roadmap)

Tim Wickberg, SchedMD

Pdf icon1
Optimizing Diverse Workloads and Resource Usage with Slurm

Chansup Byun et al, LLSC

Pdf icon1
Slurm’s REST API

Nathan Rini, SchedMD

Presentations from Dell HPC Community, September 2023

Pdf icon1
Slurm and/or/vs Kubernetes

Tim Wickberg, SchedMD

Presentations from Cray User Group, May 2023

Pdf icon1
Slurm 23.02, 23.11, and Beyond

Tim Wickberg, SchedMD

Presentations from SC22, November 2022

Pdf icon1
Slurm ♥ Containers

Nate Rini & Tim Wickberg, SchedMD

Pdf icon1
Doing More with Slurm Advanced Capabilities

Shawn Hoopes, SchedMD

Pdf icon1
Slurm 22.05, 23.02, and Beyond

Tim Wickberg, SchedMD

Pdf icon1
Slurm and/or/vs Kubernetes

Tim Wickberg, SchedMD

Pdf icon1
Accelerating HPC and AI with Slurm and SchedMD

Nick Ihli, SchedMD

Presentations from the HPC Containers Advisory Working Group, November 2022

Pdf icon1
Slurm ♥ Containers

Nathan Rini, SchedMD

Presentations from CNCF Research End User Group, October 2022

Pdf icon1
Slurm Container Support w/video

Nathan Rini, SchedMD

Presentations from Slurm User Group Meeting, September 2022

Pdf icon1
Field Notes 6: From the Frontlines of Slurm Support w/video

Jason Booth, SchedMD

Pdf icon1
Pathfinding Into the Clouds w/video

Ole Nielsen, Technical University of Denmark (DTU)

Pdf icon1
OCI Containers with scrun w/video

Nathan Rini, SchedMD

Pdf icon1
LBNL Site Report w/video

Wei Feinstein, Lawrence Berkeley National Laboratory

Pdf icon1
Cloudy, with a Chance of Dynamic Nodes w/video

Nick Ihli, SchedMD

Pdf icon1
Burst Buffer Lua Plugin for Lustre w/video

Kota Tsuyuzaki / Rikimaru Honjo / Yusuke Kaneko / Kohei Tahara, NTT Computer and Data Science Laboratory / NTT TechnoCross Corporation

Pdf icon1
22.05, 23.02, and Beyond w/video

Tim Wickberg, SchedMD

Pdf icon1
EDA Slurm Cluster on AWS w/video

Allan Carter, AWS

Presentations from NHR Container Workshop, December 2021

Pdf icon1
Containers and Slurm

Nathan Rini, SchedMD

Presentations from SC21, November 2021

Pdf icon1
BOF: Slurm Birds of a Feather

Tim Wickberg, SchedMD

Presentations from Slurm User Group Meeting, September 2021

Pdf icon1
Field Notes 5: From the Frontlines of Slurm Support w/video

Jason Booth, SchedMD

Pdf icon1
REST API and also Containers w/video

Nathan Rini, SchedMD

Pdf icon1
burst_buffer/lua and slurmscriptd w/video

Marshall Garey, SchedMD

Pdf icon1
Slurm in the Clouds w/video

Nick Ihli, SchedMD

Pdf icon1
Slurm 21.08 and Beyond, w/video

Tim Wickberg, SchedMD

Presentations from SC20, November 2020

Pdf icon1
Slurm Birds of a Feather

Tim Wickberg, SchedMD

Presentations from Slurm User Group Meeting, September 2020

Pdf icon1
Field Notes 4: From the Frontlines of Slurm Support w/video

Jason Booth, SchedMD

Pdf icon1
Cloud and Stuff w/video

Brian Christiansen, SchedMD

Pdf icon1
REST API w/video

Nathan Rini, SchedMD

Pdf icon1
Slurm 20.11 and Beyond w/video

Tim Wickberg, SchedMD

Presentations from PEARC HPCSYSPROS Workshop, August 2020

Pdf icon1
REST API

Nathan Rini, SchedMD

Presentations from Slurm User Group Meeting, September 2019

Pdf icon1
Welcome

Danny Auble, SchedMD

Pdf icon1
Tutorial: TRES and Banking

Brian Christiansen, SchedMD

Pdf icon1
Technical: GPU Scheduling and the cons_tres plugin

Chad Vizino and Morris Jette, SchedMD

Pdf icon1
Site Report: LANL

Joseph ‘Joshi’ Fullop, LANL

Pdf icon1
Tutorial: Cgroups and pam_slurm_adopt

Marshall Garey, SchedMD

Pdf icon1
Site Report: Enabling and Scaling Diverse Work Loads Efficiently with Slurm

Chansup Byun et al., MIT Lincoln Laboratory

Pdf icon1
Priority and Fair Trees

Shawn Hoopes, SchedMD

Pdf icon1
Tutorial: Slurm: Seamless Integration with Unprivileged Containers

Luke Yeager et al., NVIDIA

Pdf icon1
Technical: REST API

Nathan Rini, SchedMD

Pdf icon1
Technical: Job Container Plugin for Managing Node Local Namespaces

Aditi Gaur, NERSC

Pdf icon1
Technical: VMs and Containers for a Slurm-Based Development Cluster

François Daikhaté, CEA

Pdf icon1
Technical: High Throughput Computing

Broderick Gardner, SchedMD

Pdf icon1
Site Report: Slurm on Sherlock

Kilian Cavalotti, Stanford Research Computing Center

Pdf icon1
Slurm + GCP

Brian Christiansen (SchedMD) and Keith Binder (Google)

Pdf icon1
Site Report: ORNL

Matt Ezell, ORNL

Pdf icon1
Technical: Monitoring Slurm with a Splunk App

Nicole Dobson, LANL

Pdf icon1
Site Report: NERSC

Chris Samuel, NERSC

Pdf icon1
Tutorial: Troubleshooting

Albert Gil and Jason Booth, SchedMD

Pdf icon1
Technical: Slurm Account Synchronization with UNIX Groups and Users

Ole Nielsen, Technical University of Denmark (DTU)

Pdf icon1
Technical: A Fully Configurable HPC Web Portal for Managing Slurm Jobs

Patrice Calegari, Atos

Pdf icon1
Technical: Slurm 19.05

Tim Wickberg, SchedMD

Pdf icon1
Technical: Slurm 20.02 and Beyond

Tim Wickberg, SchedMD

Pdf icon1
Technical: Field Notes From a MadMan

Tim Wickberg, SchedMD

Presentations from Slurm User Group Meeting, September 2018

Pdf icon1
Tutorial: Slurm Overview

Felip Moll Marquès, SchedMD

Pdf icon1
Technical: Workload Management Requirements for an Interactive Computing e-Infrastructure

Sadaf Alam (CSCS) and the ICEI team (BSC, CEA, CINECA, CSCS, Jülich)

Pdf icon1
Technical: Slurm in a Container Only World – Are We Crazy?

Paul Peltz and Lowell Wofford (LANL)

Pdf icon1
Technical: Kraken – A Stateful Approach to Cluster Management

Paul Peltz and Lowell Wofford (LANL)

Pdf icon1
Technical: A Declarative Programming Style Job Submission Filter

Douglas Jacobsen, NERSC

Pdf icon1
Technical: Generalized Hypercube (GHC) – A Topology Plugin

M. Clayer and A. Faure, Atos

Pdf icon1
Technical: Keeping Accounts Consistent Across Clusters Using LDAP and YAML

Christian Clémonçon, Ewan Roche, Ricardo Silva (EPFL)

Pdf icon1
Technical: Real-Time Job Monitoring Using an Extended slurmctld Generic Plugin – Introducing the Plugin Architecture SPACE

Mike Arnhold, Ulf Markwardt, and Danny Rotscher (Dresden)

Pdf icon1
Technical: Scheduling by Trackable Resource (cons_tres)

Morris Jette and Dominik Bartkiewicz, SchedMD

Pdf icon1
Technical: Slurm 18.08 Overview

Brian Christiansen, SchedMD

Pdf icon1
Technical: Layout for Checkpoint Restart on Specialized Blades

Bill Brophy, Martin Perry, Doug Parisek, and Steve Mehlberg (Atos)

Pdf icon1
Site Report: CEA Site Report

Regine Gaudin, CEA

Pdf icon1
Site Report: Colliding High Energy Physics with HPC, Cloud, and Parallel Filesystems

Carolina Lindqvist, Pablo Llopis, and Nils Høimyr (CERN)

Pdf icon1
Technical: Slurm Simulator Improvements and Evaluation

Marco D’Amico, Ana Jokanovic, Julita Corbalan (BSC)

Pdf icon1
Site Report: CETA-CIEMAT Site Report

Alfonso Pardo, CETA-CIEMAT

Pdf icon1
Site Report: Tuning Slurm the CSCS Way

Miguel Gila, CSCS

Pdf icon1
Technical: Workload Scheduling and Power Management

Morris Jette and Alejandro Sanchez, SchedMD

Pdf icon1
Site Report: LANL Site Report – One Year Post Migration

Joseph ‘Joshi’ Fullop, LANL

Pdf icon1
Technical: Field Notes Mark 2: Random Musings From Under a New Hat

Tim Wickberg, SchedMD

Presentations from Slurm Booth and Birds of a Feather, SC17, November 2017

Pdf icon1
Booth: Slurm Overview

Brian Christiansen, Marshall Garey, Isaac Hartung (SchedMD)

Pdf icon1
Booth: Heterogeneous Job Support

Morris Jette, Tim Wickberg (SchedMD)

Pdf icon1
Booth: From Moab to Slurm: 12 HPC Systems in 2 Months

Paul Peltz, Los Alamos National Laboratory

Pdf icon1
Booth: PMIx Multi-Cluster Operations

Ralph H. Castain

Pdf icon1
Booth: Federated Cluster Support

Brian Christiansen, SchedMD

Pdf icon1
Booth: PMIx Plugin with UCX Support

Artem Polyakov, Mellanox

Pdf icon1
BOF: Slurm Birds of a Feather

Tim Wickberg, SchedMD

Presentations from Slurm User Group Meeting, September 2017

Pdf icon1
Keynote: Supernova Cosmology & Supercomputing

Alex Kim, Lawrence Berkeley National Laboratory

Pdf icon1
Tutorial: Introduction to Slurm

Tim Wickberg, SchedMD

Pdf icon1
Technical: SLURMFS – Resource Manager File System for Slurm

Steven Senator, Los Alamos National Laboratory

Pdf icon1
Technical: Federated Cluster Support

Brian Christiansen and Danny Auble, SchedMD

Pdf icon1
Technical: Utilizing Slurm and Passive Nagios Plugins for Scalable KNL Compute Node Monitoring

Tony Quan and Basil Lalli, NERSC/LBNL

Pdf icon1
Technical: Field Notes From the Frontlines of Slurm Support

Tim Wickberg, SchedMD

Pdf icon1
Technical: Towards Modular Supercomputing with Slurm

Dorian Krause et al. JSC

Pdf icon1
Technical: Heterogeneous Job Support

Morris Jette, SchedMD

Pdf icon1
Technical: cli_filter – command line filtration, manipulation, and introspection of job submissions

Douglas Jacobsen, NERSC

Pdf icon1
Technical: Slurm – Some Slightly Unconventional Use Cases

Chris Hill (MIT), Rajul Kumar (Northeastern), Evan Weinberg and Naved Ansari (BU), Tim Donahue

Pdf icon1
Technical: Managing Diversity in Complex Workloads in a Complex Environment

Nicholas Cardo, CSCS

Pdf icon1
Technical: SELinux policy for Slurm

Gilles Wiber and Mathieu Blanc (CEA), M’hamed Bouaziz and Liana Bozga (Atos)

Pdf icon1
Site Report: From Moab to Slurm: 12 HPC Systems in 2 Months

Peltz, Fullop, Jennings, Senator, Grunau (Los Alamos National Laboratory)

Pdf icon1
Site Report: NERSC Site Report

James Botts and Douglas Jacobsen

Pdf icon1
Technical: Slurm Roadmap – 17.11, 18.08, and Beyond

Danny Auble, Morris Jette, Tim Wickberg (SchedMD)

Pdf icon1
Technical: New Statistics Using TRES

Bill Brophy, Martin Perry, Thomas Cadeau (Atos)

Pdf icon1
Technical: Enabling web-based interactive notebooks on geographically distributed HPC resources

Alexandre Beche, EPFL

Pdf icon1
Technical: Slurm Singularity Spank Plugin

Martin Perry, Steve Mehlberg, Thomas Cadeau (Atos)

Pdf icon1
Site Report: A Slurm Odyssey: Slurm at Harvard FAS Research Computing

Paul Edmond

Pdf icon1
Site Report: LLSC Adoption of Slurm for Managing Diverse Resources and Workloads

Chansup Byun et al. MIT Lincoln Laboratory

Pdf icon1
Site Report: Cyfronet Site Report – Improving Slurm Usability and Monitoring

M Pawlik, J. Budzowski, L. Flis, P Lason, M. Magrys

Pdf icon1
Technical: When You Have a Hammer, Everything Looks Like a Nail – Checkpoint / Restart in Slurm

Manuel Rodríguez-Pascual, J.A. Moríñigo, and Rafael Mayo-García, CIEMAT

Presentations from Slurm Booth and Birds of a Feather, SC16, November 2016

Pdf icon1
Booth: Process Management Interface – Exascale (PMIx)

Ralph H. Castain

Pdf icon1
Booth: Bull Slurm Related Developments, w/ Job Packs demo video

Yiannis Georgiou, Bull Atos

Pdf icon1
Booth: Transition Hangout (a.k.a. how we converted to Slurm)

Ryan Cox (BYU), Bruce Pfaff (NASA)

Pdf icon1
Booth: Expanding Serial Analysis with Slurm Arrays

Christopher Coffey, Northern Arizona University

Pdf icon1
Booth: Intel HPC Orchestrator

Tom Krueger, Intel

Pdf icon1
Booth: Slurm Overview

Moe Jette, SchedMD

Pdf icon1
BOF: Slurm State of the Union; v16.05, v17.02 and Beyond

Tim Wickberg, SchedMD

Presentations from Slurm User Group Meeting, September 2016

Pdf icon1
Keynote: Computer-aided drug design for novel anti-cancer agents

Dr. Zoe Cournia (Biomedical Research Foundation, Academy of Athens)

Pdf icon1
Technical: Overview of Slurm Version 16.05

Danny Auble (SchedMD), Yiannis Georgiou (Bull)

Pdf icon1
Technical: MCS (Multi-Category Security) Plugin

Aline Roy, CEA

Pdf icon1
Technical: Slurm Burst Buffer Integration

David Paul, NERSC

Pdf icon1
Technical: Slurm Configuration Impact on Benchmarking

José Moríñgo, Manuel Rodríguez-Pascual, and Rafael Mayo-García, CIEMAT

Pdf icon1
Technical: Real-time monitoring Slurm jobs with InfluxDB

Carlos Fenoy García

Pdf icon1
Technical: Optimising HPC Resource Allocation Through Monitoring

Alexandre Beche, EPFL

Pdf icon1
Technical: Simunix, a large scale platform simulator

David Glesser and Adrien Faure, Bull Atos

Pdf icon1
Site Report: Swiss national Supercomputer Centre (CSCS)

Nicholas Cardo

Pdf icon1
Technical: Configure a Slurm cluster with Ansible

Johan Guldmyr, CSC

Pdf icon1
Technical: Checkpoint/restart in Slurm: current status and new developments

Manuel Rodríguez-Pascual, J.A. Moríñigo, and Rafael Mayo-García, CIEMAT

Pdf icon1
Technical: Intel Knights Landing (KNL)

Morris Jette and Tim Wickberg, SchedMD

Pdf icon1
Technical: Job Packs – A New Slurm Feature for Enhanced Support of Heterogeneous Resources

Andry Razafinjatovo, Martin Perry, and Yiannis Georgiou (Bull Atos), Matthieu Hautreux (CEA)

Pdf icon1
Technical: Improving system utilization under strict power budget using the layouts

Dineshkumar Rajagopal, Yiannis Georgiou, and David Glesser, Bull Atos

Pdf icon1
Technical: High definition power and energy monitoring support

Thomas Cadeau and Yiannis Georgiou, Bull Atos

Pdf icon1
Technical: Federated Cluster Scheduling

Dominik Bartkiewicz and Brian Christiansen, SchedMD

Pdf icon1
Technical: Slurm Roadmap – SchedMD

Danny Auble, SchedMD

Pdf icon1
Technical: Slurm Roadmap – Bull

Yiannis Georgiou and Andry Razafinjatovo, Bull Atos

Pdf icon1
Site Report: Electricité de France (EDF)

Cécile Yoshikawa

Pdf icon1
Site Report: Leibniz-Rechenzentrum (LRZ)

Juan Pancorbo Armada

Pdf icon1
Site Report: NERSC Site Report – One Year of Slurm

Douglas Jacobsen

Pdf icon1
Site Report: Experience Using Slurm on ARIS HPC System

Nikos Nikoloutsakos, GRNET

Presentations from Slurm Booth and Birds of a Feather, SC15, November 2015

Pdf icon1
Booth: PMIx – Enabling Application-Driven Execution at Exascale

Ralph H. Castain

Pdf icon1
Booth: NASA NCCS Site Update

Bruce Pfaff, NASA

Pdf icon1
Booth: Brigham Young University – Site Report

Ryan Cox, BYU

Pdf icon1
Booth: Slurm Overview

Brian Christiansen and Danny Auble, SchedMD

Pdf icon1
Booth: Never Port Your Code Again – Docker Functionality with Shifter using Slurm

Shane Canon, NERSC

Pdf icon1
Booth: Slurm Burst Buffer Support

Tim Wickberg, SchedMD

Pdf icon1
Booth: Slurm Overview and Elasticsearch Plugin

Alejandro Sanchez, SchedMD

Pdf icon1
Booth: All Things TRES

Brian Christiansen, SchedMD

Pdf icon1
BOF: Slurm Version 15.08

Danny Auble, SchedMD

Pdf icon1
BOF: Improving Backfilling by using Machine Learning to Predict Running Times in Slurm

David Glesser, Bull

Presentations from Slurm User Group Meeting, September 2015

Pdf icon1
Keynote: 10-Years of Computing and Atmospheric Research at NASA: 1 day per day

Bill Putnam, NASA

Pdf icon1
Technical: Overview of Slurm Version 15.08

Morris Jette and Danny Auble (SchedMD), Yiannis Georgiou (Bull)

Pdf icon1
Technical: Trackable Resources (TRES)

Brian Christiansen and Danny Auble, SchedMD

Pdf icon1
Technical: Message Aggregation

Danny Abule (SchedMD), Yiannis Georgiou and Martin Perry (Bull)

Pdf icon1
Technical: Slurm Burst Buffer Support

Morris Jette (SchedMD), Tim Wickberg (GW)

Pdf icon1
Technical: Partition QOS

Danny Auble, SchedMD

Pdf icon1
Technical: Slurm Power Management Support

Morris Jette, SchedMD

Pdf icon1
Technical: Slurm Layouts Framework

Matthieu Hautreux, CEA

Pdf icon1
Technical: Power Adaptive Scheduling

Yiannis Georgiou and David Glesser (Bull), Matthieu Hautreux (CEA), Denis Trystram (LIG)

Pdf icon1
Technical: Never Port Your Code Again – Docker Functionality with Shifter Using Slurm

Douglas Jacobsen, James Botts, and Shane Canon, NERSC

Pdf icon1
Technical: Increasing Cluster Thoughput with Slurm and rCUDA

Federico Silla, Technical University of Valencia Spain

Pdf icon1
Technical: Running Virtual Machines in a Slurm Batch System

Ulf Markwardt, Technische Universität Dresden

Pdf icon1
Technical: Supporting SR-IOV and IVSHMEM in MVAPICH2 on Slurm

Xiaoyi Lu, Jie Zhang, et al., The Ohio State University

Pdf icon1
Technical: Heterogeneous Resources and MPMD (aka Job Pack)

Rod Schultz and Martin Perry (Atos), Matthieu Hautreaux (CEA), Yiannis Georgiou (Atos)

Pdf icon1
Technical: Towards Multi-Objective Resource Selection

Dineshkumar Rajagopal, David Glesser, Yiannis Georgiou, Bull

Pdf icon1
Technical: Enhancing Startup Performance of Parallel Applications with Slurm

Sourav Chakraborty, et al., OSU/LLNL

Pdf icon1
Technical: Adaptable Profile-Driven TestBed (“Apt”)

Brian Haymore, The University of Utah

Pdf icon1
Technical: Using and Modifying the BSC Slurm Workload Simulator

Stephen Trofinoff and Massimo Benini, CSCS

Pdf icon1
Technical: Improving Job Scheduling by Using Machine Learning

David Glesser, Yiannis Georgiou (Bull) and Denis Trystram (LIG)

Pdf icon1
Technical: Federated Cluster Scheduling

Brian Christiansen and Danny Auble, SchedMD

Pdf icon1
Technical: Native Slurm on the XC30

Douglas Jacobsen, James Botts, NERSC

Pdf icon1
Technical: Slurm Roadmap – Versions 16.05 and Beyond

Morris Jette and Danny Auble (SchedMD), Yiannis Georgiou (Bull)

Pdf icon1
Technical: Exascale Process Management Interface

Ralph Castain (Intel), Joshua Ladd, Artem Polyakov (Mellanox), David Bigagli (SchedMD), Gary Brown (Adaptive Computing)

Pdf icon1
Site Report: Brigham Young Iniversity

Ryan Cox, BYU

Pdf icon1
Site Report: University of South Florida

John DeSantis, USF

Pdf icon1
Site Report: NASA Center for Climate Simulation

Bruce Pfaff, NASA

Pdf icon1
Site Report: Jülich Supercomputing Centre

Dorian Krause, JSC

Pdf icon1
Site Report: The George Washington University

Tim Wickberg, GW

Presentations from Slurm Booth and Birds of a Feather, SC14, November 2014

Pdf icon1
Slurm Overview

Danny Auble and Brian Christiansen, SchedMD

Pdf icon1
Slurm Version 14.11

Jacob Jenson, SchedMD

Pdf icon1
Slurm Version 15.08 Roadmap

Jacob Jenson, SchedMD

Pdf icon1
Slurm on Cray Systems

David Wallace, Cray

Pdf icon1
Fair Tree: Fairshare Algorithm for Slurm

Ryan Cox and Levi Morrison (Brigham Young University)

Pdf icon1
VLSCI Site Report

Chris Samuel (VLSCI)

Presentations from Slurm User Group Meeting, September 2014

Pdf icon1
Welcoming Address

Colin McMurtie (Swiss National Supercomputing Centre, CSCS)

Pdf icon1
Overview of Slurm Versions 14.03 and 14.11

Jacob Jenson (SchedMD) and Yiannis Georgiou (Bull)

Pdf icon1
Warewulf Node Health Check

Jacqueline Scoggins and Michael Jennings (Lawrence Berkeley National Lab)

Pdf icon1
Slurm Process Isolation

Bill Brophy, Martin Perry and Yiannis Georgiou (Bull), Morris JEtte (SchedMD), Matthieu Hautreux (CEA)

Pdf icon1
Improving Message Forwarding Logic in Slurm

Rod Schultz, Martin Perry and Yiannis Georgiou (Bull), Matthieu Hautreux (CEA), Danny Auble and Morris Jette (SchedMD)

Pdf icon1
Tuning Slurm Scheduling for Optimal Responsiveness and Utilization

Morris Jette (SchedMD)

Pdf icon1
Improving HPC Applications Scheduling with Predictions Based on Automatically-Collected Historical Data

Carlos Fenoy García (Barcelona Supercomputing Centre)

Pdf icon1
OStrich: Fair Scheduler for Burst Submissions of Parallel Job

Krzysztof Rzadca (University of Warsaw) and Filip Skalski (University of Warsaw / Google)

Pdf icon1
Adaptive Resource and Job Management for Limited Power Consumption

Yiannis Georgiou and David Glesser (Bull), Matthieu Hautreux (CEA), Denis Trystram (University Grenoble-Alpes)

Pdf icon1
Introducing Energy Based Fair-Share Scheduling

Yiannis Georgiou and David Glesser (Bull), Krzysztof Rzadca (University of Warsaw),  Denis Trystram (University Grenoble-Alpes)

Pdf icon1
High Performance Data Movement Between Lustre and Enterprise Storage Systems

Aamir Rashid (Terascala)

Pdf icon1
Extending Slurm with Support for Remote GPU Virtualization

Sergio Iserte, Adrián Castelló, Rafael Mayo, Enrique S. Quintana-Ortlí, Federico Silla, Jose Duato (Universitat Jaume and Universitat Politècnica de València)

Pdf icon1
Slurm Migration Experience

Jacqueline Scoggins (Lawrence Berkeley National Lab)

Pdf icon1
Budget Checking Plugin for Slurm

Huub Stoffers (SURF sara)

Pdf icon1
Fair Tree: Fairshare Algorithm for Slurm

Ryan Cox and Levi Morrison (Brigham Young University)

Pdf icon1
Integrating Layouts Framework in Slurm

Thomas Cadeau and Yiannis Georgiou (Bull), Matthieu Hautreux (CEA)

Pdf icon1
Topology-Aware Resource Selection

Emmanuel Jeannot, Guillaume Mercier, and Adèle Villiermet (Inria)

Pdf icon1
Slurm Inter-Cluster Project

Stephen Trofinoff (CSCS)

Pdf icon1
Slurm Native Workload Management on Cray Systems

Morris Jette (SchedMD)

Pdf icon1
Slurm on Cray Systems

Jason Coverston (Cray)

Pdf icon1
Slurm Roadmap

Yiannis Georgiou (Bull), Morris Jette and Jacob Jenson (SchedMD)

Pdf icon1
Private / tmp for Each Job Using SPANK

Magnus Jonsson (Umeå Universitet)

Pdf icon1
ICM Warsaw University Site Report

Dominik Bartkiewicz and Marcin Stolarek (ICM Warsaw University)

Pdf icon1
iVEC Site Report

Andrew Elwell (iVEC)

Pdf icon1
CEA Site Report

Matthieu Hautreux (CEA)

Pdf icon1
Swiss National Supercomputing Centre Site Report

Massimo Benini (Swiss National Supercomputing Centre, CSCS)

Pdf icon1
Aalto University Site Report

Janne Blomqvist, Ivan Degtyarenko and Mikko Hakala (Aalto University)

Pdf icon1
The George Washington University Site Report

Tim Wickberg, George Washington University

Presentations from Slurm Birds of a Feather, SC13, November 2013

Pdf icon1
Slurm Workload Manager Project Report

Morris Jette and Danny Auble, SchedMD

Pdf icon1
Bull’s Slurm Roadmap

Eric Monchalin, Bull

Pdf icon1
Native Slurm on Cray XC30

David Wallace, Cray

Presentations from Slurm User Group Meeting, September 2013

Pdf icon1
Welcome

Morris Jette (SchedMD)

Pdf icon1
Keynote: Future Outlook for Advanced Computing

Dona Crawford (LLNL)

Pdf icon1
Technical: Overview of Slurm version 2.6

Morris Jette and Danny Auble (SchedMD), Yiannis Georgiou (Bull)

Pdf icon1
Tutorial: Energy Accounting and External Sensor Plugins

Yiannis Georgiou, Martin Perry, Thomas Cadeau (Bull), Danny Auble (SchedMD)

Pdf icon1
Technical: Debugging Large Machines

Matthieu Hautreux (CEA)

Pdf icon1
Technical: Creating Easy to Use HPC Portals with NICE EnginFrame and Slurm

Alberto Falzone, Paolo Maggi (Nice Software)

Pdf icon1
Tutorial: Usage of New Profiling Functionalities

Rod Schultz, Yiannis Georgiou (Bull), Danny Auble (SchedMD)

Pdf icon1
Technical: Fault Tolerant Workload Management

David Bigagli, Morris Jette (SchedMD)

Pdf icon1
Technical: Slurm Layouts Framework

Yiannis Georgiou (Bull), Matthieu Hautreux (CEA)

Pdf icon1
Technical: License Management

Bill Brophy (Bull)

Pdf icon1
Technical: Multi-Cluster Management

Juan Pancorbo Armada (IRZ)

Pdf icon1
Technical: Depth Oblivious Hierarchical Fairshare Priority Factor

Francois Daikhate, Matthieu Hautreux (CEA)

Pdf icon1
Technical: Refactoring ALPS

Dave Wallace (Cray)

Pdf icon1
Site Report: CEA

Francois Diakhate, Francis Belot, Matthieu Hautreux (CEA)

Pdf icon1
Site Report: George Washington University

Tim Wickberg (George Washington University)

Pdf icon1
Site Report: Brigham Young University

Ryan Cox (BYU)

Pdf icon1
Site Report: Technische Universitat Dresden

Dr. Ulf Markwardt (Technische Universität Dresden)

Pdf icon1
Technical: Slurm Roadmap

Morris Jette, Dany Auble (SchedMD), Yiannis Georgiou (Bull)

Presentations from Slurm Birds of a Feather, SC12, November 2012

Pdf icon1
Slurm Workload Manager Project Report

Morris Jette and Danny Auble, SchedMD

Pdf icon1
Using Slurm for Data Aware Scheduling in the Cloud

Martijn de Vries, BrightComputing

Pdf icon1
Slurm Roadmap

Eric Monchalin, Bull

Pdf icon1
MapReduce Support in Slurm: Releasing the Elephant

Ralph H. Castain, Wangda Tan, Jimmy Cao and Michael Lv, Greenplum/EMC

Pdf icon1
Slurm at Rensselaer

Tim Wickberg, Rensselaer Polytechnic Institute

Presentations from Slurm User Group Meeting, October 2012

Pdf icon1
Keynote: The OmSs Programming Model and Its Links to Resource Managers

Jesus Labarta, BSC

Pdf icon1
Slurm Status Report

Morris Jette and Danny Auble, SchedMD

Pdf icon1
Site Report: BSC/RES

Alejandro Lucero and Carles Fenoy, BSC

Pdf icon1
Site Report: CSCS

Stephen Trofinoff, CSCS

Pdf icon1
Site Report: CEA

Matthieu Hautreux (CEA)

Pdf icon1
Site Report: CETA/CIEMAT

Alfonso Pardo Diaz, CIEMAT

Pdf icon1
Porting Slurm to Bluegene/Q

Don Lipari, LLNL

Pdf icon1
Tutorial: Slurm Database Use, Accounting and Limits

Danny Auble (SchedMD)

Pdf icon1
Tutorial: The Slurm Scheduler Design

Don Lipari, LLNL

Pdf icon1
Tutorial: Cgroup Support on Slurm

Martin Perry and Yiannis Georgiou (Bull), Matthieu Hautreux (CEA)

Pdf icon1
Tutorial: Kerberos and Slurm using Auks

Matthieu Hautreux, CEA

Pdf icon1
Keynote: Challenges in Evaluating Parallel Job Schedulers

Dror Feitelson, Hebrew University

Pdf icon1
Integration of Slurm with IBM’s Parallel Environment

Morris Jette and Danny Auble, SchedMD

Pdf icon1
Slurm Bank

Jimmy Tang and Paddy Doyle, Trinity College, Dublin

Pdf icon1
Using Slurm for Data Aware Scheduling in the Cloud

Martijn de Vries, Bright Computing

Pdf icon1
Enhancing Slurm with Energy Consumption Monitoring and Control Features

Yiannis Georgiou, Bull

Pdf icon1
MapReduce Support in Slurm: Releasing the Elephant

Ralph H. Castain, et al., Greenplum/EMC

Pdf icon1
Using Slurm via Python

Mark Roberts (AWE) and Stephan Gorget (EDF)

Pdf icon1
High Throughput Computing with Slurm

Morris Jette and Danny Auble, SchedMD

Pdf icon1
Evaluating Scalability and Efficiency of the Resource and Job Management System on Large HPC Clusters

Yiannis Georgiou (Bull) and Matthieu Hautreux (CEA)

Pdf icon1
Integer Programming Based Herogeneous CPU-GPU Clusters

Seren Soner, Bogazici University

Pdf icon1
Job Resource Utilization as a Metric for Clusters Comparison and Optimization

Joseph Emeras, INRIA/LIG

Presentations from the Sixth Linux Collaboration Summit, April 2012

Pdf icon1
Resource Management with Linux Control Groups in HPC Clusters

Yiannis Georgiou, Bull

Presentations from Slurm Birds of a Feather, SC11, November 2011

Pdf icon1
Slurm Version 2.3 and Beyond

Morris Jette, SchedMD LLC

Pdf icon1
Bull’s Slurm Roadmap

Eric Monchalin, Bull

Pdf icon1
Cloud Bursting with Slurm and Bright Cluster Manager

Martijn de Vries, Bright Computing

Presentations from Slurm User Group Meeting, September 2011

Pdf icon1
Basic Configuration and Usage

Rod Schultz, Groupe Bull

Pdf icon1
Slurm: Advanced Usage

Rod Schultz, Groupe Bull

Pdf icon1
CPU Management Allocation and Binding

Martin Perry, Groupe Bull

Pdf icon1
Configuring Slurm for HA

David Egolf and Bill Brophy, Groupe Bull

Pdf icon1
Slurm Resources Isolation Through cgroups

Yiannis Georgiou (Groupe Bull), Matthieu Hautreux (CEA)

Pdf icon1
Slurm Operation on Cray XT and XE

Moe Jette, SchedMD LLC

Pdf icon1
Challenges and Opportunities for Exascale Resource Management and How Today’s Petascale Systems are Guiding the Way

William Kramer, NCSA

Pdf icon1
CEA Site Report

Matthieu Hautreux, CEA

Pdf icon1
LLNL Site Report

Don Lipari, LLNL

Pdf icon1
Slurm Version 2.3 and Beyond

Moe Jette, SchedMD LLC

Pdf icon1
Slurm Simulator

Alejandro Lucero, BSC

Pdf icon1
Proposed Design for Enhanced Enterprise-wide Scheduling

Don Lipari, LLNL

Pdf icon1
Bright Cluster Manager & Slurm

Robert Stober, Bright Computing

Pdf icon1
Job Step Management in User Space

Moe Jette, SchedMD LLC

Pdf icon1
Slurm Operation IBM BlueGene/Q

Danny Auble, SchedMD LLC

Presentations from Slurm Birds of a Feather, SC10, November 2010

Pdf icon1
Slurm Version 2.2: Features and Release Plans

Morris Jette, Danny Auble, and Donald Lipari, Lawrence Livermore National Laboratory

Presentations from Slurm User Group Meeting, October 2010

Pdf icon1
Slurm: Resource Management from the Simple to the Sophisticated

Morris Jette and Danny Auble, Lawrence Livermore National Laboratory

Pdf icon1
Slurm at CEA

Matthieu Hautreux, CEA/DAM/DIF

Pdf icon1
Slurm Support for Linux Control Groups

Martin Perry, Bull Information Systems

Pdf icon1
Slurm at BSC

Carles Fenoy and Alejandro Lucero, Barcelona Supercomputing Center

Pdf icon1
Porting Slurm to the Cray XT and XE

Neil Stringfellow and Gerrit Renker, Swiss National Supercomputer Centre

Pdf icon1
Real Scale Experimentations of Slurm Resource and Job Management System

Yiannis Georgiou, Bull Information Systems

Pdf icon1
Slurm Version 2.2: Features and Release Plans

Morris Jette and Danny Auble, Lawrence Livermore National Laboratory

Presentations from Slurm Birds of a Feather, SC09, November 2009

Pdf icon1
Slurm Community Meeting

Morris Jette, Danny Auble, and Donald Lipari, Lawrence Livermore National Laboratory

Presentations from Slurm Birds of a Feather, SC08, November 2008

Pdf icon1
High Scalability Resource Management with Slurm

Morris Jette, Lawrence Livermore National Laboratory

Pdf icon1
Slurm Status Report

Morris Jette and Danny Auble, Lawrence Livermore National Laboratory

Other Presentations

Pdf icon1
Slurm Version 1.3

Morris Jette and Danny Auble, Lawrence Livermore National Laboratory (May 2008)

Pdf icon1
Managing Clusters with Moab and Slurm

Morris Jette and Donald Lipari, Lawrence Livermore National Laboratory (May 2008)

Pdf icon1
Resource Management at LLNL, Slurm Version 1.2

Morris Jette, Danny Auble and Chris Morrone, Lawrence Livermore National Laboratory (April 2007)

Pdf icon1
Resource Management Using Slurm

Morris Jette, Lawrence Livermore National Laboratory (Tutorial, The 7th International Conference on Linux Clusters, May 2006)

Publications

Pdf icon1
Energy Accounting and Control with Slurm Resource and Job Management System

Yiannis Georgiou, et. al. (ICDCN 2014, January 2014)

Pdf icon1
Evaluating scalability and efficiency of the Resource and Job Management System on large HPC Clusters

Yiannis Georgiou (BULL S.A.S, France); Matthieu Hautreux (CEA-DAM, France) (16th Workshop on Job Scheduling Strategies for Parallel Processing, May 2012)

Pdf icon1
GreenSlot: Scheduling Energy Consumption in Green Datacenters

Inigo Goiri, et. al. (SuperComputing 2011, November 2011)

Pdf icon1
Contributions for Resource and Job Management in High Performance Computing

Yiannis Georgiou, Universite Joseph Fourier (Thesis, December 2010)

Pdf icon1
Caos NSA and Perceus: All-in-one Cluster Software Stack

Jeffrey B. Layton, Linux Magazine, 5 February 2009

Pdf icon1
Enhancing an Open Source Resource Manager with Multi-Core/Multi-threaded Support

S. M. Balle and D. Palermo, Job Scheduling Strategies for Parallel Processing, 2007

Pdf icon1
Slurm: Simple Linux Utility for Resource Management

M. Jette and M. Grondona, Proceedings of ClusterWorld Conference and Expo, San Jose, California, June 2003

Pdf icon1
Slurm: Simple Linux Utility for Resource Management

A. Yoo, M. Jette, and M. Grondona, Job Scheduling Strategies for Parallel Processing, volume 2862 of Lecture Notes in Computer Science, pages 44-60, Springer-Verlag, 2003

Interview

Pdf icon1
RCE 10: Slurm (podcast)

Brock Palen and Jeff Squyres speak with Morris Jette and Danny Auble of LLNL about Slurm

Other Resources

Pdf icon1
Learning Chef: Compute Cluter with Slurm

A Slurm Cookbook by Adam DeConinck