Cray Inc.15
What does the underlying OS really mean to you?
Sunny
To Cray, the underlying OS means the entire operating environment: the kernel, OS services, daemons, and other software that provides user services. It also includes integrated interfaces to third party components like workload managers.
Joseph B George (JBG)
Right - it's direct access to system resources, and being able to work with the OS, and sometimes enhance the OS (user mode or otherwise) allows our applications to result in better performance
Joseph B George (JBG)
And that's why we're all doing this - better performance! :)
Martijn de Vries
once containerized workload becomes more mainstream in HPC, the actual OS that is running on your nodes will become less relevant, since everything needed for your jobs to run would be in the container image.
Sunny
Yes, we see workloads requirements becoming more diverse and containers being an important part of supporting them.
Scott Suchyta (HPC)
Agree with Martijn. The workload needs to be orchestrated such that jobs will run on the right nodes at the right time -- job schedulers will be critical in the stack
Joseph B George (JBG)
Yes, the applications are evolving quickly - our communities are starting to think through workload management and container orchestration
Tom Joy
@Scott_HPC do we have a scheduler that support containers? I am not sure how its identify the containers in a node...
Joseph B George (JBG)
+1 to @Scott_HPC - great that Altair is working this
Sunny
Cray is working with Altair and others on the coming mash-up of container, WLM, and orchestration technologies that serve very broad workload requirements. We might throw provisioning in their too.
Tom Joy
@Scott_HPC that means containers has to be mentioned in resourcedef ?
Scott Suchyta (HPC)
#PBSPro 18.x release simplifies the container integration for sites. From user pov, requesting a container is an environment variable, qsub ... -v CONTAINER_IMAGE=name_of_container
Scott Suchyta (HPC)
From admin pov, you can create custom resources to target specific nodes that are eligible to execute the request container