HPC Systems Architect

Job Type: Full-time | Location: Vancouver | Department: IT | Reporting to: CTO

Reporting to the Chief Technology Officer, the HPC Systems Engineer will spearhead the design, deployment, and optimization of our high-performance computing (HPC) systems with a focus on HPC clustering, Kubernetes, Slurm, and management tools. The ideal candidate will seamlessly blend traditional HPC solutions with modern container orchestration, ensuring a versatile and scalable computing environment that can be offered to our customers.

Introduction

Podtech Data Centers Inc., the employing entity, is a proud member of the IREN Group and we are currently looking to hire!

IREN is a leading next-generation data center business powering the future with 100% renewable energy. We build, own and operate our data centers and take pride in being at the forefront of sustainable solutions for the ever-evolving applications of high-performance compute. We believe that human progress is invaluable, but it should be done in the right way – responsibly, sustainably and having a positive impact on the communities we operate in.

We have grown substantially since 2019, from our inception in Australia to now having several facilities across North America and being listed on NASDAQ… and we are just getting started! By joining us, you will be contributing to the future of sustainable high-performance compute and the local communities we strive to have a positive impact on.

We value diverse perspectives and believe that skills can be developed. If you’re passionate about this role, we want to hear from you — whether you meet every criteria or not. Your unique experiences might be exactly what we need!

Apply today to be considered for an exciting opportunity with us.

Responsibilities

  • Lead the deployment and maintenance of HPC clusters, ensuring they operate effectively and maximise availability
  • Integrate and manage HPC software components such as Kubernetes, Slurm, cluster management software, and any infrastructure required to operate the HPC environment
  • Stay abreast of advancements in HPC, Kubernetes, and associated technologies, bringing innovations into our operations and product options.
  • Collaborate with technical teams to establish and implement best practices for system maintenance and optimization.
  • Draft comprehensive documentation, including system designs, operational procedures, and best practice guidelines.
  • Facilitate the selection and integration of relevant management tools to monitor, troubleshoot, and enhance HPC operations.
  • Provide technical leadership and training to other team members, fostering an environment of continuous learning and improvement.

Desired Skills, Qualifications and Competencies include:

  • Minimum of 5 years of experience in HPC system architecture with proven expertise in designing, deploying, and managing HPC clusters.
  • Extensive knowledge of Kubernetes, with a focus on its integration within HPC environments.
  • Hands-on experience with the Slurm workload manager and its intricacies.
  • Familiarity with HPC management tools and software, ensuring efficient system monitoring and troubleshooting.
  • Proven track record of resolving complex system challenges and enhancing operational performance.
  • A Bachelor's or Master's degree in Computer Science, Engineering, or a related field.
  • Relevant certifications in Kubernetes, HPC technologies, or system architecture are advantageous.
  • Understanding of cloud platforms and their integration into HPC ecosystems.
  • Deep knowledge of network and storage solutions commonly used in HPC setups.

Key Attributes

  • Analytical mindset, adept at envisioning and designing intricate systems.
  • Collaborative approach, with the ability to work effectively with diverse technical teams.
  • Excellent communication skills, translating complex technical concepts into understandable terms for varied audiences.
  • Detail-oriented focus, ensuring systems are both robust and efficient.
  • Continuous learner, keen to stay updated with rapid technological evolutions in the HPC and Kubernetes domains.

The IREN Package:

  • Salary starting from: $140,000
  • Short-term and Long-term Incentive Programs
  • Extended Health and Dental Benefits
  • 3 weeks paid vacation
  • Vancouver – Remote or hybrid work model (in office Tues-Thurs and as required)
  • Casual work attire
  • Vancouver: onsite gym
  • Relocation assistance (if necessary)

Podtech Data Centers Inc., the employing entity and proud member of the IREN Group is an equal opportunity employer that is committed to creating an inclusive workplace. We evaluate qualified applicants without regard to race, color, religion, age, sex, sexual orientation, gender identity, genetic information, national origin, disability, veteran status, and other legally protected characteristics.

This job will remain posted until filled. While we appreciate all applications we receive, we are only able to contact candidates under consideration.

Apply for this position