HPC Engineer

The purpose of this role is to provide both technical and administrative support to the companies High Performance Compute environments and end user linux workstations. The successful candidate will also be heavily involved in the planning, development, and optimisation of all HPC systems.

 

Key Accountabilities:

  • Ability to lead, manage, and work independently

  • Key Stakeholder Management

  • Technical lead of multiple HPC clustered environments

  • Responsible for the HPC systems infrastructure, installation and configuration

  • Development / architecture of HPC services providing continuous improvement

  • Documentation, procedures and knowledge articles

  • Provide out-of-hours support for HPC systems during critical outages

 

Technical Skills:

  • Configure and maintain cluster workload management systems (LSF / PBS / Slurm)

  • Linux administration and tuning (RedHat / Centos V7 / 8)

  • 40GBE & InfiniBand networking experience

  • NFS / SMB / S3 administration and configuration

  • Alerting and monitoring (Prometheus, Grafana / Kibana)

  • Shell scripting and debugging (bash / perl / python / xml)

  • DevOps toolsets and automation

  • Creating and managing HPC Cloud Deployments (AWS / GCP / Azure)

 

Candidate Profile:

  • 3+ years previous linux and HPC experience mandatory
  • Self-motivated achiever who is able to work using their own initiative
  • Excellent communication skills and team player
  • A strong sense of urgency and the ability to multi-task effectively
  • Encourage and maintain a positive employee culture of good communications, customer care and continuous improvement within team members
  • Contribute to the overall success of the IT Team by working together to improve existing processes and technical understanding
  • Understanding of the ITIL IT Service Management framework and processes

 

Benefits:

We offer a competitive package, including: a generous bonus scheme, life assurance, private medical cover and 25-days holiday.