Information Technology

Linux HPC Engineer (Remote)

Preferable Location(s): Phoenix, United States of America | Gaithersburg, United States of America | Princeton, United States of America
Work Type: Full Time


RedLine Performance Solutions (RedLine) has been in the High Performance Computing (HPC) solutions engineering services business for over 26 years and is consistently determined to keep the "bar of excellence" quite high for new hires. This enables RedLine to accomplish what other firms cannot and promotes a high level of staff retention. RedLine provides IT infrastructure management and technical support services to some of the world’s largest supercomputing sites.

The Linux/HPC Engineer will primarily work on a small team of HPC Systems Administrators responsible for the installation and operational support of an HPC cluster located in Phoenix, Arizona.  Operations run 24x7 and therefore there will be a rotational on-call requirement.  The Linux/HPC Engineer will actively participate in the evolution and maintenance of the technical infrastructure, in addition to supporting the on-site HPC environment.

The position can be remote, but will be required to support the normal business hours for the primary customer site in Phoenix, AZ. In addition to supporting the HPC cluster in Phoenix, the Engineer will also contribute to other infrastructure and customer initiatives as business needs arise. The Engineer will be required to shift priorities, support parallel efforts, and provide technical expertise across multiple projects, including deployments, upgrades, troubleshooting, and documentation. Additional assignments may include short-term tasking in adjacent programs, collaboration with cross-functional engineering teams, and participation in planned maintenance windows or special projects to meet organizational commitments.  Travel to different customer sites is expected to be a maximum of 25% of the time.

US citizenship is a mandatory requirement for this position. This full-time (W-2) position offers a full benefits package including paid time off, 401k match, and health care benefits. 

Required Skills:
  • 5 or more years of Linux systems administration, preferably in a Red Hat and/or Rocky environment
  • Strong knowledge of TCP/IP networking
  • HPC system administration experience (e.g., parallel file systems, cluster management, archival systems)
  • Strong experience in Bash, Perl, and Python scripting in a version-controlled environment using Git
  • Strong verbal and written communication skills, with the ability to coordinate between multiple team members in remote locations between several disparate projects
  • Strong organizational skills

Preferred Skills/Experience:
  • Experienced with system engineering in addition to system administration
  • Cloud administration (e.g. Azure, GCP, AWS)
  • Experience with deploying and supporting computational models and simulations in HPC infrastructure (e.g., on-premise and cloud, with containers).
  • Knowledge and understanding of application hosting, with experience using Cloud Services in a Commercial Infrastructure as a Service (IAAS) or Platform as Service (PAAS) environment.
  • Red Hat Certification (e.g., RHCSA, RHCE)
  • Server automation experience (e.g., Puppet, Foreman, Ansible)
  • Experience with job scheduling software (e.g., Slurm or Moab)
  • Experience with cluster automation tools (e.g., xCAT, HPCM, or Bright Cluster Manager)
  • Familiarity with a wide range of server and networking hardware (e.g., HPE, SuperMicro, NetGate, Juniper, etc.)
  • Applications such as Atlassian Confluence, Gitlab, or Mediawiki

To learn more about RedLine, please visit our website at www.RedLinePerf.com

Submit Your Application

You have successfully applied
  • You have errors in applying