Information Technology

Sr. HPC Cloud Developer Lead

Mountain View, California
Work Type: Full Time

RedLine Performance Solutions (RedLine) has been in the HPC solutions engineering services business for over 25 years and is consistently determined to keep the "bar of excellence" quite high for new hires. This enables RedLine to accomplish what other firms cannot and promotes a high level of staff retention. We offer services ranging from full life cycle HPC systems engineering to remote managed services to HPC program analysis. We are located in the Washington, DC area and are looking for the right candidate to join as a AWS Cloud Subject Matter Expert (SME).


We are seeking a AWS Cloud SME to support a cloud contract supporting the National Environmental Satellite, Data, and Information Service (NESDIS), a Line Office under NOAA. Working with Customers, from C-Level executives to technology leaders, there are opportunities to leverage and grow your expertise in HPC workflows optimization and HPC systems architectures on AWS. The AWS Cloud SME will provide expertise with the integration of scientific algorithms in a cloud-based high performance computing (HPC) environment.


The successful candidate will be an active leader of the HPC Cloud Team charged with working with our NASA customer’s requirements to architect an HPC cloud solution for the Supercomputing Division.  Additionally, this candidate will work directly with a small team of ~4 cloud resources to standup and support the newly designed HPC Cloud service.


This position will report directly to the Manager of the Application Performance and Productivity (APP) group and will work to design and deliver hybrid cloud solutions with a focus on high performance computing (HPC) and scientific data processing. The position responsibilities include partnering with engineering and development teams and will co-lead, with a government counterpart, in the design of hybrid cloud solutions that enable rapid adoption of new cloud services, intelligent workload distribution while leveraging existing on-prem HPC infrastructure.


US Citizenship is a mandatory requirement, as is the willingness and ability to obtain a U.S. Government security clearance.   This full-time, direct hire position offers a full benefits package including paid time off, 401k match, short and long term disability coverage and PPO health care benefits.  


This position requires someone local to the Mountain View, CA area as they will be required to work 2-3 days a week onsite at the NASA Ames facility.


Duties and Responsibilities:


  • Design and lead the architectural initiative for the NASA HPC Cloud solution
  • Build and manage a small cloud team of ~4 resources to develop and sustain the cloud solution
  • Work with a government counterpart to define cloud architectures for both hybrid and non-hybrid cloud solutions.
  • Communicate architectures and solutions using standard industry modeling and diagramming and best practice.
  • Work with a government counterpart to ensure that all hybrid and non-hybrid cloud solutions follow all security and compliance controls.
  • Plan and achieve project objectives; technically guide projects through completion and ensure all project objectives are met within target time frames.
  • Partner with development and operations teams to develop automation solutions for deployment, monitoring and securing of cloud infrastructure.
  • Advise on tooling for DevOps and provide practical guidance to improve efficiency and consistency for any solution development and deployment.


Requirements:


  • Bachelor’s degree (or equivalent experience), in Computer Science engineering, or related field
  • 10 years in-depth hands-on work experience with large-scale AWS cloud solutions
  • 5 years of HPC specific architecture consulting experience
  • Strong ability to interact with customers to understand needs, elicit requirements, and get feedback on prototype solutions
  • Experience with continuous integration & continuous deployment tools, processes and basic agile methodologies
  • Knowledge of networking, firewalls, etc.
  • Experience with how to navigate complex government security and compliance controls within a large organizational setting
  • Experience with cloud services cost modeling for a given architected solution
  • Strong analytical skills with the ability to learn new information quickly
  • Good organization skills to balance and prioritize work, and ability to multitask
  • Ability to work in a hybrid remote/onsite team environment
  • Excellent communication and people skills, time management, and organizational skills


Preferred Skills


  • Master’s degree (or equivalent experience) in computer science, science, engineering, or related field
  • 5+ years of overall experience off enterprise, full-life cycle in software/cloud architecture and IT and/or software development.
  • Proficiency with containers runtimes (Docker, Singularity, Charlie Cloud)
  • Proficiency with PBS or Slurm
  • Proficiency with git, Jira, confluence
  • Proficiency with configuration management tools such as Ansible, Chef, or Puppet
  • Familiarity with scientific and parallel computing, machine learning
  • Hands-on experience with DevOps and release management tools
  • Familiarity with object store (S3) and POSIXs file systems such as Lustre, and any potential integration of the two, e.g., S3 backed Luster (AWS FSx).



To learn more about RedLine please visit our website at www.redlineperf.com

Submit Your Application

You have successfully applied
  • You have errors in applying