High Performance Computing Team Lead
__jobinformationwidget.freetext.ExternalReference__
001807SR
- Full-time
- Boston
- Harvard Medical School
- 059
- Information Technology
- Exempt
- No
- 00 - Non Union, Exempt or Temporary
This vacancy has now expired. Please see similar roles below...
By working at Harvard University, you join a vibrant community that advances Harvard's world-changing mission in meaningful ways, inspires innovation and collaboration, and builds skills and expertise. We are dedicated to creating a diverse and welcoming environment where everyone can thrive.
Why join Harvard Medical School?
Harvard Medical School's mission is to nurture a diverse, inclusive community dedicated to alleviating suffering and improving health and well-being for all through excellence in teaching and learning, discovery and scholarship, and service and leadership.
You’ll be at the heart of biomedical discovery, education, and innovation, working alongside world-renowned faculty and a community dedicated to improving human health. This is more than a job - it’s an opportunity to shape the future of medicine.
As the High Performance Computing Team Lead, you will lead the design, engineering, operation, and lifecycle management of high-performance computing (HPC) environments that support scalable and secure research workflows across HMS. You will administer compute clusters, manage workload scheduling systems such as Slurm, support secure and policy-compliant compute platforms, and oversee user access and software environments. In this role, you will set technical direction and execute with operational oversight and collaborating across research and infrastructure teams, to establish and maintain a robust foundation for computational activities and research across the institution. You will also serve as a key contributor on HPC infrastructure initiatives spanning multiple projects. The ideal candidate will possess strong infrastructure engineering skills, automation expertise, and a commitment to reliability and performance in support of scientific computing.
Core Duties and Responsibilities:
- Oversee the design, provisioning, configuration, and decommissioning of HPC compute clusters, ensuring system performance and lifecycle sustainability.
- Engineer, administer and tune workload schedulers (e.g., Slurm) and cluster management to optimize job throughput, resource utilization, and system availability.
- Design, maintain and support secure, regulated compute environments (e.g. NIST 800-171), ensuring technical safeguards and documentation align with required frameworks necessary for enabling regulated biomedical research.
- Ensure integration and design of user accounts and identity management with institutional systems, supporting secure and streamlined access to HPC resources.
- Design and maintain customized and sustainable researcher software environments, including module systems and containerized applications within security standards.
- Lead team in the software development life cycle for operational tooling and infrastructure automation and deliver expert coding.
- Research, design, and implement technical solutions to meet infrastructure and research requirements.
- Identify opportunities to improve and simplify compute platform services and implement related enhancements.
- Contribute to the creation and maturing of operational and automation best practices, including Service Level Agreements.
- Act as a technical liaison to internal and external stakeholders and collaborators and mentor junior staff.
- Participate in off hours on-call schedule.
- Other duties as assigned.
Basic Qualifications:
- Minimum of seven years’ post-secondary education or relevant work experience.
Additional Qualifications and Skills:
- Minimum of 5 years of experience managing Linux-based HPC systems in a research or academic environment.
- Strong experience with workload schedulers (Slurm preferred), cluster provisioning, and performance tuning.
- Experience with infrastructure monitoring, configuration management tools (e.g., Ansible), and containerization tools (e.g., Singularity/Apptainer, Docker).
- Familiarity with security and compliance requirements in regulated research environments.
- Excellent troubleshooting, communication, and collaboration skills.
- Ability to work collaboratively in a team and adapt to evolving technologies and priorities.
- Excellent interpersonal skills, including the ability to build and cultivate strong relationships and work effectively with diverse groups.
- Demonstrated “can do” work ethic coupled with effective time management.
- Standard Hours/Schedule: 35 hrs. per week | Monday - Friday | 9:00 am - 5:00 pm. Occasionally required to work outside of normal business hours and may be called during off hours.
- Visa Sponsorship Information: Harvard University is unable to provide visa sponsorship for this position.
- Pre-Employment Screening: Identity, Criminal
- Other Information: Please note that we are currently conducting a majority of interviews and onboarding remotely and virtually. We appreciate your understanding.
- Staying Informed About Your Application: Due to the high volume of applications, we may not always be able to reach out right away, but you can track your status anytime through the Careers@Harvard portal.
#LI-DK1
Work Format Details
This position has been determined by school or unit leaders that the duties and responsibilities can effectively be performed fully remotely at a non-Harvard location. Employees in fully remote positions must work all scheduled hours in a Harvard registered state in compliance with the University’s Policy on Employment Outside of Massachusetts. At the discretion of the department, fully remote employees may occasionally be required on site at a Harvard location. Certain visa types and funding sources may limit work location. Individuals must meet work location sponsorship requirements prior to employment.
Salary Grade and Ranges
This position is salary grade level 059. Please visit Harvard's Salary Ranges to view the corresponding salary range and related information.
Benefits
Harvard offers a comprehensive benefits package that is designed to support a healthy work-life balance and your physical, mental and financial wellbeing. Because here, you are what matters. Our benefits include, but are not limited to:
- Generous paid time off including parental leave
- Medical, dental, and vision health insurance coverage starting on day one
- Retirement plans with university contributions
- Wellbeing and mental health resources
- Support for families and caregivers
- Professional development opportunities including tuition assistance and reimbursement
- Commuter benefits, discounts and campus perks
Learn more about these and additional benefits on our Benefits & Wellbeing Page.
EEO/Non-Discrimination Commitment Statement
Harvard University is committed to equal opportunity and non-discrimination. We seek talent from all parts of society and the world, and we strive to ensure everyone at Harvard thrives. Our differences help our community advance Harvard's academic purposes.
Harvard has an equal employment opportunity policy that outlines our commitment to prohibiting discrimination on the basis of race, ethnicity, color, national origin, sex, sexual orientation, gender identity, veteran status, religion, disability, or any other characteristic protected by law or identified in the university's non-discrimination policy. Harvard's equal employment opportunity policy and non-discrimination policy help all community members participate fully in work and campus life free from harassment and discrimination.
- Full-time
- Boston
- Harvard Medical School
- 059
- Information Technology
- Fully Remote
- Exempt
- No
- 00 - Non Union, Exempt or Temporary
Similar Roles
Agentic AI Product Manager, HBS AI Institute
Salary
Location
Boston, MA, United States
Union
00 - Non Union, Exempt or Temporary
Work Format
Hybrid
Department
Digital Data Design Institute
Job Type
Full-time
FLSA Status
Exempt
Location
Boston
Brand
Harvard Business School
Salary Grade
059
Term Appointment
Yes
Harvard Job Function
Information Technology
Description
We are seeking an experienced Agentic AI Product Manager to lead the development, oversight, and optimization of agentic AI applications supporting the HBS AI Institute Executive Education portfolio.
Reference
1a5e71c4-ff4d-40e7-a9e4-bd4ae902cd98
Expiry Date
01/01/0001
Learning Designer
Salary
Location
Cambridge, MA, United States
Union
00 - Non Union, Exempt or Temporary
Work Format
Hybrid
Job Type
Full-time
FLSA Status
Exempt
Location
Cambridge
Brand
Harvard Graduate School of Education
Salary Grade
056
Term Appointment
Yes
Harvard Job Function
Information Technology
Description
Job Summary:The Learning Designer is a member of the Learning Experience and Design (LXD) team, a fast-paced, learner-centered design and development team within Professional Education (PPE) at the Ha
Reference
38ef3b76-3294-4059-9ce4-3c71374cc1dc
Expiry Date
01/01/0001
Machine Learning Engineer
Salary
Location
Boston, MA, United States
Union
00 - Non Union, Exempt or Temporary
Work Format
Hybrid
Department
Biomedical Informatics
Job Type
Full-time
FLSA Status
Exempt
Location
Boston
Brand
Harvard Medical School
Salary Grade
060
Term Appointment
No
Harvard Job Function
Information Technology
Description
The Core for Computational Biomedicine (CCB) in the Department of Biomedical Informatics (DBMI) at Harvard Medical School (HMS) is looking for a Machine Learning Engineer with advanced expertise to le
Reference
57cb11c4-de63-42ce-b733-1c6f9ed6e27a
Expiry Date
01/01/0001
Join Our Talent
Community
Let's keep in touch! Stay connected to learn more about Harvard and future opportunities.
JOIN OUR TALENT COMMUNITY