Senior Systems HPC Engineer

Netherlands – Remote Full-Time

Job Description

Take the helm as a Senior Systems HPC Engineer, focusing on system behavior across multiple layers to pinpoint and resolve performance bottlenecks. Your expertise will directly influence the construction, operation, and refinement of our clusters. You will: - Investigate and resolve performance issues within GPU clusters under real-world training and inference loads. - Evaluate and integrate new hardware, system configurations, and tuning approaches via the software stack. - Provide support for intricate performance escalations from both internal teams and external customers. - Collaborate closely with infrastructure, software engineering teams, and hardware vendors like NVIDIA, Mellanox, and Intel. - Contribute to hardware and cluster acceptance testing, ensuring that systems meet performance benchmarks.

Qualifications

To excel in this role, you should possess: 1. 5+ years of professional experience in system-level software development with a focus on performance optimization and low-level programming. 2. 3+ years of hands-on experience with Linux systems, including administration, troubleshooting, and performance tuning. 3. A comprehensive understanding of server architecture, encompassing PCIe devices, NICs, Linux OS/Kernel, and high-performance computing (HPC) systems. 4. Strong proficiency in one or more performance-oriented programming languages such as C/C++, Go, or Python.

Benefits

As a member of our team, you'll enjoy: - Competitive salary and a comprehensive benefits package. - Opportunities for professional advancement within Nebius. - Flexible working arrangements to promote work-life balance. - A vibrant and collaborative work environment that values initiative and innovation.


Apply Now