NVIDIA Logo

NVIDIA

Technical Support Engineer, Linux and HPC Admin

Posted 3 Days Ago
Be an Early Applicant
Remote
2 Locations
Senior level
Remote
2 Locations
Senior level
The Technical Support Engineer will provide support for Linux-based cluster management software, assisting internal and external customers. Responsibilities include escalating issues, serving as a subject-matter expert, collaborating with development teams, and ensuring best practices are communicated across stakeholders. The role involves working with advanced hardware and software technologies.
The summary above was generated by AI

NVIDIA has been redefining computer graphics, PC gaming, and accelerated computing for over 25 years. It’s a unique legacy of innovation fueled by great technology—and dynamic people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. NVIDIANS immerse themselves in a diverse, supportive environment that encourages everyone to do their best work. Join the team and see how you can make a lasting impact on the world.

NVIDIA Base Command Manager powers thousands of clusters worldwide, varying from a few to several thousands of nodes, and streamlines cluster provisioning, workload management, and infrastructure monitoring. It provides all the tools you need to deploy and run an AI data center. We take great pride in providing excellent, comprehensive support to our customers! The Technical Support Engineer in this role will significantly impact and contribute to the overall success of both external customers running their clusters with NVIDIA solutions AND internal clusters used for research, operations, and next-generation projects.

What you’ll be doing:

  • Support our internal and external customers using our Linux-based cluster management software product, ensuring everyone receives the help they require to support their clusters.

  • Collaborate with the development team to collect the correct information and escalate issues to the appropriate development team.

  • Become and serve as a subject-matter expert in several areas.

  • Research and development tasks for customers or internal use by our development team.

  • Participate in proactive discussions with internal stakeholders to ensure BCM best practices are widely communicated.

  • Work with the latest hardware (e.g. GPUs, AI accelerators, high-speed interconnects) and software technologies such as parallel filesystems (e.g. Lustre, GPFS, WekaIO), Jupyter, and various ML frameworks and tools, Spark, Kubernetes, and Ceph.

What we need to see:

  • BS degree or equivalent experience in Electrical Engineering or related field.

  • 5 years of relevant, aligned experience providing support in the HPC realm, ideally in a customer-facing role.

  • Proven research skills and interest in assisting customers to achieve their goals.

  • Experience in a technical customer-facing role.

  • Eagerness to learn and become an authority on our product.

  • Excellent written communication skills with the ability to easily convey complex technical information to consumable summaries.

  • In-depth knowledge of Linux.

  • Familiarity with typical Linux installations and their most common software elements.

Ways to stand out from the crowd:

  • Experience with high-performance computing and system administration would be an asset

  • Previous experience as a system admin running BCM/Bright Cluster Manager/Base Command Manager clusters is a definite plus. 

Top Skills

Linux

Similar Jobs

2 Days Ago
Remote
Hybrid
Australia
Expert/Leader
Expert/Leader
Artificial Intelligence • Cloud • Sales • Security • Software • Cybersecurity • Data Privacy
As a Technical Support Engineer, you'll assist SailPoint customers by troubleshooting and resolving complex issues in their identity management systems, documenting practices, interfacing with internal teams, and providing 24/7 on-call support. Your role emphasizes excellent communication, empathy, and technical expertise.
Top Skills: Java
3 Days Ago
Remote
Hybrid
Sydney, New South Wales, AUS
Junior
Junior
Information Technology • Productivity • Software • Infrastructure as a Service (IaaS)
The Manager of Customer Success will lead a team of Account Managers, focusing on customer satisfaction and relationship management across the APAC region. This includes overseeing performance, strategic account management, and driving customer growth by addressing needs and enhancing product adoption.
Top Skills: Salesforce
3 Days Ago
Remote
Australia
Mid level
Mid level
Artificial Intelligence • Cloud • Consumer Web • Productivity • Software • App development • Data Privacy
The Customer Success Manager at Dropbox focuses on driving customer adoption and success by acting as a trusted advisor, analyzing customer usage, and collaborating with sales and technical teams to foster engagement and resolve issues. Responsibilities include promoting product adoption, organizing training, and representing customer needs internally for improvements.
Top Skills: SaaS

What you need to know about the Sydney Tech Scene

From opera to comedy shows, the Sydney Opera House hosts more than 1,600 performances a year, yet its entertainment sector isn't the only one taking center stage. The city's tech sector has earned a reputation as one of the fastest-growing in the region. More specifically, its IT sector stands out as the country's third-largest, growing at twice the rate of overall employment in the past decade as businesses continue to digitize their operations to stay competitive.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account