Strong Compute Logo

Strong Compute

Cluster Ops

Reposted 11 Hours Ago
Be an Early Applicant
Hybrid
Sydney, New South Wales
Senior level
Hybrid
Sydney, New South Wales
Senior level
Manage and scale GPU clusters with a focus on reliability and performance, utilizing various tools and interconnects.
The summary above was generated by AI
We manage thousands of GPUs today and need to grow this with reliability, security and performance in mind.

You’ll be working on ops for multi-provider GPU clusters.

When applying please speak to:

  • GPU type and count you’ve managed
  • Providers you’ve worked with. Eg Hyperscalers, neoclouds, on prem.
  • Interconnect you’ve managed.
  • What tooling you used eg. for provision, scheduling, storage, monitoring, cost management etc.
  • What tooling you developed.

Our culture

  • 🚀 We move fast. We ship weekly—new features, improvements, and fixes go live fast. Our infra runs cluster scale up tests daily.👥 We test big. Every month, we stress test with large groups of users face to face, get real-world feedback, and iterate rapidly.
  • 💻 We build together. Weekend hackathons push boundaries, drive innovation, and help us level up as a team.
  • 🔄 We iterate relentlessly. Direct user feedback shapes our roadmap—we release, test, refine, and keep moving.
  • ✈️ We travel when needed. Engineers may travel between SF and Sydney to run events, attend conferences, and meet with clients.

Top Skills

Cost Management Tools
Gpus
Monitoring Tools
Multi-Provider Gpu Clusters
Provision Tooling
Scheduling Tools
Storage Tools

Strong Compute Sydney, New South Wales, AUS Office

499-501 Kent St, Sydney, New South Wales, Australia, 2000

Similar Jobs

11 Hours Ago
Remote or Hybrid
Sydney, New South Wales, AUS
Senior level
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
As a Senior Technology Consultant, you will configure the ServiceNow Platform, guide customers through business processes, and ensure successful implementations with a focus on leading practices and solutions.
Top Skills: AIFsc (Finance And Supply Chain)Hrsd (Human Resources Service Delivery)Lsd (Legal Service Delivery)ServicenowWsd (Workplace Service Delivery)
11 Hours Ago
Remote or Hybrid
Sydney, New South Wales, AUS
Senior level
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
As a Senior Site Reliability Engineer, you will enhance infrastructure reliability through automation, coding, and systems engineering while collaborating on design improvements to prevent issues.
Top Skills: Cloud ArchitectureJavaScriptLinuxPython
Yesterday
Remote or Hybrid
7 Locations
Mid level
Mid level
Cloud • Fintech • Information Technology • Machine Learning • Software
Lead and develop a high-performing engineering team for AI product delivery, managing software methodology and fostering growth within the team.
Top Skills: Lean-Agile

What you need to know about the Sydney Tech Scene

From opera to comedy shows, the Sydney Opera House hosts more than 1,600 performances a year, yet its entertainment sector isn't the only one taking center stage. The city's tech sector has earned a reputation as one of the fastest-growing in the region. More specifically, its IT sector stands out as the country's third-largest, growing at twice the rate of overall employment in the past decade as businesses continue to digitize their operations to stay competitive.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account