Q-CTRL

Senior Site Reliability Engineer

Reposted 6 Days Ago

Be an Early Applicant

Hybrid

Sydney, New South Wales

Mid level

Hybrid

Sydney, New South Wales

Mid level

As a Senior Site Reliability Engineer, you will enhance reliability processes, lead incident management, and improve observability for Q-CTRL's SaaS platform.

The summary above was generated by AI

About the team

At Q-CTRL, Quantum Computing Engineering is a global team of software engineers and infrastructure experts,combining deep technical expertise with a startup mindset to deliver real impact through software innovation. Our work is underpinned by robust standards, and by embracing the three virtues. Our team excels in areas across back-end, front-end, machine learning, and platform engineering.

We transform Q-CTRL’s world-leading technological breakthroughs into commercial software products with applications across defense, research, and industry. We work closely with Product, Design, and Research teams to accelerate the path to quantum advantage worldwide.

About the role

The Q-CTRL Platform has grown from a single product with one application that could run on a single container to a growing list of products in a rapidly expanding and highly distributed Kubernetes environment. The Q-CTRL Infrastructure team is expanding to bring on an experienced site reliability engineer with a focus on owning initiatives such as leading the establishment of SLOs, incident management and improving the observability platform. They will focus on ensuring our applications remain available, performant, reliable, scalable with strong inspiration from the tenets of Site Reliability Engineering (SRE).

What you'll be doing:

Reporting to the Engineering Manager of the Infrastructure team, you will play a leading role in the reliability processes of the Q-CTRL SaaS platform.
Gather insights using monitoring tools and your past experience with microservice-based applications to review, assess and help improve Q-CTRL’s platform reliability and performance.
As a Site Reliability Engineer, you will play a major role in the ongoing development, maintenance and transformation of the observability platform and SRE operations such as traffic forecasting, on-call, establishing SLOs, incident management and production readiness.
Other duties within the Employee's skills and experience, or with reasonable training.

Ideally you'll have:

Familiarity with site reliability engineering, testing practices and production operations.
Experience operating Kubernetes in production, managing helm charts and operators.
Experience supporting, investigating and resolving issues in production environments using logs, metrics and traces.
Worked with continuous improvement and deployment tools such as GitHub Actions or GitLab.
Excellent written and verbal communication skills, with the ability to present complex technical concepts to both technical and non-technical audiences.
A keen eye for improvement and initiative in implementing new technologies and solutions while building things the right way.

It would be fantastic if you have these skills/experience but not essential:

Experience with OpenTelemetry monitoring stacks such as Grafana, Mimir, Tempo and Loki. Familiarity with Google's Site Reliability books and relevant insights.
Experience operating Kubernetes in production, managing helm charts and operators.
Knowledge on how to configure public clouds (AWS or other) using infrastructure-as-code.
Familiarity with the CNCF and its various projects. Linkerd, OpenTelemetry, and Prometheus in particular.

About Q-CTRL

Q-CTRL is the global leader in AI-powered quantum control infrastructure software. We build the tools that make quantum technology useful, solving the hardest challenges in quantum computing and quantum sensing to deliver real-world impact.

Founded in 2017, we operate globally with offices in Sydney, Los Angeles, San Francisco, Berlin, and Oxford. Our teams bring together technical and multi-disciplinary expertise across the product lifecycle, and we’re hiring talent to help scale every part of the business. We work quickly to turn cutting-edge science into deployable technology.

In 2024 we raised US$113 million in Series B funding, the largest aggregate investment for a quantum software company. Six months later we delivered the first commercial quantum advantage with Ironstone Opal, our field-validated quantum navigation solution for defense and industry.

At Q-CTRL, we prioritize outcomes over hours. We offer flexibility, equity potential, and competitive benefits that reflect our high-performance culture. If you’re ready to help shape the future of quantum, we’d love to hear from you!

Please be advised that our communications will only come from the @q-ctrl.com domain. All our active job postings are available on our company website.

To recruitment agencies, we do not accept unsolicited branded profiles and are not responsible for any fees related to unsolicited resumes.

Top Skills

AWS

Github Actions

Gitlab

Grafana

Helm

Kubernetes

Loki

Mimir

Opentelemetry

Prometheus

Tempo

Similar Jobs

Atlassian

Senior Site Reliability Engineer

10 Days Ago

In-Office or Remote

Sydney, New South Wales, AUS

Senior level

Cloud • Information Technology • Productivity • Security • Software • App development • Automation

As a Senior Site Reliability Engineer, you'll work on scaling cloud services, improving reliability, mentoring engineers, and automating tasks for Jira Cloud infrastructure.

Top Skills: AWSAzureGCPGoJavaLinuxPythonUnix

Culture Amp

Senior Site Reliability Engineer

15 Days Ago

In-Office

Sydney, New South Wales, AUS

Senior level

Software

Design, build, and manage the infrastructure for a real-time analytics platform while ensuring reliability and security. Collaborate with teams to understand needs and evolve the platform toward self-service capabilities, while mentoring others and setting quality standards.

Top Skills: AWSCdktfClickhouseTerraformTypescript

Citadel Securities

Site Reliability Engineer

10 Days Ago

In-Office

Sydney, New South Wales, AUS

Mid level

Information Technology • Software • Financial Services

Responsible for real-time support, problem diagnosis, capacity planning, and application migrations in a distributed environment.

Top Skills: BashPythonSQLTcp/IpUdpUnix/Linux

What you need to know about the Sydney Tech Scene

From opera to comedy shows, the Sydney Opera House hosts more than 1,600 performances a year, yet its entertainment sector isn't the only one taking center stage. The city's tech sector has earned a reputation as one of the fastest-growing in the region. More specifically, its IT sector stands out as the country's third-largest, growing at twice the rate of overall employment in the past decade as businesses continue to digitize their operations to stay competitive.