Q-CTRL Logo

Q-CTRL

Senior Site Reliability Engineer

Posted 6 Days Ago
Be an Early Applicant
Hybrid
Sydney, New South Wales
Senior level
Hybrid
Sydney, New South Wales
Senior level
The Senior Site Reliability Engineer will focus on enhancing application reliability, performance, and availability within a Kubernetes environment through quality processes and testing tools, while leading incident management and production readiness efforts.
The summary above was generated by AI

About us

Founded in 2017, Q-CTRL has grown to become the global leader in quantum.  We’re using control to solve the hardest problems facing quantum technology, improving hardware performance and accelerating pathways to useful quantum computers and other technologies. As a product-led company, we bring together diverse teams such as product, design, engineering and research to help achieve our mission of making quantum technology useful.  Join us to help shape the quantum future.


As one of the fastest growing companies in the quantum sector, we’ve had a number of key milestones:


- In November 2023, we announced an industry-first partnership with IBM Quantum Services, natively integrating our performance management software with all IBM quantum computers.  Building off of this relationship, in September 2024 we started offering two services via IBM’s new Qiskit Functions Catalog as an inaugural partner.

- Designed and moved our Global HQ offices and lab space into the first purpose-built (and award winning) commercial and research facility for a quantum technology company in Australia.

- Continued to deliver real world outcomes across the quantum sectors, with our work with Australian Defence on software-ruggedized quantum sensing for navigation without GPS, as featured in the New York Times.

- In October 2024, we announced our record breaking expansion of our Series B funding round to USD $113M, with $59M USD of new capital.

- Grew our global presence to include Los Angeles, Berlin, and Oxford - as well as the recently announced office in San Francisco.


From educating the workforce on how quantum computing works, to building the next generation of quantum sensors, to delivering massive performance gains for end-users, it all starts with hiring the right talent. If you want to help us build the Quantum future, read on.


About the role

The Q-CTRL Platform has grown from a single product with one application that could run on a single container to a growing list of products in a rapidly expanding and highly distributed Kubernetes environment. The Q-CTRL Infrastructure team is expanding to bring on an experienced software developer with a focus on quality, performance testing and an interest in cloud infrastructure. They will focus on ensuring our applications remain available, performant, reliable, scalable with strong inspiration from the tenets of Site Reliability Engineering (SRE).

What you'll be doing:

  • Reporting to the Engineering Manager of the Infrastructure team, you will play a leading role in our quality processes, such as our testing guild, and planning for the Q-CTRL SaaS platform. As a result, engineering managers and their teams will have state-of-the-art performance testing tooling and resources. 
  • Gather insights using monitoring tools and your past experience with microservice-based applications to review, assess and improve Q-CTRL’s platform reliability and performance. You will have the opportunity to push our software to its limits.
  • Bringing our kubernetes-based testing environment to the next level and becoming a leader supporting continuous improvement of our quality and testing practices within the whole of Engineering.
  • As a Site Reliability Engineer, you will play a major role in the ongoing development, maintenance and transformation of the observability platform and SRE operations such as traffic forecasting, on-call, incident management and production readiness.
  • Other duties within the Employee's skills and experience, or with reasonable training.

Ideally you'll have:

  • Honed software engineering skills and familiarity with site reliability engineering, testing practices and production operations.
  • Experience developing and testing software for distributed microservice applications as well as experience supporting, investigating and resolving issues in production environments using logs, metrics and traces.
  • Past experience with performance testing tools, such as Grafana, k6 and Locust, which has led to demonstrable improvements in performance and reliability.
  • Experience in identifying potential problems and performance “pinch points” in software architecture in order to influence design, monitor, test, as well as mentor others.
  • Worked with continuous improvement and deployment tools such as GitHub Actions or GitLab.
  • Excellent written and verbal communication skills, with the ability to present complex technical concepts to both technical and non-technical audiences.
  • A keen eye for improvement and initiative in implementing new technologies and solutions while building things the right way.

It would be fantastic if you have these skills/experience but not essential:

  • Experience with OpenTelemetry monitoring stacks such as Grafana, Mimir, Tempo and Loki. Familiarity with Google's Site Reliability books and relevant insights.
  • Experience operating Kubernetes in production, managing helm charts and operators.
  • Knowledge on how to configure public clouds (AWS or other) using  infrastructure-as-code.
  • Familiarity with the CNCF and its various projects. Linkerd, OpenTelemetry, and Prometheus in particular.

Why Q-CTRL?


Flexibility: We embrace workplace flexibility so you worry more about your impact vs a rigid work schedule.

Attractive salary: You’ll get to have the start-up impact without the start-up wages.

Equity: We want people to have a sense of ownership in what they do and offer the potential for equity share and annual bonuses.

Cash bonus: We recognize exceptional performance and impact by offering annual discretionary cash bonuses.

Resources: We are well funded by the world’s best technology investors, letting us chase our ambitions with minimal constraints.

Parental support: We offer paid parental leave to support you and your loved ones.

Diversity:  We’re an equal opportunity employer and actively support initiatives like the ‘Global Women in Quantum’ program to help expand the quantum workforce.

Unique culture: You’ll be surrounded by some of the world’s leading physicists, engineers, product, marketing and design people (to name a few!) with a strong desire to learn and transfer knowledge.

Meaningful values: You’ll work with an incredibly supportive team who work consistently to deliver our core values to be real, be trusted, be just and to be revered. 

Personal development: We provide you with a personal development and wellness budget. 

Make a dent: Last but not least you’ll have the unique opportunity to help set the direction for this revolutionary technology and truly make an impact that matters!


Q-CTRL aims to bring together cross-functional teams from many different backgrounds to help achieve our goals - we  strongly encourage you to apply even if you do not meet all of the requirements mentioned in the job posting.


Please be advised that our communications will only come from the @q-ctrl.com domain. All our active job postings are available on our company website.


To recruitment agencies, we do not accept unsolicited branded profiles and are not responsible for any fees related to unsolicited resumes.

Top Skills

AWS
Github Actions
Gitlab
Grafana
K6
Kubernetes
Locust
Opentelemetry
Prometheus

Similar Jobs

13 Days Ago
Sydney, New South Wales, AUS
Senior level
Senior level
Cloud • Information Technology • Productivity • Security • Software • App development • Automation
Join Atlassian as a Senior Site Reliability Engineer to enhance Jira Cloud's infrastructure and ensure high reliability and performance by leveraging cloud services and mentoring team members.
Top Skills: AWSAzureGCPGoJavaLinuxPythonUnix
19 Days Ago
Hybrid
3 Locations
Senior level
Senior level
Cloud • Fintech • Information Technology • Machine Learning • Software
Lead incident management processes, troubleshooting AWS services, and promoting SRE principles. Drive operational reliability and foster a culture of continuous learning within the team.
Top Skills: AWSBgpDnssecIpsecPythonSsl/TlsTcp/Ip
9 Days Ago
Sydney, New South Wales, AUS
Mid level
Mid level
Cloud
As a Senior Site Reliability Engineer, you'll ensure production systems are reliable and operational, collaborating with multiple teams to improve incident response and infrastructure resilience.
Top Skills: AWSAzureDockerGoHTTPKey/ValueKubernetesNo-SqlShell ScriptingSQLTerraformWeb Sockets

What you need to know about the Sydney Tech Scene

From opera to comedy shows, the Sydney Opera House hosts more than 1,600 performances a year, yet its entertainment sector isn't the only one taking center stage. The city's tech sector has earned a reputation as one of the fastest-growing in the region. More specifically, its IT sector stands out as the country's third-largest, growing at twice the rate of overall employment in the past decade as businesses continue to digitize their operations to stay competitive.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account