Aussie Broadband Logo

Aussie Broadband

Site Reliability Engineer

Posted 8 Days Ago
Be an Early Applicant
Australia
Mid level
Australia
Mid level
The Site Reliability Engineer ensures system availability and performance, automates solutions, collaborates on design and documentation, and enhances observability for critical services.
The summary above was generated by AI
Aussie Broadband’s (ABB) purpose is to the change the game.
As our Site Reliability Engineer, you'll support this by ensuring the availability, reliability and performance of our systems and infrastructure.

At Aussie Broadband we believe difference is something to celebrate. Being advocates for Inclusion and Diversity means our team can bring their whole selves to work and allows us to better represent our customers and the communities that we serve. As a proud Equal Opportunity Employer, supporting and celebrating difference is just one way that we demonstrate our value of ‘Be good to people’ everyday. 

Join us as we continue to grow and make a mark as the 5th largest telco in Australia!

Why work for Aussie? 

Founded in regional Victoria almost 20 years ago, we are local from the ground up. What started in a living room in Morwell, has now expanded to every corner of Australia - we’re growing fast and not slowing down!

Our fantastic culture lives and breathes our values: 

  • Don't be ordinary, be awesome

  • Think BIG

  • No bullsh*t

  • Be good to people

  • Have fun

We are proud to be a B Corp Certified company, which means we’re good to our people, our customers, and the planet by maintaining the highest standards for social and environmental performance, transparency, and accountability.

We care about our community through our Pledge 1% commitment, sponsorship programs and our paid staff community service leave offering.  

But don’t just take our word for it – We have been named one of the top employers in Australia by HRD magazine.

The good stuff

  • 26 weeks paid parental leave for both primary and secondary caregivers (in addition to any government-paid leave)

  • Discounted internet up to the value of $109 per month

  • 20% off our Mobile services 

  • Day to day benefits like flexible working arrangements, Employee Assistance Program (EAP), discounts with big names like Specsavers, HCF and many more

  • Celebrating you! With monthly rewards and recognition

  • Internal training and resources for you to continue to learn, grow and achieve your career goals

  • Yearly allowance for amazing Aussie merch

  • Fitness Passport for access to multiple gyms and pools across Australia

Let’s talk about you

As our Site Reliability Engineer, you will play a key role in ensuring an automated, scalable, resilient systems which are critical to enabling the next wave of growth for Aussie Broadband. To be successful in this role you will possess:

  • An understanding of the telco industry, and reference platform architectures for cloud, telco, IaaS and CaaS.

  • A Software Engineering background, or suitable experience writing well tested and maintainable code following best-practices.

  • Proficient in either Go or Python. PowerShell experience is a bonus.

  • Experience with cloud platforms (AWS, Azure) and container orchestration (Kubernetes).

  • Experience with SuSE Rancher, Harvester and related components highly desirable.

  • Experience with PostgreSQL and other SQL databases.

  • Strong knowledge of Linux systems and networking.

  • Experience with distributed systems, HA architectures and fault-tolerant systems.

  • Familiarity with CI/CD tools (GitLab).

  • Experience with observability tools (Prometheus, Grafana, Datadog).

  • Knowledge of security best practice for infrastructure and software.

How will you support our “Why?”

Our Site Reliability Engineer will collaborate with other teams to design and maintain reliable, scalable, and resilient systems for a variety of workloads. In addition, your responsibilities will include:

  • Designing, building and implementing automated solutions that enhance the reliability and uptime of critical services.

  • Enhancing observability by implementing and improving monitoring, alerting and logging solutions to proactively detect issues before they create impact.

  • Participating in on-call rotations to troubleshoot and resolve production issues quickly and efficiently

  • Analysing and improve the performance of services and systems, balancing varying business driven outcomes (cost, utilisation, throughput, etc.) with reliability.

  • Identifying bottlenecks at various levels of the stack and collaborate with other teams to fix them.

  • Designing and maintain our infrastructure platforms through the use of IaC toolsets like ArgoCD, Ansible or Terraform to ensure consistency and automation.

  • Monitoring resource usage and collaborate with other teams to plan for growth, ensuring our infrastructure and applications scale with demand.

  • Maintaining clear, detailed documentation for processes, incident responses, post-incident reviews and infrastructure configurations.

  • Increasing the capability of Aussie Broadband by mentoring staff, collaborating with software development and other operations teams to foster a culture of shared ownership and continuous improvement.

Ready to join?

Hit the apply button to submit your application and our fantastic team will be in touch!

Even if you feel you don’t meet all the requirements, we’d still love to hear your story. We like to think outside the box with the people we hire.

If you have any questions, get in touch today with our team at [email protected]

Just a heads up, we can’t take applications through email, so make sure you apply via the job link we've set for this role, so you don't miss out!

Top Skills

Ansible
Argocd
AWS
Azure
Datadog
Gitlab
Go
Grafana
Kubernetes
Linux
Postgres
Powershell
Prometheus
Python
SQL
Suse Rancher
Terraform

Similar Jobs

3 Days Ago
In-Office
Sydney, New South Wales, AUS
Mid level
Mid level
Cloud • Security • Software • Analytics
As a Site Reliability Engineer, you'll maintain and support infrastructure, build tools and automation, enhance developer experience, and resolve engineering workflow bottlenecks while ensuring scalability, reliability, and performance.
Top Skills: AnsibleArtifactoryDockerElasticsearchGerritGoGCPGrafanaInfluxdbJenkinsKata-ContainersKubernetesKvmLinuxLokiMariadbMongoDBMySQLPerforcePostgresPrometheusPythonQemuShell ScriptingSpinnakerTempoThanosVarnish
24 Days Ago
In-Office
2 Locations
Senior level
Senior level
Software
As a Staff Site Reliability Engineer at Civica, you will shape technology practices, define architecture, mentor engineers, and collaborate across teams to enhance product delivery and transition from legacy systems.
Top Skills: .NetC#Ci/CdCloud-Native PlatformsDistributed SystemsEvent-Driven Architecture
24 Days Ago
In-Office
Sydney, New South Wales, AUS
Mid level
Mid level
Information Technology • Software
As a DevOps/Site Reliability Engineer, you'll collaborate with software engineers to design resilient systems and automate infrastructure using various cutting-edge technologies, ensuring reliability in a high-traffic environment.
Top Skills: AWSConsulDockerElasticsearchGoGrafanaJenkinsKibanaKubernetesMySQLNginxNode.jsPHPPrometheusPuppetPythonRabbitMQRedisRubyTerraform

What you need to know about the Sydney Tech Scene

From opera to comedy shows, the Sydney Opera House hosts more than 1,600 performances a year, yet its entertainment sector isn't the only one taking center stage. The city's tech sector has earned a reputation as one of the fastest-growing in the region. More specifically, its IT sector stands out as the country's third-largest, growing at twice the rate of overall employment in the past decade as businesses continue to digitize their operations to stay competitive.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account