AlayaCare Logo

AlayaCare

Senior Site Reliability Engineer

Posted 4 Days Ago
Be an Early Applicant
In-Office
Sydney, New South Wales, AUS
Senior level
In-Office
Sydney, New South Wales, AUS
Senior level
The Senior Site Reliability Engineer will enhance SaaS reliability, automate infrastructure, monitor systems, and support incident response, collaborating with product and engineering teams.
The summary above was generated by AI

We’re hiring a Senior Site Reliability Engineer!

 

  • 🗓️ Full-time | Permanent 
  • 📍 Preferred locations: Sydney, Brisbane, or Melbourne. Open to Perth based for the right candidate! 
  • 🏡 Hybrid working: 2 days in office, 3 days WFH  

 

Does a competitive salary package with company stock, five wellness days per year, a flexible benefits package of $1000 per year and a fantastic team culture spark your interest? 

 

👋 Meet AlayaCare! We’re a fast-growing SaaS scale-up on a mission to transform aged and disability care across Australia, Canada, the US and beyond. Our platform helps care providers deliver exceptional service in homes, communities, and residential settings. 

We’re big on Tech with Purpose and passionate about improving lives - all while having a little fun along the way (we’ve been known to enjoy a team lunch or three). 

 

 The Role: 

 

We’re on the lookout for a Senior Site Reliability Engineer who’s ready to bring their self-starting nature, AWS experience & analytical mind to the table. Reporting to the SRE, Engineering Manager, you'll help drive the reliability of our live SaaS solutions across the region. 

 

Your days will involve: 

Development, Automation, and Tooling 

  • Design, build, and maintain infrastructure and platform services, including Kubernetes and observability tooling 
  • Implement infrastructure as code, configuration management, and automated testing to ensure reliable, repeatable environments 
  • Contribute to code and configuration reviews to improve scalability, maintainability, and reuse. 
  • All the above using AI first mindset and development tooling such as Cursor and Kiro. 
  • Build and tune AI Agents to accelerate delivery, and automate repetitive tasks. 

 

Reliability and Operations 

  • Monitor production systems, troubleshoot issues, and improve logging, monitoring, alerting, and runbooks 
  • Participate in on-call rotations, incident response, and post-incident reviews to improve long-term reliability.  

 

Requirements and Collaboration 

  • Partner with Product, Engineering, and development teams to translate requirements into practical infrastructure solutions. 
  • Identify risks related to operability, security, performance, and cost, and recommend appropriate trade-offs.  

 

Continuous Improvement 

  • Contribute to operational quality through runbooks, security practices, performance tuning, and process improvements 
  • Proactively identify issues, raise concerns, and stay current with emerging SRE practices and technologies. 

 

You’ll thrive in this role if you: 

  • Bring 5+ years of experience in SRE/DevOps or a similar role 
  • Believe that AI agents can be better than humans at certain tasks and are interested in maximizing their usage 
  • Have solid hands-on experience with AWS and Terraform 
  • Have practical experience running workloads on Docker & Kubernetes  
  • Are known for your problem-solving skills and your ability to work well autonomously 
  • Are proficient in at least one development or scripting language (such as Python, Go, Bash) 
  • Have practical knowledge of infrastructure as a code (CloudFormation or Terraform) 
  • Have some knowledge of APM, logging & metrics systems (New Relic, Prometheus or ELK)  
  • Have some background knowledge of system & network security fundamentals 
  • Have some experience participating in incident management 

 

Bonus points if you: 

  • Have knowledge of databases (such as MySQL/PostgreSQL) 
  • Have Azure experience 
  • Have knowledge of the aged or disability care sector 
  • Have previous experience with AlayaCare/Procura products 

 

We believe great work should be rewarded. Here’s how we show our appreciation: 

 

  • 🏡Choose your own 2 days/week in office, 3 days WFH 
  • 💰 Competitive salary + company stock (RSUs) 
  • 🧘 5 Wellness days per year  
  • 💳 $1,000/year flexible benefits package  
  • 👶 22 weeks company-paid parental leave 
  • 🧡 2 days company-paid volunteer leave to support causes you care about 
  • 🍕 Team lunches, events & wellness activities 
  • 🤝 A genuinely open, inclusive, and collaborative culture 
  • 💡 A chance to do purposeful work in the fast-paced tech sector, whilst making real impact in the care space.  

 

Belonging matters. 

 

We’re committed to building an organisation that reflects the communities we serve. Diversity, equity, inclusion, and accessibility aren’t just buzzwords here, they’re woven into everything we do.  

 

Need adjustments to participate in the recruitment process? We’ve got you. Just reach out to our HR team: hr[email protected]. We do not accept unsolicited CVs from Recruitment Agencies. 

Top Skills

AI
AWS
Bash
Docker
Elk
Go
Kubernetes
New Relic
Prometheus
Python
Terraform

Similar Jobs

11 Days Ago
In-Office
Sydney, New South Wales, AUS
Senior level
Senior level
Information Technology
The role involves building and leading a Site Reliability Engineering team, overseeing platform health, coordinating incident management, and ensuring operational efficiency through metrics-driven decision making.
Top Skills: Multi-Cloud SolutionsOpen-Source Technology
21 Days Ago
In-Office
Sydney, New South Wales, AUS
Senior level
Senior level
AdTech
Design and scale a global network platform, troubleshoot complex network issues, lead postmortems, build automation tools, and collaborate with teams to enhance operational excellence.
Top Skills: Arista EosBgpCalicoCiliumCisco IosDockerEnvoyGoGrafanaHaproxyJunosKubernetesNginxNokia Sr LinuxOsi ModelOspfPrometheusPythonSonicTcp/Ip
11 Days Ago
Remote or Hybrid
Sydney, New South Wales, AUS
Senior level
Senior level
Artificial Intelligence • Big Data • Cloud • Information Technology • Software • Big Data Analytics • Automation
Join Dynatrace as a Sr Site Reliability Engineer to automate tasks, optimize capacity, manage product releases, and ensure system reliability.
Top Skills: AWSAzureCGCPGoJavaKubernetesPythonShell

What you need to know about the Sydney Tech Scene

From opera to comedy shows, the Sydney Opera House hosts more than 1,600 performances a year, yet its entertainment sector isn't the only one taking center stage. The city's tech sector has earned a reputation as one of the fastest-growing in the region. More specifically, its IT sector stands out as the country's third-largest, growing at twice the rate of overall employment in the past decade as businesses continue to digitize their operations to stay competitive.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account