Solace Logo

Solace

Senior Cloud Site Reliability Engineer

Reposted 16 Days Ago
Be an Early Applicant
In-Office
Ottawa, ON
Senior level
In-Office
Ottawa, ON
Senior level
The Senior Cloud Site Reliability Engineer will ensure the health of Solace Cloud services, manage production incidents, optimize operations, and implement infrastructure tooling across multiple cloud platforms.
The summary above was generated by AI

Solace helps companies connect and integrate all of their assets through the power of event-driven architecture. Our technology makes it easy to unlock data silos and capture events occurring across large enterprises; stream information about those events everywhere it needs to be in real-time; and give the apps, AI agents and people who receive it the power to immediately react with decisive actions and smart decisions. 

  

Many of the world’s biggest companies trust Solace to modernize their IT infrastructure by embracing trends like AI, cloud and IoT so they can create awesome experiences for their customers, partners and employees. 

  

So, the next time you drive a car, order furniture online, fly in a plane, check your bank balance on your phone, your positive experience could be a direct result of our technology—and your hard work 
 

Overview 

This position is for a Senior Cloud Site Reliability Engineer. You will be responsible for the daily operations of
Solace Cloud, our market-leading SaaS offering, across leading cloud providers and platforms such as Amazon Web Services, Microsoft Azure, Google Cloud Platform, Kubernetes, etc. 

What You Will Do: 

  • Ensuring that the Solace Cloud Services are healthy and reliable, and that SLAs are being met 
  • Design and implement our infrastructure tooling, observability, and automation 
  • Contribute to making the production operations more efficient, less error-prone, etc. 
  • Expert-level knowledge in handling production Incidents in production-grade multi-cloud environments according to industry-standard Incident management process 
  • Process handling service requests and provisioning by the customers. 
  • Proven ability to manage customer escalations and drive resolution in mission-critical, high-impact production environments 
  • Work directly with customers to identify, troubleshoot, and resolve operational issues. 
  • Expert debugging knowledge in Linux and Kubernetes to detect operational issues. 
  • Be on-call rotation and provide 24x7 off-hours support 

 

Ideally, You Will Be: 

  • Highly technical, excited by technology, and eager to stay up to date in a rapidly evolving environment. 
  • Expert-level knowledge in Cloud Networking Solutions 
  • Knowledgeable in demonstrating the ability to debug at a system level and resolve incidents in complex cloud-based environments 
  • Expert in Site reliability engineering and Incident response 
  • A strong communicator who can articulate complex technical issues clearly and concisely & get on the phone with customers. 
  • Experienced in SaaS operations and customer-facing technical support 

 

Required Skills: 

  • Proven expertise with public cloud providers (AWS, Azure, GCP) services & features
  • Proven expertise with cloud Kubernetes infrastructure platforms such as AWS Elastic Kubernetes Service, Azure Kubernetes Service, Google Kubernetes Service 
  • Hands-on experience with Monitoring tools like Datadog, Kibana, Prometheus etc. 
  • Hands-on experience with Infrastructure Automation using Terraform, Cloud Formation 
  • Hands-on expertise in debugging production alerts  
  • Expert-level understanding of Linux Operating Systems 
  • Programmer in languages such as Groovy, Python, and Go 
  • Certified Kubernetes Administrator 
  • Certified Cloud Administrator (AWS, Azure, or GCP) 

 

Why You’ll Love Working at Solace

At Solace, we’re all about smart people, meaningful work, and good vibes.

  • Work with brilliance – Our team is packed with some of the sharpest minds in the industry.
  • Balance matters – We believe work should fit into your life, not the other way around.
  • Hybrid-first – Flexibility is built into how we work, so everyone feels included and empowered.
  • Values-driven – We live and breathe our core values: craftsmanship, trust, courage, freedom, momentum, humility, and human experience.
  • Growth mindset – Our training programs are designed to help you level up, fast.
  • Customer love – We’re proud of our world-class customer lineup (and we’re not shy about it).
  • Keep it fun – We’re social, we keep things simple, and we know how to have a good time.
  • Creative culture – We’ve got a great sense of humour and we make cool videos on topics like MITT and this (check them out!).

At Solace, we are committed to a fair, inclusive, and transparent recruitment process.
To help identify candidates whose qualifications best align with the role, we use artificial intelligence (AI) tools during the initial stage of resume screening. These tools compare submitted resumes to the job description, focusing on education, experience, and skills.


Importantly, all decisions beyond this initial screening—including interviews and final hiring—are made by our human recruitment team. AI is never used to make final hiring decisions.


Let’s Talk

Not sure you meet every requirement? That’s okay — we’re more interested in your potential and passion. If this role excites you, we’d love to hear from you.


Need accommodations during the hiring process? Just let us know — we’re here to support you.


Thanks to everyone who applies! While we wish we could connect with every candidate, only those selected to move forward will be contacted.


At Solace, we believe that diversity and inclusion drive innovation and growth, both in business and in life. We strive to create an enriching and safe workplace where you can be who you are. If you want to do the best work of your career and feel supported every step of the way, we encourage you to join us!


 

 

Top Skills

AWS
Azure
Cloud Formation
Datadog
GCP
Go
Groovy
Kibana
Kubernetes
Prometheus
Python
Terraform

Similar Jobs

52 Minutes Ago
In-Office
Toronto, ON, CAN
Senior level
Senior level
Food • Retail • Agriculture • Manufacturing
The Director of Innovation will develop and lead the innovation strategy and processes, manage projects, and engage teams to drive consumer-focused innovation aligned with brand objectives.
Top Skills: Consumer InsightsMarketingProject Management
53 Minutes Ago
Hybrid
2 Locations
Mid level
Mid level
Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
The Marketing Segment Manager will develop strategic plans for the Cadillac brand, analyze market dynamics, and collaborate with various teams to optimize marketing efforts and product launches.
Top Skills: ExcelMicrosoft Powerpoint
2 Hours Ago
Remote or Hybrid
Toronto, ON, CAN
Expert/Leader
Expert/Leader
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
The Principal Engagement Manager oversees project delivery, manages engagement governance, and collaborates with teams to ensure successful outcomes. They lead delivery teams, track progress, mentor staff, and align resources to achieve customer goals while leveraging AI and managing project scope, risks, and finances.
Top Skills: AIServicenow

What you need to know about the Sydney Tech Scene

From opera to comedy shows, the Sydney Opera House hosts more than 1,600 performances a year, yet its entertainment sector isn't the only one taking center stage. The city's tech sector has earned a reputation as one of the fastest-growing in the region. More specifically, its IT sector stands out as the country's third-largest, growing at twice the rate of overall employment in the past decade as businesses continue to digitize their operations to stay competitive.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account