Site Reliability Engineer

Posted 8 Hours Ago
Be an Early Applicant
Sydney, New South Wales
Hybrid
3-5 Years Experience
eCommerce • Fintech • Information Technology • Insurance • Software
Cover Genius protects millions of customers of the world’s largest online companies. Our goal is to protect all of them.
The Role
Site Reliability Engineers at Cover Genius improve the reliability and performance of production systems, automate platform operations, and collaborate with software engineers on deployments and monitoring. Their tasks include developing observability tools, troubleshooting production issues, and managing cloud infrastructure on AWS and GCP.
Summary Generated by Built In

The Company


Cover Genius is a Series E insurtech that protects the global customers of the world’s largest digital companies including Booking Holdings, owner of Priceline, Kayak and Booking.com, Intuit, Uber, Hopper, Ryanair, Turkish Airlines, Descartes ShipRush, Zip and SeatGeek. We’re also available at Amazon, Flipkart, eBay, Wayfair and SE Asia’s largest company, Shopee. Our partners integrate with XCover, our award-winning insurance distribution platform, to embed protection for millions of customers worldwide each year.

 

Our team and products have been recognized with dozens of awards including by the Financial Times which ranked Cover Genius as the #1 fastest-growing company in APAC in 2020. Our diverse team across 20+ countries and many language groups commit itself to diverse cultural programs, in particular “CG Gives” which makes social entrepreneurs out of us all and funds development initiatives in global communities.


Our People are

Bold, Authentic, Purposeful and Inspired


Our People are not

Perfect, Traditional, Complacent or Cautious


About the role:


The primary responsibility of Site Reliability Engineers is to ensure the reliable operation of production systems. In addition Site Reliability Engineers work across a wide range of technical areas to automate and improve platforms and operations in the following areas:

- Releases processes

- Observability

- Security

- Core Network & Infrastructure

- Datastores & Disaster Recovery


They continually monitor the system’s health and control security, sharing ownership of production workloads with software engineering teams. Along with Software Engineers, SREs are responsible for writing and maintaining technical documentation such as tutorials, guides, and blameless post-mortems. SREs also design and create information dashboards based on logging and monitoring data. They are key team members in helping automate, scale and drive efficiency across the technology products & platforms.

Main Duties & Responsibilities:

  • Analyze, test and modify systems to improve reliability and optimize performance particularly at an architectural/infrastructure level
  • Develop and maintain observability tooling and dashboards
  • Implement automation tools and frameworks, CI/CD pipelines, Reduce toil
  • Troubleshoot production issues and coordinate with the development team to streamline code deployments
  • Apply AWS and GCP knowledge and skills to create & maintain cloud infrastructure for software projects
  • Design, develop and implement software integrationsCollaborate with Software Engineers and other team members with the goal of improving engineering tools, systems, procedures and data security
  • Develop and maintain design and troubleshooting documentation and runbooksOptimize and control costs of the company’s computing infrastructure

To be successful in this role you will bring:

  • Understanding of SRE Principles and best practices
  • Experience using & configuring modern observability tools such as ELK/EFK, Prometheus, Grafana
  • Comfortable scripting & developing internal tooling with Bash and at least one programming language (e.g. python, go)
  • Experience working with infrastructure & configuration as code tools such as Terraform, Cloudformation, Chef, Puppet etc.
  • Experienced with container technology such as Docker and Ideally experienced with using and managing Kubernetes clusters
  • Experience working with Linux
  • Solid understanding of networking and system architectureSolid understanding of how to deploy, scale and monitor web applications and databases
  • Good knowledge of AWS and/or GCP platforms and associated best practices
  • Bachelor Degree in Computer Science/Engineering or equivalent practical experience
  • Strong communication and documentation skills
  • Curious and self motivated learner
  • Professional approach
  • Good team member
  • Organisational and time management skills
  • Excellent attention to detail
  • Positive approach to change

Top Skills

Go
Python
The Company
New York, NY
600 Employees
Hybrid Workplace
Year Founded: 2014

What We Do

Cover Genius is the insurtech for embedded protection. Together, we protect the global customers of the world’s largest digital companies including Booking Holdings, owner of Priceline and Booking.com, Intuit, Hopper, Skyscanner, Ryanair, Turkish Airlines, Descartes ShipRush, Zip and SeatGeek. We’re also available at Amazon, Flipkart, eBay, Wayfair and SE Asia’s largest company, Shopee. Cover Genius’ vision is to protect all the customers of the world’s largest online companies through XCover, an award-winning global distribution platform for any line of insurance or warranty, with an API for instant claims payments that holds an industry-leading NPS of +65‡.
Cover Genius and its partners co-create solutions that embed protection that’s licensed or authorized in over 60 countries and all 50 US States.

Why Work With Us

We are a vibrant international team that promotes inclusivity and celebrates our differences. We are growing fast, we provide our employees with professional development opportunities and we promote within through our bi-annual performance review cycles. We are bold enough to take chances, to challenge the status quo and inspire each other.

Gallery

Gallery

Jobs at Similar Companies

General Motors Logo General Motors

Supplier Quality Level 7 (Cd. Juarez)

Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Hybrid
Ramos Arizpe, Coahuila de Zaragoza, MEX
165000 Employees

General Motors Logo General Motors

GBS Transformation Analyst - DCG (Night Shift)

Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Hybrid
Taguig City, Metro Manila, National Capital Region, PHL
165000 Employees

Vendavo Logo Vendavo

Senior Full Stack Software Engineer

Artificial Intelligence • Big Data • Cloud • Software
Hybrid
Ostrava, Ostrava-město, Moravskoslezský kraj, CZE
450 Employees

Tempus AI Logo Tempus AI

Data Operations Analyst

Big Data • Healthtech • Machine Learning • Analytics • Biotech
Easy Apply
Hybrid
Chicago, IL, USA
2247 Employees

Similar Companies Hiring

Afterpay Thumbnail
Software • Payments • Fintech • Financial Services
Melbourne, Victoria
900 Employees
BlackLine Thumbnail
Software • Machine Learning • Information Technology • Fintech • Cloud • App development
Woodland Hills, CA
1900 Employees
Klaviyo Thumbnail
Software • Retail • Marketing Tech • Generative AI • eCommerce • Consumer Web • Analytics
Boston, MA
2000 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account