ING Logo

ING

Lead Engineer Site Reliability Engineering (Observability)

Posted 9 Days Ago
Be an Early Applicant
Sydney, New South Wales
Mid level
Sydney, New South Wales
Mid level
The Lead Site Reliability Engineer - Observability is responsible for ensuring the resilience and reliability of customer-facing systems, focusing on observability. This role involves designing engineering solutions, managing SRE Observability and Incident Response platforms, and evaluating technological developments to support the organization's strategy.
The summary above was generated by AI

The Centre of Expertise (CoE) for Site Reliability Engineering (SRE) supports the organisations strategy by

enabling SRE capabilities towards continuous focus on system health, reliability, availability, capacity,

performance, continuity, and management of IT services.

The Lead Site Reliability Engineer - Observability role is a key hands on, multi-skilled role responsible for ensuring resilience and reliability of key journeys on our customer facing systems with a specific focus on Observability.

The bank is undergoing a major digital and cloud transformation, and a modern observability is central to the success of those transformations.

This role is to lead the engineering of the platform and migration away from existing technologies to the new platform.

What you’ll do 

  • Assist in designing complex and/or innovative engineering solutions and the associated validation process to enable the realization of a problem solution or design brief.

  • Discuss and recommend more complex or innovative technical developments to improve quality of all workloads and supporting infrastructure to better meet our customer’s needs.

  • Own and run the SRE Observability and Incident Response platforms including - Feature planning design/build, Platform workload onboarding, Agent release planning and more

  • Identify new external developments and / or emerging issues within an area of technology or business function and evaluate their potential impact on, or usefulness to, the organization.

  • Communicate the actions needed to implement the function's strategy and business plan within the team; explain the relationship to the broader organization's mission, vision and values.

What we’re looking for 

  • Expert level Splunk Enterprise Experience

  • 4-5 years Azure DevOps functional experience, Azure Cloud infrastructure highly rewarded

  • Deep understanding and experience with Linux Platforms & Containerization Platforms

  • Experience with Deploying and scaling OpenTelemetry software infrastructure (agents, collectors)

  • Expertise in Infrastructure as Code (Terraform, Ansible etc.) FluentBit

  • Someone with a proven track record of curiosity mind-set, problem solving, adaptability and excellent communication skills.

What’s in it for you?

Drop everything and learn with over 16,000 professional and personal development courses to choose from

Discounted ING Health Insurance

An additional Rest Day to support your wellbeing

An IMPACT day to volunteer on approved sustainability activity

About Us
At ING, we want to make life simpler and more worthwhile – for everyone who banks with us, for the people who work with us, and the community at large, too. 
 
When you come to work at ING, you’re joining a team where individuality isn’t just accepted, it’s encouraged. We’ve built a culture that’s fun, friendly and supportive – it’s the kind of place where you can be yourself and make the most of whatever you have to offer. 
 
We give people the freedom to think differently, take ownership of their work, and make great things happen. We’re here to help you get ahead. And with our global network, there’s plenty of scope to take your career in new directions, perhaps even ones you’ve never considered. 
 
We are all about celebrating success and as a result we are proud to be a WGEA Employer of Choice for Gender Equality and a certified Family Inclusive workplace.
 
Sound like the kind of place you’d feel at home? We’d love to hear from you. 
 
(One last thing, ING operates a direct talent sourcing model. So no agency introductions, please.) 

Applications close 16th December 2024
 
Before you apply
 
Here at ING we consider employee development to be important and encourage existing employees to apply for suitable internal positions. It is expected that any employee applying for a vacant position would have been in their current role for a minimum of twelve (12) months before applying. This may be waived in special circumstances and after consultation with your manager.

Still in two minds?

At ING, we know that diversity drives innovation. Research reveals that 60% of women and underrepresented groups may pause at this stage, even after starting their application. Don’t miss out on the opportunity to bring your unique perspective to our team - submit your application today!

#LI-DNI

Top Skills

Ansible
Azure
Linux
Opentelemetry
Splunk
Terraform

Similar Jobs

Be an Early Applicant
3 Days Ago
Sydney, New South Wales, AUS
Remote
11,000 Employees
Senior level
11,000 Employees
Senior level
Cloud • Information Technology • Productivity • Security • Software • App development • Automation
As a Senior Site Reliability Engineer at Atlassian, you will manage scalable Cloud services within the Jira SRE team. Your role involves optimizing infrastructure, mentoring peers, and implementing reliable, performant systems while communicating complex technicalities to a wide audience.
Be an Early Applicant
8 Days Ago
Sydney, New South Wales, AUS
1,900 Employees
Junior
1,900 Employees
Junior
Information Technology • Software • Financial Services
The Site Reliability Engineer will support and diagnose problems in a real-time, distributed environment, managing application migrations and infrastructure upgrades while collaborating with colleagues. Candidates should have experience with UNIX/Linux, networking, SQL, and scripting, and demonstrate strong communication and problem-solving skills.
Be an Early Applicant
9 Days Ago
Sydney, New South Wales, AUS
Hybrid
4,700 Employees
Entry level
4,700 Employees
Entry level
Artificial Intelligence • Big Data • Cloud • Information Technology • Software • Big Data Analytics • Automation
As a Site Reliability Engineer at Dynatrace, you will automate processes, enhance monitoring and alerting, assist in production incident investigations, and collaborate on product releases while ensuring efficient cloud infrastructure operations.

What you need to know about the Sydney Tech Scene

From opera to comedy shows, the Sydney Opera House hosts more than 1,600 performances a year, yet its entertainment sector isn't the only one taking center stage. The city's tech sector has earned a reputation as one of the fastest-growing in the region. More specifically, its IT sector stands out as the country's third-largest, growing at twice the rate of overall employment in the past decade as businesses continue to digitize their operations to stay competitive.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account