Site Reliability Engineer

 

Recruiter:

Summit Africa Recruitment

Job Ref:

2278161761

Date posted:

Thursday, June 16, 2022

Location:

Cape Town, South Africa


JOB SUMMARY:
-

JOB DESCRIPTION:

Site Reliability Engineering is an engineering discipline devoted to helping an organization sustainably achieve the appropriate level of reliability in their systems, services, and products. The team plays a crucial role in our mission to reduce emergency response times and improve public safety.

Our Client is looking for a Site Reliability Engineer who will be part of a team who will be responsible for monitoring our production systems 24/7. We are looking to hire for our US based overnight shifts with weekend flexibility. Your primary responsibility will be to provide support when there is an incident and managing communications and escalations around the incidents. You will be monitoring our entire platform infrastructure and applications. You must be comfortable performing well under pressure with tight deadlines and communicate to larger audiences. Your other responsibilities will be to build monitoring and alerting tools around the availability, performance, and overall health of our services with scalability and automation in mind.

Responsibilities:
• Work with DevOps and DBA teams to support Cloud infrastructure.
• Work with Analytics team to support Eclipse Analytics.
• Work with Platform and other Development teams to support Nimbus/Radius front end applications and back end services.
• Work with IoT Team to support IoT Devices.
• Work with Customer Support team to provide technical support for customer reported issues.
• Work with QA and Implementation teams to provide insight on application and infrastructure performance with future releases.
• Be in a scheduled rotation for On Call duties which include receiving alerts from monitoring systems as well as internal escalations.
• Build and improve monitors and alerts to increase visibility of system health.
• Build tools or automation that can improve SRE role efficiencies or increase monitoring capabilities.
• Troubleshoot technical issues with infrastructure and applications.
• Operate as an Incident Commander role when Incidents are created. Escalate to other teams, be a central communication channel across teams, and make detailed timeline entries of actions taken during Incident.
• Produce Root Cause Analysis reports for customers.
• Write post-mortems for Incidents and review with internal teams.

Skills/Experience
• Bachelor's degree in Computer Science, Management Information Systems, or equivalent field with 1-2 years’ experience as a Site Reliability Engineer
• Experience with Cloud services, with preference with Azure around Application Insights, Logging, and Monitoring
• Reliability engineer, DevOps engineer, or Software engineer
• Familiarity of distributed systems and microservices
• Understanding of front end and back end architecture
• Experience with SQL databases
• Experience with Datadog or other monitoring and logging tools
• Programming/Scripting skills in a major language such as .NET, PowerShell
• Experience with deployment tools such as Terraform, Ansible, Puppet
• Experience in Kubernetes
• Strong communication skills

Behavioural competencies required
• Work well under pressure
• Good communication skills (Written and verbal)
• A good problem solver
• Have an inquisitive nature

 

NB! This job is now closed. You can apply for other jobs by uploading your CV.



 

 

 

Similar jobs you might be interested in:

 Lead DevOps Engineer (GCP)
Location: Capetown
Salary: Market-related
Drive reliability and scalability across production environments by leading an SRE team and implementing monitoring, automation, and DevOps practices on Google Cloud Platform. South Africa (Remote), R135 000 - R145 000 pm
23 days ago


Senior Computer Systems Engineer (CPT)
Location: Cape Town
Salary:
9 days ago


Platform Engineer
Location: Cape Town
Salary: Annually
Platform engineer works as part of the site reliability engineering (SRE) team within Computing & Software. The Platform engineer will contribute to the development, integration, and day-to-day operation of shared platform services that underpin scientific computing and complex software systems.While contributing to platform construction and working alongside senior engineers and cross-functio...
10 days ago


Senior Compute Systems Engineer
Location: Cape Town
Salary: Annually
The Senior Computer Systems engineer, will lead the compute and storage systems team and will report to the site reliability engineering (SRE) Manager within Computing & Software, providing hands-on technical leadership in the design, implementation, and long-term operation and maintenance of secure, reliable, and high-performance computer systems infrastructure for the Telescopes hosted by co...
10 days ago


Electrician
Location: Stellenbosch
Salary:
1 day ago


Power Platform Developer
Location: Cape Town
Salary:
2 days ago


HVAC Technician
Location: Cape Town
Salary: 450 000 Annually
Job Opportunity:Senior HVAC Technician- Cape TownTake charge of complex HVAC systems in a senior, hands‑on role where your technical expertise truly makes an impact.
3 days ago


RB 18311 - Electrician (Power Factor Correction) – Cape Town
Location: Capetown
Salary: R300K – R400K CTC per annum
Electrician (Power Factor Correction) – Cape Town
8 days ago


IT Manager (Cybersecurity, Networking & Systems Management) (CPT Onsite)
Location: Cape Town
Salary:
14 days ago


Engineering Team Lead/Software Engineering Manager
Location: Cape Town
Salary:
We are recruiting a Software engineering Manager with strong Backend development expertise to join our team on-site in Cape Town.This is a hands-on leadership role where you will guide a talented engineering team and drive delivery excellence across complex backend-focused products and systems.
15 days ago


Create a free job alert for Site Reliability Engineer in Cape Town

Enter your email address below and we will email you similar jobs when they become available:

You can cancel at any time. We will not spam you.
By giving us your email address your agree to our Terms and Conditions