Site Reliability Engineer - (CONTRACT)

 

Recruiter:

PM Connection

Job Ref:

2383262979

Date posted:

Wednesday, September 29, 2021

Location:

Johannesburg, South Africa

Salary:

Negotiable


SUMMARY:
-

POSITION INFO:

Key Purpose:

The Site Reliability Engineer is responsible for driving initiatives proactively leading to high service and platform availability, improved performance and customer experience, enhancing and optimizing monitoring coverage and working with cross-functional teams to proactively build and maintain more reliable services and platforms.

Â

Areas of responsibility may include but not limited to:

  • Design and implement an observability framework across infrastructure, application and services deployed that can be centrally configuration managed and be deployed to all environments.
  • Instrumenting specific java methods or querying values stored in Java Objects for test validation or specific Business metrics.
  • Work with cross-functional teams to identify, evaluate and establish initiatives for improvement to services or processes with the purpose of increased availability, improved service levels, reduced costs, and improved customer satisfaction by reducing the number of operational problems.
  • Participate in various cross-functional forums and lead work streams to contribute to the improvement and implementation of policies, frameworks and standards.
  • Responsible for driving initiatives regarding software automation and reliability.
  • Develop and optimize monitoring framework based on industry best practices by developing metrics (SLI`s), monitoring, and alerting (SLO`s) to observe the health of the production system.
  • Maximize value of tooling and leverage metrics for data driven insights and problem management.
  • Gather and analyse metrics from various monitoring tools covering but not limited to operating systems, infrastructure and applications to assist in performance tuning and fault finding.
  • Facilitate discussions (including technical discussions) to establish root causes and solutions to any infrastructure, application or process related issues.
  • Conduct research to establish more efficient ways of performing day to day activities using new technologies or frameworks and identify opportunities for automation.
  • Conduct trend analysis of data, both systematically and manually to determine common occurrences and recurring issues to feed into the Problem Management processes.
  • Perform impact assessments to determine priority of a problem relative to other problems and business activities.
  • Clear, concise and timely communication with emphasis on expressing technical issues in a non-technical manner to clients and executives.
  • Drive infrastructure and application performance and availability initiatives.
  • Produce and present regular reports on availability, capacity and service performance.
  • Participation and facilitation of Incident Post-mortems.
  • Build and integrate metric collectors such as Prometheus for required metrics.
  • Drive the improvement of availability and performance using quality gates of the build pipeline
  • Liaise with application development teams to improve services and customer experience.

Â

Personal Attributes and Skills:

  • Statistical analysis and reporting
  • Problem solving
  • Root Cause Analysis
  • Business writing (reports) and presentation
  • Tenacity
  • Stress Management
  • Persuasion
  • Coaching
  • Client orientation

Â

Education:

  • Relevant Tertiary qualification (Bachelors’ Degree in IT or Engineering)

Knowledge and Experience:

  • 5 or more years’ experience in a Software Engineering, DevOps Engineer, SRE or Architecture role
  • APM and Infrastructure Monitoring Tool Experience (Prometheus, DynaTrace and Cloudwatch beneficial)
  • Knowledge of Architecture Frameworks, Tools and Standards
  • Experience in Application Performance Monitoring, JVM profiling and Prometheus
  • Extensive experience managing complex and high-volume applications
  • Experience optimizing database, infrastructure and application configurations
  • Experience supporting microservices based applications on a Kubernetes platform
  • Experience with AWS technologies and event driven architecture
  • Experience with event driven orchestration

Â

Kindly regard your application as unsuccessful if you have not heard from the agency within 2 weeks.



 

NB! This job is now closed. You can apply for other jobs by uploading your CV.



 

 

 

Similar jobs you might be interested in:

Platform / DevOps / Site Reliability Engineer
Location: Johannesburg
Salary: 35 000 Monthly
Role: Platform / DevOps / site reliability engineerLocation: Remote but ideally based in Johannesburg, Cape Town, DurbanCompany: Part of a large ICT group, this company offers globally available cloud services, solutions, and platforms for all. Their expertise empowers clients to adopt and migrate to any cloud, wherever they choose. The purpose of the role is to create and manage platforms to gua...
24 days ago


Senior Automation Engineer
Location: Johannesburg
Salary:
9 days ago


Civil Engineer - Regional Lead
Location: Centurion
Salary: 950 000 Annually
A telecommunications company based in Centurion, Gauteng is looking for skilled Civil engineers to join their team.
6 days ago


Senior Solar Electrical Engineer
Location: Johannesburg
Salary:
Are you passionate about renewable energy and are looking for an opportunity to make a significant impact in the solar power industry? We are currently seeking a Senior Electrical Solar engineer to join our dynamic team and contribute to the development of cutting-edge solar energy solutions.
9 days ago


Accountant
Location: Pretoria
Salary:
19 days ago


Technical Site Technician 2 Month Contract
Location: Johannesburg
Salary:
26 days ago


Regional Maintenance Manager
Location: Johannesburg
Salary:
Are you an experienced professional in the logistics industry with strong leadership skills?Our global client is in search of a Regional Maintenance Manager to oversee and optimize maintenance activities across the designated region. If you're adept at planning, coordinating, and ensuring the execution of all maintenance tasks while maximizing asset availability, we invite you to apply!!
20 days ago


Electrical Engineer
Location: Cape Town
Salary:
Salary:  Negotiable, depending on experience + Discretionary Bonusses Please note:  The salaries offered by our clients are determined in accordance with market standards, while considering the candidate's qualifications, skills, and level of experience. Working hours: Monday – Friday 07:30- 16:30 Opportunity for experienced Intermediate - Senior Electrical engineer ...
28 days ago


Electrical Engineer
Location: Pretoria
Salary:
Salary:  Negotiable, depending on experience + Discretionary Bonusses Please note:  The salaries offered by our clients are determined in accordance with market standards, while considering the candidate's qualifications, skills, and level of experience. Working hours: Monday – Friday 07:30- 16:30 Opportunity for experienced Senior Electrical engineer with Solar indu...
29 days ago


Cloud Database Engineer (Senior)
Location: Midrand
Salary:
Join our dynamic team as a Senior Cloud Database engineer, where you'll play a pivotal role in designing and implementing cutting-edge cloud-based database solutions. With a focus on Oracle database architecture and cloud technologies, you'll collaborate closely with cross-functional teams to ensure the reliability, security, and compliance of our database systems. If you're passionate about stayi...
2 days ago


Create a free job alert for Site Reliability Engineer - (CONTRACT) in Johannesburg

Enter your email address below and we will email you similar jobs when they become available:

You can cancel at any time. We will not spam you.
By giving us your email address your agree to our Terms and Conditions