Staff Site Reliability Engineer |SRE DevOps

RANDSTAD PTE. LIMITED
5 hours ago
Posted date5 hours ago
N/A
Minimum levelN/A
EngineeringJob category
Engineeringabout company
I am currently working with a highly regulated financial services platform specializing in digital assets and offering custody services
Salary budget wide open! 4 rounds of interview to offer. 1 day in office, 4 days WFH.
about job
• Spearheading primary operational support and engineering for various platform services.
• Driving improvements in reliability, quality, and time-to-market across all system offerings.
• Developing, building, and maintaining robust operational tooling and automation to streamline workflows.
• Defining and tracking key performance indicators (SLIs/SLOs) in collaboration with development teams.
• Creating "Production-ready Scorecards" to formally evaluate system health before deployment.
• Providing education and mentorship to engineering teams on resiliency principles, including chaos testing and blue/green deployments.
skills and requirements
• Min 10 years of experience.
• Utilizing monitoring, alerting, and automation tools to resolve performance issues in systems at scale.
• Expert proficiency in developing automated solutions using Infrastructure as Code (Terraform).
• Expert-level knowledge of containerization technologies such as EKS (k8s), Nomad, and Docker.
• Expertise in Configuration Management tools like Ansible, Chef, or Puppet.
• Proficiency in writing scripts or CLI tools in high-level languages like Python or Go to enhance developer productivity.
• Proven experience as a Technical Leader, contributing to technical decision-making and architectural recommendations.
To apply online please use the 'apply' function, alternatively you may contact Stella at 96554170 (EA: 94C3609 /R1875382)
I am currently working with a highly regulated financial services platform specializing in digital assets and offering custody services
Salary budget wide open! 4 rounds of interview to offer. 1 day in office, 4 days WFH.
about job
• Spearheading primary operational support and engineering for various platform services.
• Driving improvements in reliability, quality, and time-to-market across all system offerings.
• Developing, building, and maintaining robust operational tooling and automation to streamline workflows.
• Defining and tracking key performance indicators (SLIs/SLOs) in collaboration with development teams.
• Creating "Production-ready Scorecards" to formally evaluate system health before deployment.
• Providing education and mentorship to engineering teams on resiliency principles, including chaos testing and blue/green deployments.
skills and requirements
• Min 10 years of experience.
• Utilizing monitoring, alerting, and automation tools to resolve performance issues in systems at scale.
• Expert proficiency in developing automated solutions using Infrastructure as Code (Terraform).
• Expert-level knowledge of containerization technologies such as EKS (k8s), Nomad, and Docker.
• Expertise in Configuration Management tools like Ansible, Chef, or Puppet.
• Proficiency in writing scripts or CLI tools in high-level languages like Python or Go to enhance developer productivity.
• Proven experience as a Technical Leader, contributing to technical decision-making and architectural recommendations.
To apply online please use the 'apply' function, alternatively you may contact Stella at 96554170 (EA: 94C3609 /R1875382)
JOB SUMMARY
Staff Site Reliability Engineer |SRE DevOps

RANDSTAD PTE. LIMITED
Singapore
5 hours ago
N/A
Full-time
Staff Site Reliability Engineer |SRE DevOps