I am currently working with a highly regulated financial services platform specializing in digital assets and offering custody services
Salary budget wide open! 4 rounds of interview to offer. 1 day in office, 4 days WFH.
about job
● Spearheading primary operational support and engineering for various platform services.
● Driving improvements in reliability, quality, and time-to-market across all system offerings.
● Developing, building, and maintaining robust operational tooling and automation to streamline workflows.
● Defining and tracking key performance indicators (SLIs/SLOs) in collaboration with development teams.
● Creating "Production-ready Scorecards" to formally evaluate system health before deployment.
● Providing education and mentorship to engineering teams on resiliency principles, including chaos testing and blue/green deployments.
skills and requirements
● Min 10 years of experience.
● Utilizing monitoring, alerting, and automation tools to resolve performance issues in systems at scale.
● Expert proficiency in developing automated solutions using Infrastructure as Code (Terraform).
● Expert-level knowledge of containerization technologies such as EKS (k8s), Nomad, and Docker.
● Expertise in Configuration Management tools like Ansible, Chef, or Puppet.
● Proficiency in writing scripts or CLI tools in high-level languages like Python or Go to enhance developer productivity.
● Proven experience as a Technical Leader, contributing to technical decision-making and architectural recommendations.
To apply online please use the 'apply' function, alternatively you may contact Stella at 96554170 (EA: 94C3609 /R1875382)