about the company
A fast-growing technology organisation operating in the blockchain and digital assets space, building large-scale distributed systems and modern platform infrastructure to support high-throughput, security-critical workloads. They are deeply invested in improving reliability, operational excellence, and developer productivity across all engineering teams, and are building a culture focused on resilience, scalability, and strong engineering fundamentals.
about the job
You will be a senior reliability engineer driving modern DevOps and SRE practices across the organisation. This includes hands-on operational support, building automation and tooling, improving reliability and performance, guiding teams on best practices, and influencing architectural decisions.
...
skills and experience required
7+ years supporting and operating large-scale systems with strong monitoring, alerting and automation experience
Deep experience with cloud platforms (AWS / GCP / Azure)
Expert in infrastructure-as-code (Terraform) and operational automation
Strong containerisation knowledge (Kubernetes, Nomad, Docker)
Experience with configuration management (Ansible / Chef / Puppet)
Proficient in scripting or building tools (Python, Go, etc.)
Proven ability to analyse performance, identify bottlenecks, and improve reliability
Experience mentoring or guiding engineering teams on reliability and operational best practices
To apply online please use the 'apply' function, alternatively you may contact Rumi Mohd.
(EA: 94C3609/ R1550851 )