Job Description
Job Overview
As a Systems Reliability Engineer (SRE) at Cloudflare, you will be instrumental in building and operating our Edge platform, which spans over 320 cities globally. This role calls for individuals who are passionate about automation, scalability, and operational excellence, working to improve service availability and performance. The position involves leveraging an array of monitoring and diagnostics tools while contributing to the enhancement of the Cloudflare platform.
Technical Requirements
Required Skills
- • Linux systems experience
- • Software development skills in Go or Python
- • Understanding of distributed software systems and large scale system design tradeoffs
- • Intermediate experience of common network protocols like DNS and HTTP
Preferred Skills
- • Experience with the Linux kernel and Linux software packaging
- • Configuration management systems such as Saltstack, Chef, Puppet or Ansible
- • Load balancing and reverse proxies such as Nginx, Varnish, HAProxy, Squid or Apache
- • SQL databases
- • Time series databases such as OpenTSDB, Graphite, Prometheus or Grafana
Experience Level
3 years experience in an SRE role or a role with similar functions
Responsibilities
- • Build and operate the Edge platform
- • Improve service availability, performance, and operational velocity
- • Develop and enhance the Cloudflare platform and its capabilities
- • Leverage monitoring, alerting, and diagnostics tools
Additional Information
- Location
-
London, UK
- Type
-
Hybrid