Cloudflare logo

Systems Reliability Engineer (SRE), Edge

Cloudflare

Apply Now

Job Description

Job Overview

As a Systems Reliability Engineer (SRE) at Cloudflare, you will be instrumental in building and operating our Edge platform, which spans over 320 cities globally. This role calls for individuals who are passionate about automation, scalability, and operational excellence, working to improve service availability and performance. The position involves leveraging an array of monitoring and diagnostics tools while contributing to the enhancement of the Cloudflare platform.

Technical Requirements

Required Skills
  • • Linux systems experience
  • • Software development skills in Go or Python
  • • Understanding of distributed software systems and large scale system design tradeoffs
  • • Intermediate experience of common network protocols like DNS and HTTP
Preferred Skills
  • • Experience with the Linux kernel and Linux software packaging
  • • Configuration management systems such as Saltstack, Chef, Puppet or Ansible
  • • Load balancing and reverse proxies such as Nginx, Varnish, HAProxy, Squid or Apache
  • • SQL databases
  • • Time series databases such as OpenTSDB, Graphite, Prometheus or Grafana
Experience Level

3 years experience in an SRE role or a role with similar functions

Responsibilities

  • • Build and operate the Edge platform
  • • Improve service availability, performance, and operational velocity
  • • Develop and enhance the Cloudflare platform and its capabilities
  • • Leverage monitoring, alerting, and diagnostics tools

Technical Environment

Languages

Additional Information

Location
London, UK
Type
Hybrid

About Cloudflare

We make websites, apps, and networks faster and more secure. Our developer platform is the best place to build modern apps and deliver AI initiatives.

Company Size
1001-5000
Categories
Cloud Computing Cybersecurity Internet Security Web Services