Job Description
Job Overview
Cloudflare is seeking a Systems Reliability Engineer (SRE) for their Edge team to help build a better Internet. The role involves managing and improving Cloudflare's Edge platform with a focus on reliability, performance, and scalability. The ideal candidate will work on large-scale systems, contribute to open-source projects, and mentor other engineers while being part of an on-call rotation to support distributed systems.
Technical Requirements
Required Skills
- • Managing distributed systems
- • Proficiency in distributed Linux/Unix environments
- • High-level programming (e.g., Golang, Python)
- • Configuration management (e.g., Saltstack, Chef, Puppet, Ansible)
- • Networking protocols Layer 3-7 of the OSI model
- • Performance analysis, debugging, and troubleshooting
- • SQL databases (e.g., Postgres, MySQL)
- • Load balancing and reverse proxies (e.g., Nginx)
Preferred Skills
- • Continuous integration and delivery (CI/CD)
- • 24/7/365 service environment experience
- • High-bandwidth transit Internet working and routing
- • Passion for tooling and automation
Experience Level
Up to 8 years of experience managing distributed systems
Responsibilities
- • Design, write, and deliver software that improves Cloudflare's Edge platform
- • Scale and evolve systems through software and automation to improve reliability and velocity
- • Manage and be part of the on-call rotation that supports the largest distributed edge system in the world
- • Collaborate with other engineers to design and implement scalable solutions
- • Participate in the constant cycle of knowledge sharing and mentoring
- • Research and introduce cutting-edge technologies
- • Develop and maintain sustainable tools that work on an extremely large scale
- • Contribute to open-source projects
Additional Information
- Location
-
Bengaluru
- Type
-
In-Office
- Compensation
-
Not specified