Kandji, Inc. logo

Principal Site Reliability Engineer

Kandji, Inc.

Apply Now
Location
Miami

Job Description

Job Overview

As a Principal Site Reliability Engineer at Kandji, you will play a critical role in ensuring the reliability, scalability, and performance of our platform. You will work cross-functionally to build and evolve the systems, tools, and processes that keep our services resilient and performant, especially as we scale to meet the demands of a growing customer base. This strategic position requires a deep understanding of distributed systems, incident management, observability, and automation.

Technical Requirements

Required Skills
  • • Infrastructure as Code (Terraform)
  • • Kubernetes
  • • Automation
  • • Scripting (Python, Go, Bash)
  • • CI/CD pipelines
Experience Level

10+ years in Site Reliability Engineering, DevOps, Infrastructure or related roles

Responsibilities

  • • Design and implement fault-tolerant, scalable, and highly available systems across AWS-hosted platform
  • • Define and uphold SLIs/SLOs, perform root cause analyses, and drive post-incident reviews
  • • Build and maintain automation for deployment, incident response, and remediation workflows
  • • Implement DevSecOps practices including secure IaC and policy-as-code
  • • Develop comprehensive observability solutions including metrics, logging, tracing, and alerting
  • • Contribute to and improve Terraform-based infrastructure management
  • • Lead efforts in system tuning, load testing, and capacity forecasting
  • • Embed reliability thinking into engineering and product workflows
  • • Mentor engineers in SRE best practices and incident response

Technical Environment

Languages

Benefits & Perks

  • • Competitive salary
  • • 100% individual and dependent medical + dental + vision coverage
  • • 401(K) with a 4% company match
  • • 20 days PTO
  • • Flexibility to work from anywhere for up to 30 days per year
  • • Kandji Wellness Week the first week in July
  • • Equity for full-time employees
  • • Lunch stipend provided Monday through Friday
  • • Up to 16 weeks of paid leave for new parents
  • • Paid Family and Medical Leave
  • • Modern Health mental health benefits
  • • Fertility benefits
  • • Working Advantage employee discounts
  • • Onsite fitness center
  • • Free parking
  • • Exciting opportunities for career growth

Additional Information

Location
Miami, on-site 5x a week
Type
Full-Time
Compensation
Not specified

About Kandji, Inc.

Kandji is an automation-forward Apple device management (MDM) software that integrates device management and EDR into one platform, empowering secure and productive work on Mac, iPad, iOS, and tvOS devices.

Company Size
201-500
Categories
Apple device management Automation Biotech, Pharmaceuticals & Healthcare Business Productivity & Collaboration Endpoint detection and response Enterprise Software HR & Staffing Information Technology and Services Information Technology & IT Services IT security MDM Security Semiconductor & Hardware Software & SaaS vulnerability management