Job Description

Title: Senior DevOps Engineer

Location: anywhere US/CAN

InfluxData is the creator of InfluxDB, the leading time series platform. More than 1,900 customers use InfluxDB to collect, store, and analyze all time series data at any scale. Developers can query and analyze their time-stamped data in real-time to discover, interpret, and share new insights to gain a competitive edge. InfluxData is a remote-first company with a globally distributed workforce. For more information, visit

You’ll be using your operations skill, software crafting abilities, and teamwork to maintain and extend foundational technologies to provide highly available, distributed services to customers who rely on InfluxData’s time series platform where metrics and events are first-class citizens.

You’ll join an awesome globally-distributed team, bringing experience operating multi-cloud infrastructure built on Kubernetes and the public cloud. You appreciate that great products require great culture: you design and implement with respect and kindness across multiple engineering teams.

We embrace an empathetic, supportive, and communicative environment pulling from one another’s strengths and perseverance through failure and the resulting lessons learned.

What you’d bring to the table:

  • You can spot manual toil and design automation to make it work while you sleep.
  • Observability. You don’t consider your systems complete until they’re monitored and metricized.
  • Continuous deployment isn’t just for applications. You’ve worked with infrastructure as code and seek to replace click-ops with Git-ops.
  • Experience with Kubernetes and supporting tooling. You’ve worked with operators, Helm, ArgoCD, and other CNCF projects.
  • You’ve used Terraform, Vault, Packer, and other Hashicorp products in production environments.
  • You have experience using AWS, GCP, and/or Azure.
  • You’re very comfortable with Linux, a shell or two, and have a strong opinion about a text editor.
  • You don’t need to be a software engineer, but being comfortable looking over some Go and Python will help you a lot here. If you’re able to hack together a Python script or two, even better.

Bonus if you’ve had experience:

  • Implementing and abiding by SLOs dictated by mature SLIs in production.
  • Being an incident lead, engaging multiple groups simultaneously to resolve critical problems.
  • Thriving in a remote-first, distributed team environment that shares on-call duties.