Job offers
Tech
System Admin
CDN Site Reliability Engineer - Deployments

CDN Site Reliability Engineer - Deployments

  • On-site
  • Prague
  • Full-time

About CDN77

Think global, move fast, and make every millisecond count. That’s the CDN77 way.

We’re a Content Delivery Network (CDN) company, providing critical internet infrastructure for some of the world’s most trafficked websites, apps, and live events for over 14 years. Our network is massive: 270 Tbps of global capacity, with over a hundred Points of Presence across 130+ countries. Built for speed, not show.

Who uses our services? Udemy, ESL Gaming, the European Space Agency, Factorio, just to name a few. We firmly believe that the best investor is a happy customer, and the best investment is a satisfied employee. We give people the space to work on interesting projects and technologies that have real impact.


What does an SRE in the Deployment team do?

We manage all CDN edges and handle continuous software deployments across thousands of servers globally. Our main priority is ensuring that all software on which our CDN runs and relies, produced by various engineering teams, is deployed efficiently while maintaining the CDN's stability and reliability at scale.

We also ensure our HW serves traffic consistently and reliably (we do not use the cloud, all infrastructure is ours), debug performance issues, benchmark and tune the CDN’s distributed infrastructure.

We contribute to the design of various products and services, leveraging our deep technical understanding of the systems that power our CDN.

What to expect:

  • Planning and executing canary deployments of our proprietary software to the edges worldwide, primarily running Debian. This includes testing and assessing new Nginx features and enhancements.

  • Managing and maintaining high-availability internal services/tools needed for CDN management and operation (e.g. coding and reviewing new features, suggesting further improvements). 

  • Automating existing and new tasks in Ansible.

  • Monitoring logging and telemetry pipelines to ensure the reliability and safety of ongoing deployments.

  • Analyzing incident reports from the support team and debugging potential performance issues.

Tasks are not limited to a single stack or segment of the CDN, they span multiple systems (developed in different programming languages) that work together to deliver a seamless experience for our CDN customers and their end users.


What we expect:

  • Logical and analytical thinking, passion for automation, and solving complex problems

  • Experience with Linux environment in terms of application development and operation

  • Knowledge of any scripting/programming language (we use PHP, Bash, Go, Python, Lua)

  • English required, team communicates mainly in English

  • Ability to work full-time in our villa in Vinohrady, Prague


What we welcome:

  • Lua knowledge

  • Strong networking foundation, particularly concerning TCP/IP, HTTP, and DNS

  • Understanding of Nginx internals

  • Experience with debugging (top, lsof, strace, ...)

  • Experience with Debian, Ansible, Gitlab, Collectd, Grafana, Curl…


What we can offer you:

  • Fair financial compensation based on experience, performance, and expectations

  • Free food (breakfast, lunch, fruits, snacks, lemonades, coffee, etc.)

  • Great working environment in our Vinohrady villas (and pet-friendly)

  • Equipment of your choice (laptop, monitors, headphones, etc.)

  • Ergonomic setup: Spinalis chairs and an adjustable standing desk

  • Company barber