Senior Site Reliability Engineer
San Jose
hace 4 horas

Our Company

Changing the world through digital experiences is what Adobe’s all about. We give everyone from emerging artists to global brands everything they need to design and deliver exceptional digital experiences! We’re passionate about empowering people to create beautiful and powerful images, videos, and apps, and transform how companies interact with customers across every screen.

We’re on a mission to hire the very best and are committed to creating exceptional employee experiences where everyone is respected and has access to equal opportunity.

We realize that new ideas can come from everywhere in the organization, and we know the next big idea could be yours!

Join a globally diverse team that both builds and finds best-of-breed tools to bring critical observability services to all of Adobe.

Our team embodies devops, as our responsibilities range from crafting new tools and UIs to maintaining and supporting one of the largest Cortex deployments around, with Prometheus and Grafana in them mix as well.

We’re also at the cusp of launching Jaeger for distributed tracing.

We’re a close-knit team dedicated to providing a robust platform and plenty of support - for both our customers and each other.

We need a new engineer to help us deploy and manage best-of-breed observability platforms, as well as partake in coding projects to extend their features and improve our customers’ experience.

We’re primarily a Go shop but also use Python.

If you enjoy having many different interesting tasks where it’s easy to draw a line from your efforts to real accomplishments, come talk to us.

Responsibilities :

  • Build our own tools such as API’s, self-service portals and helper UIs to improve platform performance
  • Extend observability platform feature sets with our own coding and tools
  • Support Cortex and Jaeger platforms, including on-call shifts
  • Assist internal customers on getting the most from our platforms, both guarding against inefficient usage and encouraging the best use of the platform
  • Requirements :

  • BS in Computer Science or equivalent with a minimum of 5-7 years of related experience
  • Proficiency with either Python or Go
  • Experience providing operational support for a production service
  • Understanding of linux and networking fundamentals
  • Understanding of foundational monitoring concepts : Coming up with sane thresholds, avoiding noise, etc.
  • Familiarity with Terraform, Chef, Ansible or a similar automation platform
  • Experience with a modern metrics platform such as Graphite, InfluxDB, Prometheus / Cortex
  • Reportar esta oferta

    Thank you for reporting this job!

    Your feedback will help us improve the quality of our services.

    Mi Correo Electrónico
    Al hacer clic en la opción "Continuar", doy mi consentimiento para que neuvoo procese mis datos de conformidad con lo establecido en su Política de privacidad . Puedo darme de baja o retirar mi autorización en cualquier momento.
    Formulario de postulación