LINUX Systems Engineer - Opportunity for Working Remotely
Heredia, CR
hace 6 días

Job Description

To be a member of the Command Center (SRE Team) with the focal point for the success of our enterprise class SaaS service offerings across all of VMware.

The SRE Team increases overall confidence in the services that are being delivered and ensures proper communication to VMware customers during any disruption of normal operations.

As an Site Reliability Engineer, you will Provide 24 / 7 high availability support for Virtualized Linux environment, build robust self healing automation to reduce operational efforts.

This role offers an exciting opportunity to work across multiple technology domains and engage in the latest and greatest technology services being developed at a world class company.

Team Responsibility

The Site Reliability Engineer is a member of the highly visible Cloud Services Operations team and is a core member of the VMware Engineering Services (VES).

As a member of the SRE Team, you will be functioning in a world-class team respected for its innovation, execution and collaboration operating a world-wide organization.

The team ensures continuity of VMware SaaS Services that impacts any significant disruption of normal operations of our enterprise service offerings and operates 24 / 7 / 365 days a year.

The SRE Team is expected to provide a reliable service with an enterprise level SLA and must strive for 100% customer support satisfaction.

The primary objective of this team is to oversee and ensure critical applications and services provided are available and working as expected for customers and subscribers.

The secondary objective is to develop and improve existing service monitoring tools through additional integrations, automation and collaboration.

Role Responsibility

The ideal candidate serves as the focal point for the success of our enterprise class SaaS service offerings across all of VMware providing technical skill and knowledge to the command center.

Along with working on complex issues where analysis of situations or data requires an in-depth evaluation of various factors this role will be required to help level up the technical skills of the team, develop tools and automation for VMware services, and assist services in automated problem resolution.

The ideal candidate will have in-depth knowledge in Linux, Virtualization, Networking concepts and VMware Products. Should have development skills in modern scripting language like Python, Ansible or Good in shell scripting.

Required Experience

  • Experience in providing 24 / 7 high availability support for Virtualized Linux environment
  • Build robust, self-healing automation that reduce operational effort and improve service up-time using automation tools / language like Python, Shell scripting
  • Must have very good understanding of Networking concepts and ability to trouble-shoot network problems
  • Strong analysis and problem solving abilities with meticulous attention to details
  • A passion for performance excellence, robustness and an engineering mind-set
  • Triaging incidents, troubleshooting hardware, software and network issues and resolution
  • Strong working knowledge of tools like BigPanda, PagerDuty, Uptime, Jira Service Desk etc.
  • Proactively identify and communicate potential problems and issues to project team members / leaders
  • Consistently meets or exceeds established Command Center key performance indicators (KPI’s)
  • Ability to work global teams
  • Job Responsibilities include providing 24 / 7 remote support.
  • Required Skills

  • Minimum 3-5 years of experience with Red Hat, CentOS, SUSE Linux
  • Minimum 2 years of experience in Virtualization is preferred vSphere, KVM
  • 2 years of experience in one of the following languages : Python, PowerShell, Shell scripting
  • Knowledge of data storage protocols including CIFS, NFS
  • Knowledge of TCP / IP networking, DNS, LDAP, SMTP, VMware enterprise products
  • Good working knowledge of at least one public cloud such as AWS or GCP
  • Experience with containers, such as with Docker or Kubernetes is plus
  • Good troubleshooting skills in VMware vSphere, NSX, vSAN and Linux is preferred
  • Domain knowledge of systems management and ITIL is strongly desired.
  • Excellent written and verbal communication skills
  • Required Qualifications

  • BS Degree in Computer Science, or a related field
  • RHCE certification is plus
  • Category : Engineering and Technology

    Subcategory : Site Reliability

    Experience : Manager and Professional

    Full Time / Part Time : Full Time

    Posted Date : 2021-07-26

    Reportar esta oferta

    Thank you for reporting this job!

    Your feedback will help us improve the quality of our services.

    Mi Correo Electrónico
    Al hacer clic en la opción "Continuar", doy mi consentimiento para que neuvoo procese mis datos de conformidad con lo establecido en su Política de privacidad . Puedo darme de baja o retirar mi autorización en cualquier momento.
    Formulario de postulación