Fecha de publicación: 28 Noviembre 2024We are looking for a Lead Site Reliability Engineer who takes the initiative on developing and maintain the system and services for our Cash Management Platform, automating the deployment process, ensuring system scaling, investigating and resolving outdates, identifying and implementing preventive measures proactively, collaborating with key stakeholders, continuously looking for ways to provide real-time visual feedback for all the metrics and statuses
- Lugar:
- Hybrid (CDMX, GDL, MTY, SLW)
- Skills:
- Bachelor's degree in computer science or equivalent relevant to SR or Automation/development experience.
- 7+ years' experience focussed on Site Reliability Engineering or related position in some of the majors Cloud Platforms.
- Involved in the automation of multi-tenant systems, preferably in a cloud environment.
- Good understanding of Site Reliability Engineering (SRE) philosophies, technologies, platforms and tools, SLO management, incident resolution, and automation;
- Ability to explain technical concepts in clear, non-technical language
- Experience building Infrastructure-As-Code.
- Experience in Docker and Kubernetes and networking concepts.
- Experience with Graphana and Prometeus.
- Integration experience with Pager-Duty, ServiceNow, Datadog.
- Expertise with system and performance monitoring tools (Dynatrace, Splunk, etc.)
- **English level: Advanced**
- Actividades:
- Proactively build and implement services to make IT and support better at their jobs.
- Design and implement dashboard that provide valuable real-time insights of platform key metrics.
- Leads engagement with software developers, DevOps and other infrastructure engineers to integrate software development and delivery from inception to full operation, ensuring robust released software and systems.
- Optimizing on-call rotations & processes.
- Ensure Incidents assigned to the team are being managed within agreed SLAs
- Ensure alarms are documented in up to date Knowledge Base Articles.
- Conduct pot-incident reviews to identify platform status
- Deseable:
- Beneficios:
- 100% nómina
- Vales de despensa
- Fondo de Ahorro
- Aguinaldo de 15 días
- 15 días de vacaciones
- Prima Vacacional dl 25%
- 5 días flotantes
- Seguro de Gastos Médicos Mayores y Menores
- Seguro de Vida
- PTU
- Capacitación
- Bono Anual de Desempeño