Lo sentimos, la oferta no está disponible,
pero puedes realizar una nueva búsqueda o explorar ofertas similares:

Principal Site Reliability Developer

Job Requirements: 8+ years of software design and development experience with distributed, highly-scalable, maximum availability (HA, brownout), multi-node e...


Ll Oefentherapie - Jalisco

Publicado 6 days ago

Test Technician 2 (N)

Share this job as a link in your status update to LinkedIn. Category: For Job Seekers Location: [Insert Location] Job DescriptionProduction Test Technician: ...


Sanmina Corporation - Jalisco

Publicado 6 days ago

Site Reliability Engineer Iii

Site Reliability Engineer IIIAt F5, we strive to bring a better digital world to life. Our teams empower organizations across the globe to create, secure, an...


F5 Networks, Inc. - Jalisco

Publicado 6 days ago

Linux Administrator

We're hiring! At Cognizant we have an ideal opportunity for you to be part of one of the largest companies in the digital sector worldwide. A Phenomenal Pla...


Cognizant - Jalisco

Publicado 6 days ago

Detalles de la oferta

**Job Category**:Lead**Job Type**:Remote - Full Time**Job Location**:Guadalajara - MéxicoWe are a young American-based company that produces high-quality software using the latest disruptive technologies available in the market.
We're looking for a Lead SRE/DevOps to join our dynamic team.

**Responsibilities**:

- Lead and mentor a team of SREs to ensure operational excellence and maximize the reliability and availability of client systems.
- Architect and design highly scalable and available infrastructure solutions, integrating best practices in reliability engineering and automation.
- Collaborate with cross-functional teams (DevOps, Development, IT) to implement SRE principles throughout the software development life cycle.
- Establish and manage Service Level Objectives (SLOs) and Service Level Indicators (SLIs) for critical services, monitoring and maintaining performance against defined targets.
- Implement and enhance observability, alerting, and incident response processes to proactively address issues and minimize downtime.
- Develop and maintain documentation related to system architecture, configuration, and procedures.
- Stay current with industry trends, recommending and adopting new tools and practices to enhance system reliability.

**Requirements**:

- Strong background in designing and implementing highly available and scalable infrastructure.
- Proficiency in scripting and automation using Python or Shell.
- Experience with container orchestration platforms, serverless architectures, CI/CD pipelines, and IaC implementations (Ansible & Terraform)
- Experience with Observability tools (preferred: Datadog, CloudWatch)
- In-depth knowledge of cloud computing platforms (preferred: AWS)
- Solid understanding of SRE/DevOps principles and practices.
- Excellent problem-solving skills with the ability to troubleshoot complex issues in production environments.
- Strong communication and leadership skills, fostering effective collaboration with cross-functional teams.
- Minimum 10 years of work experience in DevOps/SRE, including leadership roles.
- Advanced English.
- Mexico (Guadalajara) resident.

**Preferred**:

- Relevant certifications in SRE, DevOps, Cloud, etc., are a plus.


Salario Nominal: A convenir

Fuente: Whatjobs_Ppc

Requisitos

Built at: 2024-11-11T05:07:06.105Z