Principal Site Reliability Engineer

Detalles de la oferta

**Responsibilities**- Solve complex problems related to Linux infrastructure and Oracle Cloud Infrastructure- Act as escalation point for critical issues that may not have a documented procedure and provide Root Cause Analysis (RCA)- Understand the end-to-end configuration, technical dependencies, characteristics of production infrastructure and services- Quickly grasp and analyze new technologies that are complex and rapidly changing and integrate those into automation and infrastructure support- Design and delivery of mission critical automation, with focus on security, resiliency, scale, and performance.- Identify opportunities and drive the implementation of automation to improve service health, availability and reliability- Author functional and technical documentation and standard operating producers (SOP)- Collaborate with development teams in defining and implementing improvements in service architecture.- Articulate technical characteristics of services and technology areas and guide cross-functional teams to engineer and add capabilities to internal tools.- Partner with DevOps teams, Oracle Cloud Infrastructure deployment, development teams to identify and resolve issues.**Knowledge Skills**- 6- 12 years of experience in Site Reliability Engineering and automation.- Experience in Linux Administration with good knowledge on Kernel level debugging- Experience in debugging operating system performance issues and performance tuning- Experience working with fault tolerant, highly available, high throughput, distributed and scalable systems- Expertise in developing scripts, utilities and tools to automate routine or manual intensive tasks- Experience in cloud infrastructure technologies- Experience in operations and problem management- Development experience using Python and building Infrastructure using Terraform- Experience of working with global teams across different time zones.- Possess and demonstrates strong logical-thinking skill, full of intellectual curiosity and high for self-development.- Aptitude to be a good team player and the desire to learn and implement new Cloud technologies as needed- Good understanding of Agile software development principles including using common tools such as JIRA- Good understanding of cloud security, compliance management including patching- Excellent organizational, verbal, and written communication skills**Qualifications required**- 6 to 12 years of experience working in IT Operations\ Infrastructure team- Bachelordegree in Computer Science, Computer Engineering, Software Engineering, or related areas is preferred


Salario Nominal: A convenir

Fuente: Jobtome_Ppc

Requisitos

Data Engineer - Clojure

**Objetivo Principal**: Arquitectura de almacén de datos. Trabajando con Java, **Clojure,** Docter, SQL, Python, Airflow, Azure DevOps, otros productos Azure...


Vallen - Jalisco

Publicado 14 days ago

Controlador De Datos

**Controlador de Datos** **Descripción del Puesto**: En Caramelos de la Rosa, buscamos incorporar a un **Controlador de Datos** altamente organizado y con h...


Caramelos De La Rosa - Jalisco

Publicado 14 days ago

Control De Acceso Officemax Galerias (1-9:30 Pm

CONTROL DE ACCESO- $5,408- Si no tienes experiência ¡Nosotros te capacitamos!- ¿Qué necesitas?- Mayor de Edad- Escolaridad: Secundaria- Vivir cerca de nuestr...


Officemax Sucursal San Isidro - Jalisco

Publicado 14 days ago

Soporte Técnico En Impresoras En Campo

**TSOPORTE TÉCNICO EN IMPRESORAS (CAMPO), **para importante empresa Nacional. Si cuentas con conocimientos en informática, sistemas o redes ¡Esta es tu oport...


Consultoría Ti - Jalisco

Publicado 14 days ago

Built at: 2024-12-23T04:11:46.674Z