Are you someone with a passion for taking on big challenges? Are you interested in operating and working on the operations infrastructure for a large-scale, cutting edge cloud database service? If so, Oracle's MySQL HeatWave Service team on Oracle Cloud Infrastructure (OCI) can provide you the opportunity to build and operate a cloud service on a broadly distributed, multi-tenant cloud environment. OCI is committed to providing the best in cloud products that meet the needs of customers who are tackling some of the world's biggest challenges.
MySQL is the world's most popular open source database. OCI is the industry's broadest and most integrated public cloud and helps organizations increase business agility, lower costs, and reduce IT complexity. MHS is built, operated and supported by the Oracle staff responsible for the MySQL products. MHS offers secure, stable, and performant MySQL services for those requiring an enterprise-class experience. The MHS team is responsible for developing, deploying and operating the cloud service framework powering Oracle's MySQL Database Service and HeatWave, MySQL's in-memory, query accelerator. We are a worldwide team of problem-solvers who are driven to deliver MySQL at cloud scale to meet the real-world needs of our customers. As a key leader on our DevOps our team, you will partner with Control Plane, Data Plane, Console and SRE colleagues to provide a secure, integrated, seamless, User Experience to customers managing their MySQL Database Systems.
**Responsibilities**
- Build observability, automation and tooling for a set of modern, cloud native, fault tolerant and scalable cloud database management services
- Contribute to operational activities such as writing runbooks, troubleshooting, operations automation, and instrumentation for metrics and events
- Develop infrastructure tooling and code to automate deployment and continuous verification of healthy service levels
- Solve reliability issues across the entire service architecture and its deployments
- Mentor junior team members
- Work productively in a fast-paced, team-oriented agile development environment
- Contribute to a healthy, supportive and inclusive team culture
- Work with geographically distributed teams and contribute to the success of your team and other related teams
**Qualifications**
- BE/BS/MS degree in Computer science or Computer Engineering or 4+ years related experience
- 3+ years experience including DevOps, Site Reliability Engineer (SRE), on-call rotations, working on highly scalable, distributed systems
- Highly proficient in at least one programming and/or scripting language (Python, Ruby, Java etc.), shell scripting, ssh, git, etc.
- Proficient in Linux/Unix systems administration
- Skilled at debugging and troubleshooting complex software and/or networking issues, performing root cause analysis
- Proficient in cloud development tools/infrastructure, and experienced in infrastructure automation through Terraform, Chef, Ansible, Puppet or similar
- Hands-on experience on at least one of the following cloud platforms: AWS, Azure, Google or OCI
- Procedurally oriented and willing a contributor to documentation and runbooks
- Proven ability to quickly learn new technical domains and then train others
- A productive, proactive team-player and good communicator who thrives when collaborating with others
**Preferred Qualifications**
- MySQL DBA experience a huge plus
- Experienced deploying code within change management procedures