Senior Site Reliability Engineer - SRE - 12 months rolling contractWe want to make work and study more efficient and enjoyable, by providing the best digital paper solution possible. Our digital paper and learning ecosystem inspires anyone to take notes, share what they know, collaborate with others, and learn as a community.
Our Values:Dream big
—Be visionary, strategic, and open to innovation
Build great things
—Work in service of our users, always improving and pushing higher
Take ownership
—Take responsibility with bold decision-making and bias for action
Win like a sports team
—Be trusting and collaborative while empowering others
Learn and grow fast
—Never stop learning and iterate fast
Share our passion
—Share ideas and practice enthusiasm and joy
Be user obsessed
—Empathetic, inquisitive, practical
About the team:Our engineering teams are mainly distributed across Europe and Asia. You will be among the first SREs based in the Americas, working with the Platform Team to support the various product teams.
About the role:This role is for you if you're excited to work on the following:
Design, build, and maintain the Goodnotes infrastructure, ensuring it adheres to Dickerson's Hierarchy of Reliability.Design, refine, and execute new and existing playbooks.Educate the various teams in SRE best practices, aiding them from design to capacity planning and rolling out new features.Be the go-to person for higher-level escalation for applications.Improve existing SLAs and optimize latency and error rates.Enhance system monitoring, health reporting, and logging.Design and implement security, assisting in maintaining information security practices and procedures.Participate in on-call rotation during the Americas Timezone UTC-8 to UTC-5.Open to working 5 shifts a week, which may include weekends.The skills you will need to be successful:Strong experience working in an AWS-hosted environment.Experience supporting production workloads and firefighting.Knowledge of SRE best practices and common issues.Experience with system monitoring tools.Understanding and experience with distributed databases.Solid understanding of Linux and Networking fundamentals.Background in back-end development, including API usage and creation.Knowledge of Security for network and containers.Understanding of container orchestration, particularly Kubernetes.Experience managing Relational and Non-relational databases, including backup and restore operations.Familiarity with automation/configuration management tools, preferably CDK and/or Terraform.The interview process:An introductory call with someone from our talent acquisition team.A hands-on take-home challenge to verify fundamental infrastructure-management skills.A 2-hour technical interview with one of our engineers covering low-level questions and practical exercises.A call with your hiring manager.Values interview with another member of the leadership.What's in it for you:Budget for home office setup, personal development, professional training, and health & wellness.Sponsored visits to our Hong Kong or London office every 2 years.Company-wide annual offsite.Medical insurance for you and your dependents.This is a 12-month renewable fixed-term contract.We expect 40 hours of work per week (adjusted with local laws) across 5 days per week covering day hours in American timezones during weekends and 3 weekdays.Note: Employment is contingent upon successful completion of background checks, including verification of employment, education, and criminal records.
Apply for this jobindicates a required field
#J-18808-Ljbffr