.Who are we?Portainer is on a mission to make container management simple, quick, and easy. Whether it's Kubernetes, Swarm, Docker, or Edge computing the drive to create expert, elegant, simple, yet powerful tools that make the complex simple is what makes us tick. In its first three years, Portainer has experienced staggering global uptake of its Open-Source product, with hundreds of thousands of active users and many hundreds of millions of downloads. Now we're making the transition to our first commercial product, backed by an awesome group of global investors.To help us bring our vision for Portainer to life, we're searching for a highly skilled, go-getting, self-driven and experienced Platform Engineer to join our remote team. You will have extensive experience in Kubernetes/Swarm administration, troubleshooting across all components, infrastructure, observability, and platform engineering. This role will involve managing large-scale Kubernetes environments, implementing, maintaining and ensuring the reliability and scalability of the platform. You will also be part of an on-call rotation to handle critical incidents.What does our Platform Engineer - Kubernetes do?Kubernetes Management: Manage and optimize large-scale Kubernetes clusters. Perform version updates, configuration changes, and troubleshoot issues. Assist with and maintain container orchestration using Kubernetes. Platform Engineering Services: Maintain and expand the platform solution to meet SLA/OLS requirements. Perform platform moves/adds/changes and monitor core platform metrics. Manage load across components and ensure normal operating parameters. Implement component updates for defect resolution and preventive maintenance. Operational Onboarding: Create and maintain documentation for service levels, roles, and responsibilities. Conduct platform reviews and tooling deployments. DevOps and SRE: Aid in the use of GitOps pipelines and assist in application deployment strategies. Provide guidance on namespace, cluster, access control, and isolation best practices. Implement blue/green deployment strategies and assist with performance issues. Automation and DR Planning: Develop automations for preventative maintenance and operational efficiency. Create and validate cluster recovery guides to ensure infrastructure recoverability. Emergency Support: Be part of a team that provides 24/7 emergency engineering support with a 1-hour response SLA. Analyze alerts and perform root analysis to prevent recurrence. This section sets out the previous experience, technical abilities, and professional qualifications required to perform the role.Experience: 6 years of total experience in IT and platform engineering. 4 years managing Kubernetes environments. Experience with Docker Swarm is an advantage. Experience in operation, virtualization, cloud infrastructure (AWS, Azure, GCP), and DevOps practices