.Rockwell Automation is a global technology leader focused on helping the world's manufacturers be more productive, sustainable, and agile. With more than 28,000 employees who make the world better every day, we know we have something special. Behind our customers - amazing companies that help feed the world, provide life-saving medicine on a global scale, and focus on clean water and green mobility - our people are energized problem solvers that take pride in how the work we do changes the world for the better.We welcome all makers, forward thinkers, and problem solvers who are looking for a place to do their best work. And if that's you we would love to have you join us!**Job Description**:Rockwell Automation is a global technology leader focused on helping the world's manufacturers be more productive, sustainable, and agile. With more than 28,000 employees who make the world better every day, we know we have something special. Behind our customers - amazing companies that help feed the world, provide life-saving medicine on a global scale, and focus on clean water and green mobility - our people are energized problem solvers that take pride in how the work we do changes the world for the better.We welcome all makers, forward thinkers, and problem solvers who are looking for a place to do their best work. And if that's you we would love to have you join us!Sr Engineer - Observability**Executive Summary****Key Responsibilities**:- Analyzes, designs, programs, debugs, and modifies observability tools and interfaces.- Code may be used to enrich and correlate telemetry from many data sources in order to isolate events that indicate future or immediate IT availability issues.- Will interact with users to define system requirements and/or necessary modifications.- Design and Implement Observability Solutions: Develop and implement comprehensive observability solutions utilizing industry-standard tools and technologies such as Prometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana), Jaeger, and Open Telemetry.- Distributed Tracing: Implement distributed tracing techniques to trace and visualize the flow of requests across microservices architectures. Utilize tracking data to identify performance bottlenecks and optimize system performance.- Performance Analysis and Optimization: Analyze system performance metrics and identify opportunities for optimization. Collaborate with development teams to implement performance improvements and ensure scalability of systems.- Incident Response and Post-Mortems: Actively participate in incident response activities, providing expertise in diagnosing and resolving complex issues. Conduct thorough post-incident reviews to identify root causes and recommend preventive measures.- Documentation and Knowledge Sharing: Document observability best practices, standards, and procedures