.SRE Software Engineer is responsible for designing, configuring, monitoring, implementing, and maintaining our observability solutions and troubleshooting Ford Credit IT systems and applications to ensure optimal performance and reliability.MAJOR RESPONSIBILITIESUtilize observability and monitoring tools to detect and resolve issues affecting positive user experience.Automate alerting and remediation processes to reduce mean time to resolution (MTTR) and improve system uptime.Use Splunk query language and monitor database connection health by using Splunk DB connect health dashboards, log parsing, complex Splunk searches, including external table lookups, Splunk data flow, components, features, and product capability.Observability: Implement comprehensive monitoring and alerting solutions using GCP monitoring services and external services.Gather and analyze metrics from operating systems as well as applications to assist in performance tuning and fault finding.Build vital and efficient tooling to lower the barrier of entrance for engineering teams to plug in and enjoy the benefits of reliability focused on observability.Configure dashboards, alerts, and notifications to ensure timely identification and resolution of issues.Troubleshoot issues and outages, working closely with development and operations teams to identify root causes and develop solutions.Monitor server, network infrastructure, and application performance metrics, and identify patterns and trends to improve system performance and reliability.Develop and integrate tools for logging, monitoring, and alerting to enhance visibility into system performance.Participate in strategic planning for the technology roadmap, including scalability, cost-effectiveness, and risk management considerations related to observability infrastructure.EXPERIENCE AND BACKGROUND REQUIREMENTS6+ years of SRE observability engineering experience.6+ years of experience in observability best practices working with Dynatrace or similar tools (NewRelic, DataDog, AppDynamics, or other similar APM suites), delivering solutions across all environments, and integrating platforms and applications with monitoring and APM tools.Knowledge of CI/CD tools such as Puppet, Jenkins, Terraform, Ansible.Minimum 4 to 5 years of working experience in OpenShift and Docker/K8s.Proficiency in implementing monitoring and observability solutions using GCP monitoring services such as Cloud Monitoring, Logging, and Tracing.Deep understanding of IT infrastructure monitoring and observability best practices.Experience with gathering and organizing large amounts of data to use for instrumentation into an enterprise monitoring solution.Experience with recommending baseline monitoring thresholds and performance monitoring KPIs and SLAs.4+ years of experience in the development of Grafana dashboards, metrics/monitoring standardization - metrics, collection, dashboards with Grafana a must