
Principal Engineer Platform Engineering & Production Support
Automate your job search with Sonara.
Submit 10x as many applications with less effort than one manual application.1
Reclaim your time by letting our AI handle the grunt work of job searching.
We continuously scan millions of openings to find your top matches.

Overview
Job Description
Principal Engineer Platform Engineering & Production SupportTeam OverviewThis role supports a critical Platform Engineering team responsible for stabilizing, scaling, and operating applications as they move closer to production release. The team plays a key role post-deployment, ensuring reliability, performance, and operational excellence across a portfolio of applications.This is not traditional infrastructure support it is application-focused production engineering, requiring deep technical expertise, proactive issue prevention, and strong ownership of application health in cloud environments.
Role SummaryWe are seeking a Principal Engineer to backfill a key contractor position within our Platform Engineering team. This individual must be Day 1 ready, capable of operating in fast-paced, production-critical environments, and able to seamlessly balance multiple priorities.The ideal candidate is a strong DevOps and Site Reliability Engineering (SRE) professional with hands-on expertise in observability, incident management, and cloud platforms (OpenShift). They will play a leading role in supporting production systems, preventing outages, and improving system reliability through automation and intelligent monitoring.
Key ResponsibilitiesLead production support efforts across a portfolio of 20+ applications, ensuring stability, performance, and rapid issue resolutionDesign and build advanced monitoring, alerting, and observability dashboards using tools such as Splunk, Grafana, AppDynamics, and PrometheusProactively identify risks through gap analysis, anomaly detection, and predictive alerting, preventing production incidents before they occurTroubleshoot complex production issues across distributed microservices environments, reducing MTTR through deep technical expertiseDrive adoption of modern SRE practices, including automation, AIOps, and intelligent monitoring solutionsSupport applications running on OpenShift and cloud-native platforms, with a focus on reliability and scalabilityCollaborate closely with development teams during release cycles, providing production-readiness guidance and operational supportParticipate in 24x7 on-call rotation, demonstrating urgency and ownership during incidentsMentor and guide engineers, helping elevate team capabilities in SRE, DevOps, and platform engineering practicesAct as a trusted technical leader, able to quickly switch priorities and manage competing demands in a high-pressure environment
What We re Looking ForA genuine, hands-on engineer who can operate across multiple roles (SRE, DevOps, Production Support)Strong ability to shift priorities quickly and respond with urgency in critical situationsDeep understanding of application support in cloud environments, especially OpenShiftExperience in the financial services industry strongly preferredPrior development experience is a plus, particularly in Java-based ecosystems
Required Qualifications:
" 10+ years of Platform and production support
" 5 years of Redhat Linux, OpenShift, Kubernetes, Java, microservices, Spring Boot, Python experience
" 5 years of Observability dashboard creation experience
- Grafana, Splunk, SPLOC, AppDynamics
" 5 years of Observability alerts and Incident handling
- AIOPS, Service now, Bigpanda etc
" 4 years of React.js, Apache, Kafka, relational databases experience
" 4 years of distributed systems, microservices architectures, and cloud native platforms experiencexperience
EEO:
Mindlance is an Equal Opportunity Employer and does not discriminate in employment on the basis of Minority/Gender/Disability/Religion/LGBTQI/Age/Veterans.
Automate your job search with Sonara.
Submit 10x as many applications with less effort than one manual application.
