Role Value Proposition:
Responsible for supporting Container Platforms, specifically for OpenShift Container Platform, hosting a fleet of Kubernetes clusters, with multitenancy and Hybrid considerations. The team proactively assesses the overall health of systems, performance, and incident trending reports and develops automation to facilitate self-serviceability, address system resilience and stability by using site reliability engineering principles and practices.
The Senior Enterprise Infrastructure Engineering role for the Container Platform teams is responsible for advancing cloud native technologies that align specifically to our On-premises and regulatory system strategy. This role and team support multiple countries and are responsible for designing and maintaining the OpenShift Platform standards globally. This is an exciting role that will exercise a working knowledge of varying application frameworks running on various platforms, physical and virtual environments, with portability capabilities to effectively run on different cloud providers, hypervisors, operating systems, Network devices, Storage devices, and Database servers. This position supports critical business activity within a 24/7 Servicing DevOps agile team. Must align with Agile methodology, understand the ITSM process, operations data, and real-time visibility requirements, and how each interacts with the operation of complex production environments. We are looking for an individual who can use critical thinking, collaborate across teams, and have an automation-first mindset, with a desire to work with modernized and often transformational workforce toolsets and methods.
Success in this role requires Site Reliability Engineering principles and practices while exhibiting a continuous learning mindset. The Senior Infrastructure Engineer is responsible for accelerating agile delivery by adopting a platform-oriented Infrastructure and Operations (I&O) operating model while driving automation with Infrastructure as Code as the cornerstone approach to enabling this. You'll collaborate closely with an array of global colleagues and customers at all levels in an environment where every contribution is respected, and every perspective is heard.
Key Responsibilities:
- Exhibits the skills to collaborate with and achieve actionable results, build healthy and sustainable relationships, and the capability to interact within all levels of the organization.
- Working experience in OpenShift, K8s distros and Docker.
- Coordinate, check, and apply critical patches in OpenShift/Kubernetes
- Knowledge of CI/CD methodology and tooling (AzDo, GIT, Tekton).
- Manage the container platform ecosystem (installation, upgrade, patching, monitoring)
- Reliability Management mindset / Troubleshooting
- Takes lead role in projects and provides technical expertise for integrating products and technologies.
- Leads and participates in incident analysis and provides root cause updates to management.
- Provides SME knowledge in enterprise-wide projects or initiatives. Exhibits an automation-first mindset.
- Works with peers in Global Technology to ensure the stability and performance of the infrastructure
- Develops ongoing relationships and routines with industry partners and vendors regarding thought leadership, industry trends, product directions, and required product enhancements.
- Has responsibility for operations, participation in the evaluation of new and existing software products, and integrations.
- Willing to work in rotational shifts and occasionally serve in rotation on-call capacity.
- Provides technical guidance and mentors team members.
- Manages service levels provided by the managed service provider.
Essential Business Experience and Technical Skills:
Required:
- Proven experience in supporting distributed applications both on-prem and in the cloud (Including Azure, AWS, and/or GCP).
- 5+ years of experience in administering and supporting OpenShift, varying Kubernetes distros, containerization, Linux / UNIX technologies
- Strong experience in a lead role and mentoring team members
- Knowledge and usage of CI/CD Tools (i.e., AzDO, ArgoCD)
- Scripting: Python or Bash, or PowerShell
- A broad general knowledge of systems, operations, security, network, and storage management processes and their interdependence and implementations
Preferred:
- Container Runtimes (Podman / Docker), Kubernetes (OpenShift) / Swarm Orchestration, GoLang framework, and/or Microservices Architecture
- Good understanding of Site Reliability Engineering principles and exhibits a continuous learning mindset.
- Good knowledge of using GIT / AzDo / Tekton and related Version Control tools.
- Good working knowledge of "Infra-as-code" toolsets such as Ansible.
- Good working knowledge of Observability / Monitoring toolsets such as Grafana, ELK (Elastic), AppDynamics, and/or Zenoss.
- Excellent oral and written communication skills to communicate technical concepts to a technical and non-technical audience.
- Demonstrated ability to establish relationships and build rapport to influence colleagues at all levels, uncover business issues, and identify needs.
- Bachelor's degree in computer science, Computer Engineering, Information Systems, or related field, or ten years of equivalent work experience.
At MetLife, we're leading the global transformation of an industry we've long defined. United in purpose, diverse in perspective, we're dedicated to make a difference in the lives of our customers.
The salary range for applicants for this position is $90,000 to $117,500.
Benefits We Offer
Our U.S. benefits address holistic well-being with programs for physical and mental health, financial wellness, and support for families. We offer a comprehensive health plan that includes medical/prescription drug and vision, dental insurance, and no-cost short- and long-term disability. We also provide company-paid life insurance and legal services, a retirement pension funded entirely by MetLife and 401(k) with employer matching, group discounts on voluntary insurance products including auto and home, pet, critical illness, hospital indemnity, and accident insurance, as well as Employee Assistance Program (EAP) and digital mental health programs, parental leave, volunteer time off, tuition assistance and much more!
About MetLife
Recognized on Fortune magazine's list of the 2025 "World's Most Admired Companies", Fortune World's 25 Best Workplaces for 2024, as well as the 2025 Fortune 100 Best Companies to Work For , MetLife, through its subsidiaries and affiliates, is one of the world's leading financial services companies; providing insurance, annuities, employee benefits and asset management to individual and institutional customers. With operations in more than 40 markets, we hold leading positions in the United States, Latin America, Asia, Europe, and the Middle East.
Our purpose is simple - to help our colleagues, customers, communities, and the world at large create a more confident future. United by purpose and guided by empathy, we're inspired to transform the next century in financial services. At MetLife, it's #AllTogetherPossible. Join us!
MetLife is an Equal Opportunity Employer. All employment decisions are made without regards to race, color, national origin, religion, creed, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity or expression, age, disability, marital or domestic/civil partnership status, genetic information, citizenship status (although applicants and employees must be legally authorized to work in the United States), uniformed service member or veteran status, or any other characteristic protected by applicable federal, state, or local law ("protected characteristics").
If you need an accommodation due to a disability, please email us at .... This information will be held in confidence and used only to determine an appropriate accommodation for the application process.
MetLife maintains a drug-free workplace.
$90,000 to $117,500