Senior Manager, Systems and Infrastructure Engineering
- Walmart
- Bentonville, Arkansas
- Full Time
Senior Manager, Systems and Infrastructure Engineering
at Walmart in Bentonville, Arkansas, United States
Job Description
Position Summary
Senior Manager, Systems & Infrastructure Engineering GenAI Enablement
What you'll do In this role, you'll lead the infrastructure and platform engineering function responsible for enabling enterprise-scale GenAI and agentic AI solutions at Walmart. You'll build and mentor a high-performing team, define technical strategy, and deliver foundational systems that support AI agent orchestration, platform governance, and deep system interoperability.
About the Role: This position reports into the Productivity and Collaboration Platforms organization, overseeing the design and operational enablement of AI and productivity services across Microsoft 365, Zoom, Slack, Jira, ServiceNow, and more. You'll work across cloud and hybrid environments, ensuring that the systems powering AI experiences are secure, resilient, and scalable. This role combines technical platform ownership with strong people leadership to grow our capabilities in delivering trusted, high-impact GenAI experiences across the enterprise.
What you'll do
Infrastructure Maintenance: Requires knowledge of: Infrastructure maintenance tools and methodologies; Infrastructure maintenance plans and schedule; Infrastructure performance metrics. To perform routine maintenance tasks for infrastructure systems such as backups, patch management and hot fixes. Escalate any issue that occurs in the backup media. Audit desktops for compliance with IT policies. Conduct regular database integrity checks to ensure minimal data loss. Technology Solution Automation: Requires knowledge of: Automation tools and technologies; Scripting Languages. To analyze data and metrics to identify areas of automation to drive faster response to operational issues. Maintain up-to-date documentation on deployments, processes, and standard operating procedures/runbooks with a goal minimize runbooks by automation. Coding: Requires knowledge of: Coding standards and guidelines; Coding languages (E.g. JavaScript, Python, C# etc.), frameworks(E.g. ActiveX,.Net, Cocoa, Android application framework etc.), tools(E.g. Monday, Linx, Embold etc.) and Platforms (E.g. Microsoft Azure, AWS , Apple IOS etc.); Quality, Safety and Security (PCI etc.) standards; Emerging tools and technologies; Telemetry. To adhere to all relevant coding guidelines (Ex: code review processes, code branching strategies, reusability etc.) while writing/configuring code. Create/configure minimalistic (Less Complex, Highly Robust and high quality) code for a component/module under guidance. Maintain records by documenting program development and revisions. Stay updated on the prevalent coding languages and frameworks in the industry outside the immediate scope of delivery. Identify repetitive and routine tasks in (Continuous Integration/Continuous Delivery) CI/CD, Testing or any other process that can be automated. Implement telemetry features as required under guidance. Apply security policy requirements to component/module during code development/configuration. Requirement And Scoping Analysis: Requires knowledge of: Traceability matrix; Risk analysis methodologies; Cost Analysis; Business objectives; Classification of requirements; User stories To analyze the requirements/updates/modifications for alignment with business objectives and priorities. Articulate the impact of the proposed solution on business and its ability to address requirements. Mediate conflicting requirements of the various stakeholders. Guide teams to assess feasibility of new requirements. Prioritize the product/solution requirements to drive creation of Minimum Viable Product (MVP) to meet the core requirements. Proactively identify areas for product enhancements, new features and updates based on customer requirements/ feedback. Contribute to the creation of user stories for complex requirements across the domain (For agile methodology). Capacity Management: Requires knowledge of: Capacity management Current and future business requirements Walmart service levels Computing resources; Infrastructure performance; Infrastructure growth plans. To right-sizing IT resources to meet current and future business requirements in a cost-effective within a domain/ pillar. Optimize IT resource utilization. Create a model of infrastructure performance to manage current resource needs. Manage demand for computing resources. Produce a capacity plan that covers current and forecasted needs. Issue Resolution: Requires knowledge of: Issue resolution techniques; Escalation scale parameters. To identify and elaborate possible and feasible solutions to the issues raised for complex projects/Systems and Infra configurations. Implement the most feasible option in collaboration with the team. Track registered issues for all ongoing projects and assess on their progress. Assess if the issue needs to be escalated. Configuration Management: Requires knowledge of: Configuration management tools and processes; Configuration and release management in environment(s). To conduct analysis for complex configuration changes. Perform all the operations to prepare a system for assembly and transfer to the environment on which it will be run in internal / external environment for deployment. Cloud Migration: Requires knowledge of: Data Center Operations (Production operations, Server, Storage, Systems and Infra, Data Base Management, Systems Management, Helpdesk, DR/Business recovery practices; Platform as a service (PaaS) service for public cloud services (Cosmos, EventHub, Cloudspanner) (Applicable only to the Cloud Enablement Team). To support decommission of the on-prem hardware or convert it into a hybrid set-up or complete cloud setup as per business needs. Monitor the resiliency and reliability of the new cloud environment. Gauge the bandwidth needed to handle the internet connections for cloud access. Support in planning and executing data-center consolidations, relocations, and migrations. Version Control: Requires knowledge of: Version management tool; Product deployment tools and processes; Release verification mechanism. To ensure that any modification to code is checked into correct branch and code merges are done properly. Authenticate and authorize of code changes in the version control system / central repository. Oversee the process of code branching and ensure seamless code build. Ensure that all impacted files are included in release communication. Integration Management: Requires knowledge of: Types of middleware; Different types of platforms; Functions of Application Programming Interfaces (APIs); Programming languages used for middleware; Principles and protocols for API-level integration; New and emerging middleware products, tools and methodologies in the industry. To evaluate opportunities for creating connections among various hardware and applications. Evaluate suitable middleware to be used for integrating existing legacy applications with current applications as required including cloud systems. Program middleware or other tools to enable effective integration of applications within. Perform API-level integration. Oversee the end-to-end process of application integration to the target environment. Investigate issues or failures of application integration. Facilitate modifications to improve the success of integration between application programs. Architecture Acumen: Requires knowledge of: Architectural principles; Systems and environment behavior; Architectural Styles, Patterns and plans; Architectural standards; Non-functional System performance parameters; Technology Strategy. To assist in decomposing the product architecture into multiple components and modules and define architectural specifications for each module. Create/Apply the right architectural pattern across the module as indicated in the architectural plan to obtain the right result. Define the architecture blueprint for the various components within a product/solution. Analyze syste
To view full details and how to apply, or
at Walmart in Bentonville, Arkansas, United States
Job Description
Position Summary
Senior Manager, Systems & Infrastructure Engineering GenAI Enablement
What you'll do In this role, you'll lead the infrastructure and platform engineering function responsible for enabling enterprise-scale GenAI and agentic AI solutions at Walmart. You'll build and mentor a high-performing team, define technical strategy, and deliver foundational systems that support AI agent orchestration, platform governance, and deep system interoperability.
About the Role: This position reports into the Productivity and Collaboration Platforms organization, overseeing the design and operational enablement of AI and productivity services across Microsoft 365, Zoom, Slack, Jira, ServiceNow, and more. You'll work across cloud and hybrid environments, ensuring that the systems powering AI experiences are secure, resilient, and scalable. This role combines technical platform ownership with strong people leadership to grow our capabilities in delivering trusted, high-impact GenAI experiences across the enterprise.
What you'll do
Infrastructure Maintenance: Requires knowledge of: Infrastructure maintenance tools and methodologies; Infrastructure maintenance plans and schedule; Infrastructure performance metrics. To perform routine maintenance tasks for infrastructure systems such as backups, patch management and hot fixes. Escalate any issue that occurs in the backup media. Audit desktops for compliance with IT policies. Conduct regular database integrity checks to ensure minimal data loss. Technology Solution Automation: Requires knowledge of: Automation tools and technologies; Scripting Languages. To analyze data and metrics to identify areas of automation to drive faster response to operational issues. Maintain up-to-date documentation on deployments, processes, and standard operating procedures/runbooks with a goal minimize runbooks by automation. Coding: Requires knowledge of: Coding standards and guidelines; Coding languages (E.g. JavaScript, Python, C# etc.), frameworks(E.g. ActiveX,.Net, Cocoa, Android application framework etc.), tools(E.g. Monday, Linx, Embold etc.) and Platforms (E.g. Microsoft Azure, AWS , Apple IOS etc.); Quality, Safety and Security (PCI etc.) standards; Emerging tools and technologies; Telemetry. To adhere to all relevant coding guidelines (Ex: code review processes, code branching strategies, reusability etc.) while writing/configuring code. Create/configure minimalistic (Less Complex, Highly Robust and high quality) code for a component/module under guidance. Maintain records by documenting program development and revisions. Stay updated on the prevalent coding languages and frameworks in the industry outside the immediate scope of delivery. Identify repetitive and routine tasks in (Continuous Integration/Continuous Delivery) CI/CD, Testing or any other process that can be automated. Implement telemetry features as required under guidance. Apply security policy requirements to component/module during code development/configuration. Requirement And Scoping Analysis: Requires knowledge of: Traceability matrix; Risk analysis methodologies; Cost Analysis; Business objectives; Classification of requirements; User stories To analyze the requirements/updates/modifications for alignment with business objectives and priorities. Articulate the impact of the proposed solution on business and its ability to address requirements. Mediate conflicting requirements of the various stakeholders. Guide teams to assess feasibility of new requirements. Prioritize the product/solution requirements to drive creation of Minimum Viable Product (MVP) to meet the core requirements. Proactively identify areas for product enhancements, new features and updates based on customer requirements/ feedback. Contribute to the creation of user stories for complex requirements across the domain (For agile methodology). Capacity Management: Requires knowledge of: Capacity management Current and future business requirements Walmart service levels Computing resources; Infrastructure performance; Infrastructure growth plans. To right-sizing IT resources to meet current and future business requirements in a cost-effective within a domain/ pillar. Optimize IT resource utilization. Create a model of infrastructure performance to manage current resource needs. Manage demand for computing resources. Produce a capacity plan that covers current and forecasted needs. Issue Resolution: Requires knowledge of: Issue resolution techniques; Escalation scale parameters. To identify and elaborate possible and feasible solutions to the issues raised for complex projects/Systems and Infra configurations. Implement the most feasible option in collaboration with the team. Track registered issues for all ongoing projects and assess on their progress. Assess if the issue needs to be escalated. Configuration Management: Requires knowledge of: Configuration management tools and processes; Configuration and release management in environment(s). To conduct analysis for complex configuration changes. Perform all the operations to prepare a system for assembly and transfer to the environment on which it will be run in internal / external environment for deployment. Cloud Migration: Requires knowledge of: Data Center Operations (Production operations, Server, Storage, Systems and Infra, Data Base Management, Systems Management, Helpdesk, DR/Business recovery practices; Platform as a service (PaaS) service for public cloud services (Cosmos, EventHub, Cloudspanner) (Applicable only to the Cloud Enablement Team). To support decommission of the on-prem hardware or convert it into a hybrid set-up or complete cloud setup as per business needs. Monitor the resiliency and reliability of the new cloud environment. Gauge the bandwidth needed to handle the internet connections for cloud access. Support in planning and executing data-center consolidations, relocations, and migrations. Version Control: Requires knowledge of: Version management tool; Product deployment tools and processes; Release verification mechanism. To ensure that any modification to code is checked into correct branch and code merges are done properly. Authenticate and authorize of code changes in the version control system / central repository. Oversee the process of code branching and ensure seamless code build. Ensure that all impacted files are included in release communication. Integration Management: Requires knowledge of: Types of middleware; Different types of platforms; Functions of Application Programming Interfaces (APIs); Programming languages used for middleware; Principles and protocols for API-level integration; New and emerging middleware products, tools and methodologies in the industry. To evaluate opportunities for creating connections among various hardware and applications. Evaluate suitable middleware to be used for integrating existing legacy applications with current applications as required including cloud systems. Program middleware or other tools to enable effective integration of applications within. Perform API-level integration. Oversee the end-to-end process of application integration to the target environment. Investigate issues or failures of application integration. Facilitate modifications to improve the success of integration between application programs. Architecture Acumen: Requires knowledge of: Architectural principles; Systems and environment behavior; Architectural Styles, Patterns and plans; Architectural standards; Non-functional System performance parameters; Technology Strategy. To assist in decomposing the product architecture into multiple components and modules and define architectural specifications for each module. Create/Apply the right architectural pattern across the module as indicated in the architectural plan to obtain the right result. Define the architecture blueprint for the various components within a product/solution. Analyze syste
To view full details and how to apply, or
Job ID: 487246375
Originally Posted on: 7/29/2025
Want to find more Construction opportunities?
Check out the 167,807 verified Construction jobs on iHireConstruction
Similar Jobs