AI/ML Architect with AWS Cloud Infrastructure
- Balin Technologies LLC
- Gaithersburg, Maryland
- Full Time
Hi,
Job Title: ML/AI Architect
Location: Gaithersburg, MD (3 days per week)
Job Description:
- This position requires a deep understanding of cloud-native ML/AI Ops methodologies and technologies, AWS infrastructure, State-of-the-art (SOTA) Foundation Models and AWS GenAI Services, and the unique demands of regulated industries, making it a cornerstone of our success in delivering impactful solutions to the pharmaceutical industry.
workloads and processes.
- ML/AI Cloud Operations and Engineering
- Architect and implement scalable AWS ML/AI cloud infrastructure in a multi-tenant SaaS environment.
- Establish governance frameworks for ML/AI infrastructure management and ensure compliance with industry standard processes.
- Ensure principled and methodical validation pathways and a Well Architected Framework for Embryonic Research (WAFER) similar to and building on AWS s Well Architected Framework (WAF) for all early stage product and operational GenAI PoC s across the organization.
- Oversee ML/AI related Kubernetes (k8s) cluster management and provide guidance on alternative ML/AI workflow orchestration options such as Argo vs Kubeflow, and ML/AI data pipeline creation, management and governance with tools like Airflow.
- Employ AWS CDK (TypeScript), Projen, and Argo CD to automate infrastructure deployment and management.
- Help set the strategy and manage the tactical balance between framework and platform experimentation and democratization with standardization and centralized management and governance
- Conduct cost-benefit analyses and formal processes for selection and utilization of foundation models, evaluating their architectures, performance, and costs.
- Work with multiple teams to ensure that the platform meets organizational needs and scales effectively.
Essential Skills/Experience:
- HS Diploma and 5 years of experience in Engineering/IT solutions OR BA/BS
- Minimum of 5 years in cloud infrastructure design and management roles.
- Deep understanding of the Data Science Lifecycle (DSLC) and the ability to shepherd data science projects from inception to production within the platform architecture.
- Expert in Typescript, AWS CDK, Projen, and Argo CD and other Cloud Infrastructure CI/CD Tools
- Extensive experience in managing Kubernetes clusters for ML workflows.
- Solid understanding of foundation models and their applications in ML/AI solutions.
- Strong background in AWS DevOps practices and cloud architecture.
- Deep knowledge of AWS services (Bedrock, Sagemaker, EC2, S3, RDS, Lambda, etc) and hands-on design and implementation cloud systems (microservices architecture, API design, and database management (SQL/NoSQL))
- Experience with monitoring and optimizing cloud infrastructure for scalability and cost-efficiency.
- Ability to collaborate effectively with engineering, design, product, science and security teams.
- Strong written and verbal communication skills for reporting and documentation.
- Demonstrated ability to manage large-scale, complex projects across an organization.
- Proven experience in conducting performance and cost analyses of AWS infrastructure and ML/AI models.
Job ID: 483749690
Originally Posted on: 7/2/2025
Want to find more Construction opportunities?
Check out the 176,632 verified Construction jobs on iHireConstruction
Similar Jobs