Role overview
You will be responsible for designing and implementing scalable, secure, and production-ready AI/ML platforms in AWS. This role combines hands-on engineering, platform thinking, and best practice leadership in a large-scale enterprise environment.
What you'll work on
- Design and manage AWS infrastructure for AI/ML workloads (Step Functions, Lambda, EventBridge, MWAA)
- Build and optimize big data pipelines using PySpark / Spark (Glue, EMR, EKS)
- Manage large-scale data storage and querying (S3, Athena)
- Implement CI/CD pipelines (Jenkins) and Infrastructure as Code (Terraform)
- Ensure performance, scalability, and cost-efficiency of solutions
- Collaborate with data scientists, product teams, and stakeholders
- Contribute to engineering standards and mentor other team members
What we're looking for
- Experience with MLflow, SageMaker or Bedrock
- Knowledge of real-time data processing (e.g. Kafka)
- AWS certifications
Tags & focus areas
Used for matching and alerts on DevFound Ai Ai Engineer Mlops Generative Ai