Lead MLOps Architect – AWS

NorthBay - Pakistan · Cairo, Egypt · Posted 2026-02-25

Experience: 10–12 YearsLocation: Egypt (Onsite Role)Employment Type: Full-TimeJob SummaryWe are seeking a highly experienced Lead MLOps Architect with deep AWS expertise to lead the design, architecture, and governance of enterprise-grade ML platforms. This role requires strong leadership capabilities, hands-on expertise in scalable ML systems, and experience managing large production environments.Key ResponsibilitiesArchitect and lead enterprise-scale MLOps platforms on AWSDefine best practices for ML lifecycle management, deployment standards, and governanceLead production deployment of ML models using AWS-native servicesDesign automated CI/CD pipelines for ML workflows and infrastructureImplement advanced monitoring, drift detection, retraining automation, and observabilityEnsure high availability, scalability, security, and cost optimizationEstablish model versioning, reproducibility, and experiment tracking standardsLead troubleshooting of complex production issuesMentor and lead a team of MLOps and platform engineersCollaborate with stakeholders to align ML platform strategy with business objectivesRequired Skills & QualificationsMLOps & Machine Learning10–12 years of overall experience with strong focus on ML production systemsProven experience leading ML platform architecture and large-scale deploymentsDeep understanding of ML lifecycle management, governance, and reproducibilityHands-on experience with TensorFlow, PyTorch, Scikit-learnStrong experience with MLflow or enterprise model management toolsAWS Cloud (Mandatory)Advanced hands-on expertise in:Amazon SageMaker (training, pipelines, endpoints)S3, EC2, LambdaECR, ECS, EKSIAM, CloudWatchExperience designing secure, compliant, and scalable ML architecturesExperience implementing cost optimization strategies on AWSDevOps, Containers & IaCStrong expertise in Docker and Kubernetes (EKS)Advanced CI/CD implementationInfrastructure as Code using Terraform and/or CloudFormationExperience implementing GitOps practicesProgramming & DataExpert-level Python skillsExperience designing robust data pipelinesStrong understanding of SQL/NoSQL systemsExposure to streaming or real-time ML systemsPreferred QualificationsAWS Professional-level certificationsExperience with ML security, explainability, and regulatory complianceExperience building enterprise feature storesExposure to real-time inference systems

Apply for this role