Software Engineer III - AI/ML Platform Operations - Remote
Role responsibilities
The Software Engineer will lead the operational excellence and reliability of enterprise AI and data platforms, ensuring stability and scalability of AI/ML solutions. This role focuses on AI platform operations, MLOps, automation, and continuous improvement of enterprise AI capabilities.
Requirements
Candidates should have 3+ years of experience in software engineering or related fields, with a bachelor's degree in a relevant discipline. Strong skills in cloud operations, automation, and experience with AI/ML platforms are essential.
Key skills
AI/ML Platforms, MLOps, Automation, Observability, Cloud Operations, Software Engineering, Incident Management, Problem Resolution, Technical Leadership, Collaboration, Python, Java, JavaScript, CI/CD, AWS, Monitoring
Keywords
AI, ML, Platform Operations, MLOps, Automation, Reliability Engineering, Deployment Support, Observability, Governance, Continuous Improvement, Palantir Foundry, AWS Bedrock, Amazon SageMaker, Cloud-native Services, Generative AI, Incident Management, Root Cause Analysis, Technical Leadership, Collaboration, Python, Java, JavaScript, CI/CD, Monitoring, Datadog, Splunk, Grafana, Prometheus, CloudWatch, OpenTelemetry