This position is listed on behalf of a partner company, who manages all applications and next steps. Our partner is looking for a Telecom Observability Engineer based in the United States.
This is a hands-on, mission-critical engineering role focused on ensuring the reliability, visibility, and performance of complex mobile telecom networks. You will work at the intersection of telecom infrastructure, distributed systems, and real-time monitoring, collaborating closely with engineering teams and customer support escalation groups. The role involves designing and maintaining observability frameworks that detect, analyze, and help resolve network incidents before they impact customers. You will contribute directly to the stability of 2G/3G/4G/5G environments while helping evolve modern monitoring and alerting systems. This is an opportunity to work alongside highly experienced telecom professionals in a fast-paced, technically deep environment. The position is ideal for someone who thrives in operational excellence and wants to deepen expertise in mobile network architecture and observability at scale.
This position is listed on behalf of a partner company, who manages all applications and next steps. Our partner is looking for a Telecom Observability Engineer based in the United States.
This is a hands-on, mission-critical engineering role focused on ensuring the reliability, visibility, and performance of complex mobile telecom networks. You will work at the intersection of telecom infrastructure, distributed systems, and real-time monitoring, collaborating closely with engineering teams and customer support escalation groups. The role involves designing and maintaining observability frameworks that detect, analyze, and help resolve network incidents before they impact customers. You will contribute directly to the stability of 2G/3G/4G/5G environments while helping evolve modern monitoring and alerting systems. This is an opportunity to work alongside highly experienced telecom professionals in a fast-paced, technically deep environment. The position is ideal for someone who thrives in operational excellence and wants to deepen expertise in mobile network architecture and observability at scale.
Accountabilities:
- Design, implement, and continuously improve end-to-end observability strategies across telecom and distributed network environments, ensuring high visibility into system performance and reliability.
- Monitor, troubleshoot, and support core mobile network infrastructure (2G/3G/4G/5G), identifying performance issues and ensuring rapid escalation and resolution of incidents.
- Deploy and manage monitoring and observability tools such as Grafana, SevOne, ELK Stack, and CloudPak for AI-driven operations, ensuring system health visibility and actionable alerting.
- Develop and refine KPIs, SLIs, and SLOs to measure network performance, reliability, and service quality across critical telecom systems.
- Build dashboards, alerting systems, and real-time visualizations to support NOC engineers, engineering teams, and support escalation groups in incident response and root cause analysis.
- Implement distributed tracing, log aggregation, and anomaly detection systems to proactively identify and mitigate network and infrastructure issues.
- Support technical troubleshooting efforts by analyzing incidents, documenting root causes, and defining appropriate escalation paths across teams.
- Create training materials and operational documentation to improve internal understanding of KPIs, dashboards, and observability practices.
- 10+ years of experience in telecom environments, including L3 support or equivalent network operations roles
- Strong understanding of mobile network architectures (2G, 3G, 4G, 5G) and core telecom infrastructure
- Extensive experience with observability and monitoring tools such as Grafana, Prometheus, ELK Stack, SevOne, or CloudPak for AIOps
- Solid understanding of distributed systems, microservices architectures, and large-scale infrastructure environments
- Basic proficiency in scripting or programming (Python, Linux, Bash) for automation and troubleshooting
- Experience with cloud platforms such as AWS, GCP, or Azure
- Strong understanding of SLOs, SLIs, and SLAs in production environments
- Excellent problem-solving, communication, and documentation skills
- Ability to work collaboratively with engineering, operations, and support teams in high-pressure situations
- Competitive compensation package aligned with experience
- Comprehensive health, dental, and vision insurance
- Flexible work arrangements and remote-friendly environment
- Opportunity to work on large-scale telecom infrastructure and cutting-edge observability systems
- High-impact role with direct influence on network reliability and customer experience
- Collaborative engineering culture with strong technical mentorship
- Professional development opportunities in telecom, cloud, and observability technologies
- Inclusive workplace culture that values diversity and continuous learning.
Requirements
This role requires deep telecom domain expertise combined with strong observability engineering and systems monitoring experience. The ideal candidate is highly analytical, operationally strong, and comfortable working in complex, distributed environments with high availability requirements.