Back to all rolesSenior Site Reliability EngineerApplyJob detailsBusiness / SupercomputerMilanFull-time€50,000 EUR - €70,000 EURWe are looking for an experienced Site Reliability Engineer to join our growing team in Milan and help shape the future of our flagship project, Colosseum, one of Europe’s most powerful AI supercomputers, currently in development.Designed to run our proprietary AI models at scale, it forms the compute backbone behind the intelligence we deliver to the world’s most demanding industries.In this role, you will design and implement observability and control mechanisms that extract operational data from infrastructure and feed it into automated systems to enable continuous optimization, including key system budgets such as power, cooling and service level, security-level objectives. You will be responsible for actively guarding and maintaining these operational budgets as part of day-to-day system reliability and performance management.You will also contribute to operational excellence through blameless post-mortem analysis and structured incident learning, ensuring continuous improvement of system behavior and resilience. As a part of the team, you will work closely with Platform Engineering in a shared cybersecurity model, where SRE focuses on detection and monitoring, while Platform Engineering ensures the secure design and operation of the underlying infrastructure.What You HaveBachelor’s or Master’s degree in Computer Science, Computer Engineering, Electrical Engineering, or a related field.At least 6 years of experience as a Site Reliability Engineer or in similar roles.Strong experience with observability and monitoring systems such as Prometheus, Thanos, Grafana, and OpenTelemetryExperience with low-level system instrumentation and performance visibility using technologies such as eBPFExperience with security monitoring and threat detection tools such as Zeek, Wazuh, or equivalent SIEM / security observability platformsStrong experience with containerized and cloud-native environments, particularly KubernetesStrong software development skills, particularly in Python, with the ability to build automation, integrations, and custom toolingExperience integrating heterogeneous infrastructure systems across multiple vendors, APIs, and evolving tool ecosystemsFamiliarity with modern infrastructure automation and emerging agent-based frameworks such as MCP / A2A (or equivalent technologies)Exposure to digital twin technologies and simulation platforms such as NVIDIA Omniverse or equivalentStrong ability to design, build, and maintain software-driven infrastructure solutions in complex, large-scale environmentsWho You AreA versatile engineer, comfortable operating in complex and fast-paced environments.Driven and fearless, you proactively tackle challenges and overcome obstacles with determination.A systems thinker, capable of understanding the broader architecture and identifying dependencies across platforms and technologies.A collaborative team player who is enthusiastic, curious, and passionate about problem-solving, thriving both independently and within cross-functional teams.An effective communicator with strong interpersonal skills, able to engage with diverse stakeholders and foster collaboration.Fluent in English and eager to contribute in a multicultural and international environment.BenefitsPerksLearning Friday. If our team members know more, so do we. That’s why we give everyone a training budget that they can spend on books, online courses or other training materials.Smart Working. Trains can be a drag, you can save some commuting time by working from home.Salary is based on experience and topped up with other bonuses.We offer a competitive salary, as well as an opportunity to receive company equity. The typical salary for this role ranges between € 50.000 and € 70.000. As you gain experience and make more significant contributions to the business, your compensation will be reviewed to match...
Want jobs like this matched to you?
Swoopd scores fresh postings against your résumé so you only see the matches that matter.