Leads improvements to production infrastructure, platforms and services. Collaborates closely with other team leads and proactively designs new, scalable solutions for fast changing environment. Leads deep drill down analysis into production support operations and identifies and provides necessary fixes. Investigates errors across teams and proactively enhances troubleshooting processes. Builds complex enhancements to resolve defects. Leverages standard template to improve production infrastructure, platforms, and/or services. Leads engagement with stakeholders and drives the design, development, configuration, of enterprise supporting systems. Takes proactive steps to identify server architecture and networking solutions. Leads code and architecture reviews and capitalizes on opportunities to improve health, scalability, and resiliency of products and services.
Production Infrastructure:
- Leads production infrastructure, platforms maintenance, and services which support the growing client base as well as designs and implements highly scalable environments.
- Collaborates closely with other team leads in the organization to direct work and provides and adapts solutions for their needs.
- Proactively designs new, scalable solutions for fast changing environment with advanced needs in various fields (e.g., configuration deployments, monitoring, and logging).
Troubleshooting, Testing, and Improvements - Drilldown Analysis/Troubleshooting:
- Leads deep drill down analysis into production support operations and identifies and provides necessary fixes.
- Investigates errors, debugs source code and performance bottlenecks, and helps determine if root cause analysis (RCA) are needed across teams.
- Proactively enhances troubleshooting processes.
Troubleshooting, Testing, and Improvements - Testing and Improvements:
- Builds complex enhancements (e.g., automating alerts) and reviews proposed enhancements that resolve defects.
- Globally works with cross functional teams to ensure service health with pre-production testing of new releases, post-change validation, and automation work to improve the testing framework and methodology.
- Identifies and implements changes and new ideas for the improvement of production infrastructure, platforms, and/or services.
- Leverages the standard template to create the user acceptance test plans and participates in use of new service operational readiness.
System Design and Architecture - System Design:
- Drives the design, development, configuration, and/or deployment of enterprise supporting systems.
- Builds working relationships with external or internal customers to implement large scale solutions, business process architecture, application system design, and implementation, impacting results within and beyond the team.
- Leads engagement in stakeholders meetings for requirements gathering.
System Design and Architecture - System Architecture:
- Takes proactive steps to identify solutions in experience of application or server architecture and networking.
- Drives the development of architecture standards and practices in alignment with business goals.
Product/Service Development:
- Drives the design, development, and enhancement of new and existing products and services from inception and design, advocating for adherence to business feature requirements.
- Leads code and architecture reviews of the products and services.
- Resolves complex performance and scalability issues related to products and services.
- Identifies and capitalizes on opportunities to improve health, scalability, and resiliency of products and services.
- Participates in performance benchmarking.
Planning & Execution:
- Manages and coordinates moderately complex tasks, monitoring timelines and deliverables to ensure timely completion and adherence to requirements for a moderately-sized project or initiative. Efficiently delegates, monitors, and prioritizes work across multiple projects, providing technical oversight and adjusting plans to address shifts in resources or timelines.
Continuous Improvement:
- Develops ideas, recommends updates, and/or collaborates on the implementation of process improvements to increase the efficiency and effectiveness of processes, protocols, and workflows across teams, and evaluates the impact on key stakeholders. Solicits feedback from others on ideas for alternative approaches and methods for continued improvement.
Qualifications
- 11 years of experience in application of server architecture, system administration, software development, or cloud application delivery
- Demonstrated ability in or knowledge of configuration management and deployment tools, including automating infrastructure provisioning and management.
- Demonstrated ability in or knowledge of scripting languages, including writing and maintaining scripts to improve system efficiency.
- Demonstrated ability to coordinate releases and deployments to deliver enhancements with minimal disruption.
- Demonstrated ability in or knowledge of change control, including assessing risks and coordinating
structured modifications to products or services. - Demonstrated ability to architect end-to-end solutions that meet both business and technical requirements.
- Demonstrated ability in or knowledge of cloud architecture, including designing scalable, reliable, and
performant cloud services. - Demonstrated ability to conduct root cause analysis and implement corrective action plans.
- Demonstrated ability in or knowledge of virtualization, including designing, managing, or troubleshooting
virtual environments and infrastructure.
#LI-DNI
Career Level - IC4