Job Description
Uber
Engineering Manager II, M3 Metrics Job Description
About the role
At Uber, we provide a centralized, reliable, and interactive observability platform that includes metrics, logging, and tracing. This platform empowers engineers with the tools needed for monitoring, troubleshooting, and performing root cause analysis at scale. Currently, the platform handles more than 4 billion metrics per second, 3 million tracing spans per second, and 25 million logs per second-and it’s continuously evolving to support Uber’s growth.
The M3 Metrics team, a core part of the telemetry platform, is responsible for delivering an end-to-end distributed metrics solution at Uber scale. As Engineering Manager II for the Metrics team, you will play a crucial role in shaping the development of Uber’s observability platforms and directly impacting the company’s overall reliability. We are seeking an experienced leader with a proven track record in building large-scale distributed systems. You will own multiple large-scale platforms, setting the vision in collaboration with product teams and senior engineers. You will be tasked with building the next-generation observability platform, rethinking how our collectors, aggregators, and query layers are constructed, and enabling our engineers to scale quickly and operate reliably.
What You’ll Need
- Technical Leadership: Lead and manage high-performing engineering teams.
- Hands-on engineering background: A strong foundation in software engineering, with a track record of designing, developing and maintaining robust backend services.
- Cross-functional Collaboration: Proven ability to work cross-functionally, effectively communicating and advocating for the team across different departments. Experience collaborating closely with Product, Infrastructure, and DevOps teams to deliver end-to-end, scalable solutions.
Basic qualifications
- BS or higher degree in Computer Science, or a related technical discipline, or equivalent experience.
- Around 5+years of experience managing a high performance engineering team.
- Strong problem solving skills, with relevant experience in designing and implementing large scale distributed backend services
Preferred qualifications
- Proven record of building and operating highly reliable distributed systems at scale.
- Familiarity with observability tools and technologies: Experience with OpenTelemetry, and/or building and operating monitoring infrastructure at large scale. E.g. PB sized ES clusters, Prometheus, Kibana, Grafana, Jaeger, etc..
- Global-scale operations experience: Exposure to operating metrics systems across regions.
- This role requires a lot of organizational alignment and consensus building – excellent written and verbal communication skills is a huge plus.
- Passionate about pursuing technical excellence and mentoring engineers.
- Deep expertise in observability: Experience in metrics systems, logging and tracing at scale is a significant plus.
What will the candidate do
- Set the Vision: Define and drive the development of Uber’s next-generation observability platform, managing one of the largest distributed systems in the industry with over 4.5 billion metrics per second-scaling at levels 100-1000x larger than standard systems.
- Ensure Operational Excellence: Lead key reliability initiatives, ensuring the platform is scalable, resilient, and supports Uber’s global engineering needs.
- Empower the Team: Coach, develop, and manage a high-performing team of engineers, fostering technical excellence and driving impactful results.
- Collaborate Globally: Work with diverse teams across the US and Europe, aligning with stakeholders, program teams, and product partners to deliver end-to-end solutions.
- Engage with Open Source: Contribute to and collaborate with the broader open-source community, advancing M3’s initiatives and fostering innovation.
- Drive Future Growth: Shape the team’s trajectory with opportunities to expand the charter, influence Uber’s engineering excellence, and tackle challenges at a truly global scale.
Public Information
- https://eng.uber.com/m3/
- https://eng.uber.com/logging/
- https://github.com/jaegertracing/jaeger
We welcome people from all backgrounds who seek the opportunity to help build a future where everyone and everything can move independently. If you have the curiosity, passion, and collaborative spirit, work with us, and let’s move the world forward, together.
Offices continue to be central to collaboration and Uber’s cultural identity. Unless formally approved to work fully remotely, Uber expects employees to spend at least half of their work time in their assigned office. For certain roles, such as those based at green-light hubs, employees are expected to be in-office for 100% of their time. Please speak with your recruiter to better understand in-office expectations for this role.
*Accommodations may be available based on religious and/or medical conditions, or as required by applicable law. To request an accommodation, please reach out to accommodations@uber.com.
To apply, please visit the following URL:https://www.themuse.com/jobs/uber/staff-software-engineertlm-backend-observability→