Austin, TX, US
21 days ago
Principal Technical Program Manager, Enterprise Engineering Availability
Amazon strives to be the world’s most customer centric company. To succeed, our products and services must be available at all times to our customers. The Enterprise Engineering Availability (EEA) team is responsible for improving the availability of internal systems (software, hardware, network) used by millions of Amazonians around the world.

A Principal Technical Program Manager on the EEA team will create multi-year programs which drive engineering best practices (testing, deployments, resilience, incident management) across multiple orgs resulting in a best-in-class IT experience which boosts employee productivity. Your programs will improve the performance of software teams and bolster the resilience of the software built by those teams. You will create a closed loop between incident response and incident prevention by analysis of top root causes for problems and then designing programs to eradicate those classes of problems going forward. Your analyses will identify opportunities for continual reduction of MTTR by improving automatic detection, diagnosis and mitigation recommendations. You will drive efforts to improve system telemetry and observability which will result in better prediction, detection and triage of customer-impacting outages.

This role is a perfect fit for an experienced technologist who is passionate about availability (alerting, metrics, monitoring, observability), incident management and machine learning. You thrive in a fast-paced, startup-like environment, are comfortable with full-stack applications, communicate effectively to all types of stakeholders (tech, non-tech), enjoy learning new technology and lead through others to ship complex software at scale in fast iterations.


A day in the life
* Deliver high-impact, high-visibility projects that improve the productivity of millions of Amazonians around the world
* Invent processes, tools, and technology to force multiply the effect of your contributions across many organizations.
* Be responsible for owning, scoping, leading and delivering projects and experiments end-to-end, leveraging statistical evaluation, pattern recognition, and machine learning.

We are open to hiring candidates to work out of one of the following locations:

Austin, TX, USA
Confirm your E-mail: Send Email