Sr Engineer, Site Reliability

T-Mobile USA, Inc.
Atlanta, US

Job Description

At T-Mobile, we invest in YOU! Our Total Rewards Package ensures that employees get the same big love we give our customers. All team members receive a competitive base salary and compensation package - this is Total Rewards. Employees enjoy multiple wealth-building opportunities through our annual stock grant, employee stock purchase plan, 401(k), and access to free, year-round money coaches. That’s how we’re UNSTOPPABLE for our employees!

Job Overview

Are you ready to join the Un-carrier movement?

This role ensures the reliability and resilience of digital infrastructure to support highly critical Credit and Collections new project initiatives while continuously driving innovation. It involves automating processes and reducing manual effort to prevent operational incidents and improve system performance. The role requires expertise in programming, scripting, incident response management, and various technical tools to maintain system robustness. Success is measured by system stability, incident reduction, and continuous improvement in operational efficiency. The work directly impacts organizational stability and customer experience by maintaining high-performing and reliable systems.

Job Responsibilities:

Improve system reliability and resilience by identifying issues and implementing preventive measures to reduce downtime

Automate processes to accelerate software development and deployment while minimizing manual interventions using sophisticated agentic AI methods and tools

Design and maintain GitLab CI/CD pipelines to automate build, test, and deployment processes across multiple environments

Conduct root cause analysis and collaborate with problem management to prevent incident recurrence and improve system operations

Apply problem-solving and analytical skills to prevent operational incidents and maintain system stability

Leverage programming, scripting, and incident response expertise to improve system robustness and efficiency

Continuously learn new skills and technologies to adapt to changing environments and drive innovation

Also responsible for other duties/projects as assigned by business management as needed

Education and Work Experience:

Bachelor's Degree plus 3 years of related work experience

OR advanced degree with 1 year of related work experience

OR combination of education and experience deemed equivalent (Required)

Acceptable areas of study include Computer Science, Engineering or related field (Required)

4-7 years Working in operations or develops environments (Preferred)

4-7 years Troubleshooting customer related issues and managing customer relationships (Preferred)

4-7 years Developing software solutions using Python or similar programming languages (Preferred)

Knowledge, Skills and Abilities:

Programming Proficiency in programming and scripting languages such as Python and Bash. (Required)

Automation Ability to automate processes and reduce manual effort. (Required)

Incident Management Understanding of incident response management and operational support. (Required)

Experience with designing and maintaining CICD Pipelines. (Required)

Ability to learn new skills and technologies quickly and adapt to changing circumstances. (Required)

Understanding of system reliability and resilience principles. (Required)

Development and automation experience using Agentic AI and ML tools (preferred)

Familiarity with Billing and Credit business applications and platforms (preferred)

Licenses and Certifications:

AWS Certified DevOps Engineer

This certification validates technical expertise in provisioning, operating, and managing distributed application systems on the AWS platform. (Preferred)

Certified Kubernetes Administrator

This certification validates the skills required for day-to-day administration of Kubernetes environments. (Preferred)

Google Cloud Certified - Professional DevOps Engineer

This certification validates the ability to efficiently develop and deploy applications using Google Cloud technologies and to manage operations. (Preferred)

At least 18 years of age

Legally authorized to work in the United States

Travel:

Travel Required (Yes/No): No

DOT Regulated:

DOT Regulated Position (Yes/No): No

Safety Sensitive Position (Yes/No): No

Base Pay Range: $107,300 - $193,500

Corporate Bonus Target: 15%

The pay range above is the general base pay range for a successful candidate in the role. The successful candidate’s actual pay will be based on various factors, such as work location, qualifications, and experience, so the actual starting pay will vary within this range.

At T-Mobile, employees in regular, non-temporary roles are eligible for an annual bonus or periodic sales incentive or bonus, based on their role. Most Corporate employees are eligible for a year-end bonus based on company and/or individual performance and which is set at a percentage of the employee’s eligible earnings in the prior year. Certain posi

Skills & Requirements

Technical Skills

PythonBashGitLab CI/CD pipelinesAgentic AI and ML toolsproblem-solvinganalytical skillsincident response managementoperational supportCredit and CollectionsBilling

Salary

$107,300 - $193,500

year

Level

mid

Posted

4/11/2026

Continue to Glassdoor

You will be redirected to the job posting on Glassdoor.