about 5 hours ago

Logo of T-Mobile

Sr Engineer, Systems Reliability

$107k - $193k

T-Mobile

USFrisco, TXRemote

At T-Mobile, we invest in YOU!  Our Total Rewards Package ensures that employees get the same big love we give our customers.  All team members receive a competitive base salary and compensation package - this is Total Rewards. Employees enjoy multiple wealth-building opportunities through our annual stock grant, employee stock purchase plan, 401(k), and access to free, year-round money coaches. That’s how we’re UNSTOPPABLE for our employees!

As a Senior Engineer, Systems Reliability, you will lead the continuous enhancement of reliability, scalability, performance, and security across our technology infrastructure and software delivery processes. Your expertise in automation, CI/CD practices, cloud-native architectures, observability, and mentoring, along with your critical thinking skills and growth mindset, will directly influence our ability to deliver high-quality services quickly, reliably, and securely.

*** This position will sit in Frisco, TX only. This is not a remote role - this is a hybrid environment requiring 3 days a week in office.

Key Responsibilities:

  • Automation and CI/CD:
    • Develop, maintain, and optimize CI/CD pipelines (Jenkins, GitLab CI/CD) for efficient and reliable software deployment.
    • Automate tasks and operational workflows using scripting languages (Bash, Python) and infrastructure-as-code (Ansible, Terraform).

  • Infrastructure Reliability and Management:
    • Design and handle scalable, reliable infrastructure leveraging cloud services (AWS, Azure, GCP) and container technologies (Docker, Kubernetes).
    • Ensure high system availability, optimal performance, and robust security practices are implemented and continuously improved.

  • Monitoring and Observability:
    • Implement and maintain comprehensive monitoring, logging, and alerting solutions using tools such as OpenTelemetry, Prometheus, Grafana, and Splunk.
    • Leverage insights from monitoring systems to proactively identify issues and implement improvements.

  • Incident Response and Prevention:
    • Lead incident response efforts, including root cause analysis and post-incident reviews, driving effective preventive measures.
    • Establish and refine standards and best practices to minimize downtime and enhance reliability.

  • Mentorship and Leadership:
    • Mentor and guide junior engineers, promoting a culture of continuous learning, critical thinking, and innovation.
    • Collaborate cross-functionally, influencing and aligning teams toward reliability best practices and DevOps principles.

  • Continuous Improvement and Innovation:
    • Contribute significantly to strategic initiatives such as cloud migration, microservices architecture adoption, and modernization efforts.
    • Balance system reliability with agility and innovation, continuously refining our approach to enhance user experience.

  • Strategic Technical Leadership:
    • Participate actively in architectural decisions and contribute to the strategic technology roadmap.

  • Performance and Capacity Planning:
    • Conduct capacity planning and performance profiling to proactively identify scaling issues and optimize resource utilization.

  • Disaster Recovery:
    • Lead efforts in developing, maintaining, and testing comprehensive disaster recovery and business continuity strategies.

  • Financial and Operational Efficiency:
    • Evaluate infrastructure costs and efficiency, proactively implementing optimizations to balance cost-effectiveness with system reliability.

  • Documentation and Communication:
    • Maintain accurate and comprehensive technical documentation, ensuring effective knowledge sharing and clear team communication.

Qualifications:

  • Bachelor’s degree in Computer Science, Engineering, or related field; Masters degree preferred.
  • 4+ years experience in SRE, DevOps, or related roles.
  • Deep expertise in CI/CD tools (Jenkins, GitLab), scripting languages (Python, Bash), and version control (Git).
  • Deep expertise with infrastructure-as-code tools (Terraform, Ansible), containerization technologies (Docker), and orchestration platforms (Kubernetes).
  • Experience with cloud platforms (AWS, Azure, GCP).
  • Solid experience in monitoring and logging solutions (OpenTelemetry, Splunk, Prometheus).
  • Proven experience managing and resolving critical incidents effectively.
  • Strong problem-solving skills, critical thinking ability, and the capability to troubleshoot complex issues.
  • Exceptional collaboration, communication skills, and a growth mindset, capable of influencing cross-functional teams.
  • Familiarity with DevSecOps practices and security integration strongly preferred.


Licenses and Certifications:

  • AWS Certified DevOps Engineer
    This certification validates technical expertise in provisioning, operating, and managing distributed application systems on the AWS platform. (Preferred)
  • Certified Kubernetes Administrator
    This certification validates the skills required for day-to-day administration of Kubernetes environments. (Preferred)
  • Google Cloud Certified - Professional DevOps Engineer
    This certification validates the ability to efficiently develop and deploy applications using Google Cloud technologies and to manage operations. (Preferred)

We pride ourselves on encouraging a culture of innovation, advocating for agile methodologies, and promoting transparency in all that we do. Join us in embodying the spirit of the Un-carrier and make a tangible impact!

Our team is dynamic where no day is the same, and we are a diverse and inclusive team passionate about growth and innovation! If youre up to the challenge, apply today!

  • At least 18 years of age
  • Legally authorized to work in the United States

Travel:
Travel Required (Yes/No): No

DOT Regulated:
DOT Regulated Position (Yes/No): No
Safety Sensitive Position (Yes/No): No

Base Pay Range: $107,300 - $193,500

Corporate Bonus Target: 15%

The pay range above is the general base pay range for a successful candidate in the role. The successful candidate’s actual pay will be based on various factors, such as work location, qualifications, and experience, so the actual starting pay will vary within this range.

At T-Mobile, employees in regular, non-temporary roles are eligible for an annual bonus or periodic sales incentive or bonus, based on their role. Most Corporate employees are eligible for a year-end bonus based on company and/or individual performance and which is set at a percentage of the employee’s eligible earnings in the prior year. Certain positions in Customer Care are eligible for monthly bonuses based on individual and/or team performance. To find the pay range for this role based on hiring location, https://paylookup.t-mobile.com/paylookup?reqID=REQ318664&paradox=1

At T-Mobile, our benefits exemplify the spirit of One Team, Together! A big part of how we care for one another is working to ensure our benefits evolve to meet the needs of our team members. Full and part-time employees have access to the same benefits when eligible. We cover all of the bases, offering medical, dental and vision insurance, a flexible spending account, 401(k), employee stock grants, employee stock purchase plan, paid time off and up to 12 paid holidays - which total about 4 weeks for new full-time employees and about 2.5 weeks for new part-time employees annually - paid parental and family leave, family building benefits, back-up care, enhanced family support, childcare subsidy, tuition assistance, college coaching, short- and long-term disability, voluntary AD&D coverage, voluntary accident coverage, voluntary life insurance, voluntary disability insurance, and voluntary long-term care insurance. We dont stop there - eligible employees can also receive mobile service & home internet discounts, pet insurance, and access to commuter and transit programs! To learn about T-Mobile’s amazing benefits, check out www.t-mobilebenefits.com.

Never stop growing!
As part of the T-Mobile team, you know the Un-carrier doesn’t have a corporate ladder–it’s more like a jungle gym of possibilities! We love helping our employees grow in their careers, because it’s that shared drive to aim high that drives our business and our culture forward. By applying for this career opportunity, you’re living our values while investing in your career growth–and we applaud it. You’re unstoppable!

T-Mobile USA, Inc. is an Equal Opportunity Employer. All decisions concerning the employment relationship will be made without regard to age, race, ethnicity, color, religion, creed, sex, sexual orientation, gender identity or expression, national origin, religious affiliation, marital status, citizenship status, veteran status, the presence of any physical or mental disability, or any other status or characteristic protected by federal, state, or local law. Discrimination, retaliation or harassment based upon any of these factors is wholly inconsistent with how we do business and will not be tolerated.

Talent comes in all forms at the Un-carrier. If you are an individual with a disability and need reasonable accommodation at any point in the application or interview process, please let us know by emailing ApplicantAccommodation@t-mobile.com or calling 1-844-873-9500. Please note, this contact channel is not a means to apply for or inquire about a position and we are unable to respond to non-accommodation related requests.