We manage critical infrastructure systems across storage, computing, and databases. As a leading SRE team at TikTok's Technical Infrastructure, we solve complex scalability challenges through expert system design and automation. You'll work independently on diverse projects while receiving strong mentorship, applying your skills in coding, algorithms, and systems architecture to maintain reliable infrastructure at scale.
Our team combines technical excellence with a collaborative culture that values intellectual curiosity and diverse perspectives. We provide an environment where engineers can take ownership of projects while growing their expertise in large-scale system design and operations.
Responsibilities
- Grow and lead a high-performing SRE engineering team focused on developing and maintaining scalable, production-grade AI Platform systems, with emphasis on reliability, performance, and operational excellence
- Balance hands-on technical contributions in system design and implementation while effectively managing team growth, performance, and professional development
- Provide strategic technical direction and architectural guidance to strengthen engineering practices, mentor team members, and drive technical excellence across collaborative projects
- Build and maintain strong partnerships across engineering teams, business units, and external stakeholders to align technical initiatives with organizational goals and drive successful outcomes
- Champion innovation within the team by evaluating emerging technologies, implementing modern engineering practices, and driving continuous improvements to the AI Platform infrastructure
Report job