Site Reliability Engineer-Cloud Infrastructure

TikTok
New South Wales
Full time
4 days ago
Team Introduction
The team is responsible for infrastructure systems, including Storage/Computing/DB. We aim to be the leading SRE team across the industry. In the SRE team, you will have the opportunity to manage the complex challenges of scale, while using expertise in coding, algorithms, complexity analysis, and large-scale system design. We embrace a culture of diversity, intellectual curiosity, openness, and problem-solving. We also encourage ownership, self-governance and independence to work on various projects, and an environment that provides the support and mentorship needed to learn and grow as an engineer.

Responsibilities
- Reliability: Ensuring the reliability and efficiency of our core infrastructure, focusing on system capacity and stability; setting up reliability standards and recovery SOP.
- Reliability: Troubleshooting and locating technical issues, bottleneck analysis, managing system high availability architecture transformation and upgrading.
- Efficiency: Building automated operation solutions for large-scale systems; partnering with system development teams for system iteration.
- Efficiency: Designing and implementing software platforms and monitoring frameworks for efficient, automated, and intelligent service-oriented architecture (SOA) governance.
- Cost: There are millions of CPUs. We should build delivery standards, and monitor and budget systems to optimize the cost of the company.
- Compliance: Designing and setting up new IDC; designing and implementing a data protection plan to meet the standard requirement.
Apply
Other Job Recommendations:

Site Reliability Engineer, Spanner

Google
Sydney, New South Wales
Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed,...
2 days ago

Shift Manager, Site Reliability Engineering

ServiceNow
New South Wales
  • Team management, career development, project prioritization...
  • Drive a culture of intolerance to manual activities that...
3 days ago

Reliability Engineer

Qantas
Perth, Western Australia
  • Strong demonstrated skills in problem solving and analytical...
  • Minimum 12 months experience in a similar role...
1 week ago

Senior Customer Reliability Engineer

Replicated
Remote
USD 140,250 - USD 235,000
  • Provide expert support to customers, resolving issues...
  • Strong problem-solving skills and the ability to think...
1 week ago

Senior Integrity Monitoring, Maintenance & Reliability (IMMR) Engineer

Worley
About the job We are seeking an experienced Senior IMMR Engineer to lead the development and execution of integrity monitoring,...
2 weeks ago