Site Reliability Engineer - Infrastructure Engineering

ByteDance logo

ByteDance

View Salaries, Reviews, and more  

Job Summary


Salary
S$9,072 - S$18,036 / Monthly EST

Job Type
Permanent

Seniority
Mid

Years of Experience
At least 3 years

Tech Stacks
HTTP TCP RPC ELK Shell UNIX Azure openstack Kubernetes C++ Hadoop Linux Go Python AWS Java

Job Description

The Datacenter Infrastructure Engineering team supports the company's fast growth by building and operating hyperscale datacenters. The team manages the end to end lifecycle of server fleet, providing cloud solutions and various infrastructure services ensuring that they are scalable and are reliable.

[Responsibilities]
As the [Site Reliability Engineer - Infrastructure Engineering], you would be responsible for at least one if not all of these areas:

Infrastructure:
- Build, expand and operate global infrastructures, including large-scale systems in public and private clouds, data centers and content delivery networks.
- Build tools, automations, visualizations and monitors to facilitate the operation and optimization of the global infrastructure.
- Help improve the whole lifecycle of infrastructure services from inception and design throughout development, to deployment, user support and refinement.
- Supporting end-to-end to production environment by responding to performance and reliability issues and participating in rotational on-calls.

Security:
- Conduct security reviews of core corporate and production infrastructure.
- Carry out security updates and protect enterprise infrastructure in system and network level.
- Drive enterprise focused security improvements to products and services.
- Build security tools and processes for critical infrastructure protection, monitoring and remediation.

Traffic:
- Build tools, automations, visualizations and monitors to facilitate the operation and optimization of the traffic infrastructure.
- Provide primary operational support and engineering for traffic infrastructure systems.
- Gather and analyze metrics to assist in performance tuning and fault finding.

[Minimum Qualifications]
- Bachelor’s degree in Computer Science or equivalent with 3+ years of relevant experience.
- Experience in one or more programming languages such as Java, Python C++, Go, or scripting experience in Shell and Python.
- Ability to thrive in a fast-paced environment.
- Relevant experience working in a Datacenter setup or environment with large scale infrastructure setup featuring high traffic.

As a Site Reliability Engineer with the Infrastructure Engineering team, you would be expected to be an expert in at least one if not all of these areas as well:
Infrastructure:
- Experience working with Cloud infrastructure
- Experience in building solutions with AWS, Google, Azure and other cloud services.
- Experience in developing and operating one or more following systems: OpenStack, Kubernetes, Nginx, ipvs, ELK stack, Hadoop, etc.
- Experience working with Unix Linux systems, from kernel to shell and beyond.
- Experience working with system libraries, file systems, and client-server protocols.
- Experience in designing, analyzing, and building automation and tools for large scale systems.
- Experience in networking technologies such TCP/IP, BGP, DNS, etc. in a carrier grade environment.

Security:
- Experience in networking security like DDoS and WAF protection.
- Experience in security protocols like TLS protocol features and updates.
- Experience in VPNs and building encrypted communication channel.
- Conducted infrastructure security review, patch and update potential security vulnerabilities.
- Experience in one or more programming languages such as Java, C++, Go, or scripting experience in Shell and Python.

Traffic:
- Experience working with traffic systems from CDNs to loadbalancers and beyond.
- Experience working with network devices, remote management systems, and client-server protocols.
- Knowledge of network infrastructure and/or routing.
- Experience with Layer 4 / Layer 7 loadbalancers.
- Knowledge of protocols like TCP/IP, HTTP, RPC, TLS etc.
- Experience working with containerized environment.
- Experience in one or more programming languages such as Java, C++, Go, or scripting experience in Shell and Python.

banner icon
Prepare For Your Interview in 1 Week?
Equip yourself with possible questions that interviewers might ask you, based on your work experience and job description.
Get Started!

Achieve your dream job with our top-notch tools!

Resume Checker Illustration

Resume Checker

Our free resume checker analyzes the job description and identifies important keywords and skills missing from your resume in just a minute!

Check Now
Resume Checker Illustration

Interview Preparation

Utilizing advanced AI, our tool generates tailored interview questions based on your industry, role, and experience. Practice and receive feedback on your answers in real time!

Let's Prepare
Resume Checker Illustration

Resume Builder

Let us show you the differences between a bad, good, and great resume, and guide you in building a resume that helps you stand out to employers, ensuring you land your next position faster!

Build Resume