Site Reliability Engineer
8.0/10
Jump Trading
$98,000 – $162,000 USD
Office / on-site
mid
13 days ago
cryptotechLinuxCC++GolangRust
AI Summary
The vacancy is well-structured with clear responsibilities and requirements, though some areas could use more detail.
Check Match — Just drop your CV
See your fit for Site Reliability Engineer in seconds.
Description
What You'll Do
- •Develop deep technical expertise in your assigned product area and tech stack.
- •Own production deployment, configuration, and release processes.
- •Drive performance, reliability, and operability through continuous improvement.
- •Build and maintain production tooling that supports deployment, orchestration, monitoring, and system diagnostics.
- •Define and maintain observability, SLI/SLOs, and performance metrics in partnership with product owners.
- •Leverage metrics and capacity planning to ensure scalability and uptime.
- •Collaborate across engineering teams to troubleshoot and resolve complex production incidents.
- •Lead and coordinate incident response, root cause analysis, and post-mortems.
- •Influence architecture and promote best practices by aligning with global SRE teams.
- •Document processes and procedures; provide mentorship and cross-training to peers.
- •Actively manage operational risk for production changes.
- •Other duties as assigned or needed.
Requirements
Skills You'll Need
- •Degree in Computer Science, a related field, or equivalent professional experience.
- •At least 5+ years of relevant work experience in an IT ops role, such as DevOps, SRE, Linux Systems Engineering, or Network Engineering.
- •At least 5+ years of experience using systems programming language (C/C++/Golang/Rust).
- •A rigorous, detail-oriented approach to operations.
- •Strong understanding of the Linux operating system, including network and system configuration, kernel internals, scheduling, performance tuning.
- •Strong understanding of networking concepts such as routing, multicast, LLDP, VLANs, and Ethernet.
- •A deep sense of ownership and desire to meet business priorities with urgency.
- •Ability to handle shared operational and periodic on-call duties.
- •Reliable and predictable availability.
Loading similar jobs...