Remote Contract
PUBLISHED
Jan 16, 2026
Kentik is seeking a seasoned Staff Site Reliability Engineer to optimize and maintain our cloud infrastructure, ensuring scalable, reliable, and high-performance network observability services in a remote US-based role.
Kentik, a leader in network observability and analytics, is looking for a Staff Site Reliability Engineer specializing in cloud technologies to join our innovative team. In this critical role, you will be responsible for designing, implementing, and maintaining highly available cloud infrastructure that powers our cutting-edge platform used by enterprises worldwide to monitor and manage their networks.
Your day-to-day will involve collaborating with development and operations teams to automate deployments, optimize system performance, and troubleshoot complex issues in real-time. You will leverage SRE best practices to achieve 99.99% uptime, implement scalable monitoring solutions, and contribute to on-call rotations to ensure rapid incident resolution. With a focus on cloud-native architectures, you will work extensively with services like AWS EC2, Lambda, and VPCs, or equivalents in other clouds, to build resilient systems that handle massive data volumes.
This position offers the opportunity to influence the strategic direction of our infrastructure while working remotely from anywhere in the United States. If you are passionate about reliability engineering and thrive in a dynamic environment, Kentik provides the tools and culture to help you excel and grow your career.