Site Reliability Engineer


Not Specified, United States Full-time in Other
    • Job ID 1488863

    Job Description

    • Serve as an SRE to proactively establish the means (through tooling) to effectively monitor, analyze, report, and observe the health and upkeep of the systems and/or environments. • Establish key practices to ensure the availability, stability, scalability, performance, monitoring, incident response are handled appropriately through a means of Automation. • Provide on-call rotation to field issues and support issues as they may arise. • As a senior engineer, provides technical guidance and mentorship to less experienced team members. • Collaborate with specific SMEs from various teams to investigate, troubleshoot, and resolve issues. • Implement automation to mitigate risks and faults based on reactive and proactive measures. • Construct and maintain an incident response playbook with documented corrective actions.

