Hot Jobs


San Mateo, CA 94403

Post Date: 03/08/2018 Job ID: 8267 Industry: IT Operations Pay Rate: Not Specified
We are looking for SREs with strong technical chops and to help our client establish a true SRE capability. As the Site Reliability Engineer, you will be partnering with the software engineering team, so the ability to influence and provide operational guidance is key. Initially, the SREs focus will be contributing to the development of operational tools and practices that help maintain service availability across hosted and cloud-based infrastructure. You must have an understanding of the full stack and how systems are built as well as a grasp of operational best practices.
Required Technical Skills: * Experience in DevOps, Site Reliability, or backend/infrastructure engineering for a company experiencing fast-paced growth * Expert knowledge of Linux operating systems and environment * Scripting (Python preferred, we use it a lot!) * Expert at troubleshooting complex system and application stacks * Expertise in configuration management with a framework such as Puppet (Salt and/or Ansible highly desired) * Operational Database Experience (MySQL a plus) * Experience with Message Bus/Queueing Systems (ActiveMQ, RabbitMQ, Qpid, etc.) * Operational experience troubleshooting network/server communication * Knowledge of network, system and service redundancy methods * Experience with cloud computing services, particularly deploying and running services in AWS Additional skills that would be a big plus: * Productive habits, healthy process awareness, and good teamwork skills and instincts * Excellent written and verbal communication, able to collaborate and rally support * Use of Monitoring/Event Management Systems (Sumo, Zenoss and WaveFront * Tomcat * Experience supporting Java apps * Development Background * Familiarity with continuous integration and deployment * experience leading SRE/DevOps/Ops teams

Not ready to apply?

Send an email reminder to:

Share This Job:

Related Jobs: