Do you have a passion for learning and applying modern software engineering practices? Are you excited to use these practices to deliver and manage mission critical services at scale? Are you a quick thinker? If so, please read on.We’re looking for a Site Reliability Engineer, in Raleigh, North Carolina. Site Reliability Engineers take a different approach it comes to hosting and managing infrastructure, services, and applications. We approach this as a software problem instead of as an operations problem. Applying software engineering practices to hosting and managing enables our services to better adapt to all types of changes and failure scenarios. To help us accomplish this approach we use the latest techniques, practices and technology from the industry while hosting large scale mission critical infrastructure. Some of the techniques, practices, and technology we use are: Docker, IaaS, PaaS, ChatOps, Continuous Delivery, Continuous Deployment, DevOps, and Immutable Infrastructure. We strive to keep learning and improving, and we work to share the knowledge we have learned throughout IBM.On our Site Reliability Engineering team, we have a great engineering-oriented culture including:Transparency via internal open source - we code software components, automated operation, software delivery pipelines, and even our video game score board.Action on data feedback - we work with thousands of metrics per minute and tens of GBs of logs per day and we optimize for measurements out of tons of data feedback from tens of thousands of users.Iterate fast - we are motivated to deploy (and/or repair things) fast and frequently.Security from day one - we use several modern security services and tools to enable our secure engineering practices.
Experience in troubleshooting complex systems, including OS, Network, and Application code
Experience in coding in at least one modern language such as Python, Ruby, NodeJS
Experience with DevOps, Continuous Delivery, Continuous Deployment
Knowledge of UNIX/Linux systems