SENIOR SOFTWARE ENGINEER

Walmart Stores SUNNYVALE, CA

About the Job

Position Description


The Senior Operations Engineer is responsible for pro-actively monitoring, detecting and resolving site issues before they become customer and availability impacting. Technically you will understand the full end to end stack and use this knowledge to detect error/failures and take corrective action to mitigate. During a major incident, you will draw on your technical skills and knowledge to triage, differentiating between symptom and cause, to help restore impacting issues. Your ability to continuously challenge yourself and develop a strong network within your peer group will see you exceed in this role. Our goal is to protect the customer experience and deliver outstanding levels of availability.

Minimum Qualifications


• 5+ years in an infrastructure, systems, engineering or development environment delivering operational excellence to highly complex distributed systems.
• Experience and exposure with managing and scaling 24/7 enterprise level applications.
• Methodical and systematic problem solving approach, combined with a solid awareness of ownership, initiative and drive.
• Experience investigating, analyzing and troubleshooting large scale enterprise systems.
• Experience with tools such as Jenkins, Nexus, Maven, Docker, Kubernetes.
• Experience with automation tools such as Ansible, Chef, Puppet, Salt.
• Experience working with enterprise monitoring tools and metrics, such as ELK, Prometheus, Greylog, Grafana.
• Experience administering Unix/Linux in a production environment.
• Networking knowledge and understanding of network concepts, such as different protocols (TCP/IP, UDP, ICMP, etc.), MAC addresses, IP packets, DNS, OSI layers, and load balancing).
• Programming experience in one or more of the following languages- Shell, Python, Ruby, Groovy.

Additional Preferred Qualifications



Company Summary


The Walmart eCommerce team is rapidly innovating to evolve and define the future state of shopping. As the world’s largest retailer, we are on a mission to help people save money and live better.  With the help of some of the brightest minds in technology, merchandising, marketing, supply chain, talent and more, we are reimagining the intersection of digital and physical shopping to help achieve that mission.

Position Summary


The Senior Operations Engineer is responsible for pro-actively monitoring, detecting and resolving site issues before they become customer and availability impacting. Technically you will understand the full end to end stack and use this knowledge to detect error/failures and take corrective action to mitigate. During a major incident, you will draw on your technical skills and knowledge to triage, differentiating between symptom and cause, to help restore impacting issues. Your ability to continuously challenge yourself and develop a strong network within your peer group will see you exceed in this role. Our goal is to protect the customer experience and deliver outstanding levels of availability.