Staff Data Scientist - Machine Learning

Walmart Stores SUNNYVALE, CA

About the Job

Position Description


This position is in the data science team under the Advertising Technology organization. The mission of the Advertising Technology organization is to advance Walmart eCommerce by driving higher value for our customers and vendor partners. Walmart is investing in building a world class advertising platform and the Ads team is responsible for defining and performance advertising products that drive discovery, sales and profits. The team operates an end to end advertising platform that includes a scalable ad service that serves hundreds of millions of impressions each day, sophisticated ad matching algorithms, real-time reports, self-service interface for end to end program management etc.

We are a highly motivated group of Big Data Geeks, Data Scientists and Applications Engineers, working in small agile group to solve sophisticated and high impact problems. We are building smart data systems that ingest, model and analyze massive flow of data from online and offline user activity. We use cutting edge machine learning, data mining and optimization algorithms underneath it all to analyze all this data on top of Hadoop and Spark.

Join us if you want to be spending your time on:
- Gathering and analyzing data, identifying key prediction/classification problems, devising solutions and building prototypes;
- Formulating machine learning/statistical approaches while paying attention to business metrics, designing features from the rich data available from many sources, training, evaluating, and deploying models;
- Researching and implementing methodologies to measure the impact of the technologies;
- Initiating and proposing unique and promising modeling projects, developing new and innovative algorithms and technologies, pursuing patents where appropriate;
- Developing high-performance algorithms for precision targeting, testing and implementing these algorithms in scalable, product-ready code; Interacting with other teams to define interfaces and understanding and resolving dependencies;
- Staying current on published data mining, machine learning and modeling techniques and competing technologies and sharing these findings with scientists and engineers in the organization;
- Maintaining world-class academic credentials through publications, presentations, external collaborations and service to the research community.

Minimum Qualifications


- Masters or equivalent degree in a computational science with 4+ years of experience in Machine Learning or Data Science or PhD with 2+ yrs of relevant exp
- Experience with traditional as well as modern statistical techniques, including Regression, Support Vector Machines, Regularization, Boosting, Random Forests, and other Ensemble Methods;
- Strong implementation experience with high-level languages, such as R, Python, Perl, Ruby, Scala or similar scripting languages;
- Familiarity with Linux/Unix/Shell environments;
- Strong hands-on skills in sourcing, cleaning, manipulating and analyzing large volumes of data;
- Strong written and oral communication skills.

Additional Preferred Qualifications


- Ph.D. in a computational science with an emphasis in Machine Learning; Bachelors or higher in Computer Science;
- 2+ years of experience of writing production quality code;
- Experience with end-to-end modeling projects emerging from research efforts;
- Excellent academic or industrial track record of proposing, conducting and reporting results of original research, plus collaborative research with publications;
- Knowledge of data processing on Hadoop programming environments (e.g. Spark/Hive/Pig).

Company Summary


The Walmart eCommerce team is rapidly innovating to evolve and define the future state of shopping. As the world’s largest retailer, we are on a mission to help people save money and live better.  With the help of some of the brightest minds in technology, merchandising, marketing, supply chain, talent and more, we are reimagining the intersection of digital and physical shopping to help achieve that mission.

Position Summary


This position is in the data science team under the Advertising Technology organization. The mission of the Advertising Technology organization is to advance Walmart eCommerce by driving higher value for our customers and vendor partners. Walmart is investing in building a world class advertising platform and the Ads team is responsible for defining and performance advertising products that drive discovery, sales and profits. The team operates an end to end advertising platform that includes a scalable ad service that serves hundreds of millions of impressions each day, sophisticated ad matching algorithms, real-time reports, self-service interface for end to end program management etc.

We are a highly motivated group of Big Data Geeks, Data Scientists and Applications Engineers, working in small agile group to solve sophisticated and high impact problems. We are building smart data systems that ingest, model and analyze massive flow of data from online and offline user activity. We use cutting edge machine learning, data mining and optimization algorithms underneath it all to analyze all this data on top of Hadoop and Spark.

Join us if you want to be spending your time on:
- Gathering and analyzing data, identifying key prediction/classification problems, devising solutions and building prototypes;
- Formulating machine learning/statistical approaches while paying attention to business metrics, designing features from the rich data available from many sources, training, evaluating, and deploying models;
- Researching and implementing methodologies to measure the impact of the technologies;
- Initiating and proposing unique and promising modeling projects, developing new and innovative algorithms and technologies, pursuing patents where appropriate;
- Developing high-performance algorithms for precision targeting, testing and implementing these algorithms in scalable, product-ready code; Interacting with other teams to define interfaces and understanding and resolving dependencies;
- Staying current on published data mining, machine learning and modeling techniques and competing technologies and sharing these findings with scientists and engineers in the organization;
- Maintaining world-class academic credentials through publications, presentations, external collaborations and service to the research community.