Archive

Archive for the ‘Predictive Analytics’ Category

Building a One-Way ANOVA R Extension in SAP Predictive Analytics

One of the first statistical tools many new analyst use is the Analysis of Variance, a collection of statistical methods used to decompose and understand the causes of variation within a set of data. One-way ANOVA is perhaps the most basic of these methods and a staple of most statistical software. While SAP has included many predictive algorithms in their SAP Predictive Analytics Expert application, it is missing many of the common descriptive algorithms used by data scientists to better understand their data. Luckily, it is relatively easy to build in custom R extensions to accommodate any descriptive statistical needs.

What is One-Way ANOVA?

One-Way Analysis of Variance is used to compare means within 3 or more samples to evaluate whether or not all groups have the same means (in effect, there is no difference in the monitored statistic between the groups). Of course there is natural variation in data, so the actual means of the groups may vary slightly, but the age-old question persists: is the difference statistically significant???

One-Way ANOVA is an omnibus test, which means that if the null hypothesis (all means are the same) is rejected, it offers no additional information on which of the group(s) are different from each other, simply that at least one of them is different enough to reject the hypothesis that they are all the same.

For additional background on One-Way ANOVA, see the relevant Wikipedia article.

To download full PDF, and Continue Reading…

Hilary BlissAbout Hillary Bliss
Hillary is a Senior Manager – Data & Analytics at Protiviti, and specializes in data warehouse design, ETL development, statistical analysis, and predictive modeling. She works with clients and vendors to integrate business analysis and predictive modeling solutions into the organizational data warehouse and business intelligence environments based on their specific operational and strategic business needs. She has a master’s degree in statistics and an MBA from Georgia Tech.

Categories: Predictive Analytics

50 Business Problems I’ve Addressed with Predictive Analytics, Data Science, and Advanced Analytics

I was reading Vincent Granville’s recent blog post and thought I might add a few of the problems I’ve addressed in my career. While these are not all data science they do fall within advanced analytics.

  1. Estimating hybrid yields and crop characteristics across multiple geographies, soil types, climates, and ecosystems based on performance of limited field trials. Estimating the same for heretofore uncrossed inbreds.
  2. Accurately forecasting the demand for promotional items driving the market basket 18 months into the future in order to accommodate an extended supply chain.
  3. Estimating the brand capital associated with consumer brands in the marketplace. E.g. What is the value of a brand in the marketplace in terms of both goodwill on the balance sheet and an organizations ability to leverage the brand capital through marketing to deliver sales.
  4. Network optimization for optimizing supply chains. Optimizing supply chain routing in near real-time.
  5. Optimal scheduling of ship dates for seasonal goods based upon stochastic analysis of probable events along the chain in order to assure supply without clogging the pipe.
  6. Call center optimization.
  7. Forecasting the demand for retail store associates across a large retail chain and optimizing the schedule based upon that forecast.
  8. Predicting what goods are most likely to be purchased during a hurricane warning.
  9. Predicting the annual sales of a prospective retail site based upon demographic, market, competitor and other data.
  10. Optimizing locations for retail site selections using game theory and accounting for cannibalization and impact on key competitors.
  11. Optimal routing for transportation fleets given uncertainty.
  12. Pricing optimization, including setting pricing strategies for every day pricing.
  13. Estimating the impact of pricing changes on demand for high affinity items. Understanding expected impact of promotional pricing on over-all revenues based on affinity items and additional trips.
  14. Estimating the impact of operational activities on competitors and the likelihood of cannibalization.
  15. Estimating sell through dates for seasonal goods and understanding the risk profile for lost sales opportunity and clearance/write-off.
  16. Identifying potentially fraudulent transactions at the register and in the back office regarding cash and check deposits.
  17. Identifying potentially fraudulent workman’s comp claims.
  18. Identifying primary causes associate with employee injury accidents and prioritizing limited resources to prevent and mitigate lost productivity, workman’s comp expenses, and long term liability associated with miss-handled claims.
  19. Identifying potential tax savings due to missed tax benefits across multiple tax jurisdictions.
  20. Analyzing production data to identify root cause of quality issues and productions slow downs in manufacturing environments.
  21. Analyzing clinical trial data for safety and efficacy of treatment protocols.
  22. Analyzing system performance logs to understand bottlenecks in production and analytical lab computing environments.
  23. Analyzing consumer behavior to understand the impact of marketing message theme, channel preference, pricing sensitivity, seasonal good purchase cycle, brand affinities, product affinities, loyalty engagement, net promoter score, customer satisfaction, lifetime value, purchase driver, style preferences, color preferences, size preferences and other brand specific factors.
  24. Integrating consumer behavior data with attitudinal and demographic data to make  cohort level inferences regarding behavior.
  25. Response and uplift modeling to understand the impact of direct marketing efforts in a test and learn environment.
  26. Establishing value of information models to map response/uplift to the financial benefit they bring to the organization.
  27. Establishing a champion/challenger approach to model deployment and consistently measuring the impact the model brings to the business.
  28. Leveraging machine learning to assess when a model needs to be re-scored, refit, remodeled, or replaced.
  29. Integrating insurer, practitioner, and population data, formulary status, and negotiated pricing levels for branded prescription medicines to analyze the impact on long term sales and profits.
  30. Applying artificial intelligence techniques to assist in optimum model selection across predictive analytics solutions.
  31. Forecasting sales and returns in the publishing industry this requires forecasting at the distributor and retailer level and understanding revenue recognition, probability of returns and the publisher’s liability associated with returns.
  32. Integrating omni-channel data in order to model customer response to brand treatments across multiple touch points.
  33. Estimating the most likely customer segment for cash baskets in a retail environment with a high percentage of cash transactions.
  34. Optimizing the retail supply chain for demand driven pull.
  35. Individual store assortment planning for large chain retailers based upon customer behavioral profiles.
  36. Dealer performance estimation and visualization (GIS) in the automotive industry.
  37. OMNI Channel retail performance marketing delivering uplift modeling in a champion/challenger environment and integration into a marketing automation system.
  38. Estimating the customer response to postpaid plan upgrade offers (propensity/uplift) by micro audience and offer theme.
  39. Marketing mix modeling at the individual store level for a larger retailer.
  40. Leveraging early IOT data sources to improve forecasting results.
  41. Analysis of disparate data sources on Hadoop to understand the impact on quality of management decisions.
  42. Automation and application of artificial intelligence techniques in the champion and challenger process for on-going predictive model performance management.
  43. Stochastic optimization of project outcomes (based on data science generated predictors) for the purpose of project portfolio management.
  44. Preparing stochastic based financial projections based upon known cause and effect and predictive relationships (tactics driving KPI’s driving financial performance) for complex businesses.
  45. Estimating future gross margin contribution across a large research and development portfolio for new technology introduction, new product introduction, and continuous product improvements. Application of those estimates to identify key success drivers. Stochastic modeling of the key drivers and estimation models to allocate and optimize annual R&D budgets.
  46. Optimizing balance sheet and off-balance sheet labor costs in a manufacturing environment based on key production predictors including outside economic variables, internal scheduling constraints and forecast demand.
  47. Estimating future resource utilization based on forecasts, historical forecast accuracy, sales pipeline, and existing contract terms.
  48. Predicting customer churn and developing optimal retention policies to prevent it.
  49. Identifying cross sell and up-sell opportunities in a business to business environment.
  50. Identifying root cause of sub optimal brand performance across a portfolio of brands and determining optimal corrective action to improve financial performance.

Patrick McDonald HeadshotAbout Patrick McDonald 

Patrick McDonald is an Associate Director with Protiviti focused on advising clients in the Retail, Manufacturing and Telecommunications industries on analytical solutions. Over a 20 year career in advanced analytics, Patrick completed tours in a big four firm and leading analytics technology and software companies.

 

Categories: Predictive Analytics

How Can I Increase the Value of My Marketing Investments Using Predictive Analytics – A Real Life Use Case

In my last blog, I discussed how predictive analytics can increase your marketing bang for the buck by giving you clear insights into where to spend your marketing dollars. In this entry, I’ll give you a real-world example of how a retail department store chain with multiple product categories decided who to target for a store mailer using analytics and customer segmentation.

To determine their target audience, the retailer wanted to gain a better understanding of their customer segments and where money was being spend across those segments. The first step was determining which customers shop which categories. Working together, we mapped over 2 million customers, identified spend by product category and then clustered customers by product category and category spend.

Predictive Product Category

To download full PDF and Continue Reading…

Patrick McDonald HeadshotAbout Patrick McDonald

Patrick McDonald is an Associate Director with Protiviti focused on advising clients in the Retail, Manufacturing and Telecommunications industries on analytical solutions. Over a 20 year career in advanced analytics, Patrick completed tours in a big four firm and leading analytics technology and software companies.

Categories: Predictive Analytics

How Can I Increase the Value of My Marketing Investments Using Predictive Analytics?

If your marketing strategies cost more than they earn, they obviously aren’t good long-term marketing strategies. One of the most useful tools at your fingertips for ensuring and increasing your marketing investments’ value is predictive analytics. Specifically, using predictive analytics to anticipate an individual customer’s needs and wants. Predictive modeling can provide profound insights into customer preferences and trends, allowing you to tailor your strategies around the customer. This is customer experience optimization. Customer experience is a major revenue driver!

If you understand which questions you’re trying to answer or issues you’re trying to resolve from a business perspective, you can build models that will help you understand a customer response to a particular treatment, allowing you to address those key business questions and engage customers more personally.

Some key questions or issue you might want to begin with are:

  • Not enough customers
  • Customers not buying enough
  • Engaging the wrong customer
  • Haven’t found the right customer
  • What new markets can we engage and how

To download full PDF and Continue Reading…

Patrick McDonald HeadshotAbout Patrick McDonald

Patrick McDonald is an Associate Director with Protiviti focused on advising clients in the Retail, Manufacturing and Telecommunications industries on analytic solutions. Over a 20 year career in advanced analytics, Patrick completed tours in a big four firm and leading analytics technology and software companies.

Categories: Predictive Analytics

Using People Analytics to Increase Employee Loyalty

The ability to attract and retain a loyal employee base and understand root causes for employee disengagement and disloyalty are key strategic objectives for every organization – big or small. If you want to improve employee productivity and/or decrease the cost associated with attracting and retaining employees, you need to move along the analytics maturity curve and start leveraging People Analytics.

What is People Analytics?

Put simply, people analytics is a predictive, data-driven approach to managing people at work. Analytics centered around your employees. It is used to address people-related issues, such as talent acquisition, performance evaluations, leadership positioning, hiring and promotion, job and team design, and employee compensation.

Increasing Employee Loyalty Using People Analytics

People analytics help you merge employee data, company data, and market data to predict and interpret valuable employees’ behaviors, as well as operations-level insights, giving you competitive vision for developing your retention strategies.

To download full PDF and Continue Reading...

 

John Harris Headshot About John Harris

John Harris, Senior Manager – Predictive Modeling and Advanced Analytics, has over 16 years of industry experience applying strategic thinking and advanced analytical skill set to optimize resources, improve processes and develop quantitative models that turn data into decision-aid information for all levels of leadership. Airline and energy utility employers have attempted to patent his deliverables related to predictive and optimization modeling.

Categories: Predictive Analytics

HANA Sentiment Analysis with SAP Predictive Analytics

One of the new features I’m most excited about with the new SAP Predictive Analytics 2.4 update is the HANA Sentiment Analysis module that’s been added for HANA online mode.

The HANA Text Analytics engine has been available for several years, but has remained somewhat inaccessible due to the relatively complex interface required to use it.

I’ve written in the past about how much I love the SAP HANA (and SAP Data Services) text analtytics engine for analyzing social media data surrounding the Sharknado movie, and I also co-authored an SAP Press E-Book about the various text analytics tools available within the SAP ecosystem, but I’m so excited because this new component makes this powerful tool available to business users for the first time. For full details on how SAP text analytics works, how to create and use full text indexes within SAP HANA, and a long list of business-related applications for text analysis, checkout the SAP Press E-Book Text Analytics with SAP, released in October 2015.

HANA Sentiment Analysis Module

With Expert Analytics within Predictive Analytics 2.4, SAP has opened up a portion of this functionality to business users within the Expert Analytics tool. The HANA Sentiment Analysis module is found under the Data Preparation function category.

To download full PDF and Continue Reading…

Hilary BlissAbout Hillary Bliss 
Hillary Bliss is a Senior ETL Consultant at Decision First Technologies, and  specializes in data warehouse design, ETL development, statistical analysis, and  predictive modeling. She works with clients and vendors to integrate business  analysis and predictive modeling solutions into the organizational data  warehouse and business intelligence environments based on their specific operational and strategic business needs. She has a master’s degree in statistics and an MBA from Georgia Tech.

 

Categories: HANA, Predictive Analytics

What’s New in SAP Predictive Analytics 2.4?

Released just before Thanksgiving 2015, SAP’s latest enhancements to the Predictive Analytics suite introduce some exciting new features and algorithms.

Automated Analytics

The first major enhancement for the Automated Analtyics side in quite some time really enables Automated users with HANA to take full advantage of the power of HANA for the first time. I don’t have any screeenshots to share for this feature, but essentially this allows Automated Analytics to leverage the APL (Automated Predictive Library) within HANA to perform all model training on the HANA server; previously HANA was used simply as a data source and the data was fully transferred either to the local machine (desktop version) or the Automated Analytics Engine server for processing.

It will be interesting to see how this affects licensing for the Predictive Analytics products going forward; for example if an organization only intends to use data on HANA for predictive activities, it could possibly mean that they no longer need to license the Engine component and can perform predictive analysis on big datasets through the HANA connection, if all the desired functionality is available within HANA.

Expert Analytics

With this update, SAP has introduced 2 new HANA components to the Expert Analytics platform and 1 new custom component option.

To download full PDF and Continue Reading…

Hilary BlissAbout Hillary Bliss 
Hillary Bliss is a Senior ETL Consultant at Decision First Technologies, and  specializes in data warehouse design, ETL development, statistical analysis, and  predictive modeling. She works with clients and vendors to integrate business  analysis and predictive modeling solutions into the organizational data  warehouse and business intelligence environments based on their specific operational and strategic business needs. She has a master’s degree in statistics and an MBA from Georgia Tech.

Categories: HANA, Predictive Analytics