R concrete analysis
Business Objective
The business goal is predict the strength of the concrete based on several inputs so that we can make the strongest concrete = More $.
​
Data Science Task
We will perform a neural network analysis on the variables
​
Select Data
We will use 1030 observations with 8 inputs and 1 output
​
Data Analysis
​
We applied the following steps in our analysis.
-
We performed an multiple regression with a correlation to the true data of .75
-
To improve the analysis we created a NN of 1 hidden layer with 1 node with a correlation to the true data of .80
-
To improve the NN we changed the hidden layer from 1 node to 5 nodes with a correlation to the true data of .924
-
To improve the NN we added another hidden layer of 5 nodes with a correlation to the true data of .935
​
Apply Analysis
Her is the original strength, the predicted strength, and the error.
​
actual pred pred_new error
774 30.14 0.2860639 23.62888 -6.511121
775 44.40 0.4777305 39.46054 -4.939464
776 24.50 0.2840964 23.46636 -1.033635
Deploy Model
We would run an experiment for the next month, 50% of jobs will be using old composition and 50% will use the new improved
composition. Distribution of assignment of the composition should be blind.
​
Assess Results
We will evaluate the strength measures of the old vs new and see if new was statistically better.
​
Strengths of XYZ Analysis
NN has many strengths: Approximate any function, become accurate.
Weaknesses: Needs lots of data, takes a lot of computing time, takes a lot of effort to create the structure