All the Steps Needed for Model Building and Residual Analysis with the Aid of Minitab



Need Help with Minitab?

We can help! Your satisfaction is guaranteed.

Our rate starts at $35/hour. Free quote in hours. Quick turnaround!


\[\begin{array}{cc} & {{H}_{0}}:{{\mu }_{1}}\le {{\mu }_{2}} \\ & {{H}_{A}}:{{\mu }_{1}}>{{\mu }_{2}} \\ \end{array} \]

F-Test Two-Sample for Variances

0Profit

Mean

148.4100304

111.993007

Variance

120560.9905

320738.6231

Observations

3290

858

df

3289

857

F

0.375885478

P(F<=f) one-tail

0

F Critical one-tail

0.915852505


t-Test: Two-Sample Assuming Unequal Variances

0Profit

0Profit (missing age and/or income)

Mean

148.4100304

111.993007

Variance

120560.9905

320738.6231

Observations

3290

858

Hypothesized Mean Difference

0

df

1031

t Stat

1.797487565

P(T<=t) one-tail

0.036275383

t Critical one-tail

1.646333203

P(T<=t) two-tail

0.072550766

t Critical two-tail

1.962266651



Multiple Regression Model

  • 9Age_d1, 9Age_d2, 9Age_d3, 9Age_d4, 9Age_d5, 9Age_d6 (we need 6 dummy variables for 7 age categories)
  • 9Inc_d1, 9Inc_d2, 9Inc_d3, 9Inc_d4, 9Inc_d4, 9Inc_d5, 9Inc_d6, 9Inc_d7, 9Inc_d8 (we need 8 dummy variables for 9 income categories)
  • 9district_d1, 9district_d2 (we need 2 dummy variables for 3 geographical regions)

Minitab results from the regression analysis

  • The following is the ANOVA analysis:

ANOVA table

  • Now proceed to test the significance of the predictors included in the model:

Linear Regression Coefficients

\[S = 339.179 R-Sq = 29.5% R-Sq(adj) = 29.0%\]
  • 9Profit, 9ccd, 9mortga, 9Inc_d3, 9Inc_d5 (This means that Income is a significant variable)
  • The rest of the variables are not statistically significant
  • We observe a lack of normality of residuals, and some degree of lack of homogeneity in the residuals.

Best Regression Model

Reduced Regression Model

  • Minitab has a very powerful tool called “Best Subsets” which suggests the best model out of the whole set of variables.
  • The model with 5 variables: 9Profit, 9ccd, 9mortga, 9Inc_d3, 9Inc_d5

  • The model with 3 variables: 9Profit, 9ccd, 9mortga:

Conclusions from Minitab's Bets Subsets

  • 852 clients left the bank in 2000.
  • The variables that are clearly significant and should definitely be included in the model are 9Profit, 9ccd, 9mortga.
  • Income appears to have a significant role in predicting 0Profit, but it depends of the significance we choose.
  • The rest of the variables don’t appear to have influence in the dependent variable.
  • The overall fit is not too good. Only 29.1% of the variation in 0Profit is explained by the predictors. This suggests to look for more meaningful predictors, or to look a model other than linear.
  • The best model is written as
\[0\text{Profit}=39.4+0.777\times 9\text{Profit}+36.0\times \text{9ccd}+60.9\times 9\text{mortga}\]
  • Pilgrim Bank should increase and diverse its efforts in those areas that affect customer profitability, like expand its credit card and mortgage operations.