Quick Instructions for Top Assignment Expert: Please answer ALL questions below in American English spelling. (No format, just answer questions) In the Analysis Problem section, please use the .csv link provided to download the data for analysis. Use any method you want (excel, R, etc.) to answer the questions. The solution can be submitted by using this same document since you are just answering questions. |
Short-Answer Problems
Some of these questions will appear on the short-answer part of the tests. As part of this homework, answer the following questions, usually just several sentences that include the definition.
Text, Ch 12
Identify the steps in the data preparation process.
Discuss the processes of validating and coding data.
Describe the processes of data entry and data cleaning.
Discuss the advantages of pre-coding questionnaires.
Compare a one-way tabulation vs. a two-way tabulation.
Explain the descriptive statistics that should be used with each type of scale.
Data Analysis
What does the business analyst hope to accomplish with a regression analysis? That is, what are the two goals of regression analysis?
What is Y? What are the two primary situations in which it is applied?
How does the graph of X compare with Y vs the graph of X with Y ?
What is the meaning of the slope coefficient in Y = b0 + b1X1
In regression analysis computer output, there are one or more t-tests presented. Describe the purpose of the test(s), including the null and alternative hypotheses. What do you conclude if the associated p-value for a test is less than?
In computer output for regression analysis there are one or more confidence intervals presented. Describe the purpose of the confidence interval, and its interpretation.
What is the meaning of the residual variable e ?
What is the criterion of ordinary least squares regression to obtain the estimated model?
Model Fit: The standard deviation of the residuals to interpret model fit
Model Fit: R-squared as a relative index of fit
Analysis Problem
Note: This problem is similar to what you do for your project, use regression analysis to analyze the contribution of, here just one, product attribute to satisfaction. All responses obtained with a survey. Next week we do multiple regression where we have many product attributes in the equation, which is exactly the situation for your project, as well as the worked problem on the Final.
Consider a marketing survey of 253 customers of a restaurant called SFG.
Data: http://web.pdx.edu/~gerbing/data/SFGsfg.csv
Responses to the individual items are in a 7-pt Likert format, from 1 to 7. Assess the outcome variable of Satisfaction (x22) with the following item:
How satisfied are you with the SFG?
Not Satisfied Very
At All Satisfied
1 2 3 4 5 6 7
What are the customer experiences that lead to customer satisfaction? One potential such variable is the customer’s perceived tastiness of the food. Consider the product attribute of taste, Tastiness (x18).
The food at the SFG tastes excellent.
Strongly Strongly
Disagree Agree
1 2 3 4 5 6 7
Research Question: To what extent does perceived Tastiness contribute to Overall Satisfaction at Restaurant SFG?
Do questions a through q and s, t, u, and x from the template.
Use the following information for Questions c and d.
Scatterplot/Correlation MatrixIdentify the response variable and the predictor variable(s). Show the scatterplot matrix (just one scatterplot for a single predictor) and correlation coefficients of the relationship of each of the variables in the model with each other. From only this visual information, develop some intuition for the subsequent analysis. Relevance: Do the predictor variables relate to the target (response) variable? Explain. Uniqueness: [If multiple predictor variables] Could collinearity be a problem? Explain. Model Selection: [If multiple predictor variables] Given the correlations, what is the most likely candidate for the final model? Explain. Estimated ModelWrite the estimated regression model. Specify and interpret the sample slope coefficient. Manually calculate the fitted/predicted value for the given values of predictor variables X. Manually calculate the associated residual. Interpret for the given values of predictor variables X and response variable y. Hypothesis Test: Applied to the one specified predictor variableSpecify the null hypothesis and its alternative for the hypothesis test of the slope coefficient. Show and label the calculation of how many (estimated) standard errors the estimated slope coefficient, b, is from the hypothesized population value. Include and apply the definition of the p-value with the relevant numbers for this specific analysis. Specify the basis for the statistical decision for the hypothesis test and the resulting statistical conclusion for alpha=0.05. Hypothesis Test: Interpretation, as an executive summary you would report to management. Confidence Interval: Applied to the one specified one predictor variableSpecify the value that the confidence interval estimates. Apply the definition of the 95% margin of error for its computation using the relevant numbers of this analysis with 2 approximating the t-cutoff. Show the computations of the 95% confidence interval illustrated with the specific numbers from this analysis. Confidence Interval: Interpretation, as an executive summary you would report to management. Demonstrate the consistency of the confidence interval and hypothesis test using the specific numbers for this analysis for both results. Model FitEvaluate fit with the standard deviation of residuals. Evaluate fit with R2 and PRESS R2, including their comparison. Does this value indicate reasonable fit? Show any potential outliers and explain why they are outliers. Model Selection [if multiple predictor variables]Consider all the predictor variables simultaneously. Based on the p-values of the slope coefficients, are any predictor variables much less useful for predicting the response variable (target)? Why or why not? Any collinearity problems? Why or why not? Based on this information and the best subset analysis, which model do you recommend? Why? Prediction IntervalsFor the 95% prediction interval of [response variable y] for [the values of predictor variables X], show the interval including its calculation (can approximate with the t-cutoff of 2). Interpret the prediction interval. ConclusionWhat decision do you recommend to management based on these results? |
c. value of the predictor variable: 4
d. value of the response variable: 5