-
- QUESTION
Discussion: Correlation and Bivariate Regression
Similar to the previous week’s Discussion, this Discussion assists in solidifying your understanding of statistical testing by engaging in some data analysis. This week you will once again work with a real, secondary dataset to construct a research question, perform a correlation and bivariate regression model, and interpret the results.
Whether in a scholarly or practitioner setting, good research and data analysis should have the benefit of peer feedback. For this Discussion, you will post your response to the hypothesis test, along with the results. Be sure and remember that the goal is to obtain constructive feedback to improve the research and its interpretation, so please view this as an opportunity to learn from one another.
To prepare for this Discussion:
- Review this week’s Learning Resources and media program related to regression and correlation.
- Review Magnusson’s web blog found in the Learning Resources to further your visualization and understanding of correlations between two variables.
- Construct a research question using the General Social Survey dataset, which can be answered by a Pearson correlation and bivariate regression.
By Day 3
Use SPSS to answer the research question. Post your response to the following: Please following this outline.
- What is your research question?
- What is the null hypothesis for your question?
- What research design would align with this question?
- What dependent variable was used and how is it measured?
- What independent variable is used and how is it measured?
- If you found significance, what is the strength of the effect?
- Explain your results for a lay audience; explain the answer to your research question.
Be sure to support your Main Post and Response Post with reference to the week’s Learning Resources and other scholarly evidence in APA Style.
Learning Resources
Required Readings
Frankfort-Nachmias, C., & Leon-Guerrero, A. (2018). Social statistics for a diverse society (8th ed.). Thousand Oaks, CA: Sage Publications.
- Chapter 12, “Regression and Correlation” (pp. 325-371)
Wagner, W. E. (2016). Using IBM® SPSS® statistics for research methods and social science statistics (6th ed.). Thousand Oaks, CA: Sage Publications.
- Chapter 8, “Correlation and Regression Analysis”
Walden University Library. (n.d.). Course Guide and Assignment Help for RSCH 8210. Retrieved from http://academicguides.waldenu.edu/rsch8210
For help with this week’s research, see this Course Guide and related weekly assignment resources.
Magnusson, K. (n.d.). Welcome to Kristoffer Magnusson’s blog about R, Statistics, Psychology, Open Science, Data Visualization [blog]. Retrieved from http://rpsychologist.com/index.html
As you review this web blog, select New d3.js visualization: Interpreting Correlations link, once you select the link, follow the instructions to view the interactive for interpreting correlations. This interactive will help you to visualize and understand correlations between two variables.
Note: This is Kristoffer Magnusson’s personal blog and his views may not necessarily reflect the views of Walden University faculty.
Subject | Education Systems | Pages | 9 | Style | APA |
---|
Answer
Correlation and Bivariate Regression
#1: Research Question
What is the association between family incomes in constant dollars and the level of education attained?
#2: Hypotheses Statement
H1: There is no association between family incomes in constant dollars and the level of education attained.
H2: There exists an association between family incomes in constant dollars and the level of education attained.
#3 Research Design
The framework created to find out the answer to the research questions involved employing secondary data on a quantitative study. The research data on family incomes in constant dollars and the level of education attained were collected from a secondary source.
The research question focuses on associations among variables which is a test of the relationship of the variables. Therefore, correlation was used to test the association in order to show whether family income can relate with the level of education using a regression analysis. Correlation is an association of variables (McMaster, Rague, Wolthuis, & Sambasivam, 2018).
. According to Frankfort-Nachmias and Leon-Guerrero (2018), regression analysis is a measure on how a response variable changes relevant to the independent variable. The Author also states that correlation analysis is used to check on the relationship of the variables under study. A correlation value of 1 shows a perfect relationship and a 0 value of correlation means there change of one variable is not related to the change of the other.
Regression analysis contains the R square which shows how family income explains the level of education. R square shows if the equation has goodness of fit. ANOVA analysis was done to test the statistical significant in the regression equation that can be formed by the predictor and the response variables of study. P value is used to determine the confidence interval of a result of a test (Chapman, 2012). The p value was used to establish the level of statistical significance of the formed equation and the residuals at 99% level of significance.
#4 Dependent Variable Used
The dependent variable, also called response variable, is predicted by an independent variable. In this case, there are two variables, namely between family incomes in constant dollars and the level of education attained. The level of education is expected to respond to the change in the family income. Therefore, the dependent variable is the level of education attained by the student. The variable is measured by the level of education attained.
#5 Independent Variable Used
The family income is expected to predict the level of education and hence, can be termed as the predictor variable. Predictor variable is the independent variable. The independent or predictor variable is the family income of the respondent. The variable is measured by the amount of income in dollars that the family of the respondent has.
#6 The Results and Significance
The table below indicates correlation analysis between family incomes in constant dollars and the level of education attained. The results show a positive but weak correlation of 0.442. The result is an indicator that in as much as the association is weak, the increase of one variable parallels with an increase in the other.
The correlation result is statistically significant (P<0.01) and therefore there is an association between the variables, Pearson correlation being 0.442.
Correlations |
|||
|
|
|
|
|
FAMILY INCOME IN CONSTANT DOLLARS |
RS HIGHEST DEGREE |
|
FAMILY INCOME IN CONSTANT DOLLARS |
Pearson Correlation |
1 |
.442** |
Sig. (1-tailed) |
|
.000 |
|
Sum of Squares and Cross-products |
4344814177000.000 |
54696261.070 |
|
Covariance |
1878432415.000 |
23647.324 |
|
N |
2314 |
2314 |
|
RS HIGHEST DEGREE |
Pearson Correlation |
.442** |
1 |
Sig. (1-tailed) |
.000 |
|
|
Sum of Squares and Cross-products |
54696261.070 |
3869.387 |
|
Covariance |
23647.324 |
1.525 |
|
N |
2314 |
2538 |
**. Correlation is significant at the 0.01 level (1-tailed). |
Regression Analysis
The regression analysis indicates that F(1, 2312)=474.772, p<0.000, R square=0.170. This shows that there is an association between family incomes in constant dollars and the level of education attained is statistically significant (p<0.05).
Model Summary |
||||
Model |
R |
R Square |
Adjusted R Square |
Std. Error of the Estimate |
1 |
.413a |
.170 |
.170 |
2.756 |
a. Predictors: (Constant), FAMILY INCOME IN CONSTANT DOLLARS |
|
||||||
ANOVA The ANOVA analysis indicates that the regression equation is statistically significant with the p<0.000. ANOVAa |
|||||||
Model |
Sum of Squares |
Df |
Mean Square |
F |
Sig. |
||
1 |
Regression |
3607.181 |
1 |
3607.181 |
474.772 |
.000b |
|
Residual |
17565.905 |
2312 |
7.598 |
|
|
||
Total |
21173.086 |
2313 |
|
|
|
||
The table below is the results on the residuals. The residuals table indicates that the weights by the independent variable income are statistically significant with both p<0.000 respectively. A unit change in family income causes a 2.881E-5 change in the level of education according to the results.
a. Dependent Variable: HIGHEST YEAR OF SCHOOL COMPLETED |
|
||||||
b. Predictors: (Constant), FAMILY INCOME IN CONSTANT DOLLARS |
|
||||||
Coefficientsa |
|||||||
Model |
Unstandardized Coefficients |
Standardized Coefficients |
t |
Sig. |
|||
B |
Std. Error |
Beta |
|||||
1 |
(Constant) |
12.354 |
.086 |
|
143.470 |
.000 |
|
FAMILY INCOME IN CONSTANT DOLLARS |
2.881E-5 |
.000 |
.413 |
21.789 |
.000 |
||
a. Dependent Variable: HIGHEST YEAR OF SCHOOL COMPLETED |
#7 Explanation of the Results
The correlation and the bivariate regression are very important in finding out the association between variables. In this case the study sought to find out the association between family incomes in constant dollars and the level of education attained.
The result of this study shows that the correlation is positive and weak. A correlation of more than 0.7 is known to be strong association and therefore for a change in one variable is sensitive to the change in the other. A correlation of 0.4 is regarded as low.
The results shows that the regression equation is statistically significant with p<0.000, however, the low R square of 0.17 is an indicator of low good of fit in the equation. The good fit shows the percentage of variation of the dependent variable based on the independent variable (Wagner, 2016). Therefore, the family income is responsible for 17% variability of the level of education of individuals. According to Stoica 2013), a good fit is over 80%, yet again a low good fit does not mean the equation is not valid or the predictor variable is not effective.
In conclusion, although there is a weak association between family income and the level of education, family income shows significant effect on the level of education of an individual according to the study.
References
Chapman, J.L. (2012). Introduction to Mathematical Statistics (7th ed.) Robert V. Hogg Joseph W. McKean Allen T. Craig. Journal of the American Statistical Association, (499), 1255. Retrieved from http://165.193.178.96/login?url=http%3a%2f%2fsearch.ebscohost.com%2flogin.aspx%3fdirect%3dtrue%26db%3dedsjsr%26AN%3dedsjsr.23427431%26site%3deds-live Frankfort-Nachmias, C., & Leon-Guerrero, A. (2018). Regression and Correlation. In Social statistics for a diverse society (8th ed.) (pp. 325-371). Thousand Oaks, CA: Sage Publications. McMaster, K., Rague, B., Wolthuis, S. L., & Sambasivam, S. (2018). A Comparison of Key Concepts in Data Analytics and Data Science. Information Systems Education Journal, 16(1), 33–40. Stoica, I. (2013). Statistical Analysis. Research Starters: Education (Online Edition). Retrieved from http://165.193.178.96/login?url=http%3a%2f%2fsearch.ebscohost.com%2flogin.aspx%3fdirect%3dtrue%26db%3ders%26AN%3d89164464%26userlogin.asp%26site%3deds-live Wagner, W. E. (2016). Correlation and Regression in Analysis. Using IBM® SPSS® statistics for research methods and social science statistics (6th ed.). Thousand Oaks, CA: Sage Publications.
|
Related Samples
The Role of Essay Writing Services in Online Education: A Comprehensive Analysis
Introduction The...
Write Like a Pro: Effective Strategies for Top-Notch Explication Essays
Introduction "A poem...
How to Conquer Your Exams: Effective Study Strategies for All Learners
Introduction Imagine...
Overcoming Writer’s Block: Strategies to Get Your Essays Flowing
Introduction The...
Optimizing Your Online Learning Experience: Tips and Tricks for Success
The world of education...