The world’s Largest Sharp Brain Virtual Experts Marketplace Just a click Away
Levels Tought:
Elementary,Middle School,High School,College,University,PHD
| Teaching Since: | Jul 2017 |
| Last Sign in: | 362 Weeks Ago, 2 Days Ago |
| Questions Answered: | 5502 |
| Tutorials Posted: | 5501 |
MBA.Graduate Psychology,PHD in HRM
Strayer,Phoniex,
Feb-1999 - Mar-2006
MBA.Graduate Psychology,PHD in HRM
Strayer,Phoniex,University of California
Feb-1999 - Mar-2006
PR Manager
LSGH LLC
Apr-2003 - Apr-2007
August 7, 2017
Overview Week 3 Draft
Review Data. Data must be reviewed before it is analyzed. Look for missing values, consider the limits (end points), and extreme outliers.
Visual. Create several charts (graphs) to provide a visual of the data distribution.
Normal. Determine if the numeric data is normally distributed. Use the Anderson-Darling normality test. Knowing the data’s normality will make a difference in how the data is described, which hypothesis test to use.
Descriptive. Descriptive statistics are created.
Submit this worksheet with the Excel file for review.
Â
Overview Week 4
Interpret. The last step is to interpret the results in everyday terms.
APA formatting is not required.
Submit this worksheet for team collaboration.
Your teammate will provide a review of your work.
Â
It might be better to start with a fresh worksheet. Many of the checks were not made. And, the questions were not answered. This missing information may confuse your team members.
The data is from another dataset. I need to see this dataset and the source. It is not approved right now.
The histogram and bar charts need to be created.
Â
Â
Â
1. Variables and Research Question
Write in the Independent (IV) and Dependent (DV) variables. Identify the level of measurement, Categorical or Numeric, with an X.
|
Variable Names |
Categorical |
Numeric |
|
IV: Ethnicity |
x |
 |
|
DV: Â SAT verbal |
 |
x |
Write your Research Question
Research Question:
           RQ: Is there an average difference in SAT verbal scores (DV) based on ethnicity
Since R studio was not installed in my computer, I proceeded to do the analysis using STATA version 13. The descriptive statistics for each numeric variable is presented below (see Table 2).
2. Review the Data
Missing Values
A. Where any data rows removed? Explain.
Â
Evaluate Limits
B. Were any data removed? Explain.
Â
Evaluate Outliers
C. Were extreme outliers found? If yes, list their values. Were they removed?
Â
Categorical Data
D. Were changes made to the categorical data? Explain.
Â
3. Excel Setup
The Excel file is to be submitted with this worksheet.
Â
Â
Â
4. Descriptive Statistic - Graphics
Histogram
Paste (Ctrl V) the image in Appendix A in this paper.
E. What did your observation tell you about the numeric variable?
Â
Bar Chart
Right click and Copy. Paste the bar chart to Appendix B in this paper.
F. What did your observation tell you about the categorical variable?
Â
Scatterplot
Right click and Copy. Paste the scatterplot to Appendix C in this paper.
G. What did your observation tell you about the relationship between these two variables?
Not Applicable
Â
Normality
Place the output in Appendix D in this document.
H. What was your observation for normality? Explain.
Â
5. Descriptive Statistics – Calculations
Fill in Table 1 from the Xuru normality test output in Appendix D.
Â
The SAT Verbal Descriptive Statistics are not from the dataset. Â
Table 1
Descriptive Statistics on Numeric Variables
|
 |
Numeric Variables |
|
|
Statistic |
SAT scores |
Variable Name |
|
           Shapiro-Wilk, p-value = |
W=0.94541 p_value=0.00000 |
 |
|
           Normal (Yes/No) |
No |
 |
|
           Mean, M = |
650.0303 |
 |
|
           Median = |
683 |
 |
|
           Std. Dev., s = |
362.3033 |
 |
|
           IQR ÷ 2 = |
311 |
 |
|
           Sample Size, n = |
231 |
 |
|
           Minimum = |
14Â Â Â Â Â Â |
 |
|
           Maximum = |
1278 |
 |
|
           Mode = |
154 |
 |
|
           Confidence Interval = |
|
 |
Note. The confidence interval is applicable for normally distributed data.
The sample data was not from a normally distributed population based on the fact that the resultant p-value from the test was less than 0.05 level of significance thus falling in the rejection region, we therefore reject the null hypothesis that the population is normally distributed. The plotted qq plot supports the findings
Â
6. Descriptive Statistics - Interpretation
Interpretation
Numeric Variable Name
           The numeric variable is the SAT scores collected from both black and white students in colleges. It is a continuous variable. The minimum score was 14 and the maximum was 1278. Most of the students scored 154 and the mean of the scores was not centralized since the population was not normally distributed.
Â
Categorical Variable Name
           There were 32 students picked from each college, each had sat for the SAT and the sores had been recorded. Of these, 16 were black and 16 were white, of the black, 8 had taken a foreign language and 8 had not. Their SAT scores were recorded.
Â
Â
Â
Next Steps
1.     Post your paper to the team by midnight Day 5 for member review.
2.     Review member papers and post your critique by midnight Day 6.Â
3.     Make corrections to this paper based on the critique and member collaboration.
4.     Update your Business Research Paper (time permitting) by midnight Day 7.
5.     Submit this paper and your created member review papers by midnight Day 7.
Â
Â
Â
Â
Â
Appendix D: STATA findings
Â
1.     Shapiro_Wilk normal test
swilk SAT_scores
                  Shapiro-Wilk W test for normal data
   Variable |   Obs      W          V        z      Prob>z
-------------+--------------------------------------------------
 SAT_scores |   231   0.94541     9.237    5.152   0.00000
2.     QQ plot
. qnorm SAT_scores

Â
Â
3.     Summary statistics
. summarize SAT_scores
   Variable |      Obs       Mean   Std. Dev.      Min       Max
-------------+--------------------------------------------------------
 SAT_scores |      231   650.0303   362.3033        14      1278
Â
4.     Median and quantiles
. tabstat SAT_scores, statistics( mean iqr median p95 p5 )
   variable |     mean      iqr      p50      p95       p5
-------------+--------------------------------------------------
 SAT_scores | 650.0303      622      683     1165       79
----------------------------------------------------------------
5.     Confidence intervals
. ci SAT_scores
   Variable |       Obs       Mean   Std. Err.      [95% Conf. Interval]
-------------+---------------------------------------------------------------
 SAT_scores |       231   650.0303   23.83781       603.0619   696.9987
Â
Â
Â
Â
Â
Appendix E: Table of percentages
Â
|
Students |
Percentage |
|
|
White |
50% |
|
|
Black |
Taken foreign language |
25% |
|
Not taken foreign language |
25% |
|
Â
Â
Â
Hel-----------lo -----------Sir-----------/Ma-----------dam----------- T-----------han-----------k y-----------ou -----------for----------- us-----------ing----------- ou-----------r w-----------ebs-----------ite----------- an-----------d a-----------cqu-----------isi-----------tio-----------n o-----------f m-----------y p-----------ost-----------ed -----------sol-----------uti-----------on.----------- Pl-----------eas-----------e p-----------ing----------- me----------- on----------- ch-----------at -----------I a-----------m o-----------nli-----------ne -----------or -----------inb-----------ox -----------me -----------a m-----------ess-----------age----------- I -----------wil-----------l