Assessment 3: Statistical Data Analysis Students will perform statistical data analysis based on a data set provided in this assessment task. The data can be downloaded from Canvas under the Assessment 3 Task.  Group

30%  Week 12  1500
AssignmentTutorOnline 2000 Words 
[ULO1],
[ULO2], [ULO3], [ULO4] 
equiv. – equivalent word count based on the Assessment Load Equivalence Guide. It means this assessment is equivalent to the normally expected time requirement for a written submission containing the specified number of words.
Assessment 3: Statistical Data Analysis
Due date:  Week 12 
Group/individual:  Group 
Word count / Time provided:  1500 – 2000 Words 
Weighting:  30% 
Unit Learning Outcomes:  [ULO1], [ULO2], [ULO3], [ULO4] 
Assessment Details:
The BUS1003 assessment task 3 is worth 30% of the overall assessment in the unit. This assignment is group work.
Timeframe and Submission:
The assessment must be uploaded no later than 11:59 pm on Monday (2/05/2022) of Week 12 on the Canvas assessment submission link. Unless approval for an extension is given on medical grounds (supported by a medical certificate), there will be a penalty of 5% of the maximum marks per calendar day for late submission of assignments. Although you will be provided with guidance about addressing the assignment tasks, you will need to complete the tasks in your own time.
Assessment Presentation
 Your answers must be presented in task number order and be clearly labelled with the appropriate task number. Answers to each task must start on a new page.
 Your assignment must be presented in Microsoft (MS) Word or pdf. Copy and paste any relevant Excel outputs to this document immediately before any relevant written answers to each task.
 If you are unfamiliar with using the MS Word Equations Editor, you may write algebraic/mathematical/statistical symbols and notation in neat handwritten form.
 Your answers must be clear. You must highlight relevant items on any required Excel outputs and refer to them in your written answers.
 When asked to perform a manual calculation (i.e., MS Excel is not specified), you must show all working. This must include intermediate steps where relevant. Failure to do so will result in a loss of marks.
 An Assessment Declaration is required and must be attached to the front of your assignment.
The dataset included with this assignment is a random sample of 450 persons from the population survey of Australia in a particular year (2020). The population consists of working and drawing salaries during the survey year, which you can access from the Assessment Information page on the unit website. You need to select the random samples of 100 IDs, each containing observations, where appropriate, of the eight variables, V1 to V8. The variables in the data set are as follows:
V1 = Salary (dollars per hour)
V2 = Occupational category (1=Management, 2=Sales, 3=Clerical, 4=Service, 5=Professional, 6=Other)
V3 = Sector (0=Other, 1=Manufacturing, 2=Construction)
V4 = Indicator variable for Residency Ownership (1=Homeowner, 0=Tenant)
V5 = Educational level (0= other, 1= Diploma, 2= Graduate Certificate, 3= Bachelor, 4=Master, 5= Doctorate)
V6 = Number of years of work experience
V7 =Age (years)
V8 = Indicator variable for sex (1=Female, 0=Male).
Assessment Tasks
Answers to the Assessment 3 tasks must be based on the sample data file you created in Part I of the assignment. In addition, most tasks in assessment3 require you to obtain an Excel output before performing some analysis. There are five tasks in Assessment 3. You must meet all task requirements to receive full marks.
Task 1 (20 marks)
 Find the frequency distribution for the educational level (0= other, 1= Diploma, 2= Graduate Certificate, 3= Bachelor, 4=Master, 5= Doctorate). Use Excel to produce a Descriptive Statistics table for your sample “Educational level” data and paste it into your MS Word assignment document.
 Use the relative frequency approach to find the probability distribution for the Educational level.
 Draw the pie chart for the probability distribution of Educational level.
 Define the probability distribution based on part (b) (You must calculate according to your data). Show your results in the below format:
x  0  1  2  3  4  5 
P(x) 
 Based on the probability distribution calculated in part (d), the following
 Find the probability of exactly two.
 Find the probability of more than three.
 Find the probability of at least three.
Task 2 (20 marks)
 Find the frequency distribution for the indicator variable for Residency Ownership (1=Homeowner, 0=Tenant). Then, use Excel to produce a Descriptive Statistics table for your sample “Residency Ownership” data and paste it into your MS Word assignment document.
 Use the relative frequency approach to find the probability distribution for the Residency Ownership.
 Draw the bar chart for the probability distribution of Residency Ownership.
 According to a sample data report, 26% (you need to consider the Residency Ownership proportion as the probability of success) of the people are the homeowner. Assume that a sample of 8 people is studied:
 Find the probability of exactly five is a homeowner. ii.Find the probability less than five are a homeowner.
iii. Find the probability that at least six are a homeowner.
Task 3 (20 marks)
 Use Excel and your sample data file to produce a suitable output; test, at the 5% level of significance, the hypothesis that, for Salaries (dollar per hours) in the population with mean is $25.
 Is this a onetailed or twotailed test? Briefly explain the reasoning behind your answer.
 Write, in precise symbolic form, the null and alternative hypotheses.
 Define Z test and calculate the value of test statistics.
 Define critical values based on the nature of the problem.
 Find a 95% confidence interval for the salaries (dollar per hour) in the population.
 Make the decision based on the critical value.
Task 4 (20 marks)
 Use Excel and your sample data file to produce a descriptive summary output (remember to include confidence bound “e” at 1% level of significance) for the indicator variable for sex (1=Female, 0=Male) according to your sample data from task 1.
 Define the mean proportion.
 At a 1% level of significance, the hypothesis for the indicator variable for sex (1=Female, 0=Male) according to your sample data from task 1 and the mean proportion for the male population is 0.45.
 Write, in precise symbolic form, the null and alternative hypotheses.
 Is this a onetailed or twotailed test? Briefly explain the reasoning behind your answer.
 State the conclusion based on the sample evidence.
 Find a 99% confidence interval for the indicator variable for sex male.
Task 5 (20 marks)
 Find the relationship between Salaries (dollar per hour) as a response variable and Education level as an explanatory variable. Use excel to find the linear regression output. The belief is that as the education level increases, the Salaries (dollar per hour) would increase. (You have to calculate according to your data).
 State the slope coefficient of the least square regression equation.
 State the intercept coefficient of the least square regression equation.
 Determine the least square regression equation representing the approximately linear relationship between the Salaries (dollar per hour) as a response variable and Education level as an explanatory variable.
 Estimate the Salaries when the education level is Diploma.
 Construct the 95% confidence interval for the slope parameter of the least square regression equation.
Please provide all the necessary screenshots from excel and paste it in Word file solution.
Marking Information: The case study assessment will be marked out of 100 and weighted 30% of the total unit marks.
Marking Criteria  Excellent
(85100%) of the criterion mark 
Very Good
(7584%) of the criterion mark 
Good
(6574%) of the criterion mark 
Satisfactory
(5064%) of the criterion mark 
Not satisfactory
(049%) of the criterion mark) 
Task 1: Frequency distribution, descriptive statistics, probability distribution for education level. (20 Marks)  The submission on the
requirements of Task 1 is assessed as excellent. 
The submission on the
requirements of Task 1 is assessed as very good. 
The submission on the
requirements of Task 1 is assessed as good. 
The submission on the
requirements of Task 1 is assessed as satisfactory. 
The submission on the requirements of Task 1 is assessed as not satisfactory. 
Task 2: Frequency distribution, descriptive statistics, probability distribution for residency ownership. (20 Marks)  The submission on the
requirements of Task 2 is assessed as excellent. 
The submission on the
requirements of Task 2 is assessed as very good. 
The submission on the
requirements of Task 2 is assessed as good. 
The submission on the
requirements of Task 2 is assessed as satisfactory. 
The submission on the requirements of Task 2 is assessed as not satisfactory. 
Task 3: Hypothesis testing; confidence interval; z and ttest; confidence interval. (20 Marks)  The submission on the
requirements of Task 3 is assessed as excellent. 
The submission on the
requirements of Task 3 is assessed as very good. 
The submission on the
requirements of Task 3 is assessed as good. 
The submission on the
requirements of Task 3 is assessed as satisfactory. 
The submission on the requirements of Task 3 is assessed as not satisfactory. 
Task 4: Hypothesis testing; null and alternative hypothesis; (20
Marks) 
The submission on the
requirements of Task 4 is assessed as excellent. 
The submission on the
requirements of Task 4 is assessed as very good. 
The submission on the
requirements of Task 4 is assessed as good. 
The submission on the
requirements of Task 4 is assessed as satisfactory. 
The submission on the requirements of Task 4 is assessed as not satisfactory. 
Task 5: Regression; Least square regression. (20 Marks)  The submission on the
requirements of Task 5 is assessed as excellent. 
The submission on the
requirements of Task 5 is assessed as very good. 
The submission on the
requirements of Task 5 is assessed as good. 
The submission on the
requirements of Task 5 is assessed as satisfactory. 
The submission on the requirements of Task 5 is assessed as not satisfactory. 
 Assignment status: Already Solved By Our Experts
 (USA, AUS, UK & CA PhD. Writers)
 CLICK HERE TO GET A PROFESSIONAL WRITER TO WORK ON THIS PAPER AND OTHER SIMILAR PAPERS, GET A NON PLAGIARIZED PAPER FROM OUR EXPERTS