Examination Period Faculty of Business and Economics EXAM CODES:

Office Use Only Semester One 2019 Examination Period Faculty of Business and Economics EXAM CODES: ETC2410-ETW2410-BEX2410 TITLE OF PAPER: Introductory Econometrics – PAPER 1 EXAM DURATION: 2 hours writing time READING TIME: 10 minutes THIS PAPER IS FOR STUDENTS STUDYING AT: (tick where applicable)  Caulfield  Clayton  Parkville  Peninsula  Monash Extension  Off Campus Learning  Malaysia  Sth Africa  Other (specify) During an exam, you must not have in your possession any item/material that has not been authorised for your exam. This includes books, notes, paper, electronic device/s, mobile phone, smart watch/device, calculator, pencil case, or writing on any part of your body. Any authorised items are listed below. Items/materials on your desk, chair, in your clothing or otherwise on your person will be deemed to be in your possession. No examination materials are to be removed from the room. This includes retaining, copying, memorising or noting down content of exam material for personal use or to share with any other person by any means following your exam. Failure to comply with the above instructions, or attempting to cheat or cheating in an exam is a discipline offence under Part 7 of the Monash University (Council) Regulations, or a breach of instructions under Part 3 of the Monash University (Academic Board) Regulations. AUTHORISED MATERIALS OPEN BOOK  YES  NO CALCULATORS  YES  NO Only HP 10bII+ or Casio FX82 (any suffix) calculator permitted SPECIFICALLY PERMITTED ITEMS  YES  NO if yes, items permitted are: one A4 sheet of paper with hand written notes on both sides Candidates must complete this section if required to write answers within this paper STUDENT ID: __ __ __ __ __ __ __ __ DESK NUMBER: __ __ __ __ __ INSTRUCTIONS TO STUDENTS • Answer all FOUR questions. All questions are of equal value (15 marks). This paper is worth 60 marks in total and constitutes 60% of the final assessment. • For multiple choice questions write the question number and only one letter (a), (b), (c), (d) or (e) for each question in your answer book (not on the question sheet). • When testing a hypothesis, to obtain full marks you need to specify the null and the alternative hypotheses, the test statistic and its distribution under the null, and then perform the test and state your conclusion. • If a question does not specify the level of significance of a hypothesis test explicitly, use 5%. • Statistical tables are provided after Question 4. Question 1 (15 marks) This question has 15 multiple choice questions. Make sure that you clearly specify the question number and only one letter for each multiple choice question in your answer book (not on the question sheet). 1. Consider two datasets. In dataset A, we have data on consumption expenditure, income and hours of work for every year from 2000 to 2017 for a group of individuals who were randomly selected in the year 2000. In data set B, we have data on consumption per capita, income per capita and unemployment rate for Australia, Indonesia, Malaysia, New Zealand, Thailand and Vietnam for every year from 2000 to 2017. (a) Both datasets are examples of time series data. (b) Both datasets are examples of cross-sectional data. (c) Both datasets are examples of panel data. (d) Dataset A is an example of panel data, dataset B is an example of time series data. (e) Dataset A is an example of cross-sectional data, dataset B is an example of time-series data. (1 mark) 2. Which of the following statements is NOT true? (a) Randomised controlled trials are the best means for measuring causal relationships. (b) In predictive modelling, the variables that are used as predictors need not cause the variable that they try to predict. (c) Correlation is not causation. (d) Time series observations are always i.i.d. (e) Time series data are ordered whereas cross section data are not. (1 mark) Page 2 of 15 3. Let  denote the weight of a newborn baby immediately after birth.  is a random variable with mean , i.e. () =  and variance 2 i.e.  (−)2 = 2. We denote weights of 5 newborn babies selected at random by 12 3 4 and 5, and their sample average by ̄ Which of the following statements is NOT true (a) P5 =1( − ̄) = 0 (b) P5 =1  = 5̄ (c) (̄) =  (d) ̄ =  (e) ̄ is a linear combination of 12 3 4 and 5 (1 mark) 4. Let  and  denote returns to two risky assets. We are told that  () =  () =  and  () =   () = 2 If we invest half of our savings in one of these assets and the other half in the other asset, then the variance of the return to our investment will be (a)  2 4 if  and  are uncorrelated (b)  2 2 if  and  are uncorrelated (c)  2 2 always (d) 2 always (e) (−)2+(−)2 4 if  and  are uncorrelated (1 mark) Questions 5 and 6 refer to the following p.d.f.: According to an expert, the annual growth rate of the real GDP and the inflation rate for Malaysia in 2019 are governed by the following joint probability density function: Inflation rate ↓ , GDP growth rate → 4% 5% 6% 1% 0.1 0.1 0.0 2% 0.1 0.2 0.0 3% 0.1 0.1 0.1 4% 0.0 0.1 0.1 5. The expected growth rate of real GDP in Malaysia in 2019 according to this expert is: (a) a random variable (b) 500% because 4+5+6 3 = 5 (c) 490% because 4×03+5×05+6×02 = 49 (d) 250% because 1×02+2×03+3×03+4×02 = 25 (e) 492% because 1 4 ×{(4× 01 01+01 +5× 01 01+01 )+(4× 01 01+02 +5× 02 01+02 )+ (4× 01 01+01+01 +5× 01 01+01+01 +6× 01 01+01+01 )+ (5× 01 01+01 +6× 01 01+01 )} = 492 (1 mark) Page 3 of 15 6. Conditional on 5% GDP growth rate, the expected inflation rate in Malaysia in 2019 according to this expert is: (a) a random variable (b) 250% because 1+2+3+4 4 = 25 (c) 250% because 1×02+2×03+3×03+4×02 = 25 (d) 120% because 1×01+2×02+3×01+4×01 = 12 (e) 240% because 1× 01 05 +2× 02 05 +3× 01 05 +4× 01 05 = 24 (1 mark) Questions 7, 8 and 9 refer to the multiple regression model  = 0 + 11 + 22 + · · ·+  +   = 12      (1) which in matrix notation is y ×1 = X ×(+1) β (+1)×1 + u ×1 7.  (u | X) = 0 implies that (a) (X0u) = 0 (b) X0bu = 0 where bu is the vector of OLS residuals of regression of y on X (c)  (u | X) = 2I where I is the identity matrix of order  (d) X0X is invertible (e) Columns of X are linearly independent (1 mark) 8. Which one of the following statements is correct? (a) Xy is an  ×1 vector (b) X0X is an  ×  matrix (c) X0u = 0 (d) X0u is a ( +1)×1 vector (e) X0β is a ( +1)×1 vector (1 mark) 9. Assuming that this model satisfies all assumptions of the Classical Linear Model (CLM) and denoting the OLS estimator of β by bβ, which of the following statements is NOT correct? (a) bβ is an unbiased estimator of β (b) bβ is a consistent estimator of β (c) Conditional on X bβ is normally distributed (d) bβ is the best linear unbiased estimator of β (e) bβ is equal to β (1 mark) Page 4 of 15 10. We have chosen a random sample of 100 publicly listed companies and recorded their average share price, profits, revenues and total costs in 2017-2018 financial year. Note that profits = revenue – total cost. In a regression model with the share price as the dependent variable and a constant, profit, revenue and total cost as independent variables, the OLS estimator (a) cannot be computed because X0X matrix is not invertible (b) will be biased because share price is not normally distributed (c) will be unbiased (d) will be BLUE (e) will be unbiased but not BLUE (1 mark) Questions 11 to 13 refer to the following problem: We would like to model the relationship between the price of an apartment with its area and its number of bedrooms. We postulate the following population regression model  = 0 + 1 + 2 +  Suppose all assumptions of the Classical Linear Model applies to this model. We have collected data on price (in 1000 dollars), area (in square metres) and number of bedrooms for 120 randomly selected apartments and estimated the parameters of this models using OLS. This resulted in 31899 135 and 6237 for estimates of 0, 1 and 2 respectively. 11. Which of the following equations reports the results appropriately? (a) d = 31899+135  +6237  (b) d = 31899+135  +6237  + ̂ (c) d = 31899+135  +6237  +  (d)  = 31899+135  +6237  +  (e)  ( |  ) = 31899+135  +6237  (1 mark) 12. Which of the following statements is correct? (a)  ( |  ) = 31899+135  +6237  (b)  ( |  ) = 31899+135  +6237  +  (c)  ( |  ) = 31899+135  +6237  + ̂ (d)  ( |  ) = 0 + 1 + 2 (e)  ( |  ) = 0 + 1 + 2 +  (1 mark) 13. The null hypothesis for testing that given the area of an apartment, its number of bedrooms is not a significant predictor of its price, is: (a) 0 :  = 0 (b) 0 : ( | ) = 0 (c) 0 : b2 = 0 (d) 0 : 2 = 0 (e) 0 : b2 6= 0 (1 mark) Page 5 of 15 Questions 14 and 15 relate to the following econometric model: Some economists believe that the relationship between greenhouse gas emission and income is nonlinear. Denote a country’s emission of CO2 per capita by 2 and its GDP per capita by  and consider the following model: 2 = 0 + 1 + 2 2 +  (2) 14. The hypothesis that the relationship between 2 and  is linear versus the al- ternative that it is an inverted U shape relationship can be written as: (a) 0 : 2 = 0 against 1 : 2  0 (b) 0 : 2 = 0 against 1 : 2  0 (c) 0 : 1 = 0 against 1 : 1  0 (d) 0 : 1 = 0 against 1 : 1  0 (e) 0 : 1 = 2 = 0 against 1 : at least one of 1 or 2 not equal to zero (1 mark) 15. If we know that in the model shown in equation (2)   ( | ) = 2, but all other assumptions of the Classical Linear Model are satisfied, then (a) we can still use the OLS estimator because it is unbiased, and we can use the usual OLS standard errors to perform  tests (b) we can still use the OLS estimator because it is unbiased, but we need to use heteroskedas- ticity robust standard errors to perform  tests (c) we cannot use the OLS estimator because the OLS estimator is biased in this case (d) we can still use the OLS estimator because it is the best linear unbiased estimator in this case (e) we can still use the OLS estimator because the OLS estimator is the same as the “weighted least squares” estimator in this case (1 mark) Question 2 (15 marks) 2.a. Suppose we have a sample of  observations on a variable . Show that if we run a regression of  on a constant only, the OLS estimate of the constant will be the sample average of  (3 marks) 2.b. From the World Development Indicators database, we have extracted data on the following variables for 121 countries in 2015: Variable Definition Range UNDER5 Mortality rate in children under 5 (per 1000 live births) 2.4 – 130.9 GDPPC GDP per capita in PPP adjusted dollars (as defined in assignment 1) 626 – 80892 SANITATION People using basic sanitation services (% of population) 7 – 100 WATER People using basic drinking water services (% of population) 0 – 100 The “Range” column provides the range of these variable in our sample. Page 6 of 15 From these 121 countries, 35 are in sub-Saharan Africa. We have created a dummy variable called SUBSAHARA which is equal to 1 if the country is a sub-Saharan country and 0 otherwise. Using this data set, we have estimated the following regressions using OLS (standard errors are provided in parentheses below parameter estimates) d5 = 172 (21) +596 (38)  (3) d5 = 1590 (145) − 72 (22) log()− 06 (01)  − 02 (01)  (4) i. From the information provided, compute the average under-5 mortality rate (a) for the 35 sub-Saharan countries, (b) for the remaining 86 countries, and (c) for all 121 countries in this sample. (3 marks) ii. Explain the estimated coefficients of log() in equation (4) in a way that a person with no econometric training would understand. (2 marks) iii. Suppose we want to test the hypothesis that after controlling for log(), a 1 percentage point increase in the proportion of population with access to basic sanitation has the same effect on under-5 mortality as a 1 percentage point increase in the proportion of population with access to drinking water, against the alternative that these effects are not equal, at the 5% level of significance. Explain how we could do that. For full marks, you need to state the null, the alternative, the test statistic and its distribution under the null, any additional regressions that we may have to estimate to calculate the test statistic, and how to come up with a conclusion using this procedure. All of these need to be explained in the context of this question where appropriate. (4 marks) iv. We have added  to equation (4) and re-estimated it and obtained the following equation: d5 = 1354 (145) − 74 (22) log()− 04 (01)  − 01 (01)  +182 (58)  (5) Use this information to test the hypothesis that after controlling for GDP per capita and access to sanitation and water services, there is no difference between the mean of under-5 mortality in sub-Saharan countries and the rest of the world, against the alternative that sub-Saharan countries have a higher mean, at the 5% level of significance. Remember that you need to state all steps of hypothesis testing to obtain full marks. (3 marks) Page 7 of 15 Question 3 (15 marks) 3.a. In predictive modelling, when we want to find the best subset of  explanatory variables {1 2     } to predict a target variable  we do not use 2 to compare models. Explain why, and provide the formula of an alternative statistic (only one) that we can use for selecting the best predictive model, highlighting specifically how this statistic overcomes the deficiency of 2 for model selection. (3 marks) 3.b. We have randomly selected a sample of 249 employed men and collected the following infor- mation: Variable Definition Range Median WAGE hourly wage in dollars 7.5 – 125 30 EDUC years of education 2 – 18 12 EXPER years of experience 0 – 38 13 The “Range” and “Median” columns show the range and the median of each variable within our sample, and zero years of experience means people who have less than 6 months experience. Consider the following population regression model for the logarithm of wage given education and experience: log() = 0 + 1 ( −12)+ 2  + 3 2 +  (6) We have estimated the following regression using OLS: dlog() = 2837 (0066) + 0095 (0010) ( −12)+ 0055 (0009) − 0001 (00003) 2 (7) 2 = 0394 standard error of the regression = 0420  = 249 Note that we have subtracted 12 from years of education in order to make the results more readily interpretable. i. Interpret the estimated coefficients in this regression, including its intercept. (4 marks) ii. Can we interpret the coefficient of ( −12) as the estimate of the “return to education”, i.e. proportional increase in wage caused by an extra year of education? Explain. (2 marks) iii. In order to test the hypothesis that the errors of this model are homoskedastic against a specific alternative, we have estimated the following auxiliary regression: ̂2 = 0096 (0031) + 0004 (0006) ( −12)+ 0005 (0002)  2 = 0039 standard error of the regression = 0262  = 249 where ̂ is the estimated residual of equation (7). Use this information to perform the test at the 5% level of significance. Remember that you need to write down the null and the alternative and all steps of hypothesis testing to obtain full marks. (4 marks) iv. Suppose we are told that the conditional variance of the error in model (6) is proportional to experience, i.e.  ( | ) = 2 × . Explain how we can use this information to transform model (6) in such a way that the transformed model will have the same parameters but no heteroskedasticity. (2 marks) Page 8 of 15

 

Order a Similar Paper

You didn't find what you were looking for? Upload your specific requirements now and relax as your preferred tutor delivers a top quality customized paper

Order Now