Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics.

Slides:



Advertisements
Présentations similaires
What does en mean? The object pronoun en usually means some or of them.
Advertisements

How to solve biological problems with math Mars 2012.
Les choses que j aime Learning Objective: To know how to use j aime to talk about things I like to do.
3 Les Verbes -ER Talking about people’s activities Les normes: –Communication 1.2: Understanding the written and spoken language –Comparisons 4.1: Understanding.
What’s the weather like?. Look at the verb phrase fait-il above Turn it around and you have il fait The phrase Il fait can be used to describe lots of.
Making PowerPoint Slides Avoiding the Pitfalls of Bad Slides.
PERFORMANCE One important issue in networking is the performance of the network—how good is it? We discuss quality of service, an overall measurement.
An Introduction To Two – Port Networks The University of Tennessee Electrical and Computer Engineering Knoxville, TN wlg.
 Components have ratings  Ratings can be Voltage, Current or Power (Volts, Amps or Watts  If a Current of Power rating is exceeded the component overheats.
CNC Turning Module 1: Introduction to CNC Turning.
Usage Guidelines for Jeopardy PowerPoint Game
Leçon 6: Une Invitation Unité 7.
Theme Three Speaking Questions
CONJUGAISON.
Notes for teacher. You can just use slides 2-5 if you wish. If you want to do the practical activity (slides 6-8) you will need to: print off Slide 6.
Quel est le pays le plus heureux ?
Speaking Exam Preparation
L’impératif ( = command forms)
Les pentes sont partout.
Chapter 6- the verb ‘to go’ question words places time
Reflective verbs or Pronominal verbs
Strengths and weaknesses of digital filtering Example of ATLAS LAr calorimeter C. de La Taille 11 dec 2009.
Quantum Computer A New Era of Future Computing Ahmed WAFDI ??????
the Periodic Table the Periodic Table 2017/2018 Made by : NEDJAR NASSIMA Made by : NEDJAR NASSIMA MS:HAMZA 1.
- User case - 3D curve length optimization
ÊTRE To be (ou: n’être pas!).
Conjugating regular –er verbs en français
Les Fruits :.
Theme One Speaking Questions
Objectifs: To revise telling the time in French
© 2004 Prentice-Hall, Inc.Chap 4-1 Basic Business Statistics (9 th Edition) Chapter 4 Basic Probability.
F RIENDS AND FRIENDSHIP Project by: POPA BIANCA IONELA.
Quiz What are the different Copper cable types ? How is STP better than UTP ? What type of cable should we use between : Router-Switch, PC-Router, Hub-Switch.
P&ID SYMBOLS. P&IDs Piping and Instrumentation Diagrams or simply P&IDs are the “schematics” used in the field of instrumentation and control (Automation)
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 1-1 Chapter 1 Introduction and Data Collection Basic Business Statistics 10 th Edition.
G. Peter Zhang Neurocomputing 50 (2003) 159–175 link Time series forecasting using a hybrid ARIMA and neural network model Presented by Trent Goughnour.
Author : Moustapha ALADJI PhD student in economics-University of Guyana Co-author : Paul ROSELE Chim HDR Paris 1-Pantheon Sorbonne Economics / Management.
Essai
Français - couleurs - pays - drapeaux
Le soir Objectifs: Talking about what you do in the evening
Talking about people’s activities
La famille ER conjugaison
Qu’est-ce que tu as dans ta trousse?
Pablo Picasso P____ P_______
les instructions Bonjour la classe, sortez vos affaires
Qu’est-ce que tu as dans ta trousse?
MATLAB Basics With a brief review of linear algebra by Lanyi Xu modified by D.G.E. Robertson.
HOW DATA SCIENCE IS HELPING IN ARTIFICIAL INTELLIGENCE ? BOUAZIZ – AZZIZIGROUP A 01/10/20181.
Definition Division of labour (or specialisation) takes place when a worker specialises in producing a good or a part of a good.
Roots of a Polynomial: Root of a polynomial is the value of the independent variable at which the polynomial intersects the horizontal axis (the function.
Mettez vos devoirs dans la boîte rouge prennez les devoirs 2.2 B
Manometer lower pressure higher pressure P1P1 PaPa height 750 mm Hg 130 mm higher pressure 880 mm Hg P a = h = +- lower pressure 620 mm Hg.
WRITING A PROS AND CONS ESSAY. Instructions 1. Begin your essay by introducing your topic Explaining that you are exploring the advantages and disadvantages.
What’s the weather like?
Lesson 3.
Making PowerPoint Slides Avoiding the Pitfalls of Bad Slides.
POWERPOINT PRESENTATION FOR INTRODUCTION TO THE USE OF SPSS SOFTWARE FOR STATISTICAL ANALISYS BY AMINOU Faozyath UIL/PG2018/1866 JANUARY 2019.
© by Vista Higher Learning, Inc. All rights reserved.4A.1-1 Point de départ In Leçon 1A, you saw a form of the verb aller (to go) in the expression ça.
les formes et les couleurs
les instructions Bonjour la classe, sortez vos affaires
1 Sensitivity Analysis Introduction to Sensitivity Analysis Introduction to Sensitivity Analysis Graphical Sensitivity Analysis Graphical Sensitivity Analysis.
Avoiding the Pitfalls of Bad Slides Tips to be Covered Outlines Slide Structure Fonts Colour Background Graphs Spelling and Grammar Conclusions Questions.
Lequel The Last Part.
Chapter 6- the verb ‘to go’ question words places time
L’orchestre des animaux
Savoir et Connaître La norme: Communication 1.2 Comparisons 4.1
Will G Hopkins Auckland University of Technology Auckland NZ Quantitative Data Analysis Summarizing Data: variables; simple statistics; effect statistics.
D’Accord 1 Leçon 3A.1 Descriptive adjectives (irregular adjectives, adjective placement-BAGS, and physical description.)
IMPROVING PF’s M&E APPROACH AND LEARNING STRATEGY Sylvain N’CHO M&E Manager IPA-Cote d’Ivoire.
3. Descriptive Statistics Describing data with tables and graphs (quantitative or categorical variables) Numerical descriptions of center, variability,
Transcription de la présentation:

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics for Economist Chap 7. The Error for Regression 1.Difference between Actual and Predict values 2.Computing RMSE Using the Correlation. 3.The Residual Plot 4.The Vertical Strips 5.Approximating to the Normal Curve Inside a Vertical Strip

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 2/24 INDEX 1 Difference between Actual and Predict Values 2 Computing RMSE Using the Correlation 3 The Residual Plot 4 The Vertical Strips 5 Approximating to the Normal Curve Inside a Vertical Strip

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 3/24 1. Difference between Actual and Predict Values Root-Mean-Square-Error (RMSE) Root-Mean-Square Error (RMSE) Standard Error of Estimate Standard Error of Regression Actual value Estimate Error 회귀직선

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 4/24 Estimation error1 height141cm. average weight of height 141cm is 38.7kg residual = actual weight – predicted weight = 54.5kg – 38.7kg = +15.8kg 67.4kg – 84.0kg = -16.6kg Residual of A Residual of B Korean men 4514 with age Average height = 167.5cm - SD of height = 8.5cm - Average weight = 63.5kg - SD of weight = 11.9kg - Correlation coefficient = Difference between Actual and Predict Values

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 5/24 Estimation error actual weight – predicted weight generally called, residual. The overall size of these errors in measured by taking their root mean square. Vertical distance from the line Estimation error 2 predicted error actual weight height 1. Difference between Actual and Predict Values

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 6/24 A typical point on a scatter plot is above or below the regression line by 8.9kg. (vertical distance) meaning The divisor degrees of freedom = = 4512 Computing the errors are based on the regression line. The regression line is defined by slope and intercept (lowering the degree of freedom) Computing the RMSE 1. Difference between Actual and Predict Values

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 7/24 Group average  height of the regression line Distance from the center(RMSE) The Normal curve. Following rule. Regression line & RMSE vs. Average & SD 1. Difference between Actual and Predict Values

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 8/24 Regression and rule of thumb 68% regression 1RMSE 95% regression 2RMSE About 68% of the points on a scatter diagram will be within 1RMSE of the regression line; about 95% of them will be within 2RMSE. 1. Difference between Actual and Predict Values

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 9/24 Elementary method for RMSE actual y residual= (actual y) – (average y) estimate = (average y) x Estimate y ignoring x → a horizontal line for estimates. This elementary RMSE is SDy. 1. Difference between Actual and Predict Values

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 10/24 INDEX 1 Difference between Actual and Predict Values 2 Computing RMSE Using the Correlation 3 The Residual Plot 4 The Vertical Strips 5 Approximating to the Normal Curve Inside a Vertical Strip

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 11/24 2. Computing RMSE Using the Correlation RMSE of the regression line and SDy yy xx RMSE SD y Regression lines Average y RMSE of regression is about RMSE of regression < SDy  because the regression line get closer to the points than the horizontal line. ref: Regression line is for ‘ much closer to the more scatters ’. r = 1 → RMSE = 0 r = -1 → RMSE = 0 r = 0 → RMSE  SD y Degrees of freedom

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 12/24 RMSE and Correlation coefficient Correlation coefficient Measures spread relative to the SD without units. RMSE Measures vertically spread around the regression line in absolute y-terms. We can get the RMSE from SDy using the correlation coefficient.. 2. Computing RMSE Using the Correlation

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 13/24 Regression analysis and correlation coefficient  r describes the clustering of the points around the SD line, relative to the SDs  Associated with each 1SD increase in x there is an increase of only r SDs in y, on the average  r determines the accuracy of the regression predictions, through the formula RMSE =  SD y.  RMSE describes how the regression line summarize data well. 2. Computing RMSE Using the Correlation

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 14/24 INDEX 1 Difference between Actual and Predict Values 2 Computing RMSE Using the Correlation 3 The Residual Plot 4 The Vertical Strips 5 Approximating to the Normal Curve Inside a Vertical Strip

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 15/24 3. The Residual Plot Plotting the Residual Plot  The residuals average out to 0.  The regression line for the residual plot is horizontal x-axis. The reason is that all the trend up or down has been taken out of the residual, and is in the residuals.

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 16/24 A residual with a strong pattern With a mistake to use a regression line, such a pattern appears. The residual plot should not have a strong pattern. 3. The Residual Plot

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 17/24 INDEX 1 Difference between Actual and Predict Values 2 Computing RMSE Using the Correlation 3 The Residual Plot 4 The Vertical Strips 5 Approximating to the Normal Curve Inside a Vertical Strip

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 18/ The Vertical Strips Scatter plot and histogram inside the vertical strips The two histograms have similar shapes, and their SDs are nearly the same. Group with height about 165cm people Group with height about 170 cm people

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 19/24 Homoscedasticity and Heteroscedasticity HomoscedasticityHeteroscedasticity All the vertical strips in a scatter plot show similar amounts of spread and the SDs of weight are not related to x-value. The size of it is about RMSE. The SDs of income in groups vary to the vertical strips. In this case, the RMSE of the regression line only gives a sort of average error across all the different x- values. 4. The Vertical Strips

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 20/24 INDEX 1 Difference between Actual and Predict Values 2 Computing RMSE Using the Correlation 3 The Residual Plot 4 The Vertical Strips 5 Approximating to the Normal Curve Inside a Vertical Strip

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 21/24 5.Approximating to the Normal Curve inside a Vertical Strip Impossible to approximate Estimates are meaningless themselves, The errors does not follow normal curve. The regression method uwing RMSE is off by different amounts in different parts of the scatter plot.

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 22/24 Ex) Midterm and final scores of econometrics in spring semester year 2002 midterm average = 27.9 midterm SD = 8.5 final average = 56.4 final SD = 13.8 r = 0.49 an oval shaped scatter plot. (1) What percentage of students got 66 or over on the final? (2) What percentage of students whose midterm score is 33 got 66 or over on the final? example1 5.Approximating to the Normal Curve inside a Vertical Strip

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 23/24 example 1 (1)Even Midterm related statistics or correlation coefficient are not necessary. z=0.7 By standard normal curve, 24% ☞ ☞ 5.Approximating to the Normal Curve inside a Vertical Strip

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 24/24 example 1 (2) We get new average using the regression analysis, new SD from RMSE of regression line. Regression Analysis Method 1. Midterm score is above the average by 0.6 SDx. 2. r= 0.49; 0.6  0.49 = Final score is above by 0.3 SDy = New average is = z = 0.5 By standard normal curve, 31 % 5.Approximating to the Normal Curve inside a Vertical Strip