regressÃo logÍstica e indicadores de … · o objetivo deste trabalho é pesquisar a regressão...

17
PONTIFÍCIA UNIVERSIDADE CATÓLICA DE SÃO PAULO FEA Faculdade de Economia e Administração Programa de Estudos PósGraduados em Administração REGRESSÃO LOGÍSTICA E INDICADORES DE GOVERNANÇA GLOBAL Disciplina: Métodos Quantitativos Prof. Dr. Arnoldo Jose de Hoyos Carlos Barbosa Correa Junior

Upload: ngodan

Post on 18-Nov-2018

216 views

Category:

Documents


0 download

TRANSCRIPT

PONTIFÍCIA UNIVERSIDADE CATÓLICA DE SÃO PAULO FEA ‐ Faculdade de Economia e Administração 

Programa de Estudos Pós‐Graduados em Administração  

 

 

 

 

REGRESSÃO LOGÍSTICA  

E  INDICADORES DE GOVERNANÇA GLOBAL  

 

 

Disciplina: Métodos Quantitativos 

Prof. Dr. Arnoldo Jose de Hoyos 

Carlos Barbosa Correa Junior 

 

1. INTRODUÇÃO 

O objetivo deste trabalho é pesquisar A Regressão Logistica.. Serão utilizados dimensionadores 

do WGI  (Worldwide Governance  Indicators)  que  envolve  207  países.  Será  considerada  uma 

amostra de 30 países e analisados os dois primeiros componentes principais, para cada uma 

das  variáveis  apresentadas  pelo  índice  e  a  seguir  a  análise  dos  conglomerados. O  software 

estatístico utilizado é o MINITAB. 

  Os dimensionadores são: 

– Transparência e representatividade 

 – Estabilidade política e ausência de violência 

 – Efetividade governamental 

 – Qualidade regulatória 

 – Regras da Lei 

 – Controle da corrupção 

 

Os dados iniciais do universo são os listados a seguir: 

 

 

 

2‐ Amostras 30 e 60 

paises va60 ps60 ge60 rq60 rl60 cc60TAIWAN 0,7024 0,7244 0,8798 1,0720 0,7657 0,5504SENEGAL ‐0,1620 ‐0,1560 ‐0,1234 ‐0,2924 ‐0,3124 ‐0,4535BENIN 0,3403 0,3452 ‐0,5181 ‐0,4600 ‐0,5450 ‐0,4153NORWAY 1,5306 1,3288 1,9518 1,3405 1,9649 1,8767THAILAND ‐0,5575 ‐1,1877 0,1098 0,2607 ‐0,0305 ‐0,3844MACAO 0,1376 0,4576 1,0258 1,0038 0,4490 ‐0,0333Congo, Dem. Rep. ‐1,4753 ‐2,3392 ‐1,8931 ‐1,4268 ‐1,6831 ‐1,3070BOTSWANA 0,5450 0,9559 0,6682 0,5203 0,6433 0,9979ZAMBIA ‐0,0867 0,2880 ‐0,6628 ‐0,3343 ‐0,4985 ‐0,4784PAKISTAN ‐1,0106 ‐2,6138 ‐0,7278 ‐0,4713 ‐0,9189 ‐0,7734LIBERIA ‐0,2904 ‐0,9878 ‐1,3613 ‐1,3169 ‐1,2279 ‐0,5978MALTA 1,2125 1,2963 1,2570 1,1686 1,5897 1,0115MAURITIUS 0,8832 0,8373 0,6038 0,9536 0,8850 0,5318AUSTRIA 1,3576 1,2997 1,7064 1,6420 1,9182 1,8216SURINAME 0,5657 0,1522 0,0034 ‐0,6677 ‐0,3335 ‐0,0866NIGER ‐0,4134 ‐0,7470 ‐0,7927 ‐0,5178 ‐0,7988 ‐0,8243ARGENTINA 0,3211 ‐0,0385 ‐0,1805 ‐0,6454 ‐0,6116 ‐0,4425BRAZIL 0,5070 ‐0,1194 ‐0,0098 0,1918 ‐0,3027 ‐0,0326KOREA, SOUTH 0,5858 0,4077 1,2588 0,7297 0,7908 0,4477ST. KITTS AND NEVIS 1,1211 0,8535 0,6580 0,5007 0,7510 0,9996HONDURAS ‐0,2904 ‐0,3587 ‐0,5740 ‐0,2717 ‐0,8937 ‐0,8194INDONESIA ‐0,1445 ‐1,0021 ‐0,2909 ‐0,2699 ‐0,6559 ‐0,6421NETHERLANDS 1,5291 0,9547 1,8557 1,7506 1,7602 2,1929CUBA ‐1,8520 0,0385 ‐0,5071 ‐1,5875 ‐0,8514 ‐0,0644ISRAEL 0,6945 ‐1,3932 1,2971 1,1969 0,8823 0,8741SWITZERLAND 1,4532 1,2282 2,0631 1,6581 1,8648 2,1546VENEZUELA ‐0,6247 ‐1,2336 ‐0,8503 ‐1,4412 ‐1,5876 ‐1,1347PERU 0,0181 ‐0,8390 ‐0,3037 0,3347 ‐0,7380 ‐0,2619PUERTO RICO 1,3055 0,4877 0,6569 1,0186 0,6600 0,4989ANGUILLA 1,0013 1,1925 1,5621 1,3467 1,7165 1,3209ZIMBABWE ‐1,5226 ‐1,5633 ‐1,5556 ‐2,1764 ‐1,8106 ‐1,3734TIMOR‐LESTE 0,1533 ‐1,1305 ‐0,9985 ‐1,3986 ‐1,1507 ‐0,8936NEPAL ‐0,7884 ‐1,6894 ‐0,7532 ‐0,6555 ‐0,7632 ‐0,6766TRINIDAD AND TOBAGO 0,5339 0,0803 0,2984 0,6211 ‐0,2516 ‐0,1703LIECHTENSTEIN 1,3235 1,3978 1,8615 1,3467 1,4669 1,2557MEXICO 0,0752 ‐0,6244 0,1760 0,4487 ‐0,6415 ‐0,2587KUWAIT ‐0,5291 0,4520 0,1141 0,0356 0,7020 0,4975CONGO ‐1,1637 ‐0,6092 ‐1,3378 ‐1,1924 ‐1,1552 ‐1,1613UNITED KINGDOM 1,3288 0,5633 1,7361 1,7901 1,6834 1,7657CAMEROON ‐1,0192 ‐0,5337 ‐0,7992 ‐0,6621 ‐0,9932 ‐0,9001IRELAND 1,4044 1,1583 1,6122 1,9150 1,7415 1,7551GAMBIA ‐0,9699 0,1444 ‐0,7670 ‐0,4358 ‐0,2518 ‐0,7778LESOTHO 0,0383 ‐0,0253 ‐0,3114 ‐0,6304 ‐0,2954 0,0407UZBEKISTAN ‐1,8983 ‐0,9114 ‐0,6828 ‐1,4079 ‐1,1822 ‐1,0762PANAMA 0,5851 0,1077 0,1645 0,6290 ‐0,2043 ‐0,1515REUNION 1,2172 0,4694 1,0258 1,0999 0,8930 0,8400NAURU 1,0353 1,1829 ‐0,4158 ‐0,9070 0,1409 ‐0,2976CAYMAN ISLANDS 0,7854 1,1925 1,2940 1,0999 1,1675 0,8400PHILIPPINES ‐0,2005 ‐1,4134 0,0006 ‐0,0535 ‐0,4891 ‐0,7461HUNGARY 1,0009 0,5868 0,6600 1,2571 0,8245 0,5492CHAD ‐1,4518 ‐1,9248 ‐1,4777 ‐1,2632 ‐1,5738 ‐1,4536PALAU 1,2408 1,1829 ‐0,5606 ‐0,7301 0,6789 ‐0,2976MOZAMBIQUE ‐0,0167 0,2923 ‐0,3841 ‐0,4705 ‐0,6619 ‐0,5457KYRGYZSTAN ‐0,7201 ‐0,6833 ‐0,7010 ‐0,3228 ‐1,2635 ‐1,0601DOMINICAN REPUBLIC 0,1404 0,0990 ‐0,3953 ‐0,2391 ‐0,6006 ‐0,6181LIBYA ‐1,8962 0,4837 ‐0,8400 ‐0,9262 ‐0,6468 ‐0,8058CZECH REPUBLIC 1,0245 0,9341 1,0681 1,0890 0,8451 0,3650COLOMBIA ‐0,2571 ‐1,6629 0,1297 0,2431 ‐0,5015 ‐0,2491DJIBOUTI ‐1,1183 ‐0,1259 ‐0,9816 ‐0,7476 ‐0,5360 ‐0,3274GUYANA 0,1682 ‐0,5640 ‐0,1726 ‐0,5458 ‐0,6958 ‐0,4735  

3‐ análise de 30 

Variables

Sim

ilari

ty

ps30rq30cc30rl30ge30va30

89,92

93,28

96,64

100,00

Dendrogram with Single Linkage and Correlation Coefficient Distance

Cluster Analysis of Variables: va30; ps30; ge30; rq30; rl30; cc30 Correlation Coefficient Distance, Single Linkage Amalgamation Steps Number Number of obs. of Similarity Distance Clusters New in new Step clusters level level joined cluster cluster 1 5 98,6210 0,027581 3 5 3 2 2 4 97,8582 0,042836 3 6 3 3 3 3 96,9453 0,061093 3 4 3 4 4 2 93,7445 0,125109 1 3 1 5 5 1 89,9162 0,201677 1 2 1 6

Principal Component Analysis: va30; ps30; ge30; rq30; rl30; cc30 Eigenanalysis of the Correlation Matrix Eigenvalue 5,2983 0,3605 0,1875 0,0965 0,0393 0,0179 Proportion 0,883 0,060 0,031 0,016 0,007 0,003 Cumulative 0,883 0,943 0,974 0,990 0,997 1,000 Variable PC1 PC2 PC3 PC4 PC5 PC6 va30 -0,403 0,082 -0,803 0,391 -0,172 -0,057 ps30 -0,369 0,863 0,099 -0,305 0,086 0,098 ge30 -0,422 -0,221 0,256 -0,168 -0,783 0,259 rq30 -0,409 -0,422 -0,225 -0,545 0,476 0,282 rl30 -0,428 -0,136 0,222 -0,052 0,068 -0,861 cc30 -0,416 -0,066 0,424 0,652 0,345 0,314

 

Component Number

Eige

nval

ue

654321

6

5

4

3

2

1

0

Scree Plot of va30; ...; cc30

  First Component

Seco

nd C

ompo

nent

543210-1-2-3-4

1,0

0,5

0,0

-0,5

-1,0

-1,5

-2,0

Score Plot of va30; ...; cc30

 

 

First Component

Seco

nd C

ompo

nent

0,0-0,1-0,2-0,3-0,4

1,00

0,75

0,50

0,25

0,00

-0,25

-0,50

cc30rl30

rq30

ge30

ps30

va30

Loading Plot of va30; ...; cc30

 

– dendograma inicial dos 30 paises

Observations

Sim

ilari

ty

24251031727115182822211615179323012262314462081929131

74,85

83,23

91,62

100,00

Dendrogram with Single Linkage and Euclidean Distance

4 – construindo um cluster de 6 grupamentos com dois componentes principais

pc1  pc2  cluster 

‐1,48078  0,06081  1 

0,92532  0,22236  2 

0,82634  0,83553  2 

‐3,56491  0,05861  1 

1,01073  ‐0,9325  2 

‐0,81594  ‐0,12789  1 

4,22847  ‐0,59547  3 

‐1,33557  0,47558  1 

1,04937  0,73118  2 

2,79164  ‐1,51137  4 

2,54092  0,32697  2 

‐2,58044  0,31547  1 

‐1,48178  0,25626  1 

‐3,45821  ‐0,03883  1 

0,45355  0,63106  2 

1,91816  0,03984  2 

0,93928  0,54359  2 

0,21476  0,06862  2 

‐1,30763  ‐0,13967  1 

‐1,58204  0,44255  1 

1,58722  0,22763  2 

1,46752  ‐0,36721  2 

‐3,60982  ‐0,37534  1 

2,28333  0,85271  5 

‐1,14874  ‐1,78569  6 

‐3,73869  ‐0,18165  1 

2,97338  0,12471  2 

0,99006  ‐0,47363  2 

‐1,49548  0,00945  1 

‐2,81541  0,04687  1 

4,2154  0,25945  3 

Observations

Sim

ilari

ty

24251031727115182822211615179323012262314462081929131

74,85

83,23

91,62

100,00

Dendrogram with Single Linkage and Euclidean Distance

 

cluster

pc1

654321

5

4

3

2

1

0

-1

-2

-3

-4

Boxplot of pc1 by cluster

 

 

cluster

pc1

654321

5

4

3

2

1

0

-1

-2

-3

-4

Individual Value Plot of pc1 vs cluster

 

Cluster Analysis of Observations: va30; ps30; ge30; rq30; rl30; cc30 Euclidean Distance, Single Linkage Amalgamation Steps Number Number of obs. of Similarity Distance Clusters New in new Step clusters level level joined cluster cluster 1 30 95,4895 0,38017 23 26 23 2 2 29 95,4024 0,38751 1 13 1 2 3 28 94,8688 0,43249 4 14 4 2 4 27 94,4780 0,46543 4 23 4 4 5 26 94,3286 0,47802 3 9 3 2 6 25 93,6851 0,53226 16 21 16 2 7 24 93,5886 0,54040 12 30 12 2 8 23 93,4899 0,54871 3 17 3 3 9 22 93,1373 0,57843 3 15 3 4 10 21 92,9370 0,59532 8 20 8 2 11 20 92,8887 0,59939 1 29 1 3 12 19 92,6293 0,62125 1 19 1 4 13 18 91,9121 0,68170 2 3 2 5 14 17 91,5964 0,70831 1 8 1 6 15 16 91,5754 0,71008 16 22 16 3 16 15 91,2988 0,73339 4 12 4 6 17 14 91,0421 0,75503 16 28 16 4 18 13 90,2099 0,82517 1 6 1 7 19 12 89,8432 0,85608 2 16 2 9 20 11 89,2561 0,90556 2 18 2 10 21 10 88,9567 0,93080 11 27 11 2 22 9 87,5302 1,05104 2 5 2 11 23 8 86,6271 1,12716 2 11 2 13 24 7 86,4691 1,14047 7 31 7 2 25 6 85,3790 1,23235 1 4 1 13 26 5 82,9727 1,43517 2 7 2 15 27 4 80,2670 1,66322 1 2 1 28 28 3 78,0239 1,85228 1 10 1 29 29 2 77,2872 1,91438 1 25 1 30 30 1 74,8522 2,11962 1 24 1 31

Final Partition Number of clusters: 6 Within Average Maximum cluster distance distance Number of sum of from from observations squares centroid centroid Cluster1 13 19,3793 1,17146 1,71859 Cluster2 13 14,4639 0,97740 1,84780 Cluster3 2 0,6503 0,57023 0,57023 Cluster4 1 0,0000 0,00000 0,00000 Cluster5 1 0,0000 0,00000 0,00000 Cluster6 1 0,0000 0,00000 0,00000 Cluster Centroids Variable Cluster1 Cluster2 Cluster3 Cluster4 Cluster5 Cluster6 va30 1,02807 -0,062887 -1,49895 -1,01064 -1,85204 0,69447 ps30 0,92496 -0,452650 -1,95124 -2,61375 0,03846 -1,39319 ge30 1,24210 -0,427254 -1,72437 -0,72784 -0,50713 1,29713 rq30 1,13116 -0,417688 -1,80164 -0,47127 -1,58751 1,19694 rl30 1,21225 -0,656622 -1,74684 -0,91891 -0,85143 0,88234 cc30 1,10547 -0,505642 -1,34019 -0,77343 -0,06435 0,87410 Grand Variable centroid va30 0,238105 ps30 -0,055835 ge30 0,232467 rq30 0,155162 rl30 0,091660 cc30 0,166249 Distances Between Cluster Centroids Cluster1 Cluster2 Cluster3 Cluster4 Cluster5 Cluster6 Cluster1 0,00000 3,78973 6,84083 5,58564 5,01589 2,37799 Cluster2 3,78973 0,00000 3,12904 2,40880 2,24718 3,36316 Cluster3 6,84083 3,12904 0,00000 2,10879 2,83559 5,92106 Cluster4 5,58564 2,40880 2,10879 0,00000 3,08937 4,15204 Cluster5 5,01589 2,24718 2,83559 3,08937 0,00000 4,84040 Cluster6 2,37799 3,36316 5,92106 4,15204 4,84040 0,00000

Dendrogram One-way ANOVA: pc1 versus cluster Source DF SS MS F P cluster 5 137,825 27,565 32,62 0,000 Error 25 21,126 0,845 Total 30 158,950 S = 0,9193 R-Sq = 86,71% R-Sq(adj) = 84,05% Individual 95% CIs For Mean Based on Pooled StDev Level N Mean StDev --+---------+---------+---------+------- 1 13 -2,2513 1,0670 (-*-)

2 13 1,2997 0,7887 (-*-) 3 2 4,2219 0,0092 (----*----) 4 1 2,7916 * (------*-------) 5 1 2,2833 * (------*-------) 6 1 -1,1487 * (------*-------) --+---------+---------+---------+------- -2,5 0,0 2,5 5,0 Pooled StDev = 0,9193

One-way ANOVA: pc1 versus cluster Source DF SS MS F P cluster 5 137,825 27,565 32,62 0,000 Error 25 21,126 0,845 Total 30 158,950 S = 0,9193 R-Sq = 86,71% R-Sq(adj) = 84,05% Individual 95% CIs For Mean Based on Pooled StDev Level N Mean StDev --+---------+---------+---------+------- 1 13 -2,2513 1,0670 (-*-) 2 13 1,2997 0,7887 (-*-) 3 2 4,2219 0,0092 (----*----) 4 1 2,7916 * (------*-------) 5 1 2,2833 * (------*-------) 6 1 -1,1487 * (------*-------) --+---------+---------+---------+------- -2,5 0,0 2,5 5,0 Pooled StDev = 0,9193

Boxplot of pc1 by cluster

One-way ANOVA: pc1 versus cluster Source DF SS MS F P cluster 5 137,825 27,565 32,62 0,000 Error 25 21,126 0,845 Total 30 158,950 S = 0,9193 R-Sq = 86,71% R-Sq(adj) = 84,05% Individual 95% CIs For Mean Based on Pooled StDev Level N Mean StDev --+---------+---------+---------+------- 1 13 -2,2513 1,0670 (-*-) 2 13 1,2997 0,7887 (-*-) 3 2 4,2219 0,0092 (----*----) 4 1 2,7916 * (------*-------) 5 1 2,2833 * (------*-------) 6 1 -1,1487 * (------*-------) --+---------+---------+---------+------- -2,5 0,0 2,5 5,0 Pooled StDev = 0,9193

Individual Value Plot of pc1 vs cluster

Boxplot of pc1 by cluster

One-way ANOVA: pc1 versus cluster Source DF SS MS F P cluster 5 137,825 27,565 32,62 0,000 Error 25 21,126 0,845 Total 30 158,950 S = 0,9193 R-Sq = 86,71% R-Sq(adj) = 84,05% Individual 95% CIs For Mean Based on Pooled StDev Level N Mean StDev --+---------+---------+---------+------- 1 13 -2,2513 1,0670 (-*-) 2 13 1,2997 0,7887 (-*-) 3 2 4,2219 0,0092 (----*----) 4 1 2,7916 * (------*-------) 5 1 2,2833 * (------*-------) 6 1 -1,1487 * (------*-------) --+---------+---------+---------+------- -2,5 0,0 2,5 5,0

 

 

5 – Dendograma com 3 

cl mudado

Freq

uenc

y

321

14

12

10

8

6

4

2

0

Histogram of cl mudado

 

 

cl mudado

pc1

321

3

2

1

0

-1

-2

-3

-4

Individual Value Plot of pc1 vs cl mudado

 

cl mudado

pc1

321

3

2

1

0

-1

-2

-3

-4

Boxplot of pc1

 

 

Results for: COMPO PRINCIPAIS.MTW

Discriminant Analysis: cl mudado versus va30; ps30; ... Linear Method for Response: cl mudado

Predictors: va30; ps30; ge30; rq30; rl30; cc30 Group 1 2 3 Count 7 13 6 Summary of classification True Group Put into Group 1 2 3 1 7 0 0 2 0 13 0 3 0 0 6 Total N 7 13 6 N correct 7 13 6 Proportion 1,000 1,000 1,000 N = 26 N Correct = 26 Proportion Correct = 1,000 Squared Distance Between Groups 1 2 3 1 0,0000 20,5030 16,3269 2 20,5030 0,0000 67,6176 3 16,3269 67,6176 0,0000 Linear Discriminant Function for Groups 1 2 3 Constant -4,320 -3,295 -18,997 va30 4,562 5,700 2,267 ps30 1,417 -1,726 -0,113 ge30 4,520 1,242 5,170 rq30 2,163 1,364 -2,000 rl30 -0,213 -8,091 11,929 cc30 -1,952 -3,867 4,388

 

 

6 – Regressão Logística 

Ordinal Logistic Regression: cl mudado versus va30; ps30; ... Link Function: Logit Response Information Variable Value Count cl mudado 1 7 2 13 3 6 Total 26 Logistic Regression Table Odds 95% CI Predictor Coef SE Coef Z P Ratio Lower Upper Const(1) -1,89793 0,848183 -2,24 0,025 Const(2) 1,07268 0,756560 1,42 0,156 va30 0,921029 1,76307 0,52 0,601 2,51 0,08 79,57 ps30 1,46828 1,30723 1,12 0,261 4,34 0,33 56,29 ge30 0,846606 1,79358 0,47 0,637 2,33 0,07 78,42 rq30 2,11946 1,46754 1,44 0,149 8,33 0,47 147,80 rl30 -2,31006 2,43212 -0,95 0,342 0,10 0,00 11,67 cc30 -2,75825 1,74333 -1,58 0,114 0,06 0,00 1,93 Log-Likelihood = -22,131 Test that all slopes are zero: G = 9,726, DF = 6, P-Value = 0,137 Goodness-of-Fit Tests Method Chi-Square DF P Pearson 52,3938 44 0,180 Deviance 44,2627 44 0,461 Measures of Association: (Between the Response Variable and Predicted Probabilities) Pairs Number Percent Summary Measures Concordant 160 75,8 Somers' D 0,52 Discordant 51 24,2 Goodman-Kruskal Gamma 0,52 Ties 0 0,0 Kendall's Tau-a 0,34 Total 211 100,0

   

7 – Conclusões 

A análise discriminante oferece maior qualidade de resposta em comparação com a regressão 

logística devido à pouca quantidade de dados.