How Data Quality Affects our Understanding of the Earnings Distribution
This open access book demonstrates how data quality issues affect all surveys and proposes methods that can be utilised to deal with the observable components of survey error in a statistically sound manner. This book begins by profiling the post-Apartheid period in South Africa's history when the s...
Guardat en:
| Autor principal: | |
|---|---|
| Format: | Online |
| Idioma: | anglès |
| Publicat: |
Springer Nature
2022
|
| Matèries: | |
| Accés en línia: | ONIX_20220713_9789811936395_44 |
| Etiquetes: |
Sense etiquetes, Sigues el primer a etiquetar aquest registre!
|
| _version_ | 1865099975915995136 |
|---|---|
| author | Daniels, Reza Che |
| author_browse | Daniels, Reza Che |
| author_facet | Daniels, Reza Che |
| author_sort | Daniels, Reza Che |
| collection | Directory of Open Access Books |
| description | This open access book demonstrates how data quality issues affect all surveys and proposes methods that can be utilised to deal with the observable components of survey error in a statistically sound manner. This book begins by profiling the post-Apartheid period in South Africa's history when the sampling frame and survey methodology for household surveys was undergoing periodic changes due to the changing geopolitical landscape in the country. This book profiles how different components of error had disproportionate magnitudes in different survey years, including coverage error, sampling error, nonresponse error, measurement error, processing error and adjustment error. The parameters of interest concern the earnings distribution, but despite this outcome of interest, the discussion is generalizable to any question in a random sample survey of households or firms. This book then investigates questionnaire design and item nonresponse by building a response propensity model for the employee income question in two South African labour market surveys: the October Household Survey (OHS, 1997-1999) and the Labour Force Survey (LFS, 2000-2003). This time period isolates a period of changing questionnaire design for the income question. Finally, this book is concerned with how to employee income data with a mixture of continuous data, bounded response data and nonresponse. A variable with this mixture of data types is called coarse data. Because the income question consists of two parts -- an initial, exact income question and a bounded income follow-up question -- the resulting statistical distribution of employee income is both continuous and discrete. The book shows researchers how to appropriately deal with coarse income data using multiple imputation. The take-home message from this book is that researchers have a responsibility to treat data quality concerns in a statistically sound manner, rather than making adjustments to public-use data in arbitrary ways, often underpinned by undefensible assumptions about an implicit unobservable loss function in the data. The demonstration of how this can be done provides a replicable concept map with applicable methods that can be utilised in any sample survey. |
| format | Online |
| id | doab-20.500.12854ir-87693 |
| institution | Directory of Open Access Books |
| language | eng |
| publishDate | 2022 |
| publishDateRange | 2022 |
| publishDateSort | 2022 |
| publisher | Springer Nature |
| publisherStr | Springer Nature |
| record_format | ojs |
| spelling | doab-20.500.12854ir-876932025-03-15T07:55:03Z How Data Quality Affects our Understanding of the Earnings Distribution Daniels, Reza Che Methodology for Collecting Estimating and Organizing Microeconomic Data Survey Methods Total Survey Error Response Propensity Models Multiple Imputation Income Distribution This open access book demonstrates how data quality issues affect all surveys and proposes methods that can be utilised to deal with the observable components of survey error in a statistically sound manner. This book begins by profiling the post-Apartheid period in South Africa's history when the sampling frame and survey methodology for household surveys was undergoing periodic changes due to the changing geopolitical landscape in the country. This book profiles how different components of error had disproportionate magnitudes in different survey years, including coverage error, sampling error, nonresponse error, measurement error, processing error and adjustment error. The parameters of interest concern the earnings distribution, but despite this outcome of interest, the discussion is generalizable to any question in a random sample survey of households or firms. This book then investigates questionnaire design and item nonresponse by building a response propensity model for the employee income question in two South African labour market surveys: the October Household Survey (OHS, 1997-1999) and the Labour Force Survey (LFS, 2000-2003). This time period isolates a period of changing questionnaire design for the income question. Finally, this book is concerned with how to employee income data with a mixture of continuous data, bounded response data and nonresponse. A variable with this mixture of data types is called coarse data. Because the income question consists of two parts -- an initial, exact income question and a bounded income follow-up question -- the resulting statistical distribution of employee income is both continuous and discrete. The book shows researchers how to appropriately deal with coarse income data using multiple imputation. The take-home message from this book is that researchers have a responsibility to treat data quality concerns in a statistically sound manner, rather than making adjustments to public-use data in arbitrary ways, often underpinned by undefensible assumptions about an implicit unobservable loss function in the data. The demonstration of how this can be done provides a replicable concept map with applicable methods that can be utilised in any sample survey. 2022-07-14T04:01:21Z 2022-07-14T04:01:21Z 2022-07-13T12:27:59Z 2022 book ONIX_20220713_9789811936395_44 OCN: 1334995976 https://library.oapen.org/handle/20.500.12657/57371 9789811936395 https://directory.doabooks.org/handle/20.500.12854/87693 eng open access image/jpeg image/jpeg image/jpeg n/a n/a n/a https://library.oapen.org/bitstream/20.500.12657/57371/1/978-981-19-3639-5.pdf https://library.oapen.org/bitstream/20.500.12657/57371/1/978-981-19-3639-5.pdf https://library.oapen.org/bitstream/20.500.12657/57371/1/978-981-19-3639-5.pdf Springer Nature Springer Nature Singapore 10.1007/978-981-19-3639-5 10.1007/978-981-19-3639-5 9fa3421d-f917-4153-b9ab-fc337c396b5a Universityof CapeTown 94ca1040-a907-4251-98bd-2d9a1734557f 9789811936395 Springer Nature Singapore 114 Singapore [...] open access |
| spellingShingle | Methodology for Collecting Estimating and Organizing Microeconomic Data Survey Methods Total Survey Error Response Propensity Models Multiple Imputation Income Distribution Daniels, Reza Che How Data Quality Affects our Understanding of the Earnings Distribution |
| title | How Data Quality Affects our Understanding of the Earnings Distribution |
| title_full | How Data Quality Affects our Understanding of the Earnings Distribution |
| title_fullStr | How Data Quality Affects our Understanding of the Earnings Distribution |
| title_full_unstemmed | How Data Quality Affects our Understanding of the Earnings Distribution |
| title_short | How Data Quality Affects our Understanding of the Earnings Distribution |
| title_sort | how data quality affects our understanding of the earnings distribution |
| topic | Methodology for Collecting Estimating and Organizing Microeconomic Data Survey Methods Total Survey Error Response Propensity Models Multiple Imputation Income Distribution |
| topic_facet | Methodology for Collecting Estimating and Organizing Microeconomic Data Survey Methods Total Survey Error Response Propensity Models Multiple Imputation Income Distribution |
| url | ONIX_20220713_9789811936395_44 |
| work_keys_str_mv | AT danielsrezache howdataqualityaffectsourunderstandingoftheearningsdistribution |