{"id":111680,"date":"2018-08-22T16:00:00","date_gmt":"2018-08-22T21:00:00","guid":{"rendered":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/decoded\/\/\/validating-2016-voters-in-pew-research-centers-survey-data\/"},"modified":"2024-04-14T04:10:46","modified_gmt":"2024-04-14T09:10:46","slug":"validating-2016-voters-in-pew-research-centers-survey-data","status":"publish","type":"decoded","link":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/decoded\/2018\/08\/22\/validating-2016-voters-in-pew-research-centers-survey-data\/","title":{"rendered":"Validating 2016 voters in Pew Research Center\u2019s survey data"},"content":{"rendered":"\n<figure class=\"wp-block-image size-640-wide\"><a rel=\"attachment wp-att-126060\" href=\"https:\/\/alpha.pewresearch.org\/pewresearch-org\/decoded\/\/\/validating-2016-voters-in-pew-research-centers-survey-data\/08-22-2018_featured-png\/\"><img data-dominant-color=\"ecdbd3\" data-has-transparency=\"false\" style=\"--dominant-color: #ecdbd3;\" loading=\"lazy\" decoding=\"async\"  srcset=\"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2022\/08\/08.22.2018_featured.png?resize=480,271 480w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2022\/08\/08.22.2018_featured.png?resize=782,441 782w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2022\/08\/08.22.2018_featured.png?resize=960,541 960w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2022\/08\/08.22.2018_featured.png?resize=1200,676 1200w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2022\/08\/08.22.2018_featured.png?resize=1400,789 1400w\" sizes=\"(max-width: 480px) 480px, (max-width: 782px) 782px, 640px\" height=\"361\" width=\"640\" src=\"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2022\/08\/08.22.2018_featured.png?w=640\" alt=\"\" class=\"wp-image-126060 not-transparent\" \/><\/a><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\"><em>(Related post:&nbsp;<\/em><a href=\"https:\/\/medium.com\/pew-research-center-decoded\/validating-2020-voters-in-pew-research-centers-survey-data-ddb2e2a3c50\"><em>Validating 2020 voters in Pew Research Center\u2019s survey data<\/em><\/a><em>)<\/em><\/p>\n\n\n\n<p class=\"wp-block-paragraph\" id=\"cd1b\">One of the most common challenges facing election surveys is the tendency for some respondents to say they voted when they did not. This so-called overreporting can cause surveys to overstate voter turnout and create biases in the apparent composition of the electorate (this happens because&nbsp;<a href=\"https:\/\/academic.oup.com\/poq\/article-abstract\/65\/1\/22\/1888929?\" rel=\"noreferrer noopener\" target=\"_blank\">some kinds of people are more likely than others<\/a>&nbsp;to overreport voting).<\/p>\n\n\n\n<p class=\"wp-block-paragraph\" id=\"1f55\">Today, Pew Research Center is releasing an&nbsp;<a href=\"http:\/\/www.people-press.org\/dataset\/american-trends-panel-wave-23\/\" rel=\"noreferrer noopener\" target=\"_blank\">updated dataset<\/a>&nbsp;that helps address this issue by matching the people who took our 2016 post-election survey with the turnout records contained in five commercial voter files. This allows researchers to verify which respondents actually voted.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\" id=\"c69d\">This dataset is the basis for a&nbsp;<a href=\"http:\/\/www.people-press.org\/2018\/08\/09\/for-most-trump-voters-very-warm-feelings-for-him-endured\/\" target=\"_blank\" rel=\"noreferrer noopener\">report we issued on Aug. 9<\/a>&nbsp;about the trend over time in opinions about President Donald Trump among 2016 voters and the characteristics of the 2016 electorate. The dataset is available as an SPSS statistics file (.sav) and is accompanied by a ReadMe.txt file with information about the computation of the turnout variable. In this post, we\u2019ll discuss the turnout measure in more detail.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\" id=\"3a97\">As a reminder, Pew Research Center releases&nbsp;<a href=\"http:\/\/alpha.pewresearch.org\/pewresearch-org\/short-reads\/2018\/03\/09\/how-to-access-pew-research-center-survey-data\/\" rel=\"noreferrer noopener\" target=\"_blank\">nearly all of its raw survey datasets<\/a>to the public. The release is typically delayed for a period that ranges from a few months to more than a year after collection in order to allow the Center\u2019s staff to fully analyze and report on the data, as well as to clean and anonymize the files in order to protect respondents from the risk of being personally identified. All data for release can be found on our website, and a recent improvement in our process allows users to register for an account, after which they can download and manage datasets as often as desired.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"84da\">How we created the turnout variable<\/h4>\n\n\n\n<p class=\"wp-block-paragraph\" id=\"d380\">To validate turnout among members of the&nbsp;<a href=\"http:\/\/alpha.pewresearch.org\/pewresearch-org\/methodology\/u-s-survey-research\/american-trends-panel\/\" target=\"_blank\" rel=\"noreferrer noopener\">American Trends Panel<\/a>&nbsp;(ATP) \u2014 our nationally representative survey panel of U.S. adults \u2014 we attempted to link members to five commercial voter files. Two of the files are from nonpartisan vendors; two are from vendors that work primarily with Democratic and politically progressive clients; and one is from a vendor that works primarily with Republican and politically conservative clients.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\" id=\"7d9c\">Overall, 91% of the 3,985 active members of the ATP who took part in the post-election survey (conducted Nov. 29 to Dec. 12, 2016) and who provided a name yielded a match by at least one of the five vendors. We\u2019ll call these individuals \u201cmatched respondents.\u201d To estimate turnout, we used a composite estimate based on records in all five commercial voter files. Voters were defined as matched respondents who were recorded as having voted in&nbsp;<em>at least<\/em>&nbsp;one of the five commercial voter files. Nonvoters were defined as matched respondents who were listed in at least one file but had no record of voting in any files they matched, or respondents who were&nbsp;<em>not&nbsp;<\/em>matched in any of the five files. We assumed this last group were not registered voters and therefore had not voted.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\" id=\"83cf\">Using this approach, the voter file-verified turnout rate among the panelists was 65%, or about 5 percentage points higher than the&nbsp;<a href=\"http:\/\/www.electproject.org\/2016g\" rel=\"noreferrer noopener\" target=\"_blank\">best estimate<\/a>&nbsp;of national turnout among eligible adults. This difference is likely the result of the fact that surveys like this one tend to overrepresent politically engaged individuals.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\" id=\"4be8\">For additional details about the voter file matching and voter verification process, see Pew Research Center\u2019s&nbsp;<a href=\"http:\/\/alpha.pewresearch.org\/pewresearch-org\/2018\/02\/15\/commercial-voter-files-and-the-study-of-u-s-politics\/\" target=\"_blank\" rel=\"noreferrer noopener\">March 2018 report on commercial voter files<\/a>.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"4aae\">The variables of interest<\/h4>\n\n\n\n<p class=\"wp-block-paragraph\" id=\"dbaf\">The new dataset includes a new variable,&nbsp;<code><mark style=\"background-color:#ecece3\" class=\"has-inline-color\">VALIDATED_VOTER_2016_W23<\/mark><\/code>. This variable is coded 0 for nonvoters and 1 for validated voters. In computing the variable, noncitizens were excluded (sysmis in SPSS) since they are not eligible to vote in federal elections. The dataset also includes a variable named&nbsp;<code><mark style=\"background-color:#f0f0e6\" class=\"has-inline-color\">COMPORT_W23<\/mark><\/code>. This variable has four categories, each corresponding to a combination of&nbsp;<code><mark style=\"background-color:#f0f0e6\" class=\"has-inline-color\">VALIDATED_VOTER_2016_W23<\/mark><\/code>&nbsp;and the self-reported voter turnout question,&nbsp;<code><mark style=\"background-color:#ecece3\" class=\"has-inline-color\">VOTED_W23<\/mark><\/code>. The four categories of&nbsp;<code><mark style=\"background-color:#ecece3\" class=\"has-inline-color\">COMPORT_W23<\/mark><\/code>&nbsp;are:<\/p>\n\n\n\n<p class=\"wp-block-paragraph\" id=\"0821\">1=Validated voters who said they voted<br>2= Nonvoters who said they did not vote or were not sure<br>3=Overreporters (nonvoters who said they voted)<br>4=Underreporters (Validated voters who said they did not vote or were not sure)<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">In order to replicate the analyses in the report, it is necessary to code the candidate preference variable for voters exactly as we did. Syntax for doing so is provided in the ReadMe.txt file. Candidate preferences for voters are based on respondents who said they voted for Donald Trump, Hillary Clinton, Gary Johnson or Jill Stein. Those who said they voted for another candidate, who could not recall who they voted for or refused to say who they voted for are excluded from the tabulation. The SPSS syntax is as follows:<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>*FOR VALIDATED VOTERS.\ndo if COMPORT_W23=1.\ncompute candprefvoter= votegenpost_w23.\nmissing values candprefvoter (5,99).\nend if.\nvalue labels candprefvoter 1 \u2018Trump\u2019 2 \u2018Clinton\u2019 3 \u2018Johnson\u2019 4 \u2018Stein\u2019 5 \u2018Other\u2019 99 \u2018DK, Refused\u2019.\nvar labels candprefvoter \u20182016 vote among validated voters\u2019.<\/code><\/pre>\n\n\n\n<p class=\"wp-block-paragraph\">In order to replicate the profile of nonvoters, simply restrict the analysis to respondents who are listed as&nbsp;<code><mark style=\"background-color:#ecece3\" class=\"has-inline-color\">COMPORT_W23 = 2,3,4<\/mark><\/code>.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"6d56\">What you can do with the data<\/h4>\n\n\n\n<p class=\"wp-block-paragraph\" id=\"ffb4\">The availability of the validated turnout variable opens the door to many further analyses. One of the most obvious is the ability to compare overreporters with people who accurately reported they did not vote. It also makes comparisons of voters and nonvoters much more accurate, since overreporters change the profile of nonvoters by their absence.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\" id=\"d005\">While our published report this month includes a large number of tabulations among validated voters, many more demographic, political and lifestyle variables are available in this panel wave and in other waves. Among many other topics, the waves conducted near the election included questions about social media, guns, the police, online harassment and feelings about religious groups.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\" id=\"e5b0\">One request: If you happen to use this data please consider sharing your findings with us. We are eager to see what further knowledge arises from this effort, and we may add updates to this piece to share what others have done.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>(Related post: Validating 2020 voters in Pew Research Center\u2019s survey data)<\/p>\n","protected":false},"author":655,"featured_media":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"sub_headline":"","sub_title":"","_prc_public_revisions":[],"_ppp_expiration_hours":0,"_ppp_enabled":false,"ai_generated_summary":"","relatedPosts":[],"_prc_fork_parent":0,"_prc_fork_status":"","_prc_active_fork":0,"datacite_doi":"","datacite_doi_citation":"","_prc_seo_qr_attachment_id":0,"spoken_article_player_enabled":true,"displayBylines":true,"footnotes":"","prc_watchers":[]},"categories":[357,360],"bylines":[958,967],"collection":[],"_post_visibility":[],"decoded-category":[532,533],"formats":[],"_fund_pool":[],"languages":[],"regions-countries":[],"research-teams":[524],"workflow-status":[],"class_list":["post-111680","decoded","type-decoded","status-publish","hentry","category-survey-methods","category-voter-files","bylines-ruth-igielnik","bylines-scott-keeter","decoded-category-survey-methods","decoded-category-voter-files","research-teams-decoded"],"label":"Decoded","post_parent":0,"word_count":1014,"canonical_url":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/decoded\/2018\/08\/22\/validating-2016-voters-in-pew-research-centers-survey-data\/","art_direction":{"A1":{"id":126060,"rawUrl":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2022\/08\/08.22.2018_featured.png","url":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2022\/08\/08.22.2018_featured.png?w=564&h=317&crop=1","width":564,"height":317,"caption":"","chartArt":false},"A2":{"id":126060,"rawUrl":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2022\/08\/08.22.2018_featured.png","url":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2022\/08\/08.22.2018_featured.png?w=268&h=151&crop=1","width":268,"height":151,"caption":"","chartArt":false},"A3":{"id":126060,"rawUrl":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2022\/08\/08.22.2018_featured.png","url":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2022\/08\/08.22.2018_featured.png?w=194&h=110&crop=1","width":194,"height":110,"caption":"","chartArt":false},"A4":{"id":126060,"rawUrl":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2022\/08\/08.22.2018_featured.png","url":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2022\/08\/08.22.2018_featured.png?w=268&h=151&crop=1","width":268,"height":151,"caption":"","chartArt":false},"XL":{"id":126060,"rawUrl":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2022\/08\/08.22.2018_featured.png","url":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2022\/08\/08.22.2018_featured.png?w=720&h=405&crop=1","width":720,"height":405,"caption":"","chartArt":false},"social":{"id":126060,"rawUrl":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2022\/08\/08.22.2018_featured.png","url":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2022\/08\/08.22.2018_featured.png?w=1200&h=628&crop=1","width":1200,"height":628,"caption":"","chartArt":false}},"_embeds":[],"watchers":[],"table_of_contents":[],"datacite_doi":"","prc_seo_data":{"title":"Validating 2016 voters in Pew Research Center\u2019s survey data","description":"(Related post: Validating 2020 voters in Pew Research Center\u2019s survey data)","og_title":"Validating 2016 voters in Pew Research Center\u2019s survey data","og_description":"(Related post: Validating 2020 voters in Pew Research Center\u2019s survey data)","schema_type":"Article","noindex":false,"canonical_url":"","primary_terms":{"category":44},"custom_schema":[],"og_image":126060,"indexnow_submitted_at":null,"gsc_index_status":null},"prepublish_checks":{},"jetpack_sharing_enabled":true,"relatedPostsOrdered":[],"bylinesOrdered":[{"key":"_nc1ezrzmh","termId":967},{"key":"_ydgm6cr18","termId":958}],"acknowledgementsOrdered":[],"_links":{"self":[{"href":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/decoded\/111680","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/decoded"}],"about":[{"href":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/types\/decoded"}],"author":[{"embeddable":true,"href":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/users\/655"}],"replies":[{"embeddable":true,"href":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/comments?post=111680"}],"version-history":[{"count":2,"href":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/decoded\/111680\/revisions"}],"predecessor-version":[{"id":138556,"href":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/decoded\/111680\/revisions\/138556"}],"wp:attachment":[{"href":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/media?parent=111680"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/categories?post=111680"},{"taxonomy":"bylines","embeddable":true,"href":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/bylines?post=111680"},{"taxonomy":"collection","embeddable":true,"href":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/collection?post=111680"},{"taxonomy":"_post_visibility","embeddable":true,"href":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/_post_visibility?post=111680"},{"taxonomy":"decoded-category","embeddable":true,"href":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/decoded-category?post=111680"},{"taxonomy":"formats","embeddable":true,"href":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/formats?post=111680"},{"taxonomy":"_fund_pool","embeddable":true,"href":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/_fund_pool?post=111680"},{"taxonomy":"languages","embeddable":true,"href":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/languages?post=111680"},{"taxonomy":"regions-countries","embeddable":true,"href":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/regions-countries?post=111680"},{"taxonomy":"research-teams","embeddable":true,"href":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/research-teams?post=111680"},{"taxonomy":"workflow-status","embeddable":true,"href":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/workflow-status?post=111680"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}