{"id":88015,"date":"2019-05-23T10:49:38","date_gmt":"2019-05-23T15:49:41","guid":{"rendered":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/2019\/05\/23\/methodology-63-2\/"},"modified":"2024-08-07T16:59:50","modified_gmt":"2024-08-07T20:59:50","slug":"methodology-63-2","status":"publish","type":"post","link":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/journalism\/2019\/05\/23\/methodology-63-2\/","title":{"rendered":"Methodology"},"content":{"rendered":"\n<h4 class=\"wp-block-heading\" id=\"news-outlet-selection\">News outlet selection<\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">The study was based on American news outlets whose websites:<\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li>Have a monthly average of more than 20 million unique visitors from July-September of 2018, according to Comscore\u2019s data (Comscore Media Metrix Multi-platform, unique visitors, July-September 2018).<\/li>\n\n\n\n<li>Provide original reporting and news and information content for a general audience.<\/li>\n\n\n\n<li>Cover a variety of topics rather than specializing in a particular topic (such as weather, sports, politics, business or entertainment)<\/li>\n\n\n\n<li>Are based in the U.S.<\/li>\n\n\n\n<li>Focus at least in part on national issues (rather than focusing solely on local issues).<\/li>\n<\/ol>\n\n\n\n<figure class=\"wp-block-image alignright size-420-wide\"><a href=\"https:\/\/alpha.pewresearch.org\/pewresearch-org\/journalism\/2019\/05\/23\/methodology-63-2\/pdl_05-23-19_facebook-news-00-05-png\/\"><img data-dominant-color=\"f7f6f8\" data-has-transparency=\"false\" style=\"--dominant-color: #f7f6f8;\" loading=\"lazy\" decoding=\"async\" sizes=\"auto, (max-width: 420px) 100vw, 420px\" height=\"692\" width=\"420\" src=\"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-05.png?w=622\" alt=\"Most Facebook news photos show three or fewer individuals\" class=\"wp-image-109693 not-transparent\" srcset=\"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-05.png 846w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-05.png?resize=182,300 182w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-05.png?resize=768,1265 768w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-05.png?resize=622,1024 622w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-05.png?resize=246,405 246w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-05.png?resize=200,329 200w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-05.png?resize=260,428 260w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-05.png?resize=310,510 310w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-05.png?resize=420,692 420w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-05.png?resize=640,1054 640w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-05.png?resize=740,1218 740w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-05.png?resize=160,263 160w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-05.png?resize=320,527 320w\" \/><\/a><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">After applying these rules, researchers included the following outlets: ABC News, BuzzFeed News, CBS News, CNN, Fox News, HuffPost, NBC News, The New York Times, Newsweek, NPR, Time, U.S. News &amp; World Report, USA Today, The Washington Post, Yahoo News, Vice and Vox.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"data-collection\">Data collection<\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">To create the dataset used for both analyses, researchers built a data pipeline to streamline image collection, facial recognition and extraction, and facial classification tasks. To ensure that a large number of images could be processed in a timely manner, the team set up a database and analysis environment on the Amazon Web Service (AWS) cloud, which enabled the use of graphics processing units (GPUs) for faster image processing.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Data collection took place in April, May and June of 2018. The information collected about each post includes the title, caption and a brief comment which appeared within the post.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"face-detection\">Face detection<\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">Researchers used the face detector from the Python library dlib to identify all faces in the image. The program identifies four coordinates of the face: top, right, bottom and left (in pixels). This system achieves 99.4% accuracy on the popular <a href=\"http:\/\/vis-www.cs.umass.edu\/lfw\/\">Labeled Faces in the Wild dataset<\/a>. The research team cropped the faces from the images and stored them as separate files. A total of 44,056 photos were analyzed, 22,342 of which contained identifiable human faces.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"machine-vision-for-gender-classification\">Machine vision for gender classification<\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">Researchers used a method called \u201c<a href=\"https:\/\/medium.com\/pew-research-center-decoded\/how-social-scientists-can-use-transfer-learning-to-kickstart-a-deep-learning-project-fdceeb565cc5\">transfer learning<\/a>\u201d to train a gender classifier rather than using machine vision methods developed by an outside vendor. In some commercial and noncommercial alternative classifiers, \u201cmultitask\u201d learning methods are used to simultaneously perform face detection, landmark localization, pose estimation, gender recognition and other face analysis tasks. The research team\u2019s classifier achieved high accuracy for the gender classification task while allowing the research team to monitor a variety of important performance metrics.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"gender-classification-model-training\">Gender classification model training<\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">Recently, research has provided evidence of algorithmic bias in image classification systems from a variety of high profile vendors. This problem is believed to stem from imbalanced training data that often overrepresents white men. For this analysis, researchers decided to train a new gender classification model using a more balanced image training set. However, training an image classifier is a daunting task because collecting a large labeled dataset for training is very time and labor intensive and often is too computationally intensive to actually execute.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">To avoid these challenges, the research team relied on a technique called \u201ctransfer learning,\u201d which involves recycling large pretrained neural networks (a popular class of machine learning models) for more specific classification tasks. The key innovation of this technique is that lower layers of the pretrained neural networks often contain features that are useful across different image classification tasks. Researchers can reuse these pretrained lower layers and fine-tune the top layers for their specific application \u2013 in this case, the gender classification task.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The specific pretrained network researchers used is VGG16, implemented in the popular deep learning Python package Keras. The VGG network architecture was introduced by Karen Simonyan and Andrew Zisserman in their 2014 paper \u201c<a href=\"https:\/\/arxiv.org\/abs\/1409.1556\">Very Deep Convolutional Networks for Large Scale Image Recognition<\/a>.\u201d The model is trained using ImageNet, which has over 1.2 million images and 1,000 object categories. Other common pretrained models include ResNet and Inception. VGG16 contains 16 weight layers that include several convolution and fully connected layers. The VGG16 network has achieved a 90% top-5 accuracy in ImageNet classification.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Researchers began with the classic architecture of the VGG16 neural network as a base and then added one fully connected layer, one dropout layer and one output layer. The team conducted two rounds of training \u2013 one for the layers added for the gender classification task (the custom model), and subsequently one for the upper layers of the VGG base model.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Researchers froze the VGG base weights so that they could not be updated during the first round of training and restricted training during this phase to the custom layers. This choice reflects the fact that weights for the new layers are randomly initialized, so if the VGG weights are allowed to be updated it would destroy the information contained within them. After 20 epochs of training on just the custom model, the team unfroze four top layers of the VGG base and began a second round of training. For the second round of training, researchers implemented an early stopping function. Early stopping checks the progress of the model loss (or error rate) during training and halts training when validation loss value ceases to improve. This serves as both a timesaver and keeps the model from overfitting to the training data.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">In order to prevent the model from overfitting to the training images, researchers randomly augmented each image during the training process. These random augmentations included rotations, shifting of the center of the image, zooming in\/out, and shearing the image. As such, the model never saw the same image twice during training.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"selecting-training-images\">Selecting training images<\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">Image classification systems, even those that draw on pretrained models, require a substantial amount of training and validation data. These systems also demand diverse training samples if they are to be accurate across demographic groups. Researchers took a variety of steps to ensure that the model was accurate when classifying the gender of people from diverse backgrounds.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">First, the team located existing datasets used by researchers for image analysis. These include the \u201c<a href=\"http:\/\/vis-www.cs.umass.edu\/lfw\/\">Labeled Faces in the Wild<\/a>\u201d (LFW) and \u201c<a href=\"https:\/\/www.wilmabainbridge.com\/facememorability2.html\">Bainbridge 10K U.S. Adult Faces<\/a>\u201d datasets. Second, the team downloaded images of <a href=\"https:\/\/www.eleicoes2016.com.br\/\">Brazilian politicians<\/a> from a site that hosts municipal-level election results. Brazil is a racially diverse country, and that is reflected in the demographic diversity in its politicians. Third, researchers created original lists of celebrities who belong to different minority groups and collected 100 images for each individual.<\/p>\n\n\n\n<figure class=\"wp-block-image alignright size-310-wide\"><a href=\"https:\/\/alpha.pewresearch.org\/pewresearch-org\/journalism\/2019\/05\/23\/methodology-63-2\/pdl_05-23-19_facebook-news-00-06-png\/\"><img data-dominant-color=\"efefef\" data-has-transparency=\"false\" style=\"--dominant-color: #efefef;\" loading=\"lazy\" decoding=\"async\" sizes=\"auto, (max-width: 310px) 100vw, 310px\" height=\"270\" width=\"310\" src=\"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-06.png?w=627\" alt=\"Training datasets\" class=\"wp-image-109696 not-transparent\" srcset=\"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-06.png 627w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-06.png?resize=300,262 300w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-06.png?resize=464,405 464w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-06.png?resize=200,174 200w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-06.png?resize=260,227 260w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-06.png?resize=310,270 310w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-06.png?resize=420,366 420w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-06.png?resize=160,140 160w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-06.png?resize=320,279 320w\" \/><\/a><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">The list of minority celebrities focused on famous black and Asian individuals. The list of famous blacks includes 22 individuals: 11 men and 11 women. The list of famous Asians includes 30 individuals: 15 men and 15 women. Researchers then compiled a list of the most populous 100 countries and downloaded up to 100 images of men and women for each nation-gender combination, respectively (for example, \u201cFrench man\u201d). This choice helped ensure that the training data included images that feature people from a diverse set of countries, balancing out the overrepresentation of white people in the training dataset. Finally, researchers supplemented this list with a set of 21 celebrity seniors (11 men and 10 women) to help improve model accuracy on older individuals. This allowed researchers to easily build up a demographically diverse dataset of faces with known gender and racial profiles.<\/p>\n\n\n\n<figure class=\"wp-block-image alignright size-310-wide\"><a href=\"https:\/\/alpha.pewresearch.org\/pewresearch-org\/journalism\/2019\/05\/23\/methodology-63-2\/pdl_05-23-19_facebook-news-00-07-png\/\"><img data-dominant-color=\"ededed\" data-has-transparency=\"false\" style=\"--dominant-color: #ededed;\" loading=\"lazy\" decoding=\"async\" sizes=\"auto, (max-width: 310px) 100vw, 310px\" height=\"257\" width=\"310\" src=\"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-07.png?w=707\" alt=\"Model performance statistics\" class=\"wp-image-109701 not-transparent\" srcset=\"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-07.png 707w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-07.png?resize=300,249 300w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-07.png?resize=488,405 488w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-07.png?resize=200,166 200w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-07.png?resize=260,216 260w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-07.png?resize=310,257 310w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-07.png?resize=420,349 420w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-07.png?resize=640,531 640w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-07.png?resize=160,133 160w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-07.png?resize=320,266 320w\" \/><\/a><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">Some images feature multiple people. To ensure that the images were directly relevant, a member of the research team reviewed each face in the training datasets manually and removed irrelevant or erroneous faces (e.g., men in images with women). Researchers also removed images that were too blurry, too small and those where much of the face was obscured. In summary, the training data consist of 14,351 men and 12,630 women in images. The images belong to seven different datasets.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"gender-classification-model-performance\">Gender classification model performance<\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">To evaluate whether the model was accurate, researchers applied it to a subset of the dataset equivalent to 20% of the image sources: a \u201cheld out\u201d set which was not used for training purposes. The model achieved an overall accuracy of 95% on this set of validation data. The model was also accurate on particular subsets of the data, achieving 0.96 positive predictive value on the black celebrities subset, for example.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">As a final validation exercise, researchers used an online labor market to create a hand coded random sample of 998 faces. This random subset of images overrepresented men \u2014 629 of the images were coded as male by Mechanical Turk (MTurk) coders. Each face was coded by three online workers. For the 920 faces that had consensus across the three coders, the overall accuracy of this sample is 87%. Using the value 1 for \u201cmale\u201d and 0 for \u201cfemale,\u201d the precision and recall of the model were 0.89 and 0.92, respectively, indicating that performance was balanced for both predictions.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"text-classification-and-model-performance\">Text classification and model performance<\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">To determine whether news posts mentioned particular topics, researchers used a semi-supervised text classification algorithm. The topics were selected because they appeared in contemporaneous Pew Research Center surveys of U.S. adults either as among the most important problems facing the nation (health care, the economy and immigration) or as topics that individuals seek news about (sports and entertainment). Researchers developed a list of keywords related to each topic as \u201cseed words\u201d that initially classified posts as related to each of the topics or not. To narrow down the possible keywords for the entertainment category, researchers operationalized the concept as news mentioning TV, music or movies.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">These initial positive cases were used as a training data set, which the researchers then used to fit a support vector machines (SVM) model. The SVM model detects words that co-occur with the \u201cseed words\u201d and uses those additional words to predict which posts were likely to be related to the topics of interest. The model also avoids the \u201cseed words\u201d associated with the other topics. The seed words of other topics help the model determine the negative cases. For example, when applying the model for sports, a post might use a seemingly relevant term like \u201cwinner\u201d but <em>also<\/em> use terms associated with the economy like \u201ctrade war.\u201d In such a case, the model is especially unlikely to classify the post as mentioning sports.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">To prepare the data needed to train the model, researchers preprocessed the text by removing stop words. These words include commonly used English words such as \u201cand,\u201d \u201cthe,\u201d or \u201cof\u201d that do not provide much information about the content of the text. Researchers then used the TfidfVectorizer in the sklearn python library to convert the text to tokens, including phrases that were one, two or three words long. The model was then applied to the full dataset, resulting in a prediction about whether every post mentioned one of the topics or not.<\/p>\n\n\n\n<figure class=\"wp-block-image alignright size-420-wide\"><a href=\"https:\/\/alpha.pewresearch.org\/pewresearch-org\/journalism\/2019\/05\/23\/methodology-63-2\/pdl_05-23-19_facebook-news-00-08-png\/\"><img data-dominant-color=\"eeeeee\" data-has-transparency=\"false\" style=\"--dominant-color: #eeeeee;\" loading=\"lazy\" decoding=\"async\" sizes=\"auto, (max-width: 420px) 100vw, 420px\" height=\"216\" width=\"420\" src=\"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-08.png?w=846\" alt=\"Initial text classification validation results\" class=\"wp-image-109706 not-transparent\" srcset=\"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-08.png 846w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-08.png?resize=300,155 300w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-08.png?resize=768,396 768w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-08.png?resize=786,405 786w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-08.png?resize=200,103 200w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-08.png?resize=260,134 260w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-08.png?resize=310,160 310w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-08.png?resize=420,216 420w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-08.png?resize=640,330 640w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-08.png?resize=740,381 740w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-08.png?resize=160,82 160w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-08.png?resize=320,165 320w\" \/><\/a><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">To validate this approach, researchers selected 1,100 posts for human coding. Since the prevalence of the posts that actually discuss each of the five topics was low, researchers used oversampling \u2013 based on model-based estimates \u2013 to increase the representation of positive cases in the validation sample. Specifically, researchers randomly selected approximately half of the posts that were tagged as positive by the SVM model, and the other half was tagged as negative. After removing duplicate posts, researchers classified 1,061 posts, determining whether they mentioned any of the five topics of interest. Interrater reliability statistics are weighted to reflect the oversampling process.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Two in-house coders classified the same subset of posts (406) to ensure that humans could reasonably agree on whether or not a post mentioned each issue. After conducting the content coding, researchers resolved disagreements and created a consolidated set of human decisions to compare the model against. The vast majority of posts were coded as mentioning to a single topic, 55 posts in total (0.1%) were coded as mentioning to multiple topics.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The performance of the model and the human coders\u2019 agreement with each other is described in the table above. The model can be assessed via precision, recall, and weighted kappa, each of which compares how well the model\u2019s decisions correspond with those of the human coders. The final column shows the weighted kappa for the subset of posts coded by two coders, comparing their decisions against each other.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Researchers found that the health care topic had low precision, suggesting that many posts that the model identified as mentioning the topic did not in fact mention it. A manual review of posts revealed that the model incorrectly classified posts that mentioned personal health and wellness as mentioning health care, while the coders were focused on health care policy or general health care issues. Since the words associated with health care policy were so similar to those associated with personal health and wellness, researchers decided to exclude this topic from further analysis.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The precision value for TV, music or movies was also low, but it was clear in this case that false positives were decreasing model performance, due to both the \u201cseed\u201d keywords associated with the topic and the penalty that the model applied to mislabeled posts. In response, researchers revised the keyword list and also adjusted the model parameter that controls the size of the penalty assigned to mislabeled posts. If that model parameter is smaller, the model will achieve better separation between the positive and negative posts. So researchers lowered the parameter from 1 to 0.05.<\/p>\n\n\n\n<figure class=\"wp-block-image alignright size-420-wide\"><a href=\"https:\/\/alpha.pewresearch.org\/pewresearch-org\/journalism\/2019\/05\/23\/methodology-63-2\/pdl_05-23-19_facebook-news-00-09-png\/\"><img data-dominant-color=\"eeeeee\" data-has-transparency=\"false\" style=\"--dominant-color: #eeeeee;\" loading=\"lazy\" decoding=\"async\" sizes=\"auto, (max-width: 420px) 100vw, 420px\" height=\"199\" width=\"420\" src=\"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-09.png?w=839\" alt=\"Final text classification validation results\" class=\"wp-image-109710 not-transparent\" srcset=\"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-09.png 839w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-09.png?resize=300,142 300w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-09.png?resize=768,363 768w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-09.png?resize=200,95 200w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-09.png?resize=260,123 260w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-09.png?resize=310,147 310w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-09.png?resize=420,199 420w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-09.png?resize=640,303 640w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-09.png?resize=740,350 740w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-09.png?resize=160,76 160w, https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PDL_05.23.19_Facebook.news-00-09.png?resize=320,151 320w\" \/><\/a><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">Since changing the model risks overfitting to the data, researchers separately drew a new sample of 100 posts (50 positive and 50 negative) to conduct model validation. Two in-house coders classified these posts. After reevaluating the models, researchers arrived at the following performance statistics.[6. numoffset=&#8221;6&#8243; Note that the coder to coder weighted kappa for TV, music, or movies is based on a revised validation set of 100 oversampled and double-coded posts.]<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Performance statistics for the topic including TV, music or movies was substantially better in this round of validation: Both precision and recall increased. The model performance of the other topics also changed slightly due to the fact that seed words for one topic identify negative cases for other topics; since the keyword list for TV, music or movies changed in this round, it also affected the results of the other topics.<\/p>\n\n\n<div style=\"border-width:1px;padding-bottom:var(--wp--preset--spacing--20);padding-left:var(--wp--preset--spacing--30);padding-right:var(--wp--preset--spacing--30);padding-top:var(--wp--preset--spacing--20);--block-gap: inherit\" class=\"wp-block-prc-block-collapsible has-background has-ui-beige-very-light-background-color has-border-color has-ui-beige-dark-border-color\" id=\"terminology\" data-wp-interactive=\"{&quot;namespace&quot;:&quot;prc-block\\\/collapsible&quot;}\" data-wp-context=\"{&quot;collapsibleId&quot;:&quot;terminology&quot;,&quot;isOpen&quot;:false}\" data-wp-class--is-open=\"context.isOpen\" data-wp-init--scroll-into-view=\"callbacks.onInitScrollIntoView\"><div class=\"wp-block-prc-block-collapsible__title\" data-wp-on--click=\"actions.onClick\"><div>Terminology<\/div><button class=\"wp-block-prc-block-collapsible__icon\"><span data-wp-bind--hidden=\"context.isOpen\"><i class=\"icon icon-library__light icon__circle-plus\"><svg style=\"width: 1em; height: 1em;\"><use xlink:href=\"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/plugins\/prc-icon-library\/build\/icons\/sprites\/light.svg#circle-plus\"><\/use><\/svg><\/i><\/span><span data-wp-bind--hidden=\"!context.isOpen\" hidden><i class=\"icon icon-library__light icon__circle-minus\"><svg style=\"width: 1em; height: 1em;\"><use xlink:href=\"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/plugins\/prc-icon-library\/build\/icons\/sprites\/light.svg#circle-minus\"><\/use><\/svg><\/i><\/span><\/button><\/div><div class=\"wp-block-prc-block-collapsible__content\">\n\n<p class=\"wp-block-paragraph\"><strong>Precision and recall<\/strong> are statistics used to quantify the performance of statistical models making predictions. Low values for precision signify that the model is making a positive prediction about a post or image when in fact the prediction should be negative. Low values of recall signify that the model is systematically missing positive cases that ought to be labeled as such.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Cohen\u2019s Kappa<\/strong> is a statistic used to assess interrater reliability. It ranges from 0 (meaning that two separate sets of decisions are only related to each other according to chance) to 1 (meaning that two separate decisions perfectly agree, even adjusting for chance agreement).<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Deep learning <\/strong>is a class of machine learning models that is inspired by how biological nervous systems process information. These kinds of models include multiple layers of information that help make predictions. In this report, researchers used deep learning models to predict whether human faces belonged to men or women.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Support vector machines <\/strong>refers to a common machine learning algorithm. The algorithm uses the decision of multiple models and aims to achieve clear separation between classes of data, or predictions. In this case, researchers used support vector machines to classify whether or not posts mentioned particular news topics.\n<\/p>\n\n<\/div><\/div>","protected":false},"excerpt":{"rendered":"<p>News outlet selection The study was based on American news outlets whose websites: After applying these rules, researchers included the following outlets: ABC News, BuzzFeed News, CBS News, CNN, Fox News, HuffPost, NBC News, The New York Times, Newsweek, NPR, Time, U.S. News &amp; World Report, USA Today, The Washington Post, Yahoo News, Vice and [&hellip;]<\/p>\n","protected":false},"author":367,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_crdt_document":"","sub_headline":"","sub_title":"","_prc_public_revisions":[],"_ppp_expiration_hours":0,"_ppp_enabled":false,"ai_generated_summary":"","bylines":[],"acknowledgements":[],"displayBylines":true,"prc_watchers":[],"relatedPosts":[],"reportMaterials":[],"multiSectionReport":[],"package_parts__enabled":false,"package_parts":[],"_prc_fork_parent":0,"_prc_fork_status":"","_prc_active_fork":0,"datacite_doi":"","datacite_doi_citation":"","_prc_seo_qr_attachment_id":0,"spoken_article_player_enabled":true,"footnotes":""},"categories":[353,275,311,216,36,337,333,307,352],"tags":[],"bylines":[],"collection":[],"datasets":[],"level_of_effort":[],"primary_audience":[],"information_type":[],"_post_visibility":[],"formats":[458],"_fund_pool":[],"languages":[],"regions-countries":[],"research-teams":[521,527],"workflow-status":[],"class_list":["post-88015","post","type-post","status-publish","format-standard","hentry","category-data-science","category-digital-news-landscape-2","category-facebook","category-gender-equality-discrimination","category-methodological-research","category-news-content-analysis","category-news-platforms-sources","category-social-media","category-social-media-news","formats-report","research-teams-data-labs","research-teams-journalism"],"label":false,"post_parent":88027,"word_count":2501,"canonical_url":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/journalism\/2019\/05\/23\/methodology-63-2\/","art_direction":{"A1":{"id":181879,"rawUrl":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PL_19.04.08_NewsImages_featured.png","url":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PL_19.04.08_NewsImages_featured.png?w=564&h=317&crop=1","width":564,"height":317,"caption":"(Brendan Smialowski\/AFP\/Getty Images)","chartArt":false},"A2":{"id":181879,"rawUrl":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PL_19.04.08_NewsImages_featured.png","url":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PL_19.04.08_NewsImages_featured.png?w=268&h=151&crop=1","width":268,"height":151,"caption":"(Brendan Smialowski\/AFP\/Getty Images)","chartArt":false},"A3":{"id":181879,"rawUrl":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PL_19.04.08_NewsImages_featured.png","url":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PL_19.04.08_NewsImages_featured.png?w=194&h=110&crop=1","width":194,"height":110,"caption":"(Brendan Smialowski\/AFP\/Getty Images)","chartArt":false},"A4":{"id":181879,"rawUrl":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PL_19.04.08_NewsImages_featured.png","url":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PL_19.04.08_NewsImages_featured.png?w=268&h=151&crop=1","width":268,"height":151,"caption":"(Brendan Smialowski\/AFP\/Getty Images)","chartArt":false},"XL":{"id":181879,"rawUrl":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PL_19.04.08_NewsImages_featured.png","url":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PL_19.04.08_NewsImages_featured.png?w=720&h=405&crop=1","width":720,"height":405,"caption":"(Brendan Smialowski\/AFP\/Getty Images)","chartArt":false},"social":{"id":181879,"rawUrl":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PL_19.04.08_NewsImages_featured.png","url":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/PL_19.04.08_NewsImages_featured.png?w=1200&h=628&crop=1","width":1200,"height":628,"caption":"(Brendan Smialowski\/AFP\/Getty Images)","chartArt":false}},"_embeds":[],"watchers":[],"table_of_contents":[{"id":88027,"title":"Men Appear Twice as Often as Women in News Photos on Facebook","slug":"men-appear-twice-as-often-as-women-in-news-photos-on-facebook","link":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/journalism\/2019\/05\/23\/men-appear-twice-as-often-as-women-in-news-photos-on-facebook\/","is_active":false},{"id":88008,"title":"Acknowledgments","slug":"acknowledgments-4-6","link":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/journalism\/2019\/05\/23\/acknowledgments-4-6\/","is_active":false},{"id":88015,"title":"Methodology","slug":"methodology-63-2","link":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/journalism\/2019\/05\/23\/methodology-63-2\/","is_active":true}],"report_materials":[{"key":"5fbff053-8bb1-49f4-83f1-cff1e7ee43ca","type":"report","url":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-content\/uploads\/sites\/20\/2019\/05\/Facebook-news-FULL-REPORT-05.23.19.pdf","label":"","icon":"","attachmentId":109729}],"report_pagination":{"current_post":{"id":88015,"title":"Methodology","slug":"methodology-63-2","link":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/journalism\/2019\/05\/23\/methodology-63-2\/","is_active":true,"page_num":3},"next_post":null,"previous_post":{"id":88008,"title":"Acknowledgments","slug":"acknowledgments-4-6","link":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/journalism\/2019\/05\/23\/acknowledgments-4-6\/","is_active":false,"page_num":2},"pagination_items":[{"id":88027,"title":"Men Appear Twice as Often as Women in News Photos on Facebook","slug":"men-appear-twice-as-often-as-women-in-news-photos-on-facebook","link":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/journalism\/2019\/05\/23\/men-appear-twice-as-often-as-women-in-news-photos-on-facebook\/","is_active":false,"page_num":1},{"id":88008,"title":"Acknowledgments","slug":"acknowledgments-4-6","link":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/journalism\/2019\/05\/23\/acknowledgments-4-6\/","is_active":false,"page_num":2},{"id":88015,"title":"Methodology","slug":"methodology-63-2","link":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/journalism\/2019\/05\/23\/methodology-63-2\/","is_active":true,"page_num":3}]},"parent_info":{"parent_title":"Men Appear Twice as Often as Women in News Photos on Facebook","parent_id":88027},"materialsOrdered":[],"chaptersOrdered":[],"partsOrdered":[],"partsEnabled":false,"datacite_doi":"","prc_seo_data":{"title":"Methodology","description":"News outlet selection The study was based on American news outlets whose websites: Have a monthly average of more than 20 million unique visitors from July-September of 2018, according to&hellip;","og_title":"Methodology","og_description":"News outlet selection The study was based on American news outlets whose websites: Have a monthly average of more than 20 million unique visitors from July-September of 2018, according to&hellip;","schema_type":"Article","noindex":false,"canonical_url":"","primary_terms":{"category":307,"research-teams":527},"custom_schema":[],"og_image":181879,"indexnow_submitted_at":null,"gsc_index_status":null},"prepublish_checks":{"prc-image-alt-text":{"status":"incomplete","message":"5 images are missing alt text.","data":{"count":5}},"prc-about-this-research":{"status":"incomplete","message":"Add an \"About this research\" details block.","data":null},"prc-paragraph-count":{"status":"complete","message":"Found 32 paragraphs.","data":{"count":32}},"prc-internal-link":{"status":"complete","message":"Found 5 internal links.","data":{"count":5}}},"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"relatedPostsOrdered":[],"bylinesOrdered":[],"acknowledgementsOrdered":[],"_links":{"self":[{"href":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/posts\/88015","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/users\/367"}],"replies":[{"embeddable":true,"href":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/comments?post=88015"}],"version-history":[{"count":5,"href":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/posts\/88015\/revisions"}],"predecessor-version":[{"id":184179,"href":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/posts\/88015\/revisions\/184179"}],"wp:attachment":[{"href":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/media?parent=88015"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/categories?post=88015"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/tags?post=88015"},{"taxonomy":"bylines","embeddable":true,"href":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/bylines?post=88015"},{"taxonomy":"collection","embeddable":true,"href":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/collection?post=88015"},{"taxonomy":"datasets","embeddable":true,"href":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/datasets?post=88015"},{"taxonomy":"level_of_effort","embeddable":true,"href":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/level_of_effort?post=88015"},{"taxonomy":"primary_audience","embeddable":true,"href":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/primary_audience?post=88015"},{"taxonomy":"information_type","embeddable":true,"href":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/information_type?post=88015"},{"taxonomy":"_post_visibility","embeddable":true,"href":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/_post_visibility?post=88015"},{"taxonomy":"formats","embeddable":true,"href":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/formats?post=88015"},{"taxonomy":"_fund_pool","embeddable":true,"href":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/_fund_pool?post=88015"},{"taxonomy":"languages","embeddable":true,"href":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/languages?post=88015"},{"taxonomy":"regions-countries","embeddable":true,"href":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/regions-countries?post=88015"},{"taxonomy":"research-teams","embeddable":true,"href":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/research-teams?post=88015"},{"taxonomy":"workflow-status","embeddable":true,"href":"https:\/\/alpha.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/workflow-status?post=88015"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}