{"id":14262,"date":"2018-02-04T09:20:49","date_gmt":"2018-02-04T09:20:49","guid":{"rendered":"https:\/\/www.revoscience.com\/en\/?p=14262"},"modified":"2020-05-27T06:14:23","modified_gmt":"2020-05-27T06:14:23","slug":"unbiased-approach-sifting-big-data","status":"publish","type":"post","link":"https:\/\/www.revoscience.com\/en\/unbiased-approach-sifting-big-data\/","title":{"rendered":"An unbiased approach for sifting through big data"},"content":{"rendered":"<p><span style=\"color: #000000\"><em><strong>A new method could help researchers develop unbiased indicators for assessing complex systems such as population health.<\/strong><\/em><\/span><\/p>\n<figure id=\"attachment_14263\" aria-describedby=\"caption-attachment-14263\" style=\"width: 618px\" class=\"wp-caption alignnone\"><img loading=\"lazy\" decoding=\"async\" class=\" wp-image-14263\" src=\"https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2018\/02\/5254.png\" alt=\"\" width=\"618\" height=\"566\" title=\"\" srcset=\"https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2018\/02\/5254.png 580w, https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2018\/02\/5254-300x274.png 300w\" sizes=\"auto, (max-width: 618px) 100vw, 618px\" \/><figcaption id=\"caption-attachment-14263\" class=\"wp-caption-text\">The maximum entropy network (MENet) of all variables of health data analysed. Nodes and lines represent the health-related variables and the strength of interdependence between two variables respectively. MENet helps build the Optimal Information Network (OIN) which indicates the most useful information to accurately characterize systemic health. (Servadio J.L. and Convertino M, Science Advances, February 2, 2018)<\/figcaption><\/figure>\n<p><span style=\"color: #000000\">Researchers have developed a complex system model to evaluate the health of populations in some U.S. cities based only on the most significant variables expressed in available data. Their unbiased network-based probabilistic approach to mine big data could be used to assess other complex systems, such as ranking universities or evaluating ocean sustainability.\u00a0<\/span><\/p>\n<p><span style=\"color: #000000\">Societies today are data-rich, which can both empower and overwhelm. Sifting through this data to determine which variables to use for the assessment of something like the health of a city\u2019s population is challenging. Researchers often choose these variables based on their personal experience. They might decide that adult obesity rates, mortality rates, and life expectancy are important variables for calculating a generalized metric of the residents\u2019 overall health. But are these the best variables to use? Are there other more important ones to consider?\u00a0<\/span><\/p>\n<p><span style=\"color: #000000\">Matteo Convertino of Hokkaido University in Japan and Joseph Servadio of the University of Minnesota in the U.S. have introduced a novel probabilistic method that allows the visualization of the relationships between variables in big data for complex systems. The approach is based on \u2018maximum transfer entropy\u2019, which probabilistically measures the strength of relationships between multiple variables over time.<\/span><\/p>\n<p><span style=\"color: #000000\">Using this method, Convertino and Servadio mined through a large amount of health data in the U.S. to build a \u2018maximum entropy network\u2019 (MENet): a model composed of nodes representing health-related variables, and lines connecting the variables. The lines are darker the stronger the interdependence between two variables. This allowed the researchers to build an \u2018Optimal Information Network (OIN)\u2019 by choosing the variables that had the most practical relevance for assessing the health status of populations in 26 U.S. cities from 2011 to 2014. By combining the data from each selected variable, the researchers were able to compute an \u2018integrated health value\u2019 for each city. The higher the number, the less healthy a city\u2019s population.<\/span><\/p>\n<p><span style=\"color: #000000\">They found that some cities, such as Detroit, had poor overall health during that timeframe. Others, such as San Francisco, had low values, indicating more favorable health outcomes. Some cities showed high variability over the four year period, such as Philadelphia. Cross-sectional comparisons showed tendencies for California cities to score better than other parts of the country. Also, Midwestern cities, including Denver, Minneapolis, and Chicago, appeared to perform poorly compared to other regions, contrary to national city rankings.<\/span><br \/>\n<span style=\"color: #000000\">Convertino believes that methods like this, fed by large data sets and analysed via automated stochastic computer models, could be used to optimize research and practice; for example for guiding optimal decisions about health. \u201cThese tools can be used by any country, at any administrative level, to process data in real-time and help personalize medical efforts,\u201d says Convertino.<\/span><\/p>\n<p><span style=\"color: #000000\">But it is not just for health \u2014 \u201cThe model can be applied to any complex system to determine their Optimal Information Network, in fields from ecology and biology to finance and technology. Untangling their complexities and developing unbiased systemic indicators can help improve decision-making processes,\u201d Convertino added.<\/span> <\/p>\n","protected":false},"excerpt":{"rendered":"<p>A new method could help researchers develop unbiased indicators for assessing complex systems such as population health. Researchers have developed a complex system model to evaluate the health of populations in some U.S. cities based only on the most significant variables expressed in available data. Their unbiased network-based probabilistic approach to mine big data could [&hellip;]<\/p>\n","protected":false},"author":6,"featured_media":14263,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[17],"tags":[],"class_list":["post-14262","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-research"],"featured_image_urls":{"full":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2018\/02\/5254.png",580,530,false],"thumbnail":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2018\/02\/5254-150x150.png",150,150,true],"medium":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2018\/02\/5254-300x274.png",300,274,true],"medium_large":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2018\/02\/5254.png",580,530,false],"large":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2018\/02\/5254.png",580,530,false],"1536x1536":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2018\/02\/5254.png",580,530,false],"2048x2048":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2018\/02\/5254.png",580,530,false],"ultp_layout_landscape_large":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2018\/02\/5254.png",580,530,false],"ultp_layout_landscape":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2018\/02\/5254.png",580,530,false],"ultp_layout_portrait":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2018\/02\/5254.png",580,530,false],"ultp_layout_square":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2018\/02\/5254.png",580,530,false],"newspaper-x-single-post":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2018\/02\/5254.png",536,490,false],"newspaper-x-recent-post-big":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2018\/02\/5254.png",394,360,false],"newspaper-x-recent-post-list-image":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2018\/02\/5254.png",71,65,false],"web-stories-poster-portrait":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2018\/02\/5254.png",580,530,false],"web-stories-publisher-logo":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2018\/02\/5254.png",96,88,false],"web-stories-thumbnail":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2018\/02\/5254.png",150,137,false]},"author_info":{"info":["Amrita Tuladhar"]},"category_info":"<a href=\"https:\/\/www.revoscience.com\/en\/category\/news\/research\/\" rel=\"category tag\">Research<\/a>","tag_info":"Research","comment_count":"0","_links":{"self":[{"href":"https:\/\/www.revoscience.com\/en\/wp-json\/wp\/v2\/posts\/14262","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.revoscience.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.revoscience.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.revoscience.com\/en\/wp-json\/wp\/v2\/users\/6"}],"replies":[{"embeddable":true,"href":"https:\/\/www.revoscience.com\/en\/wp-json\/wp\/v2\/comments?post=14262"}],"version-history":[{"count":0,"href":"https:\/\/www.revoscience.com\/en\/wp-json\/wp\/v2\/posts\/14262\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.revoscience.com\/en\/wp-json\/wp\/v2\/media\/14263"}],"wp:attachment":[{"href":"https:\/\/www.revoscience.com\/en\/wp-json\/wp\/v2\/media?parent=14262"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.revoscience.com\/en\/wp-json\/wp\/v2\/categories?post=14262"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.revoscience.com\/en\/wp-json\/wp\/v2\/tags?post=14262"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}