{"id":828,"date":"2010-06-09T11:16:36","date_gmt":"2010-06-09T10:16:36","guid":{"rendered":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/?p=828"},"modified":"2010-06-09T11:16:36","modified_gmt":"2010-06-09T10:16:36","slug":"sort-working-the-crowd-galaxy-zoo-the-rise-of-the-citizen-scientist","status":"publish","type":"post","link":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/2010\/06\/sort-working-the-crowd-galaxy-zoo-the-rise-of-the-citizen-scientist\/","title":{"rendered":"SORT &#8211; Working the crowd: Galaxy Zoo &#038; the rise of the citizen scientist"},"content":{"rendered":"<p>I&#8217;ve been looking forward to this session by Chris Lintott on\u00a0<a href=\"http:\/\/www.galaxyzoo.org\/\">Galaxy Zoo<\/a><\/p>\n<p>As our ability to get information about the universe has increased we are challenged to deal with larger and larger amounts of data. In astronomy driven by availability of hi-resolution digital imaging etc &#8211; whereas 20-30 years ago you could get collections of hundreds of galaxies &#8211; now can get collections of millions.<\/p>\n<p>Analysis of galaxy images is about looking at the shape of galaxy. While machine approaches have been developed &#8211; they typically have only an 80% accuracy. However humans are very good at this type of task. This used to be a task students would do &#8211; but the amount of data far outstripped ability of students to keep up.<\/p>\n<p>In astronomy there is a long tradition of &#8216;amateurs&#8217; taking part and spotting things that may not be spotted by professionals. However contibutions have generally been around data collection &#8211; and then passed to experts for analysis. Galaxy Zoo reverses this &#8211; data collection been done and asking public to analyse data.<\/p>\n<p>GalaxyZoo was meant to be a side project &#8211; but picked up by media &#8211; specifically BBC News website &#8211; and sudden burst of publicity got huge boost. However, first thing that happened was server went down &#8211; 30,000 emails telling them that the server had gone down. Luckily able to get that back up and running quickly.<\/p>\n<p>After 48 hours were classifying as many galaxies in 1 hour as a student previously doing in a month.<\/p>\n<p>Found that getting many people to do the classification improves accuracy &#8211; over professional astronomers. Took away all barriers to participating to get as many people involved as possible. Originally had a &#8216;test&#8217; for users &#8211; but took this away.<\/p>\n<p>The huge side effect is that humans can spot unexpected stuff without being told &#8211; much better than machines.<\/p>\n<p>Also built community around people participating &#8211; this community now starting to solve problems &#8211; e.g. discovery of small green galaxies &#8211; started to analyse, recruited programmer to interrogate data and this has eventually resulted in published paper &#8211; these objects have been known since 1960s but never analysed. None of the people in the group were scientists.<\/p>\n<p>When they&#8217;ve talked to users of the site the overwhelming reason for taking part is that they want to do something useful &#8211; want to contribute.<\/p>\n<p>We have responsibility not to waste peoples time &#8211; collective manpower on GalaxyZoo 2 was equivalent to employing a single person for 200 years &#8211; we cannot take this likely.<\/p>\n<p>Don&#8217;t make promises you can&#8217;t keep &#8211; e.g. don&#8217;t offer &#8216;free response&#8217; that you then can&#8217;t actually read &#8211; Galaxy Zoo handles this via the online community forums.<\/p>\n<p>Chris describes three strands of engagement with users<\/p>\n<ul>\n<li>Known knowns<\/li>\n<li>Unknown unknowns<\/li>\n<li>Known unknowns<\/li>\n<\/ul>\n<p>Now JISC funded project to convert information from old ship logs &#8211; because has climate data.<\/p>\n<p>Show pages of ships logs &#8211;<\/p>\n<ul>\n<li>key data you should extract (known knowns &#8211; that stuff the researchers know they want from the logs like weather reports)<\/li>\n<li>unexpected things you might spot (unknown uknowns &#8211; stuff you might spot in the logs &#8211; pictures, unexpected information)<\/li>\n<li>expected things, but not known how much (known unknowns &#8211; events you know will be in there but not how often e.g. encounters with other ships)<\/li>\n<\/ul>\n<p>These strands are generalisable to many projects<\/p>\n<p><a href=\"http:\/\/www.zooniverse.org\/home\">Zooniverse<\/a> &#8211; takes the generalisable stuff from the researchers and provides it &#8211; platform for citizen science.<\/p>\n<p>Can no longer rely on media to get message out and drive engagement &#8211; &#8220;it&#8217;s on the internet isn&#8217;t it amazing&#8221; no longer a story &#8211; need to work out how we get the next 300,000 people involved [my first thought &#8211; Games &#8211; look at Farmville&#8230;]<\/p>\n","protected":false},"excerpt":{"rendered":"<p>I&#8217;ve been looking forward to this session by Chris Lintott on\u00a0Galaxy Zoo As our ability to get information about the universe has increased we are challenged to deal with larger and larger amounts of data. In astronomy driven by availability of hi-resolution digital imaging etc &#8211; whereas 20-30 years ago you could get collections of [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[46],"class_list":["post-828","post","type-post","status-publish","format-standard","hentry","category-uncategorized","tag-sort2010"],"_links":{"self":[{"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/posts\/828","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/comments?post=828"}],"version-history":[{"count":4,"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/posts\/828\/revisions"}],"predecessor-version":[{"id":832,"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/posts\/828\/revisions\/832"}],"wp:attachment":[{"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/media?parent=828"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/categories?post=828"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/tags?post=828"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}