====== Data Journalism ====== Database journalism \\ Computer-assisted journalism \\
it's not about how many developers you have, but what you do with them. -- Adrian Holovaty
Data journalism outcomes can range from visualization to long form articles. It is about the process how to turn numbers in to a story, whether the story is comprised of words or graphics is irrelevant. By knowing the structure of your team and balancing resources, you can start to think how best to organize your group. * {{youtube>DxvVZe2fnvI|99% vs. 1%}} * [[http://www.theguardian.com/news/datablog/2012/dec/04/government-spending-department-2011-12|Government spending by department, 2011-12: get the data]], The Guardian From [[http://learno.net/courses/doing-journalism-with-data-first-steps-skills-and-tools|Doing Journalism with Data: Frist Steps, Skills and Tolls]]: [[http://learno.net/classroom/doing-journalism-with-data-first-steps-skills-and-tools|Lession]]: Login required
Key-roles, you will probably need to do at least 2 of these: - Research (4:58) * Most important role * On basis of a journalistic hunch, or by digging around in data * Time consuming - Writing (5:58) * Numbers without context are just numbers, words give context * Example: Yearly guide to public spending by each government department (Links to an external site.) (The Guardian) - Development and coding * Coders (programmers), can help with the research and visualization * Coordinate who can write and code * Example: Connected China (Links to an external site.) (Reuters) - Designing and visualizing * Designers can make visualizations happen * Example: 99% vs 1% (Links to an external site.) (The Guardian) ===== Teams ===== - Lone rangers * one person does everything * possible by all the tools of the DDJ ecosystem: * [[http://openrefine.org/|OpenRefine]], * [[https://datawrapper.de/|Datawrapper]], * [[http://www.tableausoftware.com/|Tableau]], * [[http://www.google.com/drive/apps.html#fusiontables|Google Fusion Tables]], * [[http://cartodb.com/|CartoDB]] - Two-person teams * [[http://www.theguardian.com/world/interactive/2012/may/08/gay-rights-united-states|Gay rights in the US, state by state]], the Guardian - Small scale teams * Is able to produce innovative projects quickly * Part of the newsdesk * Example: [[http://project.wnyc.org/flooding-sandy-new/#12.00/40.7378/-74.0702|'Flooding and Flood Zones' map Hurricane Sandy]] (WNYC) - Larger teams * Has a deliberate strategy to create a new kind of online journalism * They help with finding ways to tell the story better * Part of the newsdesk * Example: [[http://london2012.nytimes.com/|2012 Olympic Experience]], New York Times Readings in the Guardian * [[http://www.theguardian.com/news/datablog/2012/may/24/data-journalism-punk|Anyone can do it. Data journalism is the new punk]] * [[http://www.theguardian.com/news/datablog/2012/mar/28/data-visualisation-tools-free|Data visualisation DIY: our top tools]] * {{youtube>h2zbvmXskSE}} Data-journalists are the new punks: Simon Rogers at TEDxPantheonSorbonne * [[http://www.theguardian.com/news/datablog/2010/jul/16/data-plural-singular|Data are or data is?]] ===== Writing ===== [[https://owl.english.purdue.edu/owl/resource/735/04/|The Inverted Pyramid Structure]] at Purdue University For data journalism, the 5 W's have never been so important: - Who? (1:30) * Where did the data come from? * The most important W * Transparency about the source is critical * Do not blindly trust accuracy of supplied data - What? (3:05) * What are you trying to say? * What points are you trying to get across? * Your stories are aimed at the general public * Example: [[http://quoi.info/wp-content/uploads/data_art/en/|Art Market for Dummies]] (Askmedia.fr) * Your job is to bridge the gap between the data and the user - When? (4:28) * How old is your data? * Near real-time data * [[http://homicides.redeyechicago.com/|Chicago homicides database]] * Example story based on this database "[[http://articles.redeyechicago.com/2013-09-04/news/41773469_1_3-homicides-redeye-data-south-side-community-area|Englewood sees 3 homicides in 4 days]], Redeye Chicago * Social media data * [[https://twitter.com/intent/user?screen_name=J_tsar|Twitter account Boston Marathon Bomber]] * read [[wp>Dzhokhar_and_Tamerlan_Tsarnaev|Dzhokhar and Tamerlan Tsarnaev]] * {{ :dzhokhar-tsarnaev2.jpg?200|}} Example story "[[http://qz.com/76442/we-know-when-dzhokhar-tsarnaev-sleeps/|We know when Dzhokhar Tsarnaev sleeps]]", Quartz * Sensor journalism * [[http://project.wnyc.org/cicadas/|Cicada Tracker]], WNYC * Full article, "[[https://source.opennews.org/en-US/learning/people-power-prevails/|People Power Prevails!]]" (Source) * Crowdsourced reporting * [[http://www.theguardian.com/news/datablog/interactive/2011/aug/09/uk-riots-incident-map|UK riots: every verified incident - interactive map]], The Guardian * [[http://www.theguardian.com/news/datablog/2011/dec/09/data-journalism-reading-riots|Data journalism reading the riots: what we know. And what we don't]], The Guardian * [[http://www.theguardian.com/world/datablog/2012/jan/30/occupy-movement-map-protests-active|Occupy movement: Help us map which protests are still active]], The Guardian * [[http://www.theguardian.com/news/datablog/interactive/2012/oct/30/superstorm-sandy-incidents-mapped|Superstorm Sandy: every verified event mapped and detailed]], The Guardian * [[http://www.theguardian.com/news/datablog/interactive/2012/may/11/broadband-map-britain|Broadband Britain mapped: your web speeds]], The Guardian * Each of these projects was based on a simple journalistic tool with a few steps: - A Google form embedded on a page to collate reader responses; - That data edited by journalists to compensate for repeated or offensive data; - Data imported into [[http://www.google.com/drive/using-drive/#fusiontables|Google Fusion Tables]]; - The data feeding a map which updates every time it’s reloaded. - Where? (07:53) * A key part of data journalism is the ability to ‘mash up’ different datasets to create a new story. * [[http://www.theguardian.com/news/datablog/interactive/2012/jul/22/gun-ownership-homicides-map|The gun ownership and gun homicides murder map of the world]], The Guardian - Why? (8:48) * Hardest question * Data journalism is less good at correlating that data to produce a cause and effect analysis. ====== 데이터저널리즘 ====== 오픈데이터저널리즘 빅데이터저널리즘 데이터중심(기반)저널리즘 Computer assisted = fact gathering 여기에 더 나아가 인터액티브한 결과물을 도출하는 과정 따라서 데이터저널리즘 = * __데이터의 수집, 정리, 분석, 시각화, 스토리화 등의 과정을 통해 저널리즘을 실천하는 행위들__ 데이터 기반한 인포그래픽, 스토리텔링 -> 데이터마이닝, 데이터 전문가의 역할이 중요해 짐 __필요성__ * 정보불평등의 해결책 * 저널리즘의 개관성과 신뢰도 향상 * http://hot.coroke.net 충격, 경악, 황당 등 자극적이고 무의미하며 선정적인 단어로 구성된 헤드라인 사용 * 깊이 있는 정보 (사실과 사실 밑에 있는 것들 (embedded)) __역할__ * 저널리즘 객관성 강화 * 데이터의 공공적인 성격을 부각시켜 공개화 추구 (open 데이터) -> 정부, 기업의 투명성 * 국가안보, 국익, 프라이버시, 국민의 알권리 등과 결합하여 충돌 * 개방에 걸림돌이 되고 있음 (Lehtonen, 2011) * wikileaks.com * [[http://www.nature.com/nature/journal/v494/n7436/full/nj7436-271a.html|science communication: from page to screen]] * 데이터 수집과 참여에 개인이 참여 -> 개인의 정치, 사회적인 활동 강화 * 스토리텔링을 보여주기 위한 기술의 강화 (증강현실 활용 등) * {{youtube>hVimVzgtD6w|Video of the Week}} * https://wikileaks.org/ * http://news.coroke.net/ ====== 데이터저널리즘과 관련된 주제 ====== 디지털시대 뉴스 유통과 알고리즘