Exploring the chain of truth in explainers

Site here

Starting off my work for the interview assignment where I explored embedding citations in a page, this is the flip-side, how can you make it easy for other people to cite you as a source, e.g. when doing an explainer.

Of particular interest was following the chain of citation, maybe it’s okay to cite a wikipedia article if you see where that wikipedia article is citing itself from. In the webpage, try clicking cite with text in the paragraph with a citation and without.

Posted in All

Chris Borland & Concussions in the NFL

On Wednesday, March 18th Chris Borland of the San Francisco 49ers retired from professional football, citing fear of future head injuries. This set off a flurry of discussion both on ESPN and Reddit (where it garnered 3000 upvotes, making it one of the most popular threads of the day).

I wanted to investigate how public discussion on the injury and concussions differed from discussion on the Media. To do this, I used a tool I’ve been working on for research inspired by Media Clouds. The tool compares in real time word clouds from Twitter vs AP news sources and tries to highlight differences. The tool is still in BETA so I was unable to give an interactive demo, but here is a screenshot:

Screenshot 2015-03-18 01.16.51

On the left hand side is traditional news outlets, on the right is sources from twitter.

News sources seem to be covering Borland’s impressive career, including his college football record at Wisconsin. However, the discussion on Twitter is two-fold, one is expressing shock at the retirement and the second comparing him to Patrick Willis, another linebacker for the 49ers who retired only days earlier due to religious reasons. This aspect of the story is not covered very much in news outlets, but with Willis and Borland gone, 49ers fans have felt like their season has been derailed. Public discussion seems to focus more on the consequences to the 49ers season whereas news outlets are discussing the repercussions to the NFL as a whole.

Posted in All

Toucans & Turtles – A game of League of Legends

This is a piece reporting on a game of League of Legends (often shorted to League or LoL) on Sunday afternoon featuring Team Solo Mid (TSM) vs Team Liquid streamed live to over 300k viewers. Like most sports, the verbiage is confusing if you’re not a follower, I’ve tried my best to actually report on the game while also including references to help and following the player-based stories.

The following is a brief description of the game itself.

League of Legends is a 5v5 objective-based sport (there is no fixed time per game). Each team picks 5 characters out of a pool of over 100. Games of League can be categorized into 3 phases: early game: where players are pit individually against each other, mid game: where teams start to group and work together, and late game: where objectives and team movements are more important than individual skill.

Teams fight over objectives on the map such as towers or neutral creatures such as the Dragon and Baron. Doing enough damage to an enemy champion results in a ‘kill’ or a short power-play for the other team, during which they can take objectives. Objectives give ‘buffs’ which leads to greater strength for the team that takes them. When the final objective, the enemy’s nexus, is taken the game ends.

The positions for the game are: mid, top, jungle, marksmen and support, with each role playing a unique niche on the team. Each player has a role that they play as well as an in-game nickname, referred to in quotation marks for example, Thariq “trq” Shihipar.

Posted in All

Thariq’s Media Diary

Hypotheses

I come from an engineering/computer science background, so my interest in this assignment was mapping my news data programatically and doing interesting data visualizations. I’ve learned that data visualization is often easiest if you start with a hypothesis, because then you know what you’re looking for, and even the lack of a result tells you something. So I made two hypotheses about my news reading habits over the past week:

Hypothesis #1: I get most of my news from social sources, such as Facebook & Twitter. The ‘legitimate’ news source I read most often is the New York Times, not because I visit the website, but because the links that are shared tend to be from there. However, all of my news reading is dwarfed by Reddit usage (which is mostly not due to  .

Hypothesis #2: A large amount of news that I’ve read has to do with the Chapel Hill tragedy. Being a practicing Muslim and very involved in the community, the story of 3 muslims being killed execution-style in a possible hate crime has dominated my news feed for the past week.

Methods

Empirically, I know that I do almost all of my news reading online and on my computer, so I only analyzed data from my laptop computer.

I installed RescueTime and other existing tracking options, but found that they not useful in tracking the sources of my traffic, especially only for news. Instead, I decided to query my Chrome Search history to be able to trace my activity. This was done using node.js and sqlite3 to make it easily reproducable on other people’s computers if the data is interesting.

The GitHub repo for the code is here: https://github.com/ThariqS/FutureofNews-MediaVisualizations. It is currently poorly documented and doesn’t show you visualizations (only results) but I’m uploading it on GitHub to motivate myself to polish it up.

Results

Finding #1: Sports dominated my history results. 81% of my news-related history entries were sports related from ESPN & Reddit (I read a large amount of e-sports news on Reddit). The nature of the way I read sports news is fundamentally different from how I read traditional news. Sports games change more often than news stories, more time is spent tracking and staying updated (over 80% of my news-related history was for sport stories).

Finding #2: The news sites I visited

Screenshot 2015-02-18 01.14.16

Finding #3: The sources of my news could be divided into 4 different areas: (url bar, google, social and other news stories). Interestingly, I found that the largest trigger for me to read news was to be on a news site, many of the articles I read came as branches off news sites I was already on. Google and Twitter were close seconds. Unsurprisingly, little of my news came from me directly going to the urlbar and typing in ‘newyorktimes.com’.

Disappointingly, though I think much of my news is through Facebook, I was unable to get exact numbers on Facebook usage, due to a limitation of the Chrome history storage and how facebook processes links.

Screenshot 2015-02-18 12.33.33

Finding #4: Word Cloud/Topic Generation

Discerning the topics of the news articles I read was difficult, given that I only had access to the titles of the pages. Without being able to apply some more advanced natural language processing techniques, I simply made a wordcloud out of the title words. Some key highlights that stick out to me are: “Hackers”, “Isis”, “Senate”, “Muslim”, “Storm”

Screenshot 2015-02-18 12.18.09

Further Work & Weaknesses:

I know intuitively that a large amount of my news comes from Facebook, but it appears that this method I used is inaccurate in tracking links originating from Facebook.

Secondly, I would be interested in the news I read through osmosis, i.e. news I see through scrolling through Twitter & Facebook. I may be aware of these headlines even if I don’t know anything more than the introductory blurb for the story.

 

 

Posted in All