L19 network graph
Posted: Sat Jan 29, 2011 11:24 pm
Inspired by the High-dan game network thread, I decided to try to graph the interaction between members of the forum (i.e. how often two members post in the same thread). It seemed like a fun weekend project, and I had just written a simplified html parser in Scheme for school so it was a good chance to put that to further use.
So, after writing some Python scripts to download threads from the forum to my computer and extract the names of the authors of each post, I had to figure out a way to graph it all. This took by far the longest time, as I had never done anything like this before so I had to research my options. After narrowing it down to igraph (which walleye used in the thread above) and Gephi, I decided Gephi would be easier to use. Finally, after figuring out how to format my data so it could be imported into Gephi, I was able to make the graphs below.
(Note: Click on an image to get the full size)
http://upload.wikimedia.org/wikipedia/commons/3/34/L19_January_png.png This is the graph for January. Every member with at least one post is represented, but a line is drawn between two members only if one member has posted in the same thread another member has at least three times. A larger line corresponds with more posts in common threads. The lines are also color-coded: yellow represents only slight interaction between two members, cyan represents moderate interaction, and purple represents a large amount of interaction. Nodes are sized and color-coded similarly. There is one caveat with this data: the information was gathered from only the first 20 posts of each thread.
http://upload.wikimedia.org/wikipedia/commons/c/ca/L19_alltime_png.png This is the graph for all time. Once again, every member with at least one post is represented, but a line is drawn between two members only if they have 30 posts in common threads.
This information may not be too practically useful, but I found it quite interesting to look at.
So, after writing some Python scripts to download threads from the forum to my computer and extract the names of the authors of each post, I had to figure out a way to graph it all. This took by far the longest time, as I had never done anything like this before so I had to research my options. After narrowing it down to igraph (which walleye used in the thread above) and Gephi, I decided Gephi would be easier to use. Finally, after figuring out how to format my data so it could be imported into Gephi, I was able to make the graphs below.
(Note: Click on an image to get the full size)
http://upload.wikimedia.org/wikipedia/commons/3/34/L19_January_png.png This is the graph for January. Every member with at least one post is represented, but a line is drawn between two members only if one member has posted in the same thread another member has at least three times. A larger line corresponds with more posts in common threads. The lines are also color-coded: yellow represents only slight interaction between two members, cyan represents moderate interaction, and purple represents a large amount of interaction. Nodes are sized and color-coded similarly. There is one caveat with this data: the information was gathered from only the first 20 posts of each thread.
http://upload.wikimedia.org/wikipedia/commons/c/ca/L19_alltime_png.png This is the graph for all time. Once again, every member with at least one post is represented, but a line is drawn between two members only if they have 30 posts in common threads.
This information may not be too practically useful, but I found it quite interesting to look at.