I see Reddit mentioned a lot on Hubski, and including conversations over some of the more unsavory subreddits and how they affect the site as a whole. This is just some interesting data to add to those discussions, since it's often hard to quantify how the site is affected by them.
Correct me if I'm wrong, but it mostly seems to show cross-posts. Interestingly enough, the biggest sub I moderate (/r/foodforthought) has damn near as many cross-posts as the biggest subreddit I used to moderate (/r/movies) despite the fact that the former is under 100k subscribers and the latter is like 7m.
Yep just xposts, the original off of users linking within reddit, the later one off of xposts. It's interesting how that overvalues subs based around linking to reddit (like Hailcorporate). My friends and I have one hacked together that works off an account's comments, I'll post it once we can handle reddits scale. Right now our dataset is only 800k, need to do some cleanup before we add more.
Good find. It's so cool to see where all the subreddits are relative to each other. It's also cool to see regions of the map where similar subreddits are lumped together. Here's the nsfw subreddits as a region on the map and here's where all the gaming subreddits are. Kinda funny how there's a void on the left of the map for /r/games, which is where all the nsfw stuff would be.