KDnuggets Social Network in NodeXL, May 2014
We examine KDnuggets Twitter Social Network, as generated by NodeXL, looking at clusters, top Twitter accounts, URLs, hashtags, words, and what does it all mean?
By Gregory Piatetsky,
@kdnuggets, May 29, 2014.
NodeXL is a free, open-source template for Microsoft Excel 2007, 2010 and 2013 that makes it easy to explore network graphs. NodeXL is the product/service of Connected Action, which is headed by Marc Smith, @Marc_smith, Director of Social Media Research Foundation.
Here is KDnuggets Twitter Social Network, as visualized in NodeXL on May 25, 2014.
The graph represents a network of about 1,396 Twitter users whose tweets between Apr 24 and May 25, 2014 contained "kdnuggets", or who were replied to or mentioned in those tweets.
Some of the findings
An experimental interactive version of this graph is here.
There is an edge for each "replies-to" relationship in a tweet, an edge for each "mentions" relationship in a tweet, and a self-loop edge for each tweet that is not a "replies-to" or "mentions".
The graph is directed. The graph's vertices were grouped by cluster using the Clauset-Newman-Moore cluster algorithm. The graph was laid out using the Harel-Koren Fast Multiscale layout algorithm.
Overall Graph Metrics:
Top 10 Vertices and their Betweenness Centrality, ranked by Betweenness Centrality
Top URLs in Tweet in Entire Graph:
Top Hashtags in Tweet in Entire Graph:
Top Word Pairs and Count
Cluster G1 (largest)
Summary: This is the main connected component, of tweets originating from @kdnuggets.
Details:
Top URLs in Tweet in G1
The top URLs here are infographics and cartoons
Cluster G2 (single-vertex unconnected components) Summary: The top URLs here are KDnuggets posts - reports on meetings and interviews that were shared perhaps via KDnuggets tweet button and not via social media.
Details:
Top URLs:
Cluster G3 (small connected component)
Summary: Note that this cluster includes #ff and #fs tags - KDnuggets was frequently mentioned as part of #ff (Follow Friday) and #fs (Follow Saturday) tweets. The central node in this graph is @kirkdborne.
Details:
Top URLs in Tweet in G3:
Cluster G4 (small connected component).
Central nodes in this cluster are @hey_anmol and @yvesmulkers.
Details:
Top URLs in Tweet in G4:
Related:
NodeXL is a free, open-source template for Microsoft Excel 2007, 2010 and 2013 that makes it easy to explore network graphs. NodeXL is the product/service of Connected Action, which is headed by Marc Smith, @Marc_smith, Director of Social Media Research Foundation.
Here is KDnuggets Twitter Social Network, as visualized in NodeXL on May 25, 2014.
The graph represents a network of about 1,396 Twitter users whose tweets between Apr 24 and May 25, 2014 contained "kdnuggets", or who were replied to or mentioned in those tweets.
Some of the findings
- The Twitter users most active in KDnuggets network: @yvesmulkers, @kirkdborne, @hey_anmol
- Top shared content (in cluster G1) has infographics, cartoons, or other visual
- There is a large connected component with about 40% of all users that has top hash keywords; max diameter in that component is 3 (small world).
- Another cluster (G3) component is more associated with #ff and #fs
- Cluster G2 has lots of single-vertex nodes. Marc Smith described it as "brand" cluster - people are talking about @KDnuggets.
- Better interpretation tools are needed to make sense of such graphs!
An experimental interactive version of this graph is here.
There is an edge for each "replies-to" relationship in a tweet, an edge for each "mentions" relationship in a tweet, and a self-loop edge for each tweet that is not a "replies-to" or "mentions".
The graph is directed. The graph's vertices were grouped by cluster using the Clauset-Newman-Moore cluster algorithm. The graph was laid out using the Harel-Koren Fast Multiscale layout algorithm.
Overall Graph Metrics:
- Vertices: 1396
- Unique Edges: 1608
- Total Edges: 3487
- Reciprocated Vertex Pair Ratio: 0.06
- Reciprocated Edge Ratio: 0.12
- Connected Components: 336
- Maximum Vertices in a Connected Component: 957
- Maximum Edges in a Connected Component: 2697
- Maximum Geodesic Distance (Diameter): 8
- Average Geodesic Distance: 2.5
- Graph Density: 0.0008
Top 10 Vertices and their Betweenness Centrality, ranked by Betweenness Centrality
- @kdnuggets, 890,962
- @yvesmulkers, 35,998
- @kirkdborne, 26,751
- @hey_anmol, 22,420
- @dego963, 15,224
- @enterknowledge, 11,410
- @infomgmtexec, 11,404
- @hajozaki, 10,459
- @chazard , 10,380
- @ramirogoncalez, 9858
Top URLs in Tweet in Entire Graph:
- tweetedtimes.com/#!/kdnuggets/analytics-data-mining
- www.kdnuggets.com/2014/05/big-data-landscape-v30-analyzed.html
- www.kdnuggets.com/2014/04/9-free-books-learning-data-mining-data-analysis.html
- www.kdnuggets.com/2014/05/white-house-report-big-data-opportunities-values.html
- www.kdnuggets.com/2014/05/stacking-deck-next-wave-opportunity-big-data.html
- www.kdnuggets.com/2014/05/social-media-web-analytics-summit-san-francisco-talks-day-1.html
- This 1981 Computer Magazine Cover Explains Why We're So Bad at Tech Predictions, Tweeted via Buffer
- www.kdnuggets.com/2014/04/big-data-vs-privacy-fico-interview.html
- www.kdnuggets.com/2014/04/cartoon-data-scientist-salary-negotiation.html
- 9 Free Books for Learning Data Mining and Data Analysis, via twitterfeed
Top Hashtags in Tweet in Entire Graph:
- #bigdata
- #analytics
- #data
- #datamining
- #bigdataco
- #datascience
- #rstats
- #ff
- #hadoop
- #datascientist
Top Word Pairs and Count
- big data, 509
- data mining, 202
- via kdnuggets, 197
- data scientist, 145
- summit 2014, 106
- innovation summit, 101
- day 1, 90
- data analysis, 88
- talks day, 85
- 9 free, 85
- free books, 85
Cluster G1 (largest)
Summary: This is the main connected component, of tweets originating from @kdnuggets.
Details:
- Vertices: 506
- Unique Edges: 490
- Connected components: 1
- Single-vertex components: 0
- Maximum Geodesic Distance (Diameter): 3
- Average Geodesic Distance: 2.0
- Top hashtags: #bigdata #bigdataco #rstats #hadoop #datamining #bigdata2014 #data #datascientist #iembd #watson
- Top Words in Tweet: kdnuggets data big analytics mining via 2014 analysis scientist bigdata
- Top Mentioned in Tweet: @kdnuggets @masstlc @neilraden @mattturck @fivethirtyeight @justincholmes @tamr_inc @ibmbigdata @treycausey @kirkdborne
Top URLs in Tweet in G1
The top URLs here are infographics and cartoons
- time.com/60505/this-1981-computer-magazine-cover-explains-why-were-so-bad-at-tech-predictions/?utm_content=buffer20664&utm_medium=social&utm_source=twitter.com&utm_campaign=buffer
- feedproxy.google.com/~r/kdnuggets-data-mining-analytics/~3/ztuSUx3F_es/9-free-books-learning-data-mining-data-analysis.html?utm_source=twitterfeed&utm_medium=twitter
- timoelliott.com/blog/2014/04/cartoon-analytics-reinvention.html?utm_content=buffer9679b&utm_medium=social&utm_source=twitter.com&utm_campaign=buffer
- pages.sisense.com/rs/sisense/images/sisense_infographic_cheat_sheet_final_web.jpg?utm_content=bufferb73be&utm_medium=social&utm_source=twitter.com&utm_campaign=buffer
- https://www.linkedin.com/today/post/article/20140416130501-1853953-the-convoluted-world-of-data-scientist?trk=eml-ced-b-art-M-0-8199978374031083480&midToken=AQGU5ahTQ3NsQw&fromEmail=fromEmail&ut=1myx1LozXPDSc1&utm_content=buffer0d84d&utm_medium=social&utm_source=twitter.com&utm_campaign=buffer
- visual.ly/born-2010-how-much-left-me?utm_content=bufferf54e1&utm_medium=social&utm_source=twitter.com&utm_campaign=buffer
- feedproxy.google.com/~r/kdnuggets-data-mining-analytics/~3/MZaqmAz7ZJ8/book-outlier-detection-temporal-data.html?utm_source=twitterfeed&utm_medium=twitter
- dilbert.com/strips/2014-05-05/?utm_content=buffer30ff4&utm_medium=social&utm_source=twitter.com&utm_campaign=buffer
- www.kdnuggets.com/2014/05/big-data-landscape-v30-analyzed.html
- www.businessinsider.com/chocolate-consumption-vs-nobel-prizes-2014-4?utm_content=bufferf72ac&utm_medium=social&utm_source=twitter.com&utm_campaign=buffer
Cluster G2 (single-vertex unconnected components) Summary: The top URLs here are KDnuggets posts - reports on meetings and interviews that were shared perhaps via KDnuggets tweet button and not via social media.
Details:
- Vertices: 267
- Unique Edges: 204
- Connected components: 267
- Single-vertex components: 267
- Maximum Geodesic Distance (Diameter): 0
- Average Geodesic Distance: 0
- Top hashtags: #bigdata #data #analytics #datascience #bigdataawards #datamining #socialmedia #businessintelligence #learning #tech
- Top Words in Tweet: data big analytics feedback 2014 interview highlights day mining innovation
- Top Mentioned in Tweet: none
Top URLs:
- www.kdnuggets.com/2014/05/social-media-web-analytics-summit-san-francisco-talks-day-1.html
- www.kdnuggets.com/2014/05/white-house-report-big-data-opportunities-values.html
- www.kdnuggets.com/2014/04/9-free-books-learning-data-mining-data-analysis.html
- www.kdnuggets.com/2014/04/book-social-media-mining-free-download.html
- www.kdnuggets.com/2014/04/massachusetts-big-data-report-2014.html
- www.kdnuggets.com/2014/04/elusive-data-scientists-driving-high-salaries.html
- www.kdnuggets.com/2014/04/big-data-vs-privacy-fico-interview.html
- www.kdnuggets.com/2014/05/interview-george-corugedo-trends-skills.html
- www.kdnuggets.com/2014/04/big-data-innovation-summit-2014-talks-day1.html
- www.kdnuggets.com/2014/05/media-embracing-analytics-innovation.html
Cluster G3 (small connected component)
Summary: Note that this cluster includes #ff and #fs tags - KDnuggets was frequently mentioned as part of #ff (Follow Friday) and #fs (Follow Saturday) tweets. The central node in this graph is @kirkdborne.
Details:
- Vertices: 131
- Unique Edges: 247
- Connected components: 1
- Single-vertex components: 0
- Maximum Geodesic Distance (Diameter): 6
- Average Geodesic Distance: 2.57
- Top hashtags: #bigdata #analytics #datamining #datascience #ff #datascientist #hadoop #datawest14 #fs #rstats
- Top Words in Tweet: data big analytics feedback 2014 interview highlights day mining innovation
- Top Mentioned in Tweet: @kdnuggets @kirkdborne @marcusborba @merv @bigdatagal @sve_sic @data_nerd @yvesmulkers @mphnyc @salfordsystems
Top URLs in Tweet in G3:
- www.kdnuggets.com/2014/05/big-data-landscape-v30-analyzed.html
- www.kdnuggets.com/2014/05/stacking-deck-next-wave-opportunity-big-data.html
- www.kdnuggets.com/2014/04/cartoon-data-scientist-salary-negotiation.html
- www.informationweek.com/big-data/big-data-analytics/10-big-data-pros-to-follow-on-twitter/d/d-id/1252812?_mc=RSS_IWK_EDT&utm_source=dlvr.it&utm_medium=twitter
- www.kdnuggets.com/2014/05/video-series-data-mining-for-statisticians.html
- www.kdnuggets.com/2014/04/9-free-books-learning-data-mining-data-analysis.html
- www.kdnuggets.com/2014/05/data-literacy-education-the-information-economy.html
- www.asis.org/Bulletin/Jun-12/JunJul12_Hu.html
- www.masstech.org/sites/mtc/files/documents/Full%20Report%202014%20Mass%20Big%20Data%20Report_0.pdf
- www.masstech.org/
Cluster G4 (small connected component).
Central nodes in this cluster are @hey_anmol and @yvesmulkers.
Details:
- Vertices: 53
- Unique Edges: 56
- Connected components: 1
- Single-vertex components: 0
- Maximum Geodesic Distance (Diameter): 5
- Average Geodesic Distance: 2.8
- Top hashtags: #bigdata #analytics #interview #yarn #anaytics #publicpolicy #crowdsourcing #government #masstlc #hadoop
- Top Words in Tweet: data kdnuggets big yvesmulkers via analytics interview 2014 top hey_anmol
- Top Mentioned in Tweet: @kdnuggets @yvesmulkers @hey_anmol @talksumdata @drussell41 @kirkdborne @redpointglobal @georgecorugedo @ramirogoncalez @dutchlight360
Top URLs in Tweet in G4:
- www.kdnuggets.com/2014/05/lavastorm-forrester-transform-organization-strong-data-management.html?utm_content=buffera6590&utm_medium=social&utm_source=twitter.com&utm_campaign=buffer
- https://nodexlgraphgallery.org/Pages/Graph.aspx?graphID=20047
- www.kdnuggets.com/2014/05/stacking-deck-next-wave-opportunity-big-data.html
- www.kdnuggets.com/2014/05/interview-george-corugedo-yarn-analytics.html
- www.kdnuggets.com/2014/05/interview-dale-russell-talksum-startup-award.html
- www.kdnuggets.com/2014/05/employee-churn-202-good-bad-churn.html?utm_content=bufferbea9b&utm_medium=social&utm_source=twitter.com&utm_campaign=buffer
- www.kdnuggets.com/2014/05/nuodb-3-key-trends-dbms-market.html?utm_content=buffer44ad7&utm_medium=social&utm_source=twitter.com&utm_campaign=buffer
- www.kdnuggets.com/2014/04/big-data-vs-privacy-fico-interview.html?utm_content=buffer6e5d6&utm_medium=social&utm_source=twitter.com&utm_campaign=buffer
- www.kdnuggets.com/news/index.html
- www.kdnuggets.com/2014/05/code-for-india-2014-global-hackathon-google.html
Related:
- KDD-2013 NodeXL Twitter Social Network, updated
- KDnuggets Twitter Social Network
- Social Media & Web Analytics Innovation Summit 2014: Day 1 Highlights