Top 2013 LinkedIn Groups for Analytics, Big Data,
Data Mining, and Data Science
We revisit our analysis of top 30 LinkedIn groups for Analytics, Big Data, Data Mining, and Data Science and identify the largest, fastest growing, and most active groups. In 2013 the growth rate of top groups more than doubled, and growth rate correlated with the activity level.
By Gregory Piatetsky, Dec 12, 2013. comments
We revisited our analysis of Top 30 LinkedIn Groups for Analytics, Big Data, Data Mining, and Data Science from April 2013 and extracted raw weekly data - the number of members, comments and discussions, up to Nov 25, 2013.
The 7 groups with the largest number of members (as of Nov 25, 2013) were:
- Advanced Business Analytics, Data Mining and Predictive Modeling: 100,437
- Big Data / Analytics / Strategy / FP&A / S&OP ...: 80,778
- Big Data and Analytics: 57,205
- Business Analytics: 47,967
- Data Mining, Statistics, Big Data, and Data Visualization: 32,865
- BIG DATA Professionals - Architects Scientists Analytics Experts: 25,349
- Next Gen Market Research (NGMR): 22,803
Note: You can get actual data from the HTML source code of the LinkedIn group Statistics/Activity page, look for dataset seriesName="Comments" and parse that data. Likewise for Discussions and Members. See the table with more details and link to download raw data at the end of this post. Thanks to KDnuggets intern Anmol Rajpurohit for collecting the membership, comments, and discussions data.
The most interesting finding is the acceleration of growth in 2013. While prior to 2013, the top 30 groups added on average 36 members/month, in 2013 the growth more than doubled to 80 members/month !
Here are 7 groups with the fastest growth in 2013 (annualized):
- Data Scientists: 148%
- RDataMining: 125%
- Data Mining, Statistics, Big Data, and Data Visualization: 114%
- BIG DATA Professionals - Architects Scientists Analytics Experts: 96.2%
- Big Data and Analytics: 93%
- Big Data / Analytics / Strategy / FP&A / ...: 92%
- Healthcare Data Mining and Modeling: 84%
The chart below shows group growth vs group size. Color corresponds to age - redder is younger, bluer is older.
LinkedIn provides 2 measures of activity: discussions/week (actually posts) and comments/week. In 2013 there were 83% more discussions than comments: on average 2.57 discussions per 1000 members per week, while only 1.40 comments/1000 members/week. This implies that most posts have no comments.
The chart below shows average comments/week vs average discussions/week for all 30 groups, with a circle size proportional to group size and darker color corresponding to faster growth. Group name abbreviations and full stats are in the table at the bottom of the post.
We note that most groups have between 1 and 3 discussions / 1000 members / week.
Several groups stand out: KDnuggets has the highest number of discussions/1000 members, NGMR (and RDM) have the highest number of comments per 1000 members. Some groups (eg "Actuary") seem to have a rather low level of both discussions and comments.
Since most activity is generated by a small number of people, it would be interesting to see how many people contribute to discussions and comments - an active core of the group. Perhaps LinkedIn could add such statistic in the future?
The groups with the highest activity levels (measured as sum of discussions/1000 members and comments/1000 members) are
|KDnuggets Analytics and Data Mining||12.7||2.5||10.2|
|BIG DATA Professionals - Architects Scientists Analytics Experts (BD Prof)||8.4||1.6||6.8|
|Predictive Analytics Network||7.0||1.0||6.0|
|Data Mining, Statistics, Big Data, and Data Visualization||6.1||2.4||3.7|
|Next Gen Market Research (NGMR)||5.5||3.5||2.0|
Note: one reason for such a high level of discussions (posts) on KDnuggets is that I post all www.kdnuggets.com/news blog items also on KDnuggets LinkedIn group. However, there are also many others who post on KDnuggets.
We can also chart growth vs activity level and observe that for most groups, activity seems to be proportional to growth. KDnuggets is again an outlier with a much higher number of discussions. The color corresponds to age, with blue for older groups and red for younger ones.
Another interesting measure of group engagement is the ratio of comments to discussions. According to this measure, the top 7 groups are
|Advanced Analytics, Predictive Modeling & Statistical Analyses||6.1||2.0||0.3|
|Pattern Recognition, Data Data Mining, Machine Intelligence||4.3||1.6||0.4|
|Machine Learning Connection||2.3||1.4||0.6|
|Predictive Model Markup Language (PMML)||2.25||0.6||0.2|
|Next Gen Market Research (NGMR)||1.7||3.5||2.0|
|Research Methods and Data Science||1.4||1.8||1.3|
|Big Data and Analytics||1.25||1.9||1.6|
Here is the table with all 30 LinkedIn groups, with group abbreviation following the group name.
You can also download the raw data on top 30 Linked groups, in CSV format. Let me know what else you find!
(Nov 25, 2013)
|Advanced Business Analytics, Data Mining and Predictive Modeling (Adv BA)||100,437||28-Sep-07||72.6%||3.8||2.1||1.8|
|Big Data / Analytics / Strategy / FP&A / S&OP / Strategic Planning / Predictive & Business Analytics Group (Big Data ASFSSP)||80,778||20-Feb-09||91.8%||3.53||1.5||1.7|
|Big Data and Analytics (Big Data & A)||57,205||1-Mar-12||92.6%||0.77||1.9||1.6|
|Business Analytics (Biz Analytics)||47,967||3-Mar-08||44.5%||2.14||1||1.6|
|Data Mining, Statistics, Big Data, and Data Visualization (DM Stat)||32,865||25-Jul-08||114%||5.05||2.4||3.7|
|BIG DATA Professionals - Architects Scientists Analytics Experts (BD Prof)||25,349||1-Sep-08||96.2%||4.15||1.6||6.8|
|Next Gen Market Research (NGMR) (NGMR)||22,803||26-Sep-07||19.2%||1||3.5||2|
|Business Intelligence & Analytics Group (BI&A)||20,000||6-Jan-08||4%||0.2||2.2||2|
|SAS Analytics & BI (SAS A&BI)||19,686||25-Jun-08||40.9%||1.84||0.8||1.1|
|Machine Learning Connection (ML Conn)||19,215||12-Mar-08||58%||2.77||1.4||0.6|
|Global Analytics Network (Global A)||19,061||23-May-08||14.5%||0.66||0.3||2.4|
|Pattern Recognition, Data Data Mining, Machine Intelligence (PR)||14,024||2-Oct-08||64.6%||2.73||1.6||0.4|
|Actuary / Actuarial, Predictive Modeling, Data Mining, and Statistics News / Jobs / Careers Group (Actuary)||13,920||24-Sep-08||28.8%||1.22||0.3||1.3|
|SAS & Analytics Users (SAS Users)||13,417||13-Apr-08||32.9%||1.55||0.7||1.1|
|Research Methods and Data Science, formerly Research Methods and Analytics (RMA)||13,329||10-Apr-09||30.2%||1.12||1.8||1.3|
|Text Analytics (Text A)||12,931||2-Jun-08||28.9%||1.32||1.6||1.4|
|Data Science Central (DSC)||11,542||10-Feb-12||46.3%||0.41||1.3||3.6|
|Advanced Analytics (Adv A)||10,346||11-Jan-09||71.9%||2.84||0.7||2.6|
|Data & Text Analytics Professionals (D&TA Prof)||7,557||24-Sep-07||76.3%||4.01||0.6||2.3|
|Visual Analytics (Visual)||7,293||31-Mar-08||52.1%||2.47||1.4||2.3|
|Predictive Analytics Network (PAN) (PAN)||6,474||16-Mar-09||74.8%||2.83||1||6|
|Advanced Analytics, Predictive Modeling & Statistical Analyses (Adv AP)||6,423||10-Jul-08||40.3%||1.8||2||0.3|
|Lavastorm Analytics Community Group (Lavastorm)||6,102||17-Apr-11||24.1%||0.41||0.6||2.7|
|Data Scientists (Dscientists)||5,597||8-Jun-09||148%||5.26||2.4||6.8|
|KDnuggets Analytics and Data Mining (KDnuggets)||3,927||4-Feb-08||61.4%||3||2.5||10.2|
|Data Mining Technology (DMT)||3,688||20-Jun-08||33.2%||1.5||0.3||2.5|
|Predictive Model Markup Language (PMML) (PMML)||3,479||24-Sep-09||20.6%||0.67||0.6||0.2|
|Healthcare Data Mining and Modeling (Healthcare)||3,190||11-Jul-08||83.6%||3.72||0.5||0.5|
|Business Intelligence Tools (BI Tools)||2,823||2-Jul-08||49.1%||2.2||0.3||2.9|
Another interesting aspect is the behavior of subgroups. Two of the largest groups
- Advanced Business Analytics, Data Mining and Predictive Modeling
- Big Data / Analytics / Strategy / FP&A ...