The LIONbook on machine learning and optimization, written by co-founders of LionSolver software, is provided free for personal and non-profit usage. Chapter 15 looks at Dimensionality reduction by linear transformations (projections).
The LIONbook on machine learning and optimization, written by co-founders of LionSolver software, is provided free for personal and non-profit usage. Chapter 14 looks at Self-organizing maps.
Python displacing R as The Language for Data Science; This #BigData application will grow! What distinguishes data science from statistics? Bottom-up (data-driven) vs top-down ; Rifiniti: Sr. Machine Learning Developer, cutting edge tech
International Journal of Big Data Intelligence (IJBDI) is a peer reviewed multidisciplinary international journal publishing original and high-quality articles covering a wide range of topics in big data intelligence.
icrunchdata compiled an interesting index to help visualize the present state and future job growth trends in Analytics, Big Data, Business Intelligence, Data Science, Software Development and Statistics.
The leaders of ASA - American Statistical Association discuss their view on Big Data, 3 reasons why statistical community seems to be disconnected from the Big Data movement, and how they plan to fix it.
Thanksgiving and Big Data Cartoon; Harvard Data Science Course resources - free and online; KDnuggets Poll; What Lies Ahead for Big Data and Analytics; Top research confrences, and other analytics/data mining news.
As traditional techniques often fail to identify fraudulent behavior, social network analytics offers new insights in the propagation of fraud through a network - watch this short overview.
R Guru Ajay Ohri list of 50 R functions to clear a basic interview; Netflix #BigData Platform as a Service architecture; Harvard Data Science Course, free resources online; Google Chairman Eric Schmidt on why Data Analytics is the Future
HedgeChatter has launched an online SaaS dashboard that allows investors, traders and hedge funds to see how key influencers on social media are affecting stock price and view price trends based on real-time social media data (chatter).
Harvard Data Science Course excellent lectures/slides free online; Data Science and Text Mining with R - very useful 27-page overview ; Must read: Deep Learning 101; Top Schools for MS in Data Science
Online assessment and downloadable assessment guide enable organizations to determine the maturity of their big data analytics program; guide offers best practices for moving to the next level. Take the assessment free, online.
Must read for Data Scientists: Deep Learning 101; The "Pythonization" of scientific computing and data analysis; Python for Data Science - a walkthrough of a complete project
New KDnuggets Poll: Where did you apply analytics/data mining in 2013? KDnuggets review of Analytics App Marketplaces; My report on Boston DataFest and the next Big Thing in Big Data, and more.
While big data certainly brings changes to data science, major data science principles remain unchanged regardless of the data size. Watch leading experts David Smith from Revolution Analytics and Gregory Piatetsky from KDnuggets discuss key data science principles.
New book by leading analytics professionals shows how to get the most from IBM SPSS Modeler, using detailed step-by-step examples to help you build the models you can deploy in your business.
We review Analytics App Marketplaces from Alteryx, Amazon (AWS), BigML, Datameer, RapidMiner, and Windows Azure. Who will create the next iTunes for Analytics?
Must read for Data Scientists: Deep Learning 101 (hot algorithm that wins competitions); Huge web graph publicly available for research, 3.5B web pages); Scandal: Due to bad data analysis, ~25% of studies may be false; Statistics is the *least* important part of data science
Data Driven Business recently interviewed forward thinking text analytics professionals from leading companies like Bank of America, Home Depot and PayPal, on challenges they are face, overcoming them, and the industry as a whole.
This guest post examines Insight and Analytic functions and what they need to effectively evolve by addressing key elements of the Insight and Analytics Value Chain(tm).
LIONbook Ch. 13: Bottom-up clustering, part of The LIONbook on machine learning; Databases for text analysis: archive and access texts using SQL and python ; Data Science vs Data Scientists, Data Analysts and BI Practitioners; Top 10 Blogs in 2013 from The #BigData Institute
The LIONbook on machine learning and optimization, written by co-founders of LionSolver software, is provided free for personal and non-profit usage. Chapter 13 looks at Bottom-up (agglomerative) clustering.
Highlights from a significant new report by Decision Management Solutions on Predictive Analytics in the Cloud: Opportunities, Trends and Big Data Impact. The top driver was reduced cost while data security and privacy remain the primary obstacles reported. Download this free report.
5 Fundamental Concepts of Data Science; 11 TED talks explore the dark side of #BigData ; Nobel winner Daniel Kahneman: humans are BAD intuitive statisticians; Data Science Workflow - Overview and Challenges
Twitter and Quantum Physics connection; John Tukey "Badmandments"; Strata 2013 videos; Chordalysis: a new method to discover the structure of data, and more analytics/data mining news.
Booz Allen Field Guide to Data Science (free download); Star Data Scientist Hilary Mason plans on starting her own company; Learn Data Science in 12 Intense Weeks at Zipfian Academy
The guide includes an introductory section, the practitioners guide to Data Science, a first hand account of life as a Data Scientist, tips and tricks, and an overview of successful data science solutions. Free Download.
Google Python Lessons are awesome and available online, for free! ; Data Analysis course now open at Coursera; The worst part of working at Google, for many people: overqualified; Bristol: Senior postdocs in machine learning/data mining, health applications
They contain personal accounts by the Medical Officers and statistical data in the form of graphs, tables and charts, offering a rich source of material for public health research.
Find out 4 Steps to successfully evaluating business analytics software: the differences between BI/Analytics stacks, how to choose technology that will scale to your long-term requirements, and more.
The most read articles include why Why Big Data Won't Cure Us, Data Science and its Relationship to Big Data, The Quantified Self, and Apache Drill: Interactive Ad-Hoc Analysis at Scale.
Venture capital in an age of algorithms: using data science to fund startups; Stanford Big Data Mining, Finance, Statistics Courses Online; How data mining helped GM limit a recall to just 4 (four) cars; 10 strangest data findings: unusual color cars are more reliable
Videos from 2013 Strata Big Data Conference + Hadoop World ; My answer to What are the top 10 data mining or machine learning algorithms? ; Star Data Scientist Hilary Mason on her favorite iPhone data app; Really Big, #BigData Job growth infographic
"Badmandments" from great statistician John Tukey: NEVER plan any analysis before seeing data; DONT consult with a statistician until after collecting data; LARGE enough samples always tell the truth.
How GraphChi algorithm on a Mac Mini outperformed a 1,636 Node Hadoop Cluster; These 6 startups want to disrupt #BigData world; The Mathematical Shape of Big Science Data; 10 #BigData case studies