Statwing, Modern Data Analysis Software
Every decision maker in the organization needs to be capable of analyzing data, but most tools require a lot of mundane and time-consuming data cleaning. Statwing solves that problem and lets you focus on data analysis.
on Jan 30, 2014 in Automating, Data Exploration, General Social Survey, Statwing
Data Mining for Beginners Boot Camp, Salford video series
This series shows how to easily apply SPM software suite to your predictive modeling projects, using a modern banking application as an example. This series is at the beginner level, and is perfect for first-time users or for those who need a refresher course in model building and data analysis.
on Jan 29, 2014 in Beginners, CART, Gradient Boosting, Online Education, Predictive Modeling, Salford Systems, TreeNet
Top Trends in Analytics and Big Data ahead of Strata 2014 Santa Clara
Our survey ahead of Strata 2014 Conference revealed the top 3 trends in Analytics and Big Data for 2014: Analytics for the masses, Apache Spark, and Real Time Analytics. Read the interesting comments and details and get a KDnuggets discount for Strata.
on Jan 28, 2014 in 2014 Trends, Analytics for the masses, Apache Spark, Real-time, Strata
SAS surpasses $3 billion in 2013 revenue
The growth reflects strong sales of SAS Visual Analytics, anti-fraud solutions (44%), cloud computing (20%), data management and certain industry-specific applications (16-18%). For every one of its 38 years, SAS has grown revenue and shown a profit.
on Jan 27, 2014 in Business Analytics, Fraud Prevention, Revenue, SAS
Top stories for Jan 19-25: Learning from Data free online course; MassBigData; Split on Data Science Skills
Tutorial: Data Science in Python; Learning from Data: Caltech free online course; MassBigData to boost Massachusetts Big Data, Analytics Ecosystem; Split on Data Science Skills: Individual vs Team approach.
on Jan 27, 2014 in Caltech, Massachusetts, Online Education, Python Tutorial, Social Network Analysis
Linkurious: Explore and Visualize Graph Data
Linkurious is designed to handle big data and be easy to use; it is focused on local exploration - search any information within a graph and start exploring the connections from this point.
on Jan 22, 2014 in Graph, Graph Visualization, Neo4j, Social Network Analysis
Clustify 4.0 adds Real-Time Predictive Coding
Clustify updates the predicted relevance scores for the entire document population each time a document is categorized, showing the impact on the progress pie and the precision-recall curve instantly.
on Jan 22, 2014 in
Split on Data Science Skills: Individual vs Team Approach
The results of latest KDnuggets poll show an almost equal split between those who favor individual and those who favor the team approach. See the counterintuitive regional differences and interesting comments.
on Jan 21, 2014 in Data Science, Poll, Skills, Team
MassBigData launched to boost Massachusetts Big Data, Analytics Ecosystem
MassBigData has a wealth of information, including publicly available Mass data sets, local #BigData events, regional jobs (over 2300 currently), a map of MassBigData companies, resources, and much more.
on Jan 20, 2014 in Big Data, Jobs, Massachusetts
Top stories for Jan 12-18: Tutorial: Data Science in Python; Data Science Venn Diagram v2.0
Tutorial: Data Science in Python; Data Science Venn Diagram v2.0; Interpreting Model Performance with Cost Functions.
on Jan 19, 2014 in Python Tutorial, Scicast, Venn Diagram
3X: A workbench for computational experiments
3X is an open-source software tool to ease the burden of conducting computational experiments and managing data analytics, providing a standard yet configurable structure to execute a wide variety of experiments in a systematic way.
on Jan 17, 2014 in Computational Experiments, Stanford, Workbench
Data Science for Social Good, Chicago Jun-Aug 2014, Apply by Feb 1
Data Science for Social Good 2014 Summer Fellowship at the U. of Chicago is looking for students, mentors, and project partners. Fellows apply by Feb 1.
on Jan 17, 2014 in Chicago-IL, DSSG, Fellowship, Rayid Ghani, Social Good
SciCast: a Crowdsourced Forecasting Platform for Science and Technology
Join tech people from around the world in the largest collaborative science and technology forecasting effort, ever, to make real-time predictions on future innovations and significant events in science and technology.
on Jan 15, 2014 in
PAN Competition: Plagiarism Detection, Author Identification, Author Profiling
Take part in one of 3 tasks: Plagiarism Detection - given a document, is it an original? Author Identification - given a document, who wrote it? Author Profiling - given a document, what is author age / gender?
on Jan 15, 2014 in Author Detection, Author Profiling, Competition, Plagiarism Detection
Free Tutorial: Data Science in Python
This Data Science in Python tutorial covers importing data, scikit-learn basics, aggregation and grouping, feature engineering, model evaluation, and deployment.
on Jan 14, 2014 in Data Science Tutorial, IPython, Python, Yhat
Top stories for Jan 5-11: MADlib: Big Data Machine Learning in SQL; Rock Stars of Big Data
MADlib: Big Data Machine Learning in SQL for Data Scientists; IEEE Rock Stars of Big Data Presentations; Hadoop Elephants in the Cloud.
on Jan 12, 2014 in Hadoop, MADlib, SQL
Zementis: Teradata and IBM Partnership, Hadoop/ML best practices, James Taylor on standards
Zementis Universal PMML Plug-in (UPPI), which enables the execution of standards-based predictive analytics, now works for Teradata and IBM PureData systems. Also learn about Best practices for Hadoop and Machine Learning and read James Taylor on analytics standards.
on Jan 11, 2014 in
Sand Hill 50 Swift and Strong in Big Data
Here is a list of 50 leaders in Big Data, based on their story, ecosystem, insight, and influence, from Actian to Wibidata.
on Jan 8, 2014 in Companies, ecosystem
December Analytics, Big Data, Data Mining Companies and Startups Activity
December 2013 acquisitions, startups, and company activity in Analytics, Big Data, Data Mining, and Data Science: Answers.com, Talend, Palantir, KPMG, Datameer, Dell
on Jan 7, 2014 in Acquisitions, Startups, VC
New Poll: Data Science Skills – Individual vs Team Approach
Data Science positions and tasks need a rare combination of Statistics, Hacking, Database, Business, and other skills. New KDnuggets Poll is asking which approach is better for filling Data Science Positions - Individual or Team?
on Jan 7, 2014 in Data Science, Poll, Team, Unicorn
MADlib: Big Data Machine Learning in SQL for Data Scientists
MADlib is open source with commercially usable BSD license; supports Postgres and Pivotal Greenplum DBMS, and provides classification, regression, clustering, topic modeling and other analytics for Big Data.
on Jan 6, 2014 in
Top stories for Dec 29 – Jan 4: Unicorn Data Scientists vs Data Science Teams; Top Datasets on Reddit
Unicorn Data Scientists vs Data Science Teams; Top Datasets on Reddit; A Programmer Guide to Data Mining - Free Download
on Jan 5, 2014 in
BigML 2014 Winter Release: Faster, Easier, and more Programmatic Machine Learning
BigML was used to create over 600,000 predictive models in 2013; Winter 2014 release makes big advances in speed and programmability, and new development mode allows you to run unlimited tasks of up to 16 MB for FREE.
on Jan 4, 2014 in
Top stories in December: A Programmer Guide to Data Mining – Free Download; 3 Stages of Big Data
New Book: A Programmer Guide to Data Mining - Free Download; Open Source Data Science MS Curriculum; 3 Stages of Big Data; Top 2013 LinkedIn Groups; R leading, but Python is gaining
on Jan 2, 2014 in
Additions to KDnuggets in December
MicroStrategy Analytics Express and Data Mining Services, Skytree Machine Learning platform, Ubiq Analytics, and more education, meetings, and software.
on Jan 1, 2014 in