- Scalable Select of Random Rows in SQL - Apr 5, 2018.
Performance boosts are achieved by selecting random rows or the sampling technique. Let’s learn how to select random rows in SQL.
Sampling, SQL, Statsbot
- A Beginner’s Guide to Data Engineering – Part II - Mar 15, 2018.
In this post, I share more technical details on how to build good data pipelines and highlight ETL best practices. Primarily, I will use Python, Airflow, and SQL for our discussion.
Pages: 1 2
AirBnB, Data Engineering, Data Science, ETL, Pipeline, Python, SQL
Want a Job in Data? Learn This - Feb 19, 2018.
Why mastering a 50-year-old programming language is the key to getting a data science job.
Advice, Career, Data Science, SQL
- Calculating Customer Lifetime Value: SQL Example - Feb 15, 2018.
In order to understand how to estimate LTV, it is useful to first think about evaluating a customer’s lifetime value at the end of their relationship with us.
Customer Analytics, Lifetime Value, SQL, Statsbot
- SQL Window Functions Tutorial for Business Analysis - Dec 27, 2017.
In this SQL window functions tutorial, we will describe how these functions work in general, what is behind their syntax, and show how to answer these questions with pure SQL.
Pages: 1 2
Analytics, Business Analytics, SQL, Statsbot
- A Guide for Customer Retention Analysis with SQL - Dec 19, 2017.
Customer retention curves are essential to any business looking to understand its clients, and will go a long way towards explaining other things like sales figures or the impact of marketing initiatives. They are an easy way to visualize a key interaction between customers and the business.
Pages: 1 2
Analytics, Customer Analytics, SQL, Statsbot
- Unlock Machine Learning for the New Speed and Scale of Business - Dec 8, 2017.
Learn how Vertica in-database machine learning supports the entire predictive analytics process with, with MPP, SQL execution, R, Python, Java and more - get the whitepaper.
Big Data, Database, Machine Learning, MPP Database, SQL, Vertica, White Paper
- Database Bootcamp Webinar Series, Dec 5, 7, 12, 14 - Dec 1, 2017.
The need to be broadly knowledgeable and rapidly understand the existing database ecosystem is growing. Looker broken down and simplified the differentiators of the main database technologies into this series of four, 45-minute webinar sessions.
Databases, Looker, MPP Database, SQL
- PySpark SQL Cheat Sheet: Big Data in Python - Nov 16, 2017.
PySpark is a Spark Python API that exposes the Spark programming model to Python - With it, you can speed up analytic applications. With Spark, you can get started with big data processing, as it has built-in modules for streaming, SQL, machine learning and graph processing.
Pages: 1 2
Apache Spark, Big Data, DataCamp, Python, SQL
- Spark – The Definitive Guide – exclusive preview - Sep 25, 2017.
Get an exclusive preview of "Spark: The Definitive Guide" from Databricks! Learn how Spark runs on a cluster, see examples in SQL, Python and Scala, Learn about Structured Streaming and Machine Learning and more.
Apache Spark, Databricks, Free ebook, Python, Scala, SQL
30 Essential Data Science, Machine Learning & Deep Learning Cheat Sheets - Sep 22, 2017.
This collection of data science cheat sheets is not a cheat sheet dump, but a curated list of reference materials spanning a number of disciplines and tools.
Pages: 1 2 3
Cheat Sheet, Data Science, Deep Learning, Machine Learning, Neural Networks, Probability, Python, R, SQL, Statistics
42 Steps to Mastering Data Science - Aug 25, 2017.
This post is a collection of 6 separate posts of 7 steps a piece, each for mastering and better understanding a particular data science topic, with topics ranging from data preparation, to machine learning, to SQL databases, to NoSQL and beyond.
Data Preparation, Data Science, Deep Learning, Machine Learning, NoSQL, Python, SQL
- How To Write Better SQL Queries: The Definitive Guide – Part 2 - Aug 24, 2017.
Most forget that SQL isn’t just about writing queries, which is just the first step down the road. Ensuring that queries are performant or that they fit the context that you’re working in is a whole other thing. This SQL tutorial will provide you with a small peek at some steps that you can go through to evaluate your query.
Pages: 1 2
Algorithms, Complexity, Databases, Relational Databases, SQL
- How To Write Better SQL Queries: The Definitive Guide – Part 1 - Aug 23, 2017.
Most forget that SQL isn’t just about writing queries, which is just the first step down the road. Ensuring that queries are performant or that they fit the context that you’re working in is a whole other thing. This SQL tutorial will provide you with a small peek at some steps that you can go through to evaluate your query.
Pages: 1 2
Databases, Relational Databases, SQL
The Rise of GPU Databases - Aug 17, 2017.
The recent but noticeable shift from CPUs to GPUs is mainly due to the unique benefits they bring to sectors like AdTech, finance, telco, retail, or security/IT . We examine where GPU databases shine.
Big Data, Database, GPU, Predictive Analytics, SQL, SQream
- Populating a GRAKN.AI Knowledge Graph with the World - Jul 20, 2017.
This updated article describes how to move SQL data into a GRAKN.AI knowledge graph.
GRAKN.AI, Graph, Knowledge Graph, SQL
- Data Science for Newbies: An Introductory Tutorial Series for Software Engineers - May 31, 2017.
This post summarizes and links to the individual tutorials which make up this introductory look at data science for newbies, mainly focusing on the tools, with a practical bent, written by a software engineer from the perspective of a software engineering approach.
Apache Spark, Data Science, Jupyter, Machine Learning, Pandas, Python, Reddit, Scala, SQL
- How to think like a data scientist to become one - Mar 23, 2017.
The author went from securities analyst to Head of Data Science at Amazon. He describes what he learned in his journey and gives 4 useful rules based on his experience.
Amazon, Data Science Skills, Data Scientist, SQL, Statistics
- KDnuggets™ News 17:n11, Mar 22: 50 Companies Leading The AI Revolution; 17 More Must-Know Data Science Q&A, part 3 - Mar 22, 2017.
Also 7 Types of Data Scientist Job Profiles; Email Spam Filtering: An Implementation with Python and Scikit-learn.
AI, Data Scientist, Interview Questions, SQL, Startups
- The Most Underutilized Function in SQL - Mar 20, 2017.
Find out why md5() is an SQL function that's used surprisingly often, and find out how -- and why -- you can use it yourself.
Data Science, SQL
- Grunion, Query Optimization Tool for Data Science and Big Data - Mar 14, 2017.
Grunion is a patent-pending query optimization, translation, and federation framework built to help bridge the gap between data science and data engineering teams. Read more to request access.
Apache Spark, Benchmark, Data Workflow, Datascience.com, NoSQL, SQL
- KDnuggets™ News 17:n06, Feb 15: So What is Big Data? 52 Useful Machine Learning APIs; Data Science finds Perfect Valentines Dates - Feb 15, 2017.
Also Making Python Speak SQL with pandasql; 52 Useful Machine Learning & Prediction APIs, updated; New Poll: Do you support Trump Immigration Ban?
API, Big Data, Clustering, Data Science Platform, Machine Learning, Python, SQL
- Making Python Speak SQL with pandasql - Feb 8, 2017.
Want to wrangle Pandas data like you would SQL using Python? This post serves as an introduction to pandasql, and details how to get it up and running inside of Rodeo.
Pandas, Python, SQL, Yhat
A Funny Look at Big Data and Data Science - Dec 27, 2016.
A less than serious look at Big Data and Data Science. If you can laugh at all cartoons, then your Data Science skills are in good shape.
Big Data, Cartoon, Humor, SQL
- How to Make Your Database 200x Faster Without Having to Pay More - Nov 22, 2016.
Waiting long for a BI query to execute? I know it’s annoyingly frustrating… It’s a major bottle neck in day-to-day life of a Data Analyst or BI expert. Let’s learn some of the easy to use solutions and a very good explanation of why to use them, along with other advanced technological solutions.
Pages: 1 2 3
BI, Databases, OLTP, Optimization, Performance, Sampling, SnappyData, SQL
- Evaluating HTAP Databases for Machine Learning Applications - Nov 2, 2016.
Businesses are producing a greater number of intelligent applications; which traditional databases are unable to support. A new class of databases, Hybrid Transactional and Analytical Processing (HTAP) databases, offers a variety of capabilities with specific strengths and weaknesses to consider. This article aims to give application developers and data scientists a better understanding of the HTAP database ecosystem so they can make the right choice for their intelligent application.
Pages: 1 2
Big Data, Data Processing, HTAP, Oracle, SAP, Splice Machine, SQL
- Top KDnuggets tweets, Sep 28-Oct 4: 7 Steps to Mastering SQL for #DataScience; Biggest Issues in #DataScience - Oct 5, 2016.
7 Steps to Mastering SQL for #DataScience; New Andrew Ng #MachineLearning #Book Under Construction, #Free Draft Chapters; Top #DataScientist Claudia Perlich on Biggest Issues in #DataScience; Awesome Public Datasets on GitHub
Andrew Ng, Data Science, ebook, SQL, Top tweets
- O’Reilly Live Training–Real-time. Real experts. Real learning. - Sep 26, 2016.
Get intensive, hands-on training from O'Reilly's expert network on critical data topics - from SQL fundamentals to distributed computing; enterprise strategy to data science at scale.
Apache Spark, Courses, Distributed Systems, Hadoop, O'Reilly, scikit-learn, SQL
- Doing Statistics with SQL - Aug 2, 2016.
This post covers how to perform some basic in-database statistical analysis using SQL.
SQL, Statistics
- 5 Big Data Projects You Can No Longer Overlook - Jul 21, 2016.
Check out 5 Big Data projects that you are not likely to have seen before, but which may be useful to you, and perhaps even scratch an itch you didn't know you had.
Big Data, Cloud Computing, Google, Hadoop, Javascript, Overlook, Presto, Spotify, SQL
- KDnuggets™ News 16:n22, Jun 22: Data Science Blog Contest; Free Machine Learning Ebook; Master SQL for Data Science - Jun 22, 2016.
Data Science Blog Contest; New Free Andrew Ng Machine Learning Book Under Construction; 7 Steps to Mastering SQL for Data Science; A Visual Explanation of the Back Propagation Algorithm; Mining Twitter Data with Python Part 1: Collecting Data
Backpropagation, Data Science, Free ebook, Neural Networks, SQL
- 7 Steps to Mastering SQL for Data Science - Jun 16, 2016.
Follow these 7 steps to go from SQL data science newbie to seasoned practitioner quickly. No nonsense, just the necessities.
Pages: 1 2
7 Steps, Data Science, Database, Relational Databases, SQL
- Morpace: SQL Programmer - Jun 10, 2016.
Seeking an SQL Programmer to design, implement and maintain a relational database and reporting system. Will collaborate with other programmers and cross-functional teams to assist in designing and advancing the system in an agile environment.
Developer, Farmington Hills, MI, Morpace, SQL
- R, Python Duel As Top Analytics, Data Science software – KDnuggets 2016 Software Poll Results - Jun 6, 2016.
R remains the leading tool, with 49% share, but Python grows faster and almost catches up to R. RapidMiner remains the most popular general Data Science platform. Big Data tools used by almost 40%, and Deep Learning usage doubles.
Pages: 1 2
Data Mining Software, Data Science Platform, Poll, Python, Python vs R, R, RapidMiner, SQL
- Spark 2.0 Preview Now on Databricks Community Edition: Easier, Faster, Smarter - May 17, 2016.
The preview of Spark 2.0 is here, and it promises to be easier, faster, and smarter.
Apache Spark, Databricks, SQL
- Practical skills that practical data scientists need - May 13, 2016.
The long story short, data scientist needs to be capable of solving business analytics problems. Learn more about the skill-set you need to master to achieve so.
Business Context, Data Scientist, Mathematics, Skills, SQL
- The MBA Data Science Toolkit: 8 resources to go from the spreadsheet to the command line - Apr 18, 2016.
A great guide for the MBA, or any relatively non-technical convert, for getting comfortable with the command line and other technical skills required to excel in data science.
Pages: 1 2
GitHub, Haskell, Machine Learning, Python, R, SQL
- Fastest Growing Programming Languages and Computing Frameworks - Mar 7, 2016.
A new model for ranking programming languages and predicting the growth of user adoption. Includes current language rankings and predictions.
Data Science, Javascript, Programming Languages, SQL, Trends
- Webinar: Driving Data Democracy: Hadoop and Redshift, Mar 16 - Mar 4, 2016.
The Hadoop ecosystem has improved markedly over the past few years. MPP databases allow analytics teams to easily query massive structured data sets. Learn how these pipelines work on March 16.
Amazon Redshift, Hadoop, Looker, MPP Database, SQL
- Data Science Skills for 2016 - Feb 12, 2016.
As demand for the hottest job is getting hotter in new year, the skill set required for them is getting larger. Here, we are discussing the skills which will be in high demand for data scientist which include data visualization, Apache Spark, R, python and many more.
Apache Spark, CrowdFlower, Data Science, Python, Skills, SQL
- Will Balkanization of Data Science lead to one Empire or many Republics? - Nov 30, 2015.
We examine the “Technoslavia” of the Big Data and Data Science market and consider whether it is likely to lead to a unified empire or a federation of independent republics.
Big Data Market, Data Science, Dataiku, SQL
- Top KDnuggets tweets, Oct 27 – Nov 02: A Framework for Distributed Deep Learning Layer Design in Python - Nov 3, 2015.
A Framework for Distributed #DeepLearning Layer Design in Python; SQL vs. NoSQL- What You Need to Know; Great Tutorial: A Neural Network in 11 lines of #Python; Data Scientist - 2nd Best IT and Engineering Job.
Deep Learning, NoSQL, Python, Salary, SQL
- Spark + SETI: Amping up Spark SQL with Parquets - Oct 21, 2015.
Spark SQL is a great component for data scientists as it simplifies the querying large distributed datasets. Learn how to integrate it with Parquets, which we have found to significantly improve the performance of sparse-column queries.
Apache Spark, IBM, Parquets, Python, SETI, Spark SQL, SQL
- Easier Data Prep and Analysis for Data Scientists, Oct 20 Webinar - Oct 6, 2015.
Rapid Insight will show tools that make the data preparation and analysis process significantly faster, without losing the flexibility of advanced programming or SQL tools.
Data Preparation, RapidInsight, SQL
- Dataiku Data Science Studio, now also runs on Apache Spark - Sep 29, 2015.
Dataiku Data Science Studio version 2.1 has many useful features for Data Scientists, including integration with Apache Spark.
Pages: 1 2
Apache Spark, Data Science Platform, Dataiku, R, Spark SQL, SQL
- Spark SQL for Real Time Analytics – Part Two - Sep 22, 2015.
Apache Spark is the hottest topic in Big Data. Part 2 of this covers basic concepts of Stream Processing for Real Time Analytics and for the next frontier – Internet of Things (IoT).
Pages: 1 2
Ajit Jaokar, Apache Spark, Real-time, SQL, Stream Processing, Streaming Analytics, Sumit Pal
- Data Science for Internet of Things – practitioner course - Sep 14, 2015.
Created by Data Science and IoT professionals, the course covers infrastructure (Hadoop – Spark), Programming / Modelling(R/Time series) and ioT. Course starts Nov 2015, delivered online, and will have limited participants.
Apache Spark, Data Science, IoT, R, Scala, SQL, Sumit Pal
- Upcoming Webcasts on Analytics, Big Data, Data Science – Sep 8 and beyond - Sep 7, 2015.
The Future of Data Science, Ensuring Business Value from Analytics, Apache Ignite, Text Analytics, Best Practices of Data Science, Forecasting With Predictive Analytics, and more.
Business Value, Forecasting, Hadoop, IIA, In-Memory Computing, SQL, Text Analytics
- Spark SQL for Real-Time Analytics - Sep 4, 2015.
Apache Spark is the hottest topic in Big Data. This tutorial discusses why Spark SQL is becoming the preferred method for Real Time Analytics and for next frontier, IoT (Internet of Things).
Ajit Jaokar, Apache Spark, Real-time, SQL, Sumit Pal
- 60+ Free Books on Big Data, Data Science, Data Mining, Machine Learning, Python, R, and more - Sep 4, 2015.
Here is a great collection of eBooks written on the topics of Data Science, Business Analytics, Data Mining, Big Data, Machine Learning, Algorithms, Data Science Tools, and Programming Languages for Data Science.
Book, Brendan Martin, Data Mining, Data Science, Free ebook, Machine Learning, Python, R, SQL
- How to become a Data Scientist for Free - Aug 28, 2015.
Here are the most required skills for a data scientist position based on ReSkill’s analyses of thousands of job posts and free resources to learn each skill.
Data Science Education, Data Scientist, Java, Online Education, Python, R, SQL, Statistics
- A Beginner’s Guide to SQL - Aug 27, 2015.
SQL is one of the core skills of a data engineer and data scientist. This mini-tutorial explains the four fundamental SQL functions: Create, Read, Update, and Delete using a fun example of movie quotes database.
Pages: 1 2 3
Data Processing, SQL, Udemy
- Apache Drill Makes Big Data Analysis Easier for Everyone - Aug 18, 2015.
Apache Drill is an open source query engine that provides interactive and secure SQL analytics at the scale of petabytes. Provides data querying and exploring capabilities from varied NoSQL databases and file formats.
Apache Drill, Kaushik Pal, SQL
- To Code or Not to Code with KNIME - Jul 22, 2015.
Find out how KNIME allows us to integrating analytical languages, such as R and Python and visual design of SQL code. Also, learn to integrate your Hadoop, visualization and ETL systems with the KNIME.
Pages: 1 2
Hadoop, Javascript, Knime, Michael Berthold, Python, R, SQL
- Emacs for Data Science - Jul 10, 2015.
Data science nowadays demands a polyglot developer and, choosing a correct code editor would definitely be a worthy investment. Here we provide, important features of Emacs and its advantages over other editors.
Data Science Tools, Emacs, R, SQL
- Which Big Data, Data Mining, and Data Science Tools go together? - Jun 11, 2015.
We analyze the associations between the top Big Data, Data Mining, and Data Science tools based on the results of 2015 KDnuggets Software Poll. Download anonymized data and analyze it yourself.
Apache Spark, Data Mining Software, Excel, Hadoop, Knime, Poll, Python, R, RapidMiner, SQL
- R leads RapidMiner, Python catches up, Big Data tools grow, Spark ignites - May 25, 2015.
R is the most popular overall tool among data miners, although Python usage is growing faster. RapidMiner continues to be most popular suite for data mining/data science. Hadoop/Big Data tools usage grew to 29%, propelled by 3x growth in Spark. Other tools with strong growth include H2O (0xdata), Actian, MLlib, and Alteryx.
Actian, Apache Spark, Data Mining Software, H2O, Knime, Poll, Python, R, RapidMiner, SQL
- Top KDnuggets tweets, Apr 14-20: Modern Methods for Sentiment Analysis; Basics of SQL, RDBMS – must have skills - Apr 21, 2015.
Great overview: Modern Methods for Sentiment Analysis #word2vec; Basics of SQL and RDBMS - must have skills for data science; The 7 Most Unusual Applications of Big Data; Extensive, but a little confusing site: Understanding Data Visualization.
About Gregory Piatetsky, Data Visualization, Sentiment Analysis, SQL, word2vec
- KDnuggets™ News 15:n09, Mar 25: Deep Learning from Scratch; 10 steps to Kaggle Success; US CDS DJ Patil Cartoon - Mar 25, 2015.
Deep Learning for Text Understanding from Scratch; New Poll: Computing platform; 10 Steps to Success in Kaggle Data; Cartoon: US Chief Data Scientist Most Difficult Challenge; SQL-like Query Language for Real-time Streaming Analytics.
Deep Learning, Kaggle, SQL, Streaming Analytics, UK, Yann LeCun
- Interview: Dave McCrory, Basho on Distributed Database Needs of a Future Enterprise - Mar 16, 2015.
We discuss the future of distributed storage for enterprise, Scale-up vs. Scale-out, software design patterns in Cloud era, microservices model and the place for legacy database in modern enterprise IT.
Basho, Cloud Computing, Databases, Dave McCrory, Distributed Systems, Integration, Interview, SQL
- SQL-like Query Language for Real-time Streaming Analytics - Mar 12, 2015.
We need SQL like query language for Realtime Streaming Analytics to be expressive, short, fast, define core operations that cover 90% of problems, and to be easy to follow and learn.
Real-time, Realtime Analytics, SQL, Stream Mining, Streaming Analytics
- Upcoming Webcasts on Analytics, Big Data, Data Science – Mar 10 and beyond - Mar 9, 2015.
Data Wrangling and the Art of Big Data Discovery, Data Mining: Failure to Launch, The State of Hadoop Adoption, Addressing the Challenges of Data Variety, and more.
Data Visualization, Data Wrangling, Hadoop, Kafka, Security, SQL
- Top KDnuggets tweets, Feb 23-25: Microsoft is building fast, low-power Deep Learning networks; Lucrative tech careers: Data Scientist, Data Engineer - Feb 26, 2015.
5 lucrative tech careers in 2015: Data Scientist ($150K), Data Engineer ($148K); Which SQL on Hadoop? Gartner Poll Still Says "Whatever" But DBMS Providers Gain; 10 Most-Funded #BigData #Startups; DataRPM 8 runs in #Hadoop, uses #MachineLearning to find insights.
Big Data, Data Engineer, Data Scientist, DataRPM, Hadoop, Salary, SQL, Startups, Trevor Hastie
- Analyzing Analysts to Build Better Analysis Software - Feb 10, 2015.
Our study how analysts used Mode led to major updates designed to fit how data analysts and business analysts actually use data - there's no one-size-fits-all tool and analysis doesn't end with the analyst.
Business Analyst, Data Analyst, Mode Analytics, SQL
- Most Demanded Data Science and Data Mining Skills - Dec 15, 2014.
Our analysis of most demanded data scientist skills shows that Data Science is a team effort focused on business analytics, with top 5 platform skills being SQL, Python, R, SAS, and Hadoop.
Data Science Skills, Data Scientist, Hadoop, New York-NY, Python, R, SAS, Skills, SQL
- If programming languages were vehicles, what would be R, Python, SAS, and SQL? - Dec 6, 2014.
We expand on the idea "If programming languages were vehicles" and examine what would be the main languages for data science: R, Python, SAS, and SQL?
Programming Languages, Python, R, SAS, SQL
- Mode Playbook for Open Source Analytics - Dec 5, 2014.
Mode Analytics is open-sourcing their internal analysis and data visualizations which can be tailored to common data structures in SQL databases.
Churn, Mode Analytics, Open Data, Open Source, SQL
- SlamData Open Source Analytics Tool for MongoDB - Dec 4, 2014.
SlamData is an open source SQL-based tool designed to make accessing data in MongoDB easy for developers and non-developers alike with the goal of making application intelligence easier.
MongoDB, NoSQL, Open Source, SlamData, SQL
- SQL School tackles the data analyst shortage - Nov 17, 2014.
SQL School is a free, interactive tutorial from Mode Analytics, written by analysts for aspiring analysts. Check it out!
Mode Analytics, Online Education, SQL
- Top KDnuggets tweets, Oct 24-26: Why Deep Learning is likely to make other Machine Learning algorithms obsolete - Oct 27, 2014.
Why Deep Learning is likely to make other Machine Learning algorithms obsolete; Open Source Distributed Analytics Engine with SQL interface; Data Mining Reveals How News Coverage Varies Around the World; 3 Great (and Free) Data Science Books You Can Read Now.
Data Mining Books, Deep Learning, Free ebook, Hadoop, SQL
- Four main languages for Analytics, Data Mining, Data Science - Aug 18, 2014.
New KDnuggets Poll shows the growing dominance of four main languages for Analytics, Data Mining, and Data Science: R, SAS, Python, and SQL - used by 91% of data scientists - and decline in popularity of other languages, except for Julia and Scala.
Analytics Languages, Data Mining, Data Science, Julia, Poll, Python, R, SAS, Scala, SQL
- Top KDnuggets tweets, Aug 6-7: Becoming a Data Scientist: MS Program, Bootcamp, or MOOCs? - Aug 8, 2014.
Becoming a Data Scientist: MS Program, Bootcamp, or MOOCs?; Statistics is the *least* important part of data science; New Poll: What languages you used for analytics / data mining in 2014; If you love Pizza and #DataScience, here is a unique job for you.
Bootcamp, Data Scientist, MOOC, Pizza, Poll, SQL, Statistics
- KDnuggets 15th Annual Analytics, Data Mining, Data Science Software Poll: RapidMiner Continues To Lead - Jun 7, 2014.
With over 3,000 data miners taking part in KDnuggets 15th Annual Software Poll, RapidMiner continues to lead. Free software is used much more outside US, and Hadoop usage grows fastest in Asia.
Data Mining Software, Excel, Hadoop, Knime, Poll, Python, R, RapidMiner, SAS, SQL, SQL Server, Weka
- Upcoming Webcasts on Analytics, Big Data, Data Science – Jun 2 and beyond - Jun 2, 2014.
SQL-on-HaDOOP, BigML, ClearStory, Analytic Maturity with Dean Abbott and TIBCO, Just Enough Math, Analytically Speaking with Dan Ariely, Data Mining FTL, and more.
Analytically Speaking, ClearStory, Hadoop, SQL
- Top KDnuggets tweets, May 23-25: Data Science vs. Statistics: one big difference; A SQL query walks into a bar - May 26, 2014.
Data Science vs. Statistics: one big difference in Data Science focus; TGIF: A SQL query walks into a bar, approaches two girls at two tables ...; Amazing demo - IBM #Watson analyzes topic, presents a speech, can debate opponents; Microsoft #Kinect as Inexpensive #BigData Tool.
Data Science, Deep Learning, Humor, Kinect, SQL, Statistics, Watson
- Uppd8: An Engine for the Wisdom of Crowds - May 15, 2014.
What people think matters. Uppd8 focuses on crowd sentiment analysis and provides tag-scored data based on different user types. Basic services will be provided for free.
NoSQL, Quality Score, Sentiment Analysis, SQL, Startup, Uppd8
- Top KDnuggets tweets, May 12-13: Guide to Data Science Cheat Sheets; How to analyze Facebook Networks using R - May 14, 2014.
Guide to Data Science Cheat Sheets; Clever hack: How to analyze Facebook Networks using R; Very useful - Introduction to #SQL for Data Scientists; Planning a late career shift to Analytics /Data Science? Be prepared.
Career, Cheat Sheet, Facebook, R, SQL
- Guide to Data Science Cheat Sheets - May 12, 2014.
Selection of the most useful Data Science cheat sheets, covering SQL, Python (including NumPy, SciPy and Pandas), R (including Regression, Time Series, Data Mining), MATLAB, and more.
Cheat Sheet, Data Science, Python, R, SQL
- 3 Key Trends in the DBMS Market - May 3, 2014.
The top 3 trends in DBMS include market consolidation, moving beyond OLTP, and distributed computing - we examine them in detail.
DBMS, Distributed, Gartner, Michael Waclawiczek, NuoDB, OLTP, SQL, Trends
- Top KDnuggets tweets, Apr 28-29 - Apr 30, 2014.
9 Free Books for Learning Data Mining; Cartoon: Data Scientist Salary Negotiation; statsTeachR - great free resource; What every Data Scientist needs to know about SQL.
Cartoon, Data Scientist, Free ebook, R, SQL
- Online Data Science Certificates: Analytics and Programming for Data Science - Mar 1, 2014.
Statistics.com, a leading provider of online education in statistics and analytics announces two new online certificates for Data Science - "Analytics for Data Science" and "Programming for Data Science".
Certificate, Data Science, Hadoop, Python, Risk Modeling, SQL, Statistical Modeling, Statistics.com
- Method3: Experienced Big Data Software Engineer - Feb 27, 2014.
Method3, a leader in human capital, RPO, and technology solutions is seeking an experienced Big Data Software Engineer for a large Big4 client in the Irvine, CA area.
Hadoop, Solr, SQL, Storm
- Top stories for Jan 5-11: MADlib: Big Data Machine Learning in SQL; Rock Stars of Big Data - Jan 12, 2014.
MADlib: Big Data Machine Learning in SQL for Data Scientists; IEEE Rock Stars of Big Data Presentations; Hadoop Elephants in the Cloud.
Hadoop, MADlib, SQL