- Data Analysis Using Scala - Sep 24, 2021.
It is very important to choose the right tool for data analysis. On the Kaggle forums, where international Data Science competitions are held, people often ask which tool is better. R and Python are at the top of the list. In this article we will tell you about an alternative stack of data analysis technologies, based on Scala.
Data Science, Machine Learning, Scala, Spark, YARN
Netflix’s Polynote is a New Open Source Framework to Build Better Data Science Notebooks - Aug 5, 2020.
The new notebook environment provides substantial improvements to streamline experimentation in machine learning workflows.
IDE, Jupyter, Netflix, Open Source, Scala
Data Science for Managers: Programming Languages - Nov 19, 2019.
In this article, we are going to talk about popular languages for Data Science and briefly describe each of them.
Data Science, Manager, MATLAB, Octave, Programming Languages, Python, R, Scala
Which Data Science Skills are core and which are hot/emerging ones? - Sep 17, 2019.
We identify two main groups of Data Science skills: A: 13 core, stable skills that most respondents have and B: a group of hot, emerging skills that most do not have (yet) but want to add. See our detailed analysis.
Career, Data Science Skills, Data Visualization, Deep Learning, Excel, Machine Learning, Poll, Python, PyTorch, Scala, Skills, Statistics, TensorFlow
The 6 components of Open-Source Data Science/ Machine Learning Ecosystem; Did Python declare victory over R? - Jun 6, 2018.
We find 6 tools form the modern open source Data Science / Machine Learning ecosystem; examine whether Python declared victory over R; and review which tools are most associated with Deep Learning and Big Data.
Anaconda, Apache Spark, Data Science, Keras, Machine Learning, Open Source, Poll, Python, R, RapidMiner, Scala, scikit-learn, TensorFlow
Apache Spark : Python vs. Scala - May 4, 2018.
When it comes to using the Apache Spark framework, the data science community is divided in two camps; one which prefers Scala whereas the other preferring Python. This article compares the two, listing their pros and cons.
Apache Spark, Java, Python, Scala
- KDnuggets™ News 18:n07, Feb 14: 5 Machine Learning Projects You Should Not Overlook; Intro to Python Ensembles - Feb 14, 2018.
5 Machine Learning Projects You Should Not Overlook; Introduction to Python Ensembles; Which Machine Learning Algorithm be used in year 2118?; Fast.ai Lesson 1 on Google Colab (Free GPU)
Algorithms, Data Science, Ensemble Methods, fast.ai, Feature Engineering, Google Colab, Machine Learning, Python, Scala
- Top 15 Scala Libraries for Data Science in 2018 - Feb 9, 2018.
For your convenience, we have prepared a comprehensive overview of the most important libraries used to perform machine learning and Data Science tasks in Scala.
Apache Spark, Data Analysis, Data Science, Data Visualization, Machine Learning, NLP, Scala
- Spark – The Definitive Guide – exclusive preview - Sep 25, 2017.
Get an exclusive preview of "Spark: The Definitive Guide" from Databricks! Learn how Spark runs on a cluster, see examples in SQL, Python and Scala, Learn about Structured Streaming and Machine Learning and more.
Apache Spark, Databricks, Free ebook, Python, Scala, SQL
- Spark with Scala – ACM Professional Development Seminar, Santa Clara, Aug 5 - Jun 22, 2017.
This class will introduce Apache Spark 2, focusing on using it for data analysis Taught by Sujee Maniyam on behalf of the local ACM chapter, SFbayACM.
Apache Spark, CA, Santa Clara, Scala, SFbayACM
- Data Science for Newbies: An Introductory Tutorial Series for Software Engineers - May 31, 2017.
This post summarizes and links to the individual tutorials which make up this introductory look at data science for newbies, mainly focusing on the tools, with a practical bent, written by a software engineer from the perspective of a software engineering approach.
Apache Spark, Data Science, Jupyter, Machine Learning, Pandas, Python, Reddit, Scala, SQL
5 Machine Learning Projects You Can No Longer Overlook, April - Apr 13, 2017.
It's about that time again... 5 more machine learning or machine learning-related projects you may not yet have heard of, but may want to consider checking out. Find tools for data exploration, topic modeling, high-level APIs, and feature selection herein.
Data Exploration, Deep Learning, Java, Machine Learning, Neural Networks, Overlook, Python, Scala, scikit-learn, Topic Modeling
The Most Popular Language For Machine Learning and Data Science Is … - Jan 11, 2017.
When it comes to choosing programming language for Data Analytics projects or job prospects, people have different opinions depending on their career backgrounds and domains they worked in. Here is the analysis of data from indeed.com with respect to choice of programming language for machine learning and data science.
Data Science, Machine Learning, Programming Languages, Python, R, Scala
- Dataiku DSS 3.1 – Now with 5 ML Backends & Scala! - Aug 1, 2016.
Introducing Dataiku DSS 3.1, with new visual machine learning engines that allow users to create incredibly powerful predictive applications within a code-free interface.
Data Science, Dataiku, Machine Learning, Scala
- Top KDnuggets tweets, Jun 15-21: Predicting UEFA Euro2016; Visual Explanation of Backprop for Neural Nets - Jun 22, 2016.
Building statistical model to predict UEFA #Euro2016; A Visual Explanation of Back Propagation Algorithm for #NeuralNetworks; Scala is the new golden child for coding and #DataScience.
Backpropagation, Football, Scala, Soccer, Top tweets, Yahoo
- Apache Spark: RDD, DataFrame or Dataset? - Feb 3, 2016.
There are now 3 Apache Spark APIs. Here’s how to choose the right one.
Pages: 1 2
Apache Spark, API, Dataset, Java, RDD, Scala
- MassiveAnalytic: Scala/Python Data Scientist - Jan 18, 2016.
Oscar AP is the world first precognitive analytics platform. Be involved in critical research and innovation projects, customer prototyping and proofs-of-concept.
Data Scientist, London, Massive Analytic, Python, Scala, UK
- Introduction to Spark with Python - Nov 11, 2015.
Get a handle on using Python with Spark with this hands-on data processing tutorial.
Pages: 1 2 3
Apache Spark, Dataquest, Python, Scala
- Top KDnuggets tweets, Oct 13-19: R vs Python: head to head; Machine Learning for Developers tutorial - Oct 20, 2015.
Machine Learning for Developers - very nice tutorial; R vs #Python: head to head #Data Analytics; Data Science Skills and the Improbable Unicorn Data Scientist; How Tesla AutoPilot learns.
Machine Learning, Pedro Domingos, Python vs R, Scala, Tesla, Tutorials
- Data Science for Internet of Things – practitioner course - Sep 14, 2015.
Created by Data Science and IoT professionals, the course covers infrastructure (Hadoop – Spark), Programming / Modelling(R/Time series) and ioT. Course starts Nov 2015, delivered online, and will have limited participants.
Apache Spark, Data Science, IoT, R, Scala, SQL, Sumit Pal
- Scala By the Bay (Aug 13-16) + Big Data Scala (Aug 16-18), Bay Area - Jun 12, 2015.
77 best talks from the leading companies using Scala, Spark, and other Scala-based projects in production, including Twitter, Salesforce, Cloudera, Verizon, with innovative end-to-end pipeline training on Aug 16.
Alexy Khrabrov, Big Data, CA, Oakland, Scala, Twitter
- Top Big Data influencers of 2014, according to HadoopSphere - Mar 13, 2015.
Top big data influencers of 2014 include analysts Mike Gualtieri and Curt Monash, IBM and TDWI media, Spark and Scala products, Ben Lorica @bigdata and Gregory Piatetsky @kdnuggets on social media, Data Collective and AngelList co-founder.
About Gregory Piatetsky, Apache Spark, Big Data Influencers, Hadoop, IBM, Influencers, Kafka, Kirk D. Borne, Mike Gualtieri, Scala, TDWI
- Machine Learning Table of Elements Decoded - Mar 11, 2015.
Machine learning packages for Python, Java, Big Data, Lua/JS/Clojure, Scala, C/C++, CV/NLP, and R/Julia are represented using a cute but ill-fitting metaphor of a periodic table. We extract the useful links.
Big Data Software, Java, Julia, Machine Learning, NLP, Python, R, Scala, scikit-learn, Weka
- PredictionIO: Machine Learning Engineer (Evangelist) - Feb 26, 2015.
Are you passionate about machine learning and open source? Do you have the ability to engage other developers and data scientists? If yes, read on ...
API, CA, Machine Learning, Open Source, PredictionIO, San Francisco, Scala, USA
- PredictionIO: Machine Learning Evangelist - Feb 4, 2015.
Are you passionate about machine learning and open source? Do you have the ability to engage other developers and data scientists? If yes, read on ...
API, CA, Machine Learning, Open Source, PredictionIO, San Francisco, Scala, USA
- BigData TechCon San Francisco Report: Focus on Spark - Nov 1, 2014.
BigData TechCon SF 2014 covered a number of data technologies from the open source ecosystem through tutorials and classes. Spark and its libraries were a significant focus of the talks.
Apache Spark, Arun Swami, Big Data, Hadoop, Machine Learning, San Francisco-CA, Scala, Techcon
- H2O World, Open Source Machine Learning Meeting, Nov 18-19, Mountain View - Oct 27, 2014.
H2O World (Nov 18-19, Mountain View) is where the users of the very popular Open Source Machine Learning Engine H2O gather to share their knowledge and know-how to build Smart Applications.
Deep Learning, H2O, Machine Learning, Mountain View-CA, Open Source, Python, R, Scala
- Four main languages for Analytics, Data Mining, Data Science - Aug 18, 2014.
New KDnuggets Poll shows the growing dominance of four main languages for Analytics, Data Mining, and Data Science: R, SAS, Python, and SQL - used by 91% of data scientists - and decline in popularity of other languages, except for Julia and Scala.
Analytics Languages, Data Mining, Data Science, Julia, Poll, Python, R, SAS, Scala, SQL