Topics: AI | Data Science | Data Visualization | Deep Learning | Machine Learning | NLP | Python | R | Statistics

Search results for mahout

    Found 100 documents, 12389 searched:

  • Interview: Ted Dunning, MapR on Apache Mahout & Technology Landscape in ML

    ...content and users based on external properties or relations that allow some pretty amazing capabilities. AR: Q8. What motivated you to work on Apache Mahout? How do you compare Mahout with Spark and H2O? TD: Well, some good friends asked me to answer some questions. From there it was a down-hill...

    https://www.kdnuggets.com/2015/03/interview-ted-dunning-apache-mahout-machine-learning.html

  • ebook: Learning Apache Mahout, Big Data Analytics

    ...t bit.ly/1Gnqdxn Chandramani Tiwary March 2015, Packt Publishing. Acquire practical skills in Big Data Analytics and explore data science with Apache Mahout   About This Book Learn to use Apache Mahout for Big Data Analytics Understand machine learning concepts and algorithms and their...

    https://www.kdnuggets.com/2015/05/packt-ebook-learning-apache-mahout.html

  • ebook: Learning Apache Mahout Classification

    ...ing Apache Mahout Classification bit.ly/1FVAL4f Ashish Gupta February 2015, Packt Publishing. Build and personalize your own classifiers using Apache Mahout About This Book Explore the different types of classification algorithms available in Apache Mahout Create and evaluate your own ready-to-use...

    https://www.kdnuggets.com/2015/05/packt-ebook-learning-apache-mahout-classification.html

  • Top KDnuggets tweets, Jan 31 – Feb 2: Free books on statistical learning; Intro: Machine Learning and Apache Mahout

    ...ks on statistical learning: ESL by Hastie et al (Classic), Intro to Stat Learning, Stat foundations of ML buff.ly/1bJ9RNU Machine Learning and Apache Mahout : very good Introduction #Hadoop #BigData #DataScience shrd.by/AbpLKa 16 Top #BigData Analytics Platforms, from InformationWeek...

    https://www.kdnuggets.com/2014/02/top-tweets-jan31-feb2.html

  • Book Review: Data Just Right

    ...low through. Part 5 deals with a single chapter on Machine Learning using Mahout which is clearly a shame given the plethora of approaches aside from Mahout to deal with it. The author's apparently computer scientist like enthusiasm for machine learning bubbles over leaving much lesser room for...

    https://www.kdnuggets.com/2014/04/book-review-data-just-right.html

  • Top KDnuggets tweets, Aug 5-6: R jobs up, SAS jobs down; Mahout Machine Learning; $1M Data Science salary?

    …Most Retweeted: R jobs are increasing, SAS jobs decline, SPSS jobs are flat (amazingly COBOL still in demand) #rstats bit.ly/194jlHx Most Favorited: Mahout: Machine Learning for Enterprise Data Science, for recommendations, clustering, classification bit.ly/1b9HqtV Top 10 Tweets R jobs are…

    https://www.kdnuggets.com/2013/08/top-tweets-aug5-6.html

  • Top KDnuggets tweets, Aug 4-5: Ensemble Methods, a brief history; Data Scientist role shifting

    ...ntrepreneurs, CEOs, engineers, researchers - 35% women t.co/tbeJzd4zl5 Most Favorited: To add #MachineLearning: for Python, scikit-learn; for Hadoop: Mahout; for Java: Weka; for JavaScript: ConvNetJS t.co/17l1R82KOA Most viewed: Data Scientist role shifting, with companies focusing on Developers...

    https://www.kdnuggets.com/2014/08/top-tweets-aug04-05.html

  • How Big Data is used in Recommendation Systems to change our lives

    …s, Hadoop can be used. To reduce the manual work needed to code, identify right algorithms, similarity methods and other tasks, Mahout could be used. Mahout is a library that comprises machine learning algorithms. It provides a set of options to choose recommendation algorithm, choosing n-nearest…

    https://www.kdnuggets.com/2015/10/big-data-recommendation-systems-change-lives.html

  • Citizen Data Scientist, Jumbo Shrimp, and Other Descriptions That Make No Sense

    ...y building predictive models. Quantifying correlations coefficients, statistical errors and residuals using tools such as SAS, R, Python, MADlib, and Mahout is required to ascertain if the model being built is more predictive or not. Model validation. The data scientist then needs to determine the...

    https://www.kdnuggets.com/2016/12/citizen-data-scientist-jumbo-shrimp-make-no-sense.html

  • How Big Data Pieces, Technology, and Animals fit together

    ...the tools described above. For example, you can write an Oozie script that will scrape your production HBase data to a Hive warehouse nightly, then a Mahout script will train with this data. At the same time, you might use pig to pull in the test set into another file and when Mahout is done...

    https://www.kdnuggets.com/2015/02/how-big-data-pieces-technology-fit-together.html

  • Too slow or out of memory problems in Machine Learning/Data Mining?

    ...igh memory usage] MovieLens dataset with 10M ratings from 70K for 10K movies. SlopeOne recommends new movies based on Collaborative Filtering. Apache Mahout's "Taste" non-distributed recommender would fail for less than 6GB memory. To benchmark the out-of-core performance, we restricted our version...

    https://www.kdnuggets.com/2013/03/too-slow-or-out-memory-problems-machine-learning-data-mining.html

  • 3 Generations of Machine Learning and Data Mining Tools

    ...ta sets by implementing the algorithms over Hadoop, the open source Map-Reduce implementation. These tools are maturing fast and are open source. ... Mahout has a set of algorithms for clustering and classification, as well as a very good recommendation algorithm. ... Mahout implements only a...

    https://www.kdnuggets.com/2013/02/3-generations-machine-learning-data-mining-tools.html

  • KDnuggets Analytics, Data Mining, Data Science Software Poll – Analyzed

    ...ho , Mahout, Mathematica, and IBM Cognos. The second group also correlated with tools that were part of larger platforms. Users of Pig, Mathematica , Mahout , Perl , Other Hadoop/HDFS-based tools , and Orange have used at least 8 tools (vs avg of 3.7). The largest number of commercial tool used was...

    https://www.kdnuggets.com/2014/06/analytics-data-mining-data-science-software-poll-analyzed.html

  • R, Python Duel As Top Analytics, Data Science software – KDnuggets 2016 Software Poll Results

    ...10.2% +21.3% MLlib 11.6% 3.3% +253% SQL on Hadoop tools 7.3% 7.2% +1.6% H2O 6.7% 2.0% +234% HBase 5.5% 4.6% +18.6% Apache Pig 4.6% 5.4% -16.1% Apache Mahout 2.6% 2.8% -7.2% Dato 2.4% 0.5% +338% Datameer 0.4% 0.9% -52.3% Other Hadoop/HDFS-based tools 4.9% 4.5% +7.5% Deep Learning Tools For the...

    https://www.kdnuggets.com/2016/06/r-python-top-analytics-data-mining-data-science-software.html

  • Hadoop Key Terms, Explained

    ...eco-system. Its main purpose is to provide better user experience. It provides drag and drop facilities and editors for Spark, Hive, HBase, etc. 15. Mahout Mahout is open source software for building scalable machine learning and data mining applications quickly. 16. Ambari   Ambari is...

    https://www.kdnuggets.com/2016/05/hadoop-key-terms-explained.html

  • SanDisk: Senior Big Data Engineer/Hadoop Developer

    ...Big Data" Stack and platform. Skills Required. Extensive knowledge about Hadoop Architectures and HDFS. Java/C++, Map Reduce HBase, Hive, PIG, Oozie, Mahout, Zookeeper, Flume, Solr, ElasticSearch, Storm/Spark Leading the learning/understanding and knowledge of very complex semi-conductor data...

    https://www.kdnuggets.com/jobs/16/02-03-sandisk-big-data-engineer-hadoop.html

  • Top 10 Data Science Videos on Youtube">Gold BlogTop 10 Data Science Videos on Youtube

    …ience? | Data Analytics Tools | Edureka – (views: 90K) Category: Tutorial The third place is hold by another tutorial on Data Science using R, Apache Mahout and Hadoop framework. This first part of the series of tutorials hold by Edureka!, gives mainly a more speculative introduction to Data…

    https://www.kdnuggets.com/2016/10/top-10-data-science-videos-youtube.html

  • Apache Spark Introduction for Beginners">Silver BlogApache Spark Introduction for Beginners

    ...y the MLlib engineers against the Alternating Least Squares (ALS) executions. Spark MLlib is nine times as rapid as the Hadoop disk version of Apache Mahout (before Mahout picked up a Spark interface). 5. GraphX GraphX is a distributed Graph-Processing framework of Spark. It gives an API for...

    https://www.kdnuggets.com/2018/10/apache-spark-introduction-beginners.html

  • Top 10 Books on NLP and Text Analysis">Silver BlogTop 10 Books on NLP and Text Analysis

    ...homas Morton and Drew Farris. This book provides an introduction to several NLP tools and problems, including Apache Solr, Apache OpenNLP, and Apache Mahout with code samples in Java. Target readers: Software developers who want to familiarize themselves with enterprise-grade NLP tools for work...

    https://www.kdnuggets.com/2019/01/top-10-books-nlp-text-analysis.html

  • 150 Most Influential People in Big Data & Hadoop

    ...@Metamarkets. Investor @DCVC. I heart data, analytics, & visualization. 12. Ted Dunning @ted_dunning, Score: 111 Description: Committer on Apache Mahout, Apache Drill, PMC on Mahout, Zookeeper and Drill, Product Architect at MapR, ex-Chief Scientist at Veoh, Musicmatch. 13. Krish Krishnan...

    https://www.kdnuggets.com/2015/05/greycampus-150-most-influential-people-big-data-hadoop.html

  • 75 Big Data Terms to Know to Make your Dad Proud

    ...-tolerant way and supposedly ‘wicked fast’. Given that social network environment deals with streams of data, Kafka is currently very popular. Apache Mahout: Mahout provides a library of pre-made algorithms for machine learning and data mining and also an environment to create more algorithms. In...

    https://www.kdnuggets.com/2017/06/75-big-data-terms.html

  • SanDisk: Senior Staff Hadoop Developer

    ...Big Data" Stack and platform. Skills Required. Extensive knowledge about Hadoop Architectures and HDFS. Java/C++, Map Reduce HBase, Hive, PIG, Oozie, Mahout, Zookeeper, Flume, Solr, ElasticSearch, Storm/Spark Leading the learning/understanding and knowledge of very complex semi-conductor data...

    https://www.kdnuggets.com/jobs/16/01-20-sandisk-senior-staff-hadoop-developer.html

  • KDnuggets 15th Annual Analytics, Data Mining, Data Science Software Poll: RapidMiner Continues To Lead

    ...ools (not counting languages like Perl or SQL) that received at least 1% share in 2014 were Pig 3.5% Alpine Data Labs, 2.7% Pentaho, 2.6% Spark, 2.6% Mahout, 2.5% MLlib, 1.0%   Among tools with at least 2% share, the largest decline in 2014 was for StatSoft Statistica (now part of Dell), down...

    https://www.kdnuggets.com/2014/06/kdnuggets-annual-software-poll-rapidminer-continues-lead.html

  • Software Developer, Machine Learning

    …ality by designing and implementing cutting-edge machine learning algorithms and writing code to access other data analytics libraries and platforms (Mahout, Pentaho, etc); Design, Develop and Automate test cases as needed to validate machine learning algorithms, and demos for customers; Meet…

    https://www.kdnuggets.com/jobs/13/08-12-sgi-software-developer-machine-learning.html

  • R leads RapidMiner, Python catches up, Big Data tools grow, Spark ignites

    ...(311) Hive, 10.2% (282) SQL on Hadoop tools, 7.2% (198) Pig, 5.4% (150) HBase, 4.6% (127) Other Hadoop/HDFS-based tools, 4.5% (125) MLlib, 3.3% (91) Mahout, 2.8% (76) Datameer, 0.8% (23)   Deep Learning Tools New this year was a category of Deep Learning Tools, with most popular tools being:...

    https://www.kdnuggets.com/2015/05/poll-r-rapidminer-python-big-data-spark.html

  • Open Source Tools for Machine Learning

    ...interact in a stand-alone fashion with HDFS stores, on top of YARN, in MapReduce, or directly in an Amazon EC2 instance. Github: github.com/h2oai/h2o Mahout The Mahout framework has long been tied to Hadoop, but many of the algorithms under its umbrella can also run as-is outside Hadoop. They're...

    https://www.kdnuggets.com/2014/12/open-source-tools-machine-learning.html

  • Data Mining / Analytic Publications News, Aug 2013

    ...age. This chapter provides a clear explanation of the most popular machine learning methods. Top KDnuggets tweets, Aug 5-6: R jobs up, SAS jobs down; Mahout Machine Learning; $1M Data Science salary? - Aug 7, 2013.R jobs are increasing, SAS jobs decline, SPSS jobs are flat, COBOL still around;...

    https://www.kdnuggets.com/2013/08/publications-news.html

  • Simplilearn Big Data and Analytics Online Courses

    ...»  60 Hrs of Real Time Industry based Projects »  Packed with Latest & Advanced modules like YARN, Flume, Oozie, Mahout & Chukwa »  30 PDUs Offered Courses included »   Certified Big Data and Hadoop Developer Training...

    https://www.kdnuggets.com/2015/02/simplilearn-big-data-analytics-online-courses.html

  • KDnuggets™ News 13:n20, Aug 16

    .../out smart questions); The Science behind Netflix Algorithms that decide what you watch next Top KDnuggets tweets, Aug 5-6: R jobs up, SAS jobs down; Mahout Machine Learning; $1M Data Science salary? - Aug 7, 2013.R jobs are increasing, SAS jobs decline, SPSS jobs are flat, COBOL still around;...

    https://www.kdnuggets.com/2013/n20.html

  • KDnuggets™ News 15:n16, May 20: 7 Techniques for Dimensionality Reduction; Who are the real Data Scientists?

    ...isco, June 29-30    Jobs Geisinger Health System: Associate VP, Enterprise Data Management    Publications ebook: Learning Apache Mahout, Big Data Analytics ebook: Learning Apache Mahout Classification Data Science for Workforce Optimization: Reducing Employee Attrition...

    https://www.kdnuggets.com/2015/n16.html

  • KDnuggets™ News 15:n07, Mar 4: Analytics/Data Science Salaries; Machine Learning Flaws; Strata Highlights

    ...ing and auctioning personal data.    Interviews  (see also All Interviews for this month ) Interview: Ted Dunning, MapR on Apache Mahout & Technology Landscape in ML - Mar 3, 2015. We discuss Apache Mahout, its comparison with Spark and H2O, trends, advice, desired...

    https://www.kdnuggets.com/2015/n07.html

  • Thomson Reuters: Data Scientist

    ...king with scripting languages like Python, Experience with one or more of the following: Big data analytics (Hadoop, Hive, NoSQL, Spark, Shark, Hive, Mahout, Impala, Solr, HBase, Pig, Cascading) Information extraction, data mining, or machine learning Data Visualization   Comfortable in a fast...

    https://www.kdnuggets.com/jobs/14/10-02-thomsonreuters-data-scientist.html

  • Webinar: Learn Analytics Best Practices for Hadoop

    ...s in-Hadoop and when to do data extracts Take advantage of the variety of tools available from the Hadoop ecosystem, including Hive, Impala, Pig, and Mahout Deploy your big data analytics in production with managed workflows Unable to attend? Please still register and you will receive the webinar...

    https://www.kdnuggets.com/2014/09/rapidminer-webinar-learn-analytics-best-practices-hadoop.html

  • Customer Analytics Summit 2014 Chicago: Day 2 Highlights

    ...brings infinite scalability, extremely large storage capability and fast data processing. Discussing about Big Data Analytics in retail, he mentioned Mahout – an Apache Foundation software project using scalable machine learning algorithms. He briefly discussed three primary algorithms: Clustering,...

    https://www.kdnuggets.com/2014/09/customer-analytics-summit-2014-chicago-day2.html

  • U. Miami School of Business Administration: Tenure-Track Faculty in Management Science (Big Data Analytics)

    ...challenges of dealing with large data sets. Expertise in, or experience with, one or more of the following is particularly welcome: MapReduce/Hadoop, Mahout, Cassandra, cloud computing, mobile/wearable technologies, social media analytics, recommendation systems, data mining and machine learning,...

    https://www.kdnuggets.com/academic/14/10-08-miami-faculty-management-science-big-data-analytics.html

  • ACM Data Science Camp, San Jose, Oct 25

    ...els of generality, as relevant) Regression, svm, neural_network, deep_learning, cluster_analysis, text_mining, recommender_systems Python, R, Matlab, Mahout (language, tool or system name) Hadoop, spark, storm, mongodb, hive, pig, aws, apache_contribution   TO REGISTER FOR THE EVENT: Use...

    https://www.kdnuggets.com/2014/08/acm-data-science-camp-san-jose-oct-25.html

  • Big Data & Analytics for Retail Summit 2014 Chicago: Day 1 Highlights

    ...ls.   Finally, he talked about the Advanced Analytics team at Macy's. The team uses a wide range of tools including Hadoop, SAS, SAP/KXEN, R and Mahout. Phani Nagarjuna, Founder and CEO, Nuevora delivered a thought-provoking talk on "Analytics – Predictive, Prescriptive, Closed-Loop,...

    https://www.kdnuggets.com/2014/10/big-data-analytics-retail-summit-chicago-day1.html

  • Cray: Senior Data Scientist

    ...SAS EGuide, MapReduce, PIG, HIVE, Python, SAS, SAS HPA, SAS Visual Analytics, SAS EMiner, Salford TreeNet, R, and machine learning approaches such as Mahout Strong theoretical and practical knowledge of analytical techniques including: Graph analytics, segmentation creation, mix and time series...

    https://www.kdnuggets.com/jobs/14/10-07-cray-senior-data-scientist.html

  • Bosch Research and Technology Center: Data Mining Engineer – Big Data Infrastructure

    ...ce identifying performance bottlenecks w/network, I/O, OS, DBMS configuration. Experience with two or more of the following: Java, C++ (STL), Python, Perl, MATLAB, R, SPSS, SAS. HBase, Hive, Pig, Cassandra, or similar technologies - Mahout a plus....

    https://www.kdnuggets.com/jobs/14/07-25-bosch-data-mining-engineer-big-data-infrastructure.html

  • Interview: Cliff Lyon, Stubhub on Mastering Recommendation & Personalization Analytics Part 2

    ...ommendation, too. This is pretty great, since it gives you some useful tools for free; I am used to DIY technology for recommendation. Ted Dunning, a Mahout contributor now at MapR, has some nice presentations online on this topic. On the presentation side, given the huge growth of mobile, and the...

    https://www.kdnuggets.com/2014/07/interview-cliff-lyon-stubhub-recommendation-personalization-analytics-part2.html

  • Interview: Cliff Lyon, Stubhub on Mastering the Art of Recommendation and Personalization Analytics

    ...ining: confidence, lift, leverage, and conviction. I would also recommend trying the root log-likelihood ratio (LLR) similarity implemented in Apache Mahout. The history of recommendation as we’re discussing it here is relatively short; it is easy enough to just go through the timeline, and look at...

    https://www.kdnuggets.com/2014/07/interview-cliff-lyon-stubhub-recommendation-personalization-analytics.html

  • Big Data and Hadoop, Big Data Boot Camp LA

    ...reaming – MapReduce in languages other than Java Flume – data ingestion into HDFS Sqoop – Import data from SQL databases Oozie – Hadoop job scheduler Mahout – Recommendation, clustering, classification   Technologies involved in Hadoop ETL can be classified as shown in following figure:...

    https://www.kdnuggets.com/2014/10/big-data-hadoop-boot-camp-los-angeles.html

  • Top KDnuggets tweets, Jul 23-24: 81% of retail firms gather #BigData, only 34% use analytics

    ...ly a search company. It's a #MachineLearning (on #BigData) company t.co/lZK36A6BCe The Journal of Big Data has published its first articles - Hadoop, Mahout, Data Mining, Health Informatics, and more t.co/s08NZbPml2 MLlib: Apache Spark component for machine learning http://t.co/QUaKlLlrld Course:...

    https://www.kdnuggets.com/2014/07/top-tweets-jul23-24.html

  • Kreditech: Data Scientist, Web Analytics

    ...gating and querying them on your own. R (Rstudio). Prior experience with big data use cases a big plus. Hadoop/MapReduce (both setup and usage), with Mahout being a strong plus.   Other skills that would be interesting include: A track record of methodological innovation (incl. Publications)...

    https://www.kdnuggets.com/jobs/14/07-30-kreditech-data-scientist-web-analytics.html

  • Top Analytics and Big Data trends ahead of Strata Hadoop NYC Conference

    ...mantic analysis The challenge of communicating complex analyses to non-technical clients/partners   SH: Deep Learning In Memory Databases Apache Mahout   GG: emergence of graph databases and graph-based tools such as Neo4j, GraphX, GraphLab, as a standard way to process and store...

    https://www.kdnuggets.com/2014/08/strata-hadoop-nyc-conference-top-analytics-big-data-trends.html

  • 18 essential Hadoop tools

    ...e that breaks from traditional relational database management systems using SQL. Popular NoSQL databases include Cassandra, Riak, and MongoDB. Apache Mahout, a machine learning library designed to run on data stored in Hadoop. Apache Lucene/Apache Solr, a tool for indexing text data that integrates...

    https://www.kdnuggets.com/2014/08/18-essential-hadoop-tools.html

  • Interview: Arpit Gupta, CEO, Actionable Analytics on Enterprise Challenges in Big Data and Cloud

    ...AG: Study stats, data structures, economics, DBMS and maths. Take online courses on machine learning and experiment on pen source tools such as Weka, Mahout, Hadoop etc. Be part of industry groups. Apply in big data companies to start the career there. AR: Q7. Which book (or article) did you read...

    https://www.kdnuggets.com/2014/08/interview-arpit-gupta-enterprise-big-data-cloud.html

  • Deep Learning to Fight Crime

    ...s and Machine Learning experts at H2O.ai. Related: Interview: Arno Candel, H2O.ai on the Journey from Physics to Machine Learning Interview: Arno Candel, H20.ai on How to Quick Start Deep Learning with H2O Interview: Ted Dunning, MapR on Apache Mahout & Technology Landscape in ML...

    https://www.kdnuggets.com/2015/04/deep-learning-fight-crime.html

  • Top stories for Mar 1-7: All Machine Learning Models Have Flaws; Analytics, Data Mining, Data Science professionals salary

    ...nsated - Mar 3, 2015. Interview: Ted Dunning, MapR on The Real Meaning of Real-Time in Big Data - Mar 2, 2015. Interview: Ted Dunning, MapR on Apache Mahout & Technology Landscape in ML - Mar 3, 2015. Interview: Kaiser Fung, NYU on Why Ignoring Data Integrity is a Recipe for Disaster - Mar 4,...

    https://www.kdnuggets.com/2015/03/top-news-week-mar-1.html

  • Interview: Ted Dunning, MapR on The Real Meaning of Real-Time in Big Data

    Ted Dunning is Chief Applications Architect at MapR Technologies and committer and PMC member of the Apache Mahout, Apache ZooKeeper, and Apache Drill projects and mentor for Apache Storm, DataFu, Flink and Optiq projects. Ted was the chief architect behind the MusicMatch (now Yahoo Music) and...

    https://www.kdnuggets.com/2015/03/interview-ted-dunning-mapr-real-time-big-data.html

  • Collective: Data Scientist

    ...n and analysis using SQL, NoSQL, Java, C, SAS, MapReduce, PIG, HIVE, Python (NumPy, SciPy, scikit-learn), SAS, R, and machine learning suites such as Mahout, Weka, and RapidMiner Experience with third-party API integration Strong communication skills to utilize while interacting with the product...

    https://www.kdnuggets.com/jobs/15/02-18-collectivei-data-scientist.html

  • Machine Learning Table of Elements Decoded

    ...earn mlpy Machine Learning Packages in Java (Green): Weka Mallet Knime RapidMiner Encog ELKI DL4J Machine Learning Packages for Big Data (Dark Blue): Mahout Conjecture SAMOA Oryx MLLib MLbase Machine Learning Packages in Lua/JS/Clojure (Red): Torch April ConvNetJS jsLDA Machine learning library for...

    https://www.kdnuggets.com/2015/03/machine-learning-table-elements.html

  • Simplilearn Big Data and Analytics Courses – CAREER30

    ...r, SAS Base Programmer, R Language) »  60 Hrs of Real-time Industry based ProjectsM »  Modules on YARN, Flume, Oozie, Mahout & Chukwa   Starting @ $ 280   Know More   All-in-One Big Data and Cloud Computing Suite »  70+ Hrs of High...

    https://www.kdnuggets.com/2015/03/simplilearn-big-data-analytics-courses-career30.html

  • Top stories for May 10-16: Poll: Analytics, Data Mining software used; 3 things about Data Science not in books

    ..., 2015. Interview: Mark Weiner, Temple University Health System on Maturity Assessment of Healthcare Analytics - May 11, 2015. ebook: Learning Apache Mahout, Big Data Analytics - May 15, 2015. Top stories for May 3-9: Data Scientists Automated by 2025? The Inconvenient Truth About Data Science -...

    https://www.kdnuggets.com/2015/05/top-news-week-may-10.html

  • BehaviorMatrix: Sr. Big Data Engineer /Architect

    ...and troubleshooting large scale distributed systems Familiarity / some working experience with Twitter Storm / Apache Spark, Redis, Kafka, ZooKeeper, Mahout, and Celery a plus Ability to solve complex problems in a fast paced environment with limited guidance. An eye for quality and a willingness...

    https://www.kdnuggets.com/jobs/14/06-30-behaviormatrix-sr-big-data-engineer-architect.html

  • Collective[i]: Data Scientist

    ...n and analysis using SQL, NoSQL, Java, C, SAS, MapReduce, PIG, HIVE, Python (NumPy, SciPy, scikit-learn), SAS, R, and machine learning suites such as Mahout, Weka, and RapidMiner Experience with third-party API integration Strong communication skills to utilize while interacting with the product...

    https://www.kdnuggets.com/jobs/15/01-28-collectivei-data-scientist.html

  • simplilearn Big Data & Analytics Certification Courses Online, 30% off till Jan 31

    ...nt    40 Hrs of Lab Exercises with proprietary VM    Packed with Latest & Advanced modules like YARN, Flume, Oozie, Mahout & Chukwa    Excellence in Hadoop Certificate   $259     $181 Enroll Now Know More   Certified SAS...

    https://www.kdnuggets.com/2015/01/simplilearn-big-data-analytics-certification-courses-online.html

  • R and Hadoop make Machine Learning Possible for Everyone

    ...etc.), to datastores (Hbase, Cassandra, Redis, Voldermort, etc.), to schedulers (Oozie, Cascading, Scalding, etc.), and finally to Machine Learning (Mahout, MLlib, H2O, etc.) among many other applications. Unfortunately, there is not a simple way to see all of these technologies and easily install...

    https://www.kdnuggets.com/2014/11/r-hadoop-make-machine-learning-possible-everyone.html

  • Most Popular Slideshare Presentations on Data Mining

    ...a Machine Learning API 2012-02-14 32812 98 16 Machine Learning and Data Mining: 19 Mining Text And Web Data 2007-06-03 33329 2003 43 Introduction to Mahout and Machine Learning 2013-07-27 29261 575 49 Log Mining: Beyond Log Analysis 2007-09-27 27497 0 28 Social Data Mining 2013-11-02 28944 0 4...

    https://www.kdnuggets.com/2014/11/most-popular-slideshare-presentations-data-mining.html

  • Apple: Data Scientist

    ...e with accessing data stored Cassandra, HBase, or other NoSQL database is a plus. Experience with machine learning tools and libraries such as Apache Mahout and Weka. Working knowledge of SQL, Hive, Pig, and other query languages. Knowledge of statistical methods, mathematical modeling and business...

    https://www.kdnuggets.com/jobs/14/11-05-apple-data-scientist.html

  • Apple: Senior Data Scientist, Retail – Online

    ...have working knowledge of SAS - Enterprise Guide/Miner. experience with one or more of the following is desirable: Hadoop, Hive, NoSQL, Spark, Hive, Mahout, Impala, Pig, Cascading, Theano Data Visualization with Tableau     To be successful in this position, you should ... have a strong...

    https://www.kdnuggets.com/jobs/14/12-07-apple-senior-data-scientist-retail-online.html

  • Simplilearn Big Data and Analytics courses, 30% off

    ...sp;  $ 181      Packed with Latest & Advanced modules like        YARN, Flume, Oozie, Mahout & Chukwa    Excellence in Hadoop Certificate   Enroll Now Know More Business Analytics Foundation - R Language Key...

    https://www.kdnuggets.com/2014/12/simplilearn-big-data-analytics-courses-30pct-off.html

  • BIME Business Intelligence Predictions for 2015

    ...e time to push out enhancements and updates for all. 2. Data mining is the new black Whether or not you understand the R programming language, Apache Mahout or concepts such as Holt-Winters multiplicative exponential smoothing: data mining will be sprinkled into a lot of modern applications in...

    https://www.kdnuggets.com/2014/12/bime-business-intelligence-predictions-2015.html

  • Interview: Daqing Zhao, Macys.com on Building Effective Data Models for Marketing

    ...ly and cumulate expertise in house. We have Hadoop, and Hbase, and are in efforts to test solutions using Spark and H2O. We also have SAS, and use R, Mahout, SAP/KXEN, and Tableau for visualization, etc. We also work with research groups at SAS, IBM and others on some solutions. We evaluate...

    https://www.kdnuggets.com/2014/12/interview-daqing-zhao-macys-data-models-marketing.html

  • STRATA + Hadoop World 2014 NYC Report

    ...s new book, Time Series Databases, which he co-authored with Ellen Friedman. Apart from MapR, Ted is also involved with such great projects as Apache Mahout, Drill, and Zookeeper, and also a mentor for the Storm and Spark projects. Unfortunately work commitments brought me back to Boston early...

    https://www.kdnuggets.com/2014/11/strata-hadoop-world-2014-nyc-report.html

  • NPR: Data Scientist

    ...analytics Proficient in at least two programming/scripting languages: Python, Ruby, Pig, Java, Hive, PHP, JS Specific experience using tools such as Mahout, Hadoop, Cassandra, Splunk Hands on knowledge of database manipulation through environments such as MySQL, NoSQL Experience working with...

    https://www.kdnuggets.com/jobs/13/12-18-npr-data-scientist.html

  • Big Data Engineer

    ...ience with Hadoop, Map/Reduce, Solr / ElasticSearch, Hbase and CouchDB are a must Familiarity / some working experience with Storm, Redis, ZooKeeper, Mahout, and Celery a plus Ability to solve complex problems in a fast paced environment with limited guidance. An eye for quality and a willingness...

    https://www.kdnuggets.com/jobs/13/04-27-behaviormatrix-big-data-engineer.html

  • Data Mining Engineer – Big Data/HPC

    ...ration. Exp. w/2+ of the following: Java, C++ (STL), Python, Perl, MATLAB, R, SPSS, SAS. Propensity to work with stakeholders from a variety of business units & educational backgrounds. HBase, Hive, Pig, Cassandra, or similar technologies - Mahout (a plus) _Contact_: Apply online...

    https://www.kdnuggets.com/jobs/13/04-03-bosch-data-mining-engineer-big-data-hpc.html

  • Data Scientist

    ...t of all we are looking for passionate, pro-active and solution driven Software Developers with: Experience with Hadoop - HDFS, MapReduce, Hive, Pig, Mahout, Sqoop and related technologies Java expertise combined with Python or other scripting languages Deep knowledge of networking communication...

    https://www.kdnuggets.com/jobs/13/05-22-swisscom-senior-software-engineer-big-data.html

  • Amazon Instant Video Content Discovery
    – Software Development Engineer

    ...communication skills Excellent analytical skills Keywords: machine learning, data mining, personalization, recommender systems, similarities, hadoop, mahout, information extraction, sentiment analysis, unsupervised learning, supervised learning, instance based learning, dimensionality reduction,...

    https://www.kdnuggets.com/jobs/13/04-25-amazon-video-content-discovery-sde.html

  • Strata Conference Reports and Highlights

    ...hine Learning Algorithms. The speaker classified first generation as desktop (or single server), such as R, and second generation as Map Reduce (e.g. Mahout), and third generation as post-Map Reduce. He actually called Spark a third-generation machine learning tool. "Excel Big Data" demo from...

    https://www.kdnuggets.com/2013/03/strata-conference-reports-highlights.html

  • Yahoo SAMOA, Open Source Platform for Mining Big Data Streams

    ...tributed algorithms for the most common machine learning tasks such as classification and clustering. For a simple analogy, you can think of SAMOA as Mahout for streaming. SAMOA is both a platform and a library. As a platform, it allows the algorithm developer to abstract from the underlying...

    https://www.kdnuggets.com/2013/11/yahoo-samoa-open-source-platform-mining-big-data-streams.html

  • Data Scientist

    ...deep understanding of machine learning algorithms (Random Forest, SVD, etc). Extensive experience with statistical and machine learning packages (R, Mahout, SAS,SciPy) Experience building and evaluating complex statistical predictive models Clear thinker and effective communicator Ability to...

    https://www.kdnuggets.com/jobs/13/03-07-m6d-data-scientist.html

  • Data Scientist (13010974)

    …xperience within digital and offline channels. Prior experience working with very large datasets using Big Data tools and platforms (Hadoop, PIG/HIVE/Mahout) required. Strong working knowledge of data mining techniques, including regression analysis, clustering, decision trees, neural networks, SVM…

    https://www.kdnuggets.com/jobs/13/08-29-americanexpress-data-scientist.html

  • Top news for Aug 4-10: BBC on Age of Big Data; 10 Predictive Analytics Platforms compared

    …Portals – Aug 4, 2013. LIONbook Chapter 6: Rules, decision trees, and forests – Aug 8, 2013. Top KDnuggets tweets, Aug 5-6: R jobs up, SAS jobs down; Mahout Machine Learning; $1M Data Science salary? – Aug 7, 2013. Top KDnuggets tweets, Aug 2-4: The Age of Big Data – BBC Documentary; 10 Enterprise…

    https://www.kdnuggets.com/2013/08/top-news-week-Aug-4.html

  • 2013 Nov News: Analytics, Big Data, Data Mining and Data Science Features, News, Software

    ...d Massive Online Analysis) is a framework for mining big data streams and applying distributed machine learning algorithms. You can think of SAMOA as Mahout for streaming. Project Tycho digitized 125 years of Public Health and Disease Data - Nov 29, 2013.Project Tycho: UPitt researchers have...

    https://www.kdnuggets.com/2013/11/news-software.html

  • 2013 Nov: Analytics, Big Data, Data Mining and Data Science Posts

    ...d Massive Online Analysis) is a framework for mining big data streams and applying distributed machine learning algorithms. You can think of SAMOA as Mahout for streaming. Project Tycho digitized 125 years of Public Health and Disease Data - Nov 29, 2013.Project Tycho: UPitt researchers have...

    https://www.kdnuggets.com/2013/11/index.html

  • KDnuggets™ News 13:n30, Dec 11

    ...d Massive Online Analysis) is a framework for mining big data streams and applying distributed machine learning algorithms. You can think of SAMOA as Mahout for streaming. Webcasts Webinar: Data Mining: Failure to Launch [Dec 18] - Dec 10, 2013.Learn how to get started with predictive modeling and...

    https://www.kdnuggets.com/2013/n30.html

  • Libraries and Development Kits for Data Mining

    ...is in SQL databases. XELOPES, an open platform-independent and data-source-independent library for Embedded Data Mining. free and open-source: Apache Mahout, a suite of machine learning libraries designed to be scalable and robust Data Mining Template Library (DMTL), an open-source collection of...

    https://www.kdnuggets.com/software/libraries.html

  • KDnuggets™ News 14:n03, Feb 5

    ...enn Diagram v2.0: "unicorns"; Great map of #DataScience skills Jan 31 - Feb 2: Free books on statistical learning; Intro: Machine Learning and Apache Mahout Jan 29-30: Visual.ly Data Visualization Catalog; 100 numpy exercises, from Novice to Expert Data Scientists Jan 27-28: Dilbert takes on...

    https://www.kdnuggets.com/2014/n03.html

  • KDnuggets™ News 14:n21, Aug 13

    ...- a brief history; Data Scientist role shifting, with companies focusing on Developers; To add #MachineLearning for Python: scikit-learn, for Hadoop: Mahout; Meet Fortune 2014 #BigData All-Stars: data scientists, entrepreneurs, CEOs. Quote "only a small fraction of all problems are Big Data...

    https://www.kdnuggets.com/2014/n21.html

  • KDnuggets™ News 14:n19, Jul 30

    ...e pricing optimization Google Brain project: Google is not really a search company The Journal of Big Data has published its first articles - Hadoop, Mahout, Data MLlib: Apache Spark component for machine learning. Top KDnuggets tweets, Jul 21-22 - Jul 23, 2014. Microsoft: Data ScientistHaskell...

    https://www.kdnuggets.com/2014/n19.html

  • KDnuggets Review of Analytics Marketplaces: The Next Big Thing for Big Data

    ...models, not tied to any specific tool, language or technology. It currently accepts models implemented in R, Python, C++, Java, SAS, PMML, Octave and Mahout/Hadoop. The marketplace also includes an execution platform where potential users can try the models before buying, and allows deployment on...

    https://www.kdnuggets.com/2013/11/kdnuggets-review-analytics-marketplaces-next-big-thing-for-big-data.html

  • Sr. Big Data Engineer / Architect

    ...with Hadoop, MapReduce, Solr / ElasticSearch, Hbase / CouchDB are a must Familiarity / some working experience with Twitter Storm, Redis, ZooKeeper, Mahout, and Celery a plus Ability to solve complex problems in a fast paced environment with limited guidance. An eye for quality and a willingness...

    https://www.kdnuggets.com/jobs/13/11-18-behaviormatrix-big-data-engineer-architect.html

  • Big Data TechCon – Great How-To Conference

    ...r SQL pros, and so forth. For advanced developers, there were all manner of entrees: Getting Started with R and Hadoop, Hadoop architectures, H2O and Mahout, topological data analysis and more. Best were the vetted instructor/presenters, all of whom were under strict orders not to “sell” themselves...

    https://www.kdnuggets.com/2014/04/big-data-techcon-great-how-to-conference.html

  • Bosch: Data Mining Engineer – Big Data Infrastructure

    ...ce identifying performance bottlenecks w/network, I/O, OS, DBMS configuration. Experience with two or more of the following: Java, C++ (STL), Python, Perl, MATLAB, R, SPSS, SAS. HBase, Hive, Pig, Cassandra, or similar technologies - Mahout a plus....

    https://www.kdnuggets.com/jobs/14/04-17-bosch-data-mining-engineer-big-data-infrastructure.html

  • Prediction.io open source machine learning server

    ...PredictionIO importing data to PredictionIO adding engines getting prediction results   PredictionIO v0.7 now supports GraphChi - a Big Data Graph Engine, and developers can now evaluate and deploy algorithms in both GraphChi and Apache Mahout on one platform. Read more....

    https://www.kdnuggets.com/2014/04/prediction-io-open-source-machine-learning-server.html

  • RichRelevance: Marketing and Merchandising Analyst

    ...ng event-level data on hundreds of millions of shoppers and millions of products Required: B.S. in a quantitative field, good SQL skills Big plus: R, Mahout, predictive analytics, clustering, customer segmentation Also a plus: Pig, design of experiment About RichRelevance: RichRelevance is the...

    https://www.kdnuggets.com/jobs/14/05-17-richrelevance-marketing-merchandising-analyst.html

  • Goldman Sachs: VP/Data Scientist, Surveillance Analytics Group

    ...iness understanding Strong analytical and problem solving skills Extensive knowledge of big data technologies/architectures (e.g., Hadoop, Pig, Hive, Mahout, etc.) Experience with key analytics methods (e.g., machine learning, link analysis, predictive modeling, natural language processing, text...

    https://www.kdnuggets.com/jobs/14/05-21-goldmansachs-vp-data-scientist-surveillance-analytics-group.html

  • Kreditech: Software Developer for Data Science Team

    ...and C/C++: Absolutely non-negotiable Relational and non-relational databases (PostgreSQL and MongoDB), Hadoop/MapReduce (both setup and usage), with Mahout being a strong plus. Scripting languages (Python preferred), Unix systems and version control tools (git) Exposure to Rserve a big plus  ...

    https://www.kdnuggets.com/jobs/14/05-27-kreditech-software-developer-data-science.html

  • Big Data for Executives 2014: Day 1 Highlights

    ...e then displayed the leading OSS tools available for Data Science tasks, found through a survey: Statistical Analysis: R Data Mining: Pandas, Impala, Mahout Machine Learning: Scikit-learn Machine Learning + NLP: Mallet Natural Language Processing: NLTK, Stanford CoreNLP, NLP + Geospatial Analysis:...

    https://www.kdnuggets.com/2014/05/big-data-executives-highlights-day-1.html

  • Top KDnuggets tweets, Mar 31 – Apr 1: Experfy marketplace for Data Science projects; Event Recommendation in Python

    ...uff.ly/1lA1QnL 11 Analytics and Data Science Salary Surveys, including KDnuggets, @analyticbridge, O'Reilly, Burtch, Infoworld buff.ly/1ojinyH Apache Mahout #MachineLearning project moves beyond MapReduce, will support Apache Spark, H20 data engine #BigData buff.ly/1hZyNU0 Experfy, a Harvard-Backed...

    https://www.kdnuggets.com/2014/04/top-tweets-mar31-apr1.html

  • Kreditech: Data Scientist – Web Analytics

    ...ting and querying them on your own. -R (Rstudio). Prior experience with big data use cases a big plus. -Hadoop/MapReduce (both setup and usage), with Mahout being a strong plus. Other skills that would be interesting include: -A track record of methodological innovation (incl. publications)...

    https://www.kdnuggets.com/jobs/14/03-20-kreditech-data-scientist-web-analytics.html

  • Best KDnuggets tweets in January: Data Science Venn Diagram v2.0: “unicorns”; Great map of #DataScience skills

    ...s buff.ly/1cJoUq9 Reference Texts on Data Mining and Machine Learning from a top UCI Prof. Padhraic Smyth buff.ly/1ewPU11 Machine Learning and Apache Mahout : very good Introduction #Hadoop #BigData #DataScience shrd.by/AbpLKa Online Courses in Predictive Analytics, Machine Learning, Data Science...

    https://www.kdnuggets.com/2014/02/best-tweets-in-january.html

  • WhitePages: Software Engineer – Machine Learning, Data Science

    ...lysis. Our ideal engineer will also have worked at scale with technologies like: MapReduce (Hadoop / EMR), Pig, Hive, NoSQL (HBase / Riak / MongoDB), Mahout, Lucene, Solr. Extra points for an advanced degree or experience with statistical modeling, inference or implementing machine-learning /...

    https://www.kdnuggets.com/jobs/13/12-19-whitepages-software-engineer-machine-learning-data-science.html

  • Top stories for Feb 2-8: Predicting Sochi Olympics Medals; 3 Ways to Test Predictive Models

    ...ies and startups activity - Feb 3, 2014. Top KDnuggets tweets, Jan 31 - Feb 2: Free books on statistical learning; Intro: Machine Learning and Apache Mahout - Feb 3, 2014. Additions to KDnuggets Directory in January - Feb 1, 2014. Cartoon: Watson and Artificial vs Natural Intelligence - Jan 24,...

    https://www.kdnuggets.com/2014/02/top-news-week-Feb-2.html

  • Why Predictive Analytics Marketplaces are not taking off, and how to fix it

    ...e,Snap Analytx, a new startup, is addressing this issue by being open: Snap Analytx supports a number of tools and technologies (including SAS, SPSS, Mahout, Java, C++, Matlab, PMML, R and Python) that together add up to a sizable market share. The current set of models in the Snap Analytx catalog...

    https://www.kdnuggets.com/2014/03/predictive-analytics-marketplaces-not-taking-off-how-to-fix.html

  • AT&T: Lead Product Development Mgr, Big Data Algorithms and Insights

    ...ssion and drive to conceive the inconceivable Motivation to collaborate with a diverse, innovative team Working with Big Data technologies MAPR, PIG, MAHOUT, CHUKWA, FLUME, HBASE/HDFS/Cassandra, SQIVE, HOOP (for semi-structured, unstructured content) Hadoop stack-modeling, collection, development,...

    https://www.kdnuggets.com/jobs/14/03-19-att-lead-product-development-mgr-big-data-algorithms-insights.html

  • AT&T: Lead Product Development Engineer Big Data CIP IT Systems

    ...management experience in product development, QA, testing, and product deployment in big data platforms Working with Big Data technologies MAPR, PIG, MAHOUT, CHUKWA, FLUME, HBASE/HDFS/Cassandra, SQIVE, HOOP (for semi-structured, unstructured content) Hadoop stack-modeling, collection, development,...

    https://www.kdnuggets.com/jobs/14/03-19-att-lead-product-development-engineer-big-data-cip-it-systems.html

  • Thomson Reuters: Data Scientist (Data Innovation Lab)

    ...lustering and collaborative filtering Work experience with one or more of the following: Big data analytics (Hadoop, Hive, NoSQL, Spark, Shark, Hive, Mahout, Impala, Solr, HBase, Pig, Cascading) Information extraction, data mining, or machine learning   Comfortable in a fast paced environment...

    https://www.kdnuggets.com/jobs/14/06-13-thomsonreuters-data-scientist-data-innovation-lab.html

Refine your search here:

Sign Up

By subscribing you accept KDnuggets Privacy Policy