Search results for flume

    Found 43 documents, 10418 searched:

  • The top 5 Big Data courses to help you break into the industry

    ...la Lesson 5: Working with Hive and Impala Lesson 6: Types of Data Formats Lesson 7: Advanced Hive concept and data file partitioning Lesson 8: Apache flume and HBase Lesson 9: Pig Lesson 10: Basics of Apache Spark Lesson 11: RDDS in Spark Lesson 12: Implementation of Spark Applications Lesson 13:...

    https://www.kdnuggets.com/2016/08/simplilearn-5-big-data-courses.html

  • Top KDnuggets tweets, May 3-5: Social network analysis of Boston Marathon Bomber; Hadoop Toolbox: When to use what

    ...zhokhar Tsarnaev and his friends bit.ly/18lBbCM Most Favorited: Hadoop Toolbox: When to use what - a guide to Hadoop, Hbase, Hive, Pig, Sqoop, Oozie, Flume, Avro, ... bit.ly/18khThf Top 10 Tweets What social network analysis says about Boston Marathon Bomber Dzhokhar Tsarnaev and his friends...

    https://www.kdnuggets.com/2013/05/top-tweets-may03-may05.html

  • IBM: Big Data Architect

    ...nce in a consulting environment. At least 2 years of experience in the following components of the Hadoop ecosystem: Hive, HBase, Spark, Storm, YARN, Flume, and/or Oozie. Preferred Technical and Professional Experience: At least 5 years of experience in the Hadoop platform (such as Cloudera,...

    https://www.kdnuggets.com/jobs/16/05-27-ibm-big-data-architect.html

  • Hadoop Key Terms, Explained

    ...s. It is a centralized service for maintaining configuration information, naming registry, distributed synchronization and group services. 13. Apache Flume   Apache Flume is a distributed service, mainly used for data collection, aggregation and movement. It works very efficiently with large...

    https://www.kdnuggets.com/2016/05/hadoop-key-terms-explained.html

  • Hadoop and Big Data: The Top 6 Questions Answered

    ...of HDFS or stand-alone. As an in-memory engine, Spark is much faster than the traditional MapReduce approach. Spark can process data from HDFS, Hive, Flume and other data sources extremely fast, allowing Hadoop to be an effective streaming or real-time analytics platform. Spark can replace...

    https://www.kdnuggets.com/2016/01/hadoop-and-big-data-questions.html

  • Simplilearn Big Data and Analytics Online Courses

    ...proprietary VM »  60 Hrs of Real Time Industry based Projects »  Packed with Latest & Advanced modules like YARN, Flume, Oozie, Mahout & Chukwa »  30 PDUs Offered Courses included »   Certified Big Data and Hadoop Developer...

    https://www.kdnuggets.com/2015/02/simplilearn-big-data-analytics-online-courses.html

  • How Big Data Pieces, Technology, and Animals fit together

    ...store. Zookeeper is a coordination and synchronization service that a distributed set of computer make decisions by consensus, handles failure, etc. Flume and Scribe are logging services, Flume is an Apache project and Scribe is an open-source Facebook project. Both aim to make it easy to collect...

    https://www.kdnuggets.com/2015/02/how-big-data-pieces-technology-fit-together.html

  • Scotiabank (Toronto): Data Scientist

    ...enting visualization tools like D3, Tableau or Qlikview Assets: - Experience with streaming (or real time) analytics and ingestion frameworks (Kafka, Flume, Storm, Flink, Samza) - Expertise in Software Design (code efficiency, security, design patterns, system or web architecture) - Expertise in...

    https://www.kdnuggets.com/jobs/16/05-03-scotiabank-data-scientist.html

  • Top 12 Interesting Careers to Explore in Big Data

    ...SUGGESTED CERTIFICATIONS Hadoop SAS Excel R MongoDB Python Pandas Apache Spark & Scala Apache Storm Apache Cassandra MapReduce Cloudera HBase Pig Flume Hive Zookeeper Related: The top 5 Big Data courses to help you break into the industry 5 EBooks to Read Before Getting into A Data Science or...

    https://www.kdnuggets.com/2016/10/top-12-interesting-careers-explore-big-data.html

  • Top Big Data Processing Frameworks

    ...synonymous with Big Data. But you already know about Hadoop, and MapReduce, and its ecosystem of tools and technologies including Pig, and Hive, and Flume, and HDFS. And all the others. Hadoop was first out of the gate, and enjoyed (and still does enjoy) widespread adoption in industry. So why...

    https://www.kdnuggets.com/2016/03/top-big-data-processing-frameworks.html

  • Apache Spark Key Terms, Explained

    ...he core Spark API that allows data engineers and data scientists to process real-time data from various sources including (but not limited to) Kafka, Flume, and Amazon Kinesis. This processed data can be pushed out to filesystems, databases, and live dashboards. Its key abstraction is a Discretized...

    https://www.kdnuggets.com/2016/06/spark-key-terms-explained.html

  • SanDisk: Senior Staff Hadoop Developer

    ...platform. Skills Required. Extensive knowledge about Hadoop Architectures and HDFS. Java/C++, Map Reduce HBase, Hive, PIG, Oozie, Mahout, Zookeeper, Flume, Solr, ElasticSearch, Storm/Spark Leading the learning/understanding and knowledge of very complex semi-conductor data leveraging existing...

    https://www.kdnuggets.com/jobs/16/01-20-sandisk-senior-staff-hadoop-developer.html

  • Top 10 Amazon Books in Data Mining, 2016 Edition">Silver BlogTop 10 Amazon Books in Data Mining, 2016 Edition

    ...ws) Paperback, $36.24 Using Hadoop 2 exclusively, author Tom White presents new chapters on YARN and several Hadoop-related projects such as Parquet, Flume, Crunch, and Spark. You’ll learn about recent changes to Hadoop, and explore new case studies on Hadoop’s role in healthcare systems and...

    https://www.kdnuggets.com/2016/11/top-10-amazon-books-data-mining.html

  • SanDisk: Senior Big Data Engineer/Hadoop Developer

    ...platform. Skills Required. Extensive knowledge about Hadoop Architectures and HDFS. Java/C++, Map Reduce HBase, Hive, PIG, Oozie, Mahout, Zookeeper, Flume, Solr, ElasticSearch, Storm/Spark Leading the learning/understanding and knowledge of very complex semi-conductor data leveraging existing...

    https://www.kdnuggets.com/jobs/16/02-03-sandisk-big-data-engineer-hadoop.html

  • Data Lake Plumbers: Operationalizing the Data Lake

    ...and Sqoop to accelerate data ingest. In the big data world, you might also want to ingest unstructured data sets as well, introducing new tools like Flume. Finally, you may want to trigger complex events based on this data stream and you might do so via Spark, Gemfire, or other in-memory grids....

    https://www.kdnuggets.com/2016/02/data-lakes-plumbers-operationalizing.html

  • Foot Locker: Sr Solutions Architect (Personalization/Adobe Technologies)

    ...ge Computing. Expertise in various data integration/ETL tools, application integration, business process and data science tools (Hadoop, Hive, Spark, Flume) Strong understanding of CICD principles and how they translate into the Azure platform Experience with streaming architectures and...

    https://www.kdnuggets.com/jobs/18/03-30-foot-locker-solutions-architect-personalization.html

  • UnitedHealth Group: Big Data Engineering Lead (Eden Prairie, MN)

    ...onal teams Preferred Qualifications: Java development experience highly preferred Experience with Big Data technologies like Hbase, MapReduce, Storm, Flume, Sqoop, Pig, Apache Drill, Oozie, Zeppelin Experience building Big Data solutions on public cloud (AWS EMR) Experience building data pipelines...

    https://www.kdnuggets.com/jobs/18/08-17-unitedhealth-group-big-data-engineering-lead.html

  • UnitedHealth Group: Health Care Data Analytics Consultant [Minnetonka, MN]

    ...ics/Economics/Computer Science/Mathematics/Business Analytics Healthcare domain experience Experience with Hadoop stack (MapReduce, Sqoop, Pig, Hive, Flume) Experience of Business intelligence tools like Tableau/Spotfire/DOMO, etc. Experience with statistical software (SAS, SPSS, R, Python, etc.)...

    https://www.kdnuggets.com/jobs/18/08-27-unitedhealth-group-health-care-data-analytics-consultant.html

  • Practical Apache Spark in 10 Minutes

    ...can be loaded from the different sources. As we don’t have the real streaming data source, we should simulate it. For this purpose, we can use Kafka, Flume, and Kinesis, but the simplest streaming data simulator is Netcat.   Part 6 - GraphX In our last post, we explained the basics of...

    https://www.kdnuggets.com/2019/01/practical-apache-spark-10-minutes.html

  • Foot Locker: Sr Architect – Data Engineering

    ...ment systems Experience in various data integration/ETL tools, application integration, business process and data science tools (Hadoop, Hive, Spark, Flume) Strong conceptual, logical and physical architecture design experience. Excellent analytical and problem solving skills with great...

    https://www.kdnuggets.com/jobs/18/04-03-foot-locker-architect-data-engineering.html

  • Foot Locker: Sr Solutions Architect – Machine Learning and AI Technologies

    ...ge Computing. Expertise in various data integration/ETL tools, application integration, business process and data science tools (Hadoop, Hive, Spark, Flume). Strong understanding of CICD principles and how they translate into the Azure platform Experience with streaming architectures and...

    https://www.kdnuggets.com/jobs/18/03-30-foot-locker-solutions-architect-ml-ai.html

  • Celgene: Director, Big Data Ops Lead

    ...oSQL and Graph databases) Data quality Management Metadata Management (e.g. ASG Rochade, Hive Metastore, Navigator, etc) Data integration (e.g.Sqoop, Flume, Talend, etc.) Data Warehousing (e.g. Netezza) Knowledge of IT Service Management framework e.g. ITIL Excellent interpersonal skills in areas...

    https://www.kdnuggets.com/jobs/17/07-18-celgene-director-big-data-ops-lead.html

  • UnitedHealth: Sr Director, Data Science – Advanced Research & Analytics

    ...) and commercial platforms (SAS, SPSS, Azure, etc.) Experience delivering analytic solutions on the Hadoop stack (MapReduce, Sqoop, Pig, Hive, Hbase, Flume) Experience integrating analytic models within business applications for real-time scoring Demonstrated experience consulting with executive...

    https://www.kdnuggets.com/jobs/18/03-14-unitedhealth-director-data-science.html

  • Hackerday – Stay Updated in your Career through Hands-On Projects

    ...ig data room - but to find an ideal solution with real-time business intelligence it needs to be combined with Apache Spark, Apache Storm, Kafka, and Flume in order to bring evolutionary changes in big data processing environments. This is not enough, having achieved blazingly fast real time...

    https://www.kdnuggets.com/2015/11/dezyre-hackerday-platform-hands-on-projects.html

  • Why the Data Scientist and Data Engineer Need to Understand Virtualization in the Cloud

    ...T.pdf Toward an Elastic Elephant – Enabling Hadoop for the Cloud labs.vmware.com/vmtj/toward-an-elastic-elephant-enabling-hadoop-for-the-cloud Apache Flume and Apache Scoop Data Ingestion to Apache Hadoop Clusters on VMware vSphere...

    https://www.kdnuggets.com/2017/01/data-scientist-engineer-understand-virtualization-cloud.html

  • Simplilearn Big Data and Analytics Courses – CAREER30

    ...adoop Developer, SAS Base Programmer, R Language) »  60 Hrs of Real-time Industry based ProjectsM »  Modules on YARN, Flume, Oozie, Mahout & Chukwa   Starting @ $ 280   Know More   All-in-One Big Data and Cloud Computing Suite...

    https://www.kdnuggets.com/2015/03/simplilearn-big-data-analytics-courses-career30.html

  • AT&T: Lead Product Development Mgr, Big Data Algorithms and Insights

    ...to conceive the inconceivable Motivation to collaborate with a diverse, innovative team Working with Big Data technologies MAPR, PIG, MAHOUT, CHUKWA, FLUME, HBASE/HDFS/Cassandra, SQIVE, HOOP (for semi-structured, unstructured content) Hadoop stack-modeling, collection, development, file structure,...

    https://www.kdnuggets.com/jobs/14/03-19-att-lead-product-development-mgr-big-data-algorithms-insights.html

  • Apple: iAd – Senior Software Engineer

    ...xperience with Oracle 10g, 11g databases. Python and Bash Scripting Experience is a plus. Experience with Analytical Tools is a plus. Experience with Flume and Oozie is a plus.   Description: Apple advertising provides an opportunity to redefine the advertising experience on mobile devices....

    https://www.kdnuggets.com/jobs/14/04-20-apple-iad-senior-software-engineer.html

  • 18 essential Hadoop tools

    ...ion system. Oozie, a workflow manager for the Apache toolchain. GIS Tools, a set of tools to help manage geographical components of your data. Apache Flume, a system for collecting log data using HDFS. SQL on Hadoop, some of the most popular options include: Apache Hive, Cloudera Impala, Presto...

    https://www.kdnuggets.com/2014/08/18-essential-hadoop-tools.html

  • AT&T: Lead Product Development Engineer Big Data CIP IT Systems

    ...ience in product development, QA, testing, and product deployment in big data platforms Working with Big Data technologies MAPR, PIG, MAHOUT, CHUKWA, FLUME, HBASE/HDFS/Cassandra, SQIVE, HOOP (for semi-structured, unstructured content) Hadoop stack-modeling, collection, development, file structure,...

    https://www.kdnuggets.com/jobs/14/03-19-att-lead-product-development-engineer-big-data-cip-it-systems.html

  • Big Data ETL Developer

    ...ars of experience in Java and scripting languages Experience with Hadoop workflows and MapReduce is a plus Experience in Python, Pig, Scala, Sqoop or Flume is desirable Experience in working with manufacturing or machine data is a plus Experience in working with software developers and data...

    https://www.kdnuggets.com/jobs/13/11-19-bosch-big-data-etl-developer.html

  • Lead Analytics Engineer

    ...code At least 2 years of dedicated Java experience At least 1 year of production experience with big data technologies: MapReduce, HBASE, Cassandra, Flume, Hive, PIG, MongoDB, etc. Strong system design skills Excellent communication skills A passion for transforming education Highly desired:...

    https://www.kdnuggets.com/jobs/13/01-17-knewton-lead-analytics-engineer.html

  • Big Data Analyst

    ...a visualization Extra Credit: Experience with big data solutions utilizing advanced technologies and frameworks such as MapReduce, Hadoop, Pig, Hive, Flume, and NoSQL Experience and use of ETL (Extract-Transform-Load) tools (e.g Informatica) Perform predictive modeling using tool sets such as "R"...

    https://www.kdnuggets.com/jobs/13/10-18-rightcaresolutions-big-data-analyst.html

  • Interview: Taylor Phillips, Square on Why Finance Needs Machine Learning and Data Science

    ...tlab and Python. Data Engineer - Focuses on obtaining and maintaining the data in a variety of usable forms. They own the data pipelines (e.g. Kafka, Flume) and data storage (e.g. HDFS, MySQL). Data Science Engineer - Implements the features and models and makes them go live in production. These...

    https://www.kdnuggets.com/2014/08/interview-taylor-phillips-square-finance-machine-learning.html

  • Big Data and Hadoop, Big Data Boot Camp LA

    ...– NoSQL storage for real-time queries   Extended Hadoop Ecosystem includes following: Hadoop streaming – MapReduce in languages other than Java Flume – data ingestion into HDFS Sqoop – Import data from SQL databases Oozie – Hadoop job scheduler Mahout – Recommendation, clustering,...

    https://www.kdnuggets.com/2014/10/big-data-hadoop-boot-camp-los-angeles.html

  • KDnuggets™ News 15:n05, Feb 11: Annual Salary Poll; 10 things statistics teaches about Big Data; Data Science Jargon

    ...Technology, and Animals fit together - Feb 5, 2015. How Big Data Pieces and animals fit together: MapReduce, HDFS, Apache Spark,, Pregel, Zookeeper, Flume, Hive, Pig, and more, explained by a Quora (and past Facebook) Data Scientist.    Opinions  (see also All Opinions for this...

    https://www.kdnuggets.com/2015/n05.html

  • Big Data Bootcamp, Austin: Day 3 Highlights

    ...torage. Sqoop is a tool to import/export any JDBC -supported database into Hadoop and it transfers data between Hadoop and external databases or EDW. Flume is log file collector. Storm is used for real-time streaming and is made up of topologies of spouts (accepts stream) and bolts (in-stream...

    https://www.kdnuggets.com/2015/04/big-data-bootcamp-austin-highlights-day3.html

  • Simplilearn: Big Data, Analytics online courses discount, Free ebook

    ...9 Know More   Big-Data and Hadoop Developer Certification Training »   Packed with Latest & Advanced modules like YARN, Flume, Oozie, Mahout & Chukwa »   30 PDUs Offered »   Industry Specific Projects on Top 3 Sectors - Retail,...

    https://www.kdnuggets.com/2015/05/simplilearn-big-data-analytics-online-courses-free-ebook.html

  • simplilearn Big Data & Analytics Certification Courses Online, 30% off till Jan 31

    ...Learning content    40 Hrs of Lab Exercises with proprietary VM    Packed with Latest & Advanced modules like YARN, Flume, Oozie, Mahout & Chukwa    Excellence in Hadoop Certificate   $259     $181 Enroll Now Know More  ...

    https://www.kdnuggets.com/2015/01/simplilearn-big-data-analytics-certification-courses-online.html

  • Simplilearn Big Data and Analytics courses, 30% off

    ...nbsp;    $ 181      Packed with Latest & Advanced modules like        YARN, Flume, Oozie, Mahout & Chukwa    Excellence in Hadoop Certificate   Enroll Now Know More Business Analytics Foundation - R...

    https://www.kdnuggets.com/2014/12/simplilearn-big-data-analytics-courses-30pct-off.html

  • MassMutual: Data Engineer

    ...of data modeling and administration of NoSQL and SQL databases. 3+ years of experience with at least one these: Hadoop, MapReduce, HDFS, HBase, Hive, Flume, Sqoop, Spark, Vertica, SQL, data warehouses. (Certifications in one or more of the above tools preferred) Familiarity with web programming and...

    https://www.kdnuggets.com/jobs/14/10-30-massmutual-data-engineer.html

  • R and Hadoop make Machine Learning Possible for Everyone

    ...orm it ahead of time. As with R, many open source projects were created to re-imagine the data platform. Starting with getting data into HDFS (sqoop, flume, kafka, etc.) to compute and streaming (Spark, YARN, MapReduce, Storm, etc.), to querying data (Hive, Pig, Stinger / Tez, Drill, Presto, etc.),...

    https://www.kdnuggets.com/2014/11/r-hadoop-make-machine-learning-possible-everyone.html

  • Spark SQL for Real-Time Analytics

    …on called DStream (discrete streams) which is a continuous stream of data. DStreams are created from input data stream or from sources such as Kafka, Flume or by applying operations on other DStreams. A DStream is essentially a sequence of RDDs. RDDs generated by DStreams can be converted to…

    https://www.kdnuggets.com/2015/09/spark-sql-real-time-analytics.html

Refine your search here:

Sign Up

By subscribing you accept KDnuggets Privacy Policy