KDnuggets Home » Jobs » Virginia Tech: Data Engineer ( 15:n19 )

Virginia Tech: Data Engineer


The Discovery Analytics Center applies cutting edge machine learning to forecast worldwide events using Twitter, Facebook, Google News, and blogs. Data engineer will help harness and curate the ever-growing influx of data.



Virginia Tech Company: Virginia Tech
Location: Arlington, VA
Web: www.cs.vt.edu


Intrigued by big data? Does social media inspire you? The Discovery Analytics Center applies cutting edge machine learning to forecast worldwide events using Twitter, Facebook, Google News, and blogs. Its ability to forecast tomorrow's events has been the subject of articles in Newsweek, Wall Street Journal, Slate, Popular Science, and PBS's NOVA Next. DAC's research garners best paper awards-most recently at SIAM's 2015 International Conference on Data Mining.

DAC, part of the Department of Computer Science at Virginia Tech, seeks a data engineer to harness and curate the ever-growing influx of data created across five continents. The primary responsibilities of the successful candidate will be the development and maintenance of rich, big data systems. This role will support the real-time storage of this data in appropriate distributed storage systems. Finally, he or she will integrate natural language processing techniques to transform raw feeds into rich, structured datasets.

The hybrid software engineering / research team within DAC uses agile development methodology to build productions systems running 24/7. DAC offers opportunities for research publication and career advancement, as well as access to the rich portfolio of data holdings from DAC's projects.

Required Qualifications:
  • M.S. in Computer Science, IT, or similar concentration; an exception can be made for a BS in Computer Science, IT, or similar concentration with relevant experience.
  • Demonstrated experience in the software engineering of big data systems.
  • Deep knowledge of operating systems, including Linux and Unix.
  • Demonstrated proficiency in object oriented programming languages, including Python and Java.
  • Demonstrated knowledge of SQL and SQL-like languages, including the ability to optimize queries involving complex logic.
  • Demonstrated working knowledge of distributed file systems, distributed storage systems (HDFS, Hive).

 
Preferred Qualifications:
  1. Excellent communication skills.
  2. Experience with a major cloud computing platform such as AWS.
  3. Experience with and knowledge of RESTful APIs.
  4. Demonstrated knowledge of schema design.
  5. Knowledge of software architecture design methodology.
  6. Knowledge of product development methodology.
  7. Demonstrated experience in an agile development environment.


Pay Range/Salary: Commensurate with experience.

_Contact_:
Apply online

For more information contact

Prof. Naren Ramakrishnan
Thomas L. Phillips Professor of Engineering
Director, Discovery Analytics Center
Department of Computer Science
Virginia Tech
Email: naren@cs.vt.edu