Pearson, a leading international publishing company, looking for a full-stack developer with an understanding of modern data mining techniques: machine learning, information retrieval, collaborative filtering, map-reduce algorithms, and visualization
Company: Pearson
Location: London, UK
Web: www.pearson.com
Company:
Pearson is the international publishing and media company behind global brands such as Financial Times, Penguin, Dorling Kindersley, Rough Guides, Edexcel, and Mergermarket.
Within Pearson's Enabling Technologies Group, the Data Analytics Team will be responsible for providing a toolset of sophisticated statistical techniques, data mining algorithms and predictive analytics as well as the expertise in how and when they should be used.
We are looking for a full-stack developer with an understanding of modern data mining techniques: machine learning, information retrieval, collaborative filtering, map-reduce algorithms, and visualization.
Key Activities
- Design, develop and maintain production-ready data analytics services and APIs.
- Turn state-of-the-art algorithms into robust, supportable and reusable libraries.
- Build user-facing tools for querying and visualizing large data sets.
- Administer development servers (upgrades, backups, schema changes etc.).
- Work with the Enterprise Architecture department to understand and agree key data models required to support analytics.
- Work with internal customers and collaborators to identify analytics needs and data sources, and produce end-user documentation.
- Evangelize for the adoption of analytic methods and data-driven business practices throughout Pearson.
Required Skills
- Proven ability to develop fast, scalable and robust data-centric services and applications.
- Strong software engineering skills: unit testing, source control, repeatable builds, continuous integration, load testing and performance monitoring.
- Fluency with both relational and non-relational databases and data models.
- Understanding of data-mining concepts - classification, regression, recommendation, ranking etc - and the ability to choose the right approach for the job.
- Ability to test hypotheses and prototype methodologies interactively, e.g. in R, Matlab, Octave, or NumPy/SciPy.
Helpful Skills
- Comfortable working in both Linux and Windows-based server environments.
- Hands-on experience with Hadoop, Pig, Hive etc.
- Familiarity with one or more data visualization packages.
- Natural language processing experience.
- Experience working with web-scale data and public-facing websites.
Competency
- Strong communication and listening skills.
- Self-directed, adaptable and flexible.
- Good verbal, written and presentation skills.
- Strong analytical and critical thinking skills.
- Excellent judgment.
Education:
at least Masters level in Computer Science, Mathematics, Statistics, Bioinformatics or similar.
_Contact_:
Job has been filled.
|