exceptional SW engineers to develop algorithms and build systems to automatically solve a variety of Information Retrieval and Data Mining problems related to the Amazon Product Catalog
Company: Amazon.com
Location: Seattle, WA
Web: www.amazon.com
_Contact_:
nkrishna@amazon.com
Job reference: 134744
Amazon's Fact Extraction team is looking for exceptional software engineers to develop algorithms and build systems to automatically solve a variety of Information Retrieval and Data Mining problems related to the Amazon Product Catalog - one of the company's biggest assets. Our charter is to enrich the structured information present in the catalog and greatly improve the product search and detail product page viewing experience for the Amazon customers.
Structured information extraction is in general a very hard problem. We make this more tractable by using the large amount of information present in Amazon and solving targeted problems. Incorrect or missing information for sometimes key attributes such as brand name or size can lead to a lot of issues in merchandising a product. In a lot of these cases, this information is present in other unstructured attributes like title and product description. The Fact Extraction team extracts this data using a mix of information retrieval, data mining and text analysis techniques resulting in a much richer product catalog and end user experience.
As part of this team you will encounter:
Algorithms and Systems:
We use information retrieval and knowledge representation techniques to solve complex extraction problems. We also own the normalization service for the Amazon product catalog.
Data analysis:
We constantly dig into the data available at Amazon to analyze, inform and develop solutions to the right problems. If looking at data, slicing and dicing it in multiple ways and coming up with interesting insights is something you thrive upon, you will fit right in.
Scalability:
We process 10s of millions of product submissions daily. Making sure the systems we built are able to handle these large amounts of data while meeting the time budget constraints is a given for any work we take on! Along with building a system to achieve this, also comes the responsibility of making it maintainable.
As a member of this team, you will have the opportunity to:
- Design and implement of complex systems, which process a large amount of data
- Identify areas of improvement in our frameworks, tools, processes and strive to make them better. Evaluate our success metrics and evolve our reporting systems.
- Dive deep into the catalog data, understand different functional areas, and use your creativity to come up with extraction techniques that improve the quality of Amazon's product catalog.
- Work with the business teams within Amazon to understand data quality issues and generate the features which need to be extracted.
Basic Qualifications:
Qualifications:
- Bachelors in Computer Science
- 4+ years of experience in software development
- Strong coding skills in Java/C++ coupled with strong base in object-oriented design and development.
Desired Qualifications:
- PhD./MS in Computer science
- Results oriented person with a delivery focus.
- Excellent verbal and written communication skills
- 6+ years of experience in software development and full product life-cycles
- Strong coding skills in Java coupled with strong base in object-oriented design and development.
- Results oriented person with a delivery focus.
- Excellent verbal and written communication skills
- Ability to handle multiple competing priorities in a fast-paced environment.
- Experience in Machine Learning, information retrieval, knowledge representation or computational linguistics is a plus.
- Fluency in written Chinese/Japanese/German is a plus.
|