Date:
This posting is from Atbrox, a startup company providing technology and services for Search and Mapreduce/Hadoop.
Here are selected Mapreduce & Hadoop papers in the area of data mining
Ads Analysis
- Improving ad relevance in sponsored search
- Predicting the Click-Through Rate for Rare/New Ads
- Learning Influence Probabilities in Social Networks
- Mining advertiser-specific user behavior using adfactors
- Extracting user profiles from large scale data
- LogMaster: Mining Event Correlations in Logs of Large Scale Cluster Systems
- Stateful Bulk Processing for Incremental Analytics
- Mining dependency in distributed systems through unstructured logs analysis
- Beyond online aggregation: parallel and incremental data mining with online mapreduce
- Stochastic gradient boosted distributed decision trees
- Distributed Algorithms for Topic Models
- Cloud Computing Boosts Business Intelligence of Telecommunication Industry
- Parallel K-Means Clustering Based on MapReduce
The most read papers were:
- MapReduce-Based Pattern Finding Algorithm Applied in Motif Detection for Prescription Compatibility Network
- Data-intensive text processing with Mapreduce
- Large-Scale Behavioral Targeting
- Improving Ad Relevance in Sponsored Search
- Experiences on Processing Spatial Data with MapReduce
Here are more Statistics on Hadoop and Mapreduce Algorithm Papers