PAKDD 2016 Data Science Contest: Sarcasm detection on Reddit comments

The contest task is to design an effective algorithm for sarcasm detection in the domain of opinion mining. Submissions due Feb 15, 2016.

Pakdd 2016Sarcasm detection on Reddit comments

PAKDD 2016 Contest/ Competition


Welcome to PAKDD 2016 contest competition. This year we would like to challenge participants with a task of designing an effective algorithm for sarcasm detection in the domain of opinion mining. Sarcasm detection is a subtask of opinion mining. It aims at correctly identifying the user opinions expressed in the written text. Sarcasm detection plays a critical role in sentiment analysis by correctly identifying sarcastic sentences which can incorrectly flip the polarity of the sentence otherwise.

Sarcasm Understanding sarcasm, which is often a difficult task even for humans, is a challenging task for machines. Common approaches for sarcasm detection are based on machine learning classifiers trained on simple lexical or dictionary based features. To date, some research in sarcasm detection has been done on collections of tweets from Twitter, and reviews on For this task, we are interested in looking at a more conversational medium - comments on Reddit - in order to develop an algorithm that can use the context of the surrounding text to help determine whether a specific comment is sarcastic or not.

Here is the detailed PAKDD-2016 contest task description and the related datasets.


Following are the prizes for the top two winners.
  • First Prize: Apple Mac Book + one free PAKDD'16 conference registration
  • Second Prize: New high end Smartphone (to be defined - either Samsung or Apple) + one free PAKDD'16 conference registration

Contest Dates
  • Start date: 25th Nov 2015
  • End date: 15th Feb 2016

Terms and Conditions
  • The competition is open for all interested researchers, specialists and students. Only members of the Contest Organizing Committee cannot participate.
  • Participants may submit solutions as teams made up of one or more persons.
  • Each team needs to designate a leader responsible for communication with the Organizers. A single person can be a leader of only one team.
  • One person may be incorporated in maximally 3 teams.
  • Each team needs to be composed of a different set of persons.
  • The total number of submissions for any single team is limited to 100 solutions.
  • A winner of the competition is chosen on the basis of the final evaluation results. In a case of draws in the evaluation scores, time of the submission will be taken into account.
  • Each team is obliged to provide a short report describing their final solution. Reports must contain information such as the name of a team, names of all team members, the last preliminary evaluation score and a brief overview of the used approach. The report length should not exceed 2000 words and it should be submitted in the PDF format by the 15th of February, 2016. Only submissions made by teams that provided the reports will qualify for the final evaluation.
  • By enrolling to this competition you grant the organizers rights to process your submissions for the purpose of evaluation and post-competition research.

Organizing Committee
  • Muhammad Asif Naeem, Auckland University of Technology, New Zealand
  • Arturas Vedrickas, Parrot Analytics, New Zealand

Contest Sponsors