KDD Cup 2016 Call for Proposals

ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) 2016 is being held in San Francisco, California, from August 13-17, 2016. KDD is currently inviting KDD Cup competition proposals at this time.

We invite organization proposals for KDD Cup 2016. Starting from 1997, KDD Cup has been the most prestigious annual data mining competition held in conjunction with the ACM SIGKDD conference on Knowledge Discovery and Data Mining.

KDD 2016 KDD 2016 will be held in San Francisco, Aug 24-27, 2016. The competition is anticipated to last for 2-4 months, and the winner is supposed to be notified by mid-June. The winners will be announced during the KDD conference and present their solutions in the KDD Cup workshop during the conference.

This call for proposal solicits industry or academic institutes to submit their proposal as the potential organizer of the 2016 KDD Cup competition. We are looking for a strong proposal that contains most of the following features from a proposed task: a novel and motivated goal, a rigid and fair setup, challenging yet manageable task, and accessibility to the public.

A novel and motivated goal: Of particular interests are machine learning tasks that are different from a traditional (competition) setup, in which in the end a supervised learner is on demand given a set of training data with the goal to optimize typical prediction quality in the testing. We encourage organizers to ponder on a novel yet practical challenge with broad real-world application scenarios. Examples such as learning with incrementally arrival data and evaluation on the accumulated error; prediction given limited amount of resources; learning with mostly unlabeled data; or addressing cold-start issues in learning, etc. are highly recommended.

A rigid and fair setup: The organizers should guarantee the accessibility of the data and the confidentiality of the ground truth. The evaluation metrics should be both meaningful for the application and statistically sound for objective performance comparison. The baseline should be established with evidence to show that non-trivial performance can be achieved, and an estimate of what constitutes a significant difference in performance is preferable.

A challenging yet manageable task: The task should be challenging in the sense that there is decent room for improvement from the basic solutions, and novel ideas are required to succeed in the competition. The task should be manageable in about 3 months as the competitors can mainly focus on solving the core challenges.

Accessibility: The competition should be accessible to the majority of the machine learners and data miners without excessive domain knowledge or powerful computational infrastructure.

In your proposal, please highlight how the proposed challenge meetings those requirements. Besides, please answer the following questions in the proposal.

  1. Which competition infrastructure do you plan to use (e.g. Kaggle, or building your own)?
  2. How many resources (including people, time, award money) do you plan to invest?
  3. What is the time table for the competition?
  4. Is there any concern of the privacy about the released data? Have you obtained the right to release the data for competition from the legal department of your institute?
  5. Do you require the winners to submit the source code of their winning solutions?
  6. Names, affiliations, postal addresses, phone numbers, and short biographies of the organizers.
  7. (Optional) An endorsement letter for the higher-level management of the organization.
Important Dates:

Oct 6, 2015 CFP Release
Dec 1, 2015 Proposal Submission Deadline
Dec 31, 2015 Decision Notification
Feb-March, 2016 Cup starts

Please send your proposals to kddcup2016cfp@gmail.com by Dec 6, 2015.