Cette compilation de compétitions Kaggle autour de sujets de NLP peut vous faire gagner du temps dans vos propres Use Cases.


Integer Sequence Learning

Thu 2 Jun 2016 - Fri 30 Sep 2016

The On-Line Encyclopedia of Integer Sequences is a 50+ year effort by mathematicians the world over to catalog sequences of integers.


The Winton Stock Market Challenge

Tue 27 Oct 2015 – Tue 26 Jan 2016

In this recruiting competition, Winton challenges you to take on the very difficult task of predicting the future (stock returns). Given historical stock performance and a host of masked features, can you predict intra and end of day returns without being deceived by all the noise?


Denoising Dirty Documents

Mon 1 Jun 2015 – Mon 5 Oct 2015

Remove noise from printed text


Grasp-and-Lift EEG Detection

Mon 29 Jun 2015 – Mon 31 Aug 2015

Identify hand motions from EEG recordings


Bag of Words Meets Bags of Popcorn

Tue 9 Dec 2014 – Tue 30 Jun 2015

Use Google’s Word2Vec for movie reviews


Billion Word Imputation

Thu 8 May 2014 – Fri 1 May 2015

Find and impute missing words in the billion word corpus


Sentiment Analysis on Movie Reviews

Fri 28 Feb 2014 – Sat 28 Feb 2015

Classify the sentiment of sentences from the Rotten Tomatoes dataset


Predict seizures in intracranial EEG recordings

Mon 25 Aug 2014 – Mon 17 Nov 2014

Predict seizures in intracranial EEG recordings


Tradeshift Text Classification

Thu 2 Oct 2014 – Mon 10 Nov 2014

Classify text blocks in documents


The Hunt for Prohibited Content

Tue 24 Jun 2014 – Sun 31 Aug 2014

Predict which ads contain illicit content


Large Scale Hierarchical Text Classification

Wed 22 Jan 2014 – Tue 22 Apr 2014

Classify Wikipedia documents into one of 325,056 categories


Personalized Web Search Challenge

Fri 11 Oct 2013 – Fri 10 Jan 2014

Re-rank web documents using personal preferences


Facebook Recruiting III - Keyword Extraction

Fri 30 Aug 2013 – Fri 20 Dec 2013

This competition tests your text skills on a large dataset from the Stack Exchange sites. The task is to predict the tags (a.k.a. keywords, topics, summaries), given only the question text and its title. The dataset contains content from disparate stack exchange sites, containing a mix of both technical and non-technical questions.


Partly Sunny with a Chance of Hashtags

Fri 27 Sep 2013 – Sun 1 Dec 2013

What can a #machine learn from tweets about the #weather?


Multi-label Bird Species Classification - NIPS 2013

Wed 16 Oct 2013 – Sun 24 Nov 2013

Identify which of 87 classes of birds and amphibians are present into 1000 continuous wild sound recordings


Belkin Energy Disaggregation Competition

Tue 2 Jul 2013 – Wed 30 Oct 2013

Disaggregate household energy consumption into individual appliances


MLSP 2013 Bird Classification Challenge

Mon 17 Jun 2013 – Mon 19 Aug 2013

Predict the set of bird species present in an audio recording, collected in field conditions.


The ICML 2013 Whale Challenge - Right Whale Redux

Fri 10 May 2013 – Mon 17 Jun 2013

Develop recognition solutions to detect and classify right whales for BIG data mining and exploration studies


The ICML 2013 Bird Challenge

Wed 8 May 2013 – Mon 17 Jun 2013

Identify bird species from continuous audio recordings


CPROD1: Consumer PRODucts contest #1

Mon 2 Jul 2012 – Mon 24 Sep 2012

Identify product mentions within a largely user-generated web-based corpus and disambiguate the mentions against a large product catalog.


Detecting Insults in Social Commentary

Tue 18 Sep 2012 – Fri 21 Sep 2012

The challenge is to detect when a comment from a conversation would be considered insulting to another participant in the conversation.


GigaOM WordPress Challenge: Splunk Innovation Prospect

Wed 20 Jun 2012 – Fri 7 Sep 2012

Predict which blog posts someone will like.


The Hewlett Foundation: Short Answer Scoring

Mon 25 Jun 2012 – Wed 5 Sep 2012

Develop a scoring algorithm for student-written short-answer responses.


EMC Israel Data Science Challenge

Mon 18 Jun 2012 – Sat 1 Sep 2012

Match source code files to the open source code project


The Hewlett Foundation: Automated Essay Scoring

Fri 10 Feb 2012 – Mon 30 Apr 2012

Develop an automated scoring algorithm for student-written essays.


The Marinexplore and Cornell University Whale Detection Challenge

Fri 8 Feb 2013 – Mon 8 Apr 2013

Create an algorithm to detect North Atlantic right whale calls from audio recordings, prevent collisions with shipping traffic


ICFHR 2012 - Arabic Writer Identification

Tue 21 Feb 2012 – Sun 15 Apr 2012

Identify which writer wrote which documents.


ICDAR 2011 - Arabic Writer Identification

Mon 28 Feb 2011 – Sun 10 Apr 2011

This competition require participants to develop an algorithm to identify who wrote which documents. The winner will be honored at a special session of the ICDAR 2011 conference.