About

This site is the companion challenge platform for the capstone courses of the Data Science and Artificial Intelligence tracks of the Microsoft Professional Program.

Here learners have the opportunity to dig into a real-world data problem with all the information you need to on this session's Capstone Challenge.

Frequently Asked Questions

I signed up on DrivenData but haven't received a confirmation email

E-mails can take a few minutes to process. If you haven't received your confirmation email:

  • Check spam folder.
  • Wait up to 30 minutes to see if email arrives.
  • If e-mail has not arrived in 30 minutes, contact us.

How do I make a submission?

In order to submit to the competition, you need to build a model that makes predictions for the test set in the competition, and write those predictions to a .csv file in exactly the same format as the submission_format.csv file.

However, if you want to just verify that everything works, you can simply submit the submission format file, which has 0s for every prediction. To do that, follow these steps:

  • Step 0: Download the data from the "Data Download" page
  • Step 1: Go to the "Submissions" page
    Screenshot of the tab labeled 'Submissions'
  • Step 2: Click the "Make a new submssion" button, and then choose the submission_format.csv file that you downloaded from the data download page.
    Screenshot of the 'Make a new submission' button
  • Step 3: Click "Submit" score
    Screenshot of the 'Submit' button

You won't have a good score at this point, but you will at least have made one submission! (Note: you are only allowed to make 3 submissions per day.)

At this point you can to submit your token to EdX by following the instructions in the FAQ. Though, you won't have a very good grade yet with this score!

My submission is still pending/shows loading image/hasn't been scored

Occasionally, submissions can "hang" in an unscored status, meaning that you have a submission counted against your daily limit but haven't received a score yet. If this happens, please contact us and a site admin can requeue the submission for scoring.

It says something's wrong with my submission but I can't figure out what

When you are submitting an incorrect format, we can often provide helpful error messages if the error falls into some common categories (e.g. lack of headers, wrong data types, etc.). However, if you are submitting with an unusual error, you will get a generic message. Most often if you ask on the edX Discussion Board for this course, other users can help figure out what is wrong (note: do not share your whole submission file; just a sample will do). If you still can't figure out the problem or feel time pressure, please contact us.

How do I turn my score here into a grade on edX?

You will get credit on edX by pasting an encrypted token from DrivenData into a simple graded assignment form on the edX course. Here are the steps:

  • Step 0: Navigate to the export page.
    Screenshot of the tab labeled 'Export score for edX'
  • Step 1: Copy the encrypted token representing your best private score (your predictions on the holdout data):
    Screenshot of encrypted token
  • Step 2: Navigate to the graded assignment on the edX course and paste in your token:
    Screenshot of graded assignment on edX
  • Step 3: Click the button to submit and see your score.
    Screenshot of a scored token

Why is my grade on EdX lower than I expected?

Remember: the leaderboard scores and leaderboard benchmarks on the competition platform show your public score, while your grade is based on your private score. In other words, you are being graded on how close your predictions were to the holdout data for which you know the inputs but not the outputs. If your grade doesn't match what you expected, make sure you are using appropriate cross validation techniques to avoid overfitting the training data!

The grading cutoffs are explained under the "Get Your Grade" widget on EdX

I'm having issues transferring my score to edX/viewing my score on edX

If there are any issues accessing this score or concerns that the score is not correct for some reason, please contact us and we will make sure everything is working properly.

I have a question about the data or how to solve the problem

In general, most questions about the dataset, methods being used in the competition, or any questions about the underlying challenge setup should be posted to the edX course Discussion Board. This way, all other users can benefit from the answers and the question asker does not receive unfair assistance. Examples of such questions are:

  • I noticed X about the data — what does this mean?
  • I'm having trouble getting model Y to give good predictions for this problem; what do you recommend?
  • I keep trying to submit a file like Z but it's telling me the format isn't correct — what am I doing wrong?

For questions that may indicate any problems with the predictive challenge or otherwise seem worthy of escalation, please contact us.