Challenge has Ended

Spoonshot Internship Hiring AI Challenge
Invite only

The aim of this challenge is to classify the given text into multiple categories.

Difficulty

Data Science
Hiring
Interns
Certificate
₹20,000.00 - ₹25,000.00 per Month
Ended
156 Participants
9,308 Views
Organized By Spoonshot


Welcome to Spoonshot Internship Hiring AI Challenge!

Spoonshot is hiring data science interns and the people solving this challenge will get a chance to be interviewed by them. Please refer to the quickstart guide on how to get started.

AIM: The aim of this challenge is to develop a machine learning model(s) to classify given text data into multiple classes. The text can belong to multiple classes simultaneously.

The dataset contains rows of 6 classes of Research Topics. One row may belong to multiple classes as well. The dataset consists of 3 files "TRAIN.csv", "TEST.csv" and "Sample_Submission.csv". The TRAIN.csv file contains the training data and labels for training. It's very important to note that the "labels" may contain multiple classes (comma(,) separated) and can be in any order:

  • Computer Science
  • Physics
  • Mathematics
  • Statistics
  • Quantitative Biology
  • Quantitative Finance

Please refer to TRAIN.csv labels to see the format taken.

NOTE: If a row belong to three classes, for example "Computer Science", "Physics" and "Quantitative Biology", the answer would be "Computer Science,Physics,Quantitative Biology "

Please refer to the Submission Guidelines for the final format of the submission.

The final submission must include:

output.csv (single file having results for "TEST.csv” which contains 3972 data rows and 1 header row.

The first row should correspond to the column names "Index" and "Labels".

  • "Index" corresponds to row index of TEST.csv.
  • "Labels" correspond to multi-class output with comma-separated values.
  • Final submission has "output.csv" for score evaluation, Ipython notebook is not the solution but has to be submitted alongside output.csv via "My Submissions" section of this challenge.

Sample Output:

...

NOTE: If a row belong to three classes, for example "Computer Science", "Physics" and "Quantitative Biology", the answer would be "Computer Science,Physics,Quantitative Biology "


How to make a submission?

  1. Click on "My Submission"
  2. On the next page, click on "+ New Submission"
  3. Upload your CSV in the next page and click on "Submit for Review"


Please note:

  1. You must submit your CSV file by uploading the CSV in the "My Submissions" section of this challenge.
  2. Your submission will be auto graded and you will be able to see your results instantly.
  3. If there is any error in the submission, your final score will be marked as 0 or a warning prompt of “Invalid Score” will be displayed..

Judgement

  1. F1 Accuracy Score
  2. If the scores are tied, the person reaching the score FIRST will get the better rank.

Rules

  1. It is mandatory to provide your solution by uploading your .ipynb notebook in the My Submissions section of this challenge.
  2. We are using automatic anti-cheat system. Our system flags submission which have low trust scores. Manually modifying your submission CSV (via comparison of any form etc.), using image comparison techniques (pixel matching, file size matching etc,), in any form, can lead to your disqualification without any notice.
  3. The participants must use only the provided dataset for training.
  4. The submission should be in a proper format as described by "Submission Guidelines".
  5. Late submission will not be accepted beyond provided deadline (Indian Standard Time).

NOTE: We may request for the code files if there is any discrepancy in your score. Your score will be marked invalid or you can be disqualified if the request for code is not fulfilled by you.

Rewards:

  1. Internship offering with stipend Rs.20,000 to Rs.25,000.
  2. Participants with at least 1 successful submission will receive participation certificates from "Spoonshot" for their commendable effort.
  3. The Top 3 participants will receive a permanent place in Dockship's Hall of Fame page.
How do I apply for this Challenge?
How do I download the dataset?
Can I make multiple submissions?
Where will the results be declared?
Can we apply as a team?
I've other queries, where can I get support?
Challenge Announced
26-Oct-2020, 6:35 pm IST
28-Oct-2020, 10:00 am IST
Challenge Started
Challenge Ended
29-Oct-2020, 10:00 pm IST