CERVC
HiddenPredicting Cervical Cancer
π΅οΈ Introduction
We have said it earlier and we say it again - 'With Great Power Comes Great Responsibility' And yes we do have the power to do good for the world. Let us be responsible and put that power to use.
This time, we pick up our weapons against cancer
.
Given information of different risk
factors in a woman, predict
as best as possible, the presence
or absence
of cervical cancer
in the woman.
Understand with code! Here is getting started code
for you.π
πΎ Dataset
This dataset contains indicators and risk factors for predicting whether a woman will get cervical cancer
. There are total of 15
attributes out of which first 14
features include demographic data such as age
, lifestyle
, and medical history
. The last attribute called Biopsy
is target variable and it's value is 0
for Healthy
and 1
for Cancer
. The first 14
attributes are as:
- Age [ in years ]
- Number of sexual partners
- First sexual intercourse [ age in years ]
- Number of pregnancies
- Smoking [ yes or no ]
- Smoking [ in years ]
- Hormonal contraceptives [ yes or no ]
- Hormonal contraceptives [ in years ]
- Intrauterine device [ yes or no (IUD) ]
- Number of years with an intrauterine device (IUD)
- Has patient ever had a sexually transmitted disease (STD) [ yes or no ]
- Number of STD diagnoses
- Time since first STD diagnosis
- Time since last STD diagnosis
- The biopsy results - Target outcome.[
0
forHealthy
or1
forCancer
]
π Files
Following files are available in the resources
section:
train.csv
- (686
samples) This csv file contains the attributes describing the risk factors along with its biopsy results.test.csv
- (172
samples) File that will be used for actual evaluation for the leaderboard score but does not have its biopsy result.
π Submission
- Prepare a CSV containing header as
Biopsy
and predicted value as digit0
or1
with name assubmission.csv
. - Sample submission format available at
sample_submission.csv
.
Make your first submission here π !!
π Evaluation Criteria
During evaluation F1 score and Log Loss will be used to test the efficiency of the model where,
π Links
- πͺ Challenge Page: https://www.aicrowd.com/challenges/cervc
- π£οΈ Discussion Forum: https://www.aicrowd.com/challenges/cervc/discussion
- π Leaderboard: https://www.aicrowd.com/challenges/cervc/leaderboards
π± Contact
Notebooks
0
|
0
|