Skip to content

Latest commit

 

History

History
94 lines (64 loc) · 4.6 KB

README.md

File metadata and controls

94 lines (64 loc) · 4.6 KB

Cervical Cancer Screening - Kaggle Challenge Introduction Recently, Intel partnered with MobileODT to challenge Kagglers to develop an algorithm which accurately identifies a woman’s cervix type based on images. Their motivation: doing so will prevent ineffectual treatments and allow healthcare providers to give proper referral for cases that require more advanced treatment.

The following notebook is my solution for the presented task.

In this competition, we had to develop algorithms to correctly classify cervix types based on cervical images. These different types of cervix in our data set are all considered normal (not cancerous), but since the transformation zones aren't always visible, some of the patients require further testing while some don't. This decision is very important for the healthcare provider and critical for the patient. Identifying the transformation zones is not an easy task for the healthcare providers, therefore, an algorithm-aided decision will significantly improve the quality and efficiency of cervical cancer screening for these patients.

Dataset The training dataset comprises of 1481 images belonging to 3 different categories, with the following distribution:

Type 1 - 252 images

Type 2 - 780 images

Type 3 - 449 images

The competition was held in two stages where we were provided 2 test datasets for reporting our results. After stage 1, the output classes of stage 1 test images were released, so as to give kagglers a chance to improve and fine tune their models. The number of images provided for testing ast 2 stages are:

Stage 1 Test: 512 images

Stage 2 Test (Final): 4018 images

The final loss and accuracy were to be reported by tagging 4018 images.

The dataset is available on Kaggle here

Preprocessing Data Data preprocessing comprises of the following steps:

Resizing all images to same size (32 x 32 x 3)

Normalizing pixel values

Applying image deformations (Random Scaling + Rotations) for regularization

Storing data in a loadable numpy format

Model Using a CNN was a default choice given we have to build an image classifier.

We shall be using:

Two 2D-Convolutional layers followed by Max Pooling layers

ReLU activations

Dropout between output of second convolutional block and input of fully connected layer

Two fully connected layers for classification with dropout

Hyperbolic Tan activation for FC-1 layer

Softmax activation for FC-2 layer (Obvious choice, given a multiclass classification problem)

Adamax optimizer - a variant of Adam based on the infinity norm

Model architecture:


Layer (type) Output Shape Param #

conv2d_3 (Conv2D) (None, 4, 30, 30) 112


max_pooling2d_3 (MaxPooling2 (None, 4, 15, 15) 0


conv2d_4 (Conv2D) (None, 8, 13, 13) 296


max_pooling2d_4 (MaxPooling2 (None, 8, 6, 6) 0


dropout_3 (Dropout) (None, 8, 6, 6) 0


flatten_2 (Flatten) (None, 288) 0


dense_3 (Dense) (None, 12) 3468


dropout_4 (Dropout) (None, 12) 0


dense_4 (Dense) (None, 3) 39

Total params: 3,915 Trainable params: 3,915 Non-trainable params: 0


Results The simple convolutional model implemented in this notebook was able to generate a score of 0.96407.

This helped me achieve a rank of #110 on Kaggle leaderboard.

Future considerations: I believe a higher score can be achieved by Transfer Learning. Fine tuning a pretrained model such as Inception-V3, VGG19, ResNet-50 can definitely boost the model accuracy.

Many kagglers reported improved results by using R-CNN like approach i.e generating bounding boxes around regions of interest and generating probability predictions.

I would definitely consider exploring these ideas in future implementations!