Multi-Task Pre-Training of Deep Neural Networks for Digital Pathology

In this work, we investigate multi-task learning as a way of pre-training models for classification tasks in digital pathology. It is motivated by the fact that many small and medium-size datasets have been released by the community over the years whereas there is no large scale dataset similar to ImageNet in the domain. We first assemble and transform many digital pathology datasets into a pool of 22 classification tasks and almost 900k images. Then, we propose a simple architecture and training scheme for creating a transferable model and a robust evaluation and selection protocol in order to evaluate our method. Depending on the target task, we show that our models used as feature extractors either improve significantly over ImageNet pre-trained models or provide comparable performance. Fine-tuning improves performance over feature extraction and is able to recover the lack of specificity of ImageNet features, as both pre-training sources yield comparable performance.

[1]  Heng Huang,et al.  Supervised Intra-embedding of Fisher Vectors for Histopathology Image Classification , 2017, MICCAI.

[2]  Elisa Ficarra,et al.  Dealing with Lack of Training Data for Convolutional Neural Networks: The Case of Digital Pathology , 2019, Electronics.

[3]  Catarina Eloy,et al.  BACH: Grand Challenge on Breast Cancer Histology Images , 2018, Medical Image Anal..

[4]  Sergey Ioffe,et al.  Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning , 2016, AAAI.

[5]  Nasir M. Rajpoot,et al.  PanNuke: An Open Pan-Cancer Histology Dataset for Nuclei Instance Segmentation and Classification , 2019, ECDP.

[6]  Timo Heikkinen,et al.  Improving Prostate Cancer Detection with Breast Histopathology Images , 2019, ECDP.

[7]  Yu Zhang,et al.  A Survey on Multi-Task Learning , 2017, IEEE Transactions on Knowledge and Data Engineering.

[8]  Jean-Christophe Olivo-Marin,et al.  An approach for detection of glomeruli in multisite digital pathology , 2016, 2016 IEEE 13th International Symposium on Biomedical Imaging (ISBI).

[9]  Francesco Bianconi,et al.  Multi-class texture analysis in colorectal cancer histology , 2016, Scientific Reports.

[10]  Gilles Louppe,et al.  Collaborative analysis of multi-gigapixel imaging data using Cytomine , 2016, Bioinform..

[11]  S. Levine,et al.  Gradient Surgery for Multi-Task Learning , 2020, NeurIPS.

[12]  Nima Tajbakhsh,et al.  Convolutional Neural Networks for Medical Image Analysis: Full Training or Fine Tuning? , 2016, IEEE Transactions on Medical Imaging.

[13]  Luiz Eduardo Soares de Oliveira,et al.  Deep features for breast cancer histopathological image classification , 2017, 2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[14]  Marcel Worring,et al.  Many Task Learning With Task Routing , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[15]  Natalia Gimelshein,et al.  PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[16]  Kevin Smith,et al.  Digital image analysis in breast pathology-from image processing techniques to artificial intelligence. , 2017, Translational research : the journal of laboratory and clinical medicine.

[17]  Bram van Ginneken,et al.  A survey on deep learning in medical image analysis , 2017, Medical Image Anal..

[18]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[19]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[20]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[21]  Lassi Paavolainen,et al.  BIAFLOWS: A Collaborative Framework to Reproducibly Deploy and Benchmark Bioimage Analysis Workflows , 2020, Patterns.

[22]  Jiaying Liu,et al.  Adaptive Batch Normalization for practical domain adaptation , 2018, Pattern Recognit..

[23]  Shahryar Rahnamayan,et al.  Classification and Retrieval of Digital Pathology Scans: A New Dataset , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[24]  Lassi Paavolainen,et al.  BIAFLOWS: A collaborative framework to benchmark bioimage analysis workflows , 2019, bioRxiv.

[25]  Yolanda T. Chong,et al.  Automated analysis of high‐content microscopy data with deep learning , 2017, Molecular systems biology.

[26]  Karl Rohr,et al.  Predicting breast tumor proliferation from whole‐slide images: The TUPAC16 challenge , 2018, Medical Image Anal..

[27]  Yoshua Bengio,et al.  How transferable are features in deep neural networks? , 2014, NIPS.

[28]  Stefan Carlsson,et al.  CNN Features Off-the-Shelf: An Astounding Baseline for Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[29]  Manfred Claassen,et al.  Coupling weak and strong supervision for classification of prostate cancer histopathology images , 2018, ArXiv.

[30]  Bohyung Han,et al.  Domain-Specific Batch Normalization for Unsupervised Domain Adaptation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[31]  Liron Pantanowitz,et al.  Artificial Intelligence and Digital Pathology: Challenges and Opportunities , 2018, Journal of pathology informatics.

[32]  Zijian Zhang,et al.  What And How Other Datasets Can Be Leveraged For Medical Imaging Classification , 2019, 2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019).

[33]  Andrew H. Beck,et al.  Diagnostic Assessment of Deep Learning Algorithms for Detection of Lymph Node Metastases in Women With Breast Cancer , 2017, JAMA.

[34]  Rich Caruana,et al.  Multitask Learning , 1998, Encyclopedia of Machine Learning and Data Mining.

[35]  Luiz Eduardo Soares de Oliveira,et al.  A Dataset for Breast Cancer Histopathological Image Classification , 2016, IEEE Transactions on Biomedical Engineering.

[36]  Daisuke Komura,et al.  Machine Learning Methods for Histopathological Image Analysis , 2017, Computational and structural biotechnology journal.

[37]  Shang Hong,et al.  What And How Other Datasets Can Be Leveraged For Medical Imaging Classification , 2019 .

[38]  Ronald M. Summers,et al.  Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning , 2016, IEEE Transactions on Medical Imaging.

[39]  Zhenbing Liu,et al.  Multi-task Deep Learning for Fine-Grained Classification/Grading in Breast Cancer Histopathological Images , 2019, Cognitive Internet of Things.

[40]  Andrew Janowczyk,et al.  Deep learning for digital pathology image analysis: A comprehensive tutorial with selected use cases , 2016, Journal of pathology informatics.

[41]  Harald Burgsteiner,et al.  Training echo state networks for rotation-invariant bone marrow cell classification , 2016, Neural Computing and Applications.

[42]  Martial Hebert,et al.  Cross-Stitch Networks for Multi-task Learning , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[43]  Xiang Zhang,et al.  OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks , 2013, ICLR.

[44]  Nasir M. Rajpoot,et al.  Locality Sensitive Deep Learning for Detection and Classification of Nuclei in Routine Colon Cancer Histology Images , 2016, IEEE Trans. Medical Imaging.

[45]  Jieping Ye,et al.  Deep Model Based Transfer and Multi-Task Learning for Biological Image Analysis , 2015, IEEE Transactions on Big Data.

[46]  Hamid R. Tizhoosh,et al.  Deep Features for Tissue-Fold Detection in Histopathology Images , 2019, ECDP.

[47]  Raphaël Marée,et al.  Comparison of Deep Transfer Learning Strategies for Digital Pathology , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[48]  Matti Pietikäinen,et al.  Identification of tumor epithelium and stroma in tissue microarrays using texture analysis , 2012, Diagnostic Pathology.

[49]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[50]  Quoc V. Le,et al.  Do Better ImageNet Models Transfer Better? , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[51]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[52]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[53]  Luca Antiga,et al.  Automatic differentiation in PyTorch , 2017 .

[54]  Riccardo Cicchi,et al.  Few Shot Learning in Histopathological Images:Reducing the Need of Labeled Data on Biological Datasets , 2019, 2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019).

[55]  Lubomir M. Hadjiiski,et al.  Multi-task transfer learning deep convolutional neural network: application to computer-aided diagnosis of breast cancer on mammograms , 2017, Physics in medicine and biology.