Deep learning computer network developed to accurately identify and classify breast cancer
Washington D.C.,May11: A deep-learning computer network has been developed that can accurately identify and delineate breast cancers on the digital tissue slides.
The network developed through research led by Case Western Reserve University was 100 percent accurate in determining whether invasive forms of breast cancer were present in whole biopsy slides.Looking closer, the network correctly made the same determination in each individual pixel of the slide 97 percent of the time, rendering near-exact delineations of the tumours.
Compared to the analyses of four pathologists, the machine was more consistent and accurate, in many cases improving on their delineations.In a field where time and accuracy can be critical to a patient’s long-term prognosis, the study is a step toward automating part of biopsy analysis and improving the efficiency of the process, the researchers said.
Currently, cancer is present in one in 10 biopsies ordered by physicians, but all must be analyzed by pathologists to identify the extent and volume of the disease, determine if it has spread and whether the patient has an aggressive or indolent cancer and needs chemotherapy or a less drastic treatment.
“If the network can tell which patients have cancer and which do not, this technology can serve as triage for the pathologist, freeing their time to concentrate on the cancer patients,” said co-author Anant Madabushi.
To train the deep-learning network, the researchers downloaded 400 biopsy images from multiple hospitals. Each slide was approximately 50,000 x 50,000 pixels. The computer navigated through or rectified the inconsistencies of different scanners, staining processes and protocols used by each site, to identify features in cancer versus the rest of the tissue.
The researchers then presented the network with 200 images from The Cancer Genome Atlas and University Hospitals Cleveland Medical Center. The network scored 100 percent on determining the presence or absence of cancer on whole slides and nearly as high per pixel.
Network training took about two weeks, and identifying the presence and exact location of cancer in the 200 slides took about 20 to 25 minutes each.That was done two years ago. Madabhushi suspects training now, with new computer architecture, would take less than a day, and cancer identification and delineation could be done in less than a minute per slide.
“To put this in perspective,” Madabhushi said, “the machine could do the analysis during ‘off hours,’ possibly running the analysis during the night and providing the results ready for review by the pathologist when she/he were to come into the office in the morning.”The study is published in Scientific Reports.