Mammography is the most widely used screening modality for the detection of breast cancer. There is evidence that it decreases breast cancer mortality in women aged 50 to 69 years and that it is associated with harms, including the detection of clinically insignificant cancers that pose no threat to life (overdiagnosis). The benefit of mammography for women aged 40 to 49 years is uncertain.   Clinical breast examination (CBE) has not been studied as an independent screening test. Breast self-exam has been shown to have no mortality benefit. Technologies such as ultrasound, magnetic resonance imaging, and molecular breast imaging are being evaluated, usually as adjuncts to mammography, and are not primary screening tools in the average population.
Informed medical decision making is increasingly recommended for individuals who are considering cancer screening. Many different types and formats of decision aids have been studied. (Refer to the PDQ summary on Cancer Screening Overview for more information.)
Randomized controlled trials (RCTs) initiated 50 years ago provide evidence that screening mammography improves breast cancer survival for women aged 60 to 69 years (solid evidence) and women aged 50 to 59 years (fair evidence). Population-based studies done more recently raise questions as to the benefits to screened populations who participate in screening for longer time periods.
The validity of meta-analyses of RCT demonstrating a mortality benefit is limited by improvements in medical imaging and treatment in the decades since their completion. The 25-year follow-up from the Canadian National Breast Screening Study (CNBSS),  completed in 2014, showed no survival benefit associated with screening mammograms.
Based on solid evidence, screening mammography may lead to the following harms:
For all of these conclusions regarding potential harms from screening mammography, internal validity, consistency, and external validity are good.
The CNBSS trial did not study the efficacy of CBE versus no screening. Ongoing randomized trials, two in India and one in Egypt, are designed to assess the efficacy of screening CBE but have not reported mortality data.       Thus, the efficacy of screening CBE cannot be assessed yet.
Screening by CBE may lead to the following harms:
BSE has been compared with no screening and has been shown to have no benefit in reducing breast cancer mortality.
There is solid evidence that formal instruction and encouragement to perform BSE leads to more breast biopsies and more diagnoses of benign breast lesions.
Breast cancer is the most common noncutaneous cancer in U.S. women, with an estimated 268,600 cases of invasive disease, 62,930 cases of in situ disease, and 41,760 deaths expected in 2019.  Women with inherited risk, especially BRCA1 and BRCA2 gene carriers, comprise no more than 10% of breast cancer cases. Males account for 1% of breast cancer cases and breast cancer deaths. 
The biggest risk factor for breast cancer is being female followed by advancing age. Other risk factors include hormonal aspects (such as early menarche, late menopause, nulliparity, late first pregnancy, and postmenopausal hormone therapy), alcohol consumption, and exposure to ionizing radiation.
Breast cancer incidence in white women is higher than in black women, who also have a lower survival rate for every stage when diagnosed. This may reflect differences in screening behavior and access to healthcare. Hispanic and Asian-Pacific islanders have lower incidence and mortality than whites or blacks. 
Breast cancer incidence depends on reproductive issues (such as early vs. late pregnancy, multiparity, and breastfeeding), participation in screening, and postmenopausal hormone usage. The incidence of breast cancer (especially ductal carcinoma in situ [DCIS]) increased dramatically after mammography was widely adopted in the United States and the United Kingdom.  Widespread use of postmenopausal hormone therapy was associated with a dramatic increase in breast cancer incidence, a trend that reversed when its use decreased. 
In any population, the adoption of screening is not followed by a decline in the incidence of advanced-stage cancer.
Women with breast symptoms undergo diagnostic mammography as opposed to screening mammography, which is done in asymptomatic women. In a 10-year study of breast symptoms prompting medical attention, a breast mass led to a cancer diagnosis in 10.7% of cases, whereas pain was associated with cancer in only 1.8% of cases. 
Breast cancer can be diagnosed when breast tissue cells removed during a biopsy are studied microscopically. The breast tissue to be sampled can be identified by an abnormality on an imaging study or because it is palpable. Breast biopsies can be performed with a thin needle attached to a syringe (fine-needle aspirate), a larger needle (core biopsy), or by excision (excisional biopsy). Image guidance can improve accuracy. Needle biopsies sample an abnormal area large enough to make a diagnosis. Excisional biopsies aim to remove the entire region of abnormality.
DCIS is a noninvasive condition that can be associated with, or evolve into, invasive cancer, with variable frequency and time course.  Some authors include DCIS with invasive breast cancer statistics, but others argue that it would be better if the term were replaced with ductal intraepithelial neoplasia, similar to the terminology used for cervical and prostate precursor lesions, and that excluding DCIS from breast cancer statistics should be considered.
DCIS is most often diagnosed by mammography. In the United States, only 4,900 women were diagnosed with DCIS in 1983 before the adoption of mammography screening, compared with approximately 62,930 women who are expected to be diagnosed in 2019.    The Canadian National Breast Screening Study-2, which evaluated women aged 50 to 59 years, found a fourfold increase in DCIS cases in women screened by clinical breast examination (CBE) plus mammography compared with those screened by CBE alone, with no difference in breast cancer mortality.  (Refer to the PDQ summary on Breast Cancer Treatment for more information.)
The natural history of DCIS is poorly understood because nearly all DCIS cases are detected by screening and nearly all are treated. Development of breast cancer after treatment of DCIS depends on the pathologic characteristics of the lesion and on the treatment. In a randomized trial, 13.4% of women whose DCIS was excised by lumpectomy developed ipsilateral invasive breast cancer within 90 months, compared with 3.9% of those treated by both lumpectomy and radiation.  Among women diagnosed and treated for DCIS, the percentage of women who died of breast cancer is lower than that for the age-matched population at large.   This favorable outcome may reflect the benign nature of the condition, the benefits of treatment, or the volunteer effect (i.e., women who undergo breast cancer screening are generally healthier than those who do not do so).
Atypia, which is a risk factor for breast cancer, is found in 4% to 10% of breast biopsies.   Atypia is a diagnostic classification with considerable variation among practicing pathologists. 
The range of pathologists' diagnoses of breast tissue includes benign without atypia, atypia, DCIS, and invasive breast cancer. The incidence of atypia and DCIS breast lesions has increased over the past three decades as a result of widespread mammography screening, although atypia is generally mammographically occult.   Misclassification of breast lesions may contribute to either overtreatment or undertreatment of lesions—with variability especially in the diagnoses of atypia and DCIS.      
The largest study on this topic, the B-Path study, involved 115 practicing U.S. pathologists who interpreted a single-breast biopsy slide per case, and it compared their interpretations with an expert consensus-derived reference diagnosis.  While the overall agreement between the individual pathologists’ interpretations and the expert reference diagnoses was highest for invasive carcinoma, there were markedly lower levels of agreement for DCIS and atypia.  As the B-Path study included higher proportions of cases of atypia and DCIS than typically seen in clinical practice, the authors expanded their work by applying Bayes’ theorem to estimate how diagnostic variability affects accuracy from the perspective of a U.S. woman aged 50 to 59 years having a breast biopsy.  At the U.S. population level, it is estimated that 92.3% (confidence interval [CI], 91.4%–93.1%) of breast biopsy diagnoses would be verified by an expert reference consensus diagnosis, with 4.6% (CI, 3.9%–5.3%) of initial breast biopsies estimated to be overinterpreted and 3.2% (CI, 2.7%–3.6%) under interpreted. Figure 1 shows the predicted outcomes per 100 breast biopsies, overall and by diagnostic category.Figure 1. Predicted outcomes per 100 breast biopsies, overall and by diagnostic category. From Annals of Internal Medicine, Elmore JG, Nelson HD, Pepe MS, Longton GM, Tosteson AN, Geller B, Onega T, Carney PA, Jackson SL, Allison KH, Weaver DL, Variability in Pathologists' Interpretations of Individual Breast Biopsy Slides: A Population Perspective, Volume 164, Issue 10, Pages 649–55, Copyright © 2016 American College of Physicians. All Rights Reserved. Reprinted with the permission of American College of Physicians, Inc.
To address the high rates of discordance in breast tissue diagnosis, laboratory policies that require second opinions are becoming more common. A national survey of 252 breast pathologists participating in the B-Path study found that 65% of respondents reported having a laboratory policy that requires second opinions for all cases initially diagnosed as invasive disease. Additionally, 56% of respondents reported policies that require second opinions for initial diagnoses of DCIS, while 36% of respondents reported mandatory second opinion policies for cases initially diagnosed as atypical ductal hyperplasia.  In this same survey, pathologists overwhelmingly agreed that second opinions improved diagnostic accuracy (96%).
A simulation study that used B-Path study data evaluated 12 strategies for obtaining second opinions to improve interpretation of breast histopathology.  Accuracy improved significantly with all second-opinion strategies, except for the strategy limiting second opinions only to cases of invasive cancer. Accuracy improved regardless of the pathologists’ confidence in their diagnosis or their level of experience. While the second opinions improved accuracy, they did not completely eliminate diagnostic variability, especially in the challenging case of breast atypia.
Women with an increased risk of breast cancer caused by a BRCA1 or BRCA2 genetic mutation might benefit from increased screening. (Refer to the PDQ summary on Genetics of Breast and Gynecologic Cancers for more information.)
Women with Hodgkin and non-Hodgkin lymphoma who were treated with mantle irradiation have an increased risk of breast cancer, starting 10 years after completing therapy and continuing life-long. Therefore, screening mammography has been advocated, even though it may begin at a relatively young age.  
The potential benefits of screening mammography occur well after the examination, often many years later, whereas the harms occur immediately. Therefore, women with limited life expectancy and comorbidities who suffer harms may do so without benefit. Nonetheless, many of these women undergo screening mammography.  In one study, approximately 9% of women with advanced cancer underwent cancer screening tests. 
Screening mammography may yield cancer diagnoses in approximately 1% of elderly women , but most of these cancers are low risk.  The question remains whether the diagnosis and treatment of localized breast cancer in elderly women is beneficial.
There is no evidence of benefit in performing screening mammography in average-risk women younger than 40 years.
Approximately 1% of all breast cancers occur in men.  Most cases are diagnosed during the evaluation of palpable lesions, which are generally easy to detect. Treatment consists of surgery, radiation, and systemic adjuvant hormone therapy or chemotherapy. (Refer to the PDQ summary on Male Breast Cancer Treatment for more information.) Screening is unlikely to be beneficial.
Mammography utilizes ionizing radiation to image breast tissue. The examination is performed by compressing the breast firmly between two plates, which spreads out overlapping tissues and reduces the amount of radiation needed for the image. For routine screening in the United States, examinations are taken in both mediolateral oblique and craniocaudal projections.  Both views will include breast tissue from the nipple to the pectoral muscle. Radiation exposure is 4 to 24 mSv per standard two-view screening examination. Two-view examinations have a lower recall rate than single-view examinations because they reduce concern about abnormalities caused by superimposition of normal breast structures.  Two-view exams have lower interval cancer rates than single-view exams. 
Under the Mammography Quality Standards Act (MQSA) enacted by Congress in 1992, all U.S. facilities that perform mammography must be certified by the U.S. Food and Drug Administration (FDA) to ensure the use of standardized training for personnel and a standardized mammography technique utilizing a low radiation dose.  (Refer to the FDA's web page on Mammography Facility Surveys, Mammography Equipment Evaluations, and Medical Physicist Qualification Requirement under MQSA.) The 1998 MQSA Reauthorization Act requires that patients receive a written lay-language summary of mammography results.
The following Breast Imaging Reporting and Data System (BI-RADS) categories are used for reporting mammographic results: 
Most screening mammograms are interpreted as negative or benign (BI-RADS 1 or 2, respectively); about 10% of women in the United States are asked to return for additional evaluation.  The percentage of women asked to return for additional evaluation varies not only by the inherent characteristics of each woman but also by the mammography facility and radiologist. 
Digital mammography is more expensive than screen-film mammography (SFM) but is more amenable to data storage and sharing. Performance of both SFM and digital mammography for cancer detection rate, sensitivity, specificity, and positive predictive value (PPV) has been compared directly in several trials, with similar results in most patient groups.
The Digital Mammographic Imaging Screening Trial (DMIST) compared the findings of digital and film mammograms in 42,760 women at 33 U.S. centers. Although digital mammography detected more cancers in women younger than 50 years (area under the curve [AUC] of 0.84 +/- 0.03 for digital; AUC of 0.69 +/- 0.05 for film; P = .002), there was no difference in breast cancer detection overall.  A second DMIST report found a trend toward higher AUC for film mammography than for digital mammography in women aged 65 years and older. 
Another large U.S. cohort study  also found slightly better sensitivity for film mammography for women younger than 50 years with similar specificity.
A Dutch study compared the findings of 1.5 million digital versus 4.5 million screen-film screening mammograms performed between 2004 and 2010. A higher recall and cancer detection rate was observed for the digital screens.  A meta-analysis  of 10 studies, including the DMIST   and the U.S. cohort study,  compared digital mammography and film mammography in 82,573 women who underwent both types of the exam. In a random-effects model, there was no statistically significant difference in cancer detection between the two types of mammography (AUC of 0.92 for film and AUC of 0.91 for digital). For women younger than 50 years, all studies found that sensitivity was higher for digital mammography, but specificity was either the same or higher for film mammography.
Computer-aided detection (CAD) systems highlight suspicious regions, such as clustered microcalcifications and masses,  generally increasing sensitivity, decreasing specificity,  and increasing detection of ductal carcinoma in situ (DCIS).  Several CAD systems are in use. One large population-based study that compared recall rates and breast cancer detection rates before and after the introduction of CAD systems, found no change in either rate.   Another large study noted an increase in recall rate and increased DCIS detection but no improvement in invasive cancer detection rate.   Another study, using a large database and digital mammography in women aged 40 to 89 years, found that CAD did not improve sensitivity, specificity, or detection of interval cancers, but it did detect more DCIS. 
The use of new screening mammography modalities by more than 270,000 women aged 65 years and older in two time periods, 2001 to 2002 and 2008 to 2009, was examined, relying on a Surveillance, Epidemiology, and End Results (SEER)–Medicare-linked database. Digital mammography increased from 2% to 30%, CAD increased from 3% to 33%, and spending increased from $660 million to $962 million. CAD was used in 74% of screening mammograms paid for by Medicare in 2008, almost twice as many screening mammograms as in 2004. There was no difference in detection rates of early-stage (DCIS or stage I) or late-stage (stage IV) tumors. 
Tomosynthesis, or 3-dimensional (3-D) mammography, like standard 2-D mammography, compresses the breast and uses x-rays to create the image. Multiple short-exposure x-rays are obtained at different angles. Some cancers are better seen with this method than on mammography or ultrasound. The radiation dose is double that of 2-D mammography.
Tomosynthesis has been evaluated only in limited studies, and some professional groups consider it investigational. It is not universally reimbursed.
Regardless of stage, nodal status, and tumor size, screen-detected cancers have a better prognosis than those diagnosed outside of screening.  This suggests that they are biologically less lethal (perhaps slower growing and less likely to invade locally and metastasize), and are useful for predicting prognosis and for treatment planning.
A 10-year follow-up study of 1,983 Finnish women with invasive breast cancer demonstrated that the method of cancer detection is an independent prognostic variable. When controlled for age, nodal status, and tumor size, screen-detected cancers had a lower risk of relapse and better overall survival. For women whose cancers were detected outside of screening, the hazard ratio (HR) for death was 1.90 (95% confidence interval [CI], 1.15–3.11), even though they were more likely to receive adjuvant systemic therapy. 
Similarly, an examination of the breast cancers found in three randomized screening trials (Health Insurance Plan, National Breast Screening Study [NBSS]-1, and NBSS-2) accounted for stage, nodal status, and tumor size and determined that patients whose cancer was found via screening had a more favorable prognosis. The relative risks (RR) for death were 1.53 (95% CI, 1.17–2.00) for interval and incident cancers, compared with screen-detected cancers; and 1.36 (95% CI, 1.10–1.68) for cancers in the control group, compared with screen-detected cancers. 
A third study compared the outcomes of 5,604 English women with screen-detected cancers to those with symptomatic breast cancers diagnosed between 1998 and 2003. After controlling for tumor size, nodal status, grade, and patient age, researchers found that the women with screen-detected cancers fared better. The HR for survival of the symptomatic women was 0.79 (95% CI, 0.63–0.99).  
The findings of these studies are also consistent with the evidence that some screen-detected cancers are low risk and represent overdiagnosis.
Numerous uncontrolled trials and retrospective series have documented the ability of mammography to diagnose small, early-stage breast cancers, which have a favorable clinical course.  Individuals whose cancer is detected by screening show a higher survival rate than those whose cancers are not detected by screening even when screening has not prolonged any lives. This concept is explained by the following four types of statistical bias:
The impact of these biases is not known. A new randomized controlled trial (RCT) with cause-specific mortality as the endpoint is needed to determine both survival benefit and impact of overdiagnosis, lead time, length time, and healthy volunteer biases. This is not achievable; randomizing patients to screen and nonscreen groups would be unethical, and at least three decades of follow-up would be needed, during which time changes in treatment and imaging technology would invalidate the results. Decisions must therefore be based on available RCTs, despite their limitations, and on ecologic or cohort studies with adequate control groups and adjustment for confounding. (Refer to the PDQ summary on Cancer Screening Overview for more information.)
Performance benchmarks for screening mammography in the United States are described on the Breast Cancer Surveillance Consortium (BCSC) website. (Refer to the PDQ summary on Cancer Screening Overview for more information.)
The sensitivity of mammography is the percentage of women with breast cancers detected by mammographic screening. Sensitivity depends on tumor size, conspicuity, hormone sensitivity, breast tissue density, patient age, timing within the menstrual cycle, overall image quality, and interpretive skill of the radiologist. Overall sensitivity is approximately 79% but is lower in younger women and in those with dense breast tissue (see the BCSC website).    Sensitivity is not the same as benefit because some woman with possible breast cancer are harmed by overdiagnosis. According to the Physician's Insurance Association of America (PIAA), delay in diagnosis of breast cancer and errors in diagnosis are common causes of medical malpractice litigation. PIAA data from 2002 through 2011 note that the largest total indemnity payments for breast cancer claims are for errors in diagnosis, with an average indemnity payment of $444,557. 
The specificity of mammography is the percentage of all women without breast cancer whose mammograms are negative. The false-positive rate is the likelihood of a positive test in women without breast cancer. Low specificity and high rate of false positives result in unnecessary follow-up examinations and procedures. Because specificity includes all women without cancer in the denominator, even a small percentage of false positives turns out to be a large number in absolute terms. Thus—in screening—a good specificity must be very high. Even 95% specificity is quite low for a screening test.
Interval cancers are cancers that are diagnosed in the interval between a normal screening examination and the anticipated date of the next screening mammogram. One study found interval cancers occurred more often in women younger than 50 years, and had mucinous or lobular histology, high histologic grade, high proliferative activity with relatively benign mammographic features, and no calcifications. Conversely, screen-detected cancers often had tubular histology, small size, low stage, hormone sensitivity, and a major component of DCIS.  Overall, interval cancers have characteristics of rapid growth,   are diagnosed at an advanced stage, and carry a poor prognosis. 
The Nova Scotia Breast Screening Program defined missed cancers as those that were false negatives on the previous screening exam, occurring less often than 1 per 1,000 women. It concluded that interval cancers occurred in approximately 1 per 1,000 women aged 40 to 49 years, and 3 per 1,000 women aged 50 to 59 years. 
Conversely, a larger trial found that interval cancers were more prevalent in women aged 40 to 49 years. Those appearing within 12 months of a negative screening mammogram were usually attributable to greater breast density. Those appearing within a 24-month interval were related to decreased mammographic sensitivity caused by greater breast density or to rapid tumor growth. 
The accuracy of mammography has been noted to vary with patient characteristics, such as a woman's age, breast density, whether it is her first or subsequent exam, and the time since her last mammogram. Younger women have lower sensitivity and higher false-positive rates than do older women.
The Million Women Study in the United Kingdom found decreased sensitivity and specificity in women aged 50 to 64 years if they used postmenopausal hormone therapy, had prior breast surgery, or had a body mass index below 25.  Increased time since the last mammogram increases sensitivity, recall rate, and cancer-detection rate and decreases specificity. 
Sensitivity may be improved by scheduling the exam after the initiation of menses or during an interruption from hormone therapy.  Obese women have more than a 20% increased risk of having false-positive mammography, although sensitivity is unchanged. 
Dense breasts may obscure the detection of small masses on mammography, thereby reducing the sensitivity of mammography.  For women of all ages, high breast density is associated with 10% to 29% lower sensitivity.  High breast density is an inherent trait, which can be inherited   or affected by age; endogenous  and exogenous   hormones;  selective estrogen receptor modulators, such as tamoxifen;  and diet.  Hormone therapy is associated with increased breast density, lower mammographic sensitivity, and an increased rate of interval cancers. 
Digital mammography is more accurate than film mammography in examining dense breasts.  Most U.S. states have enacted laws mandating that mammography facilities report breast density, but inconsistent guidelines have generated confusion and anxiety among patients and health care providers. 
Dense breast tissue is not abnormal. Breast density is a description of the proportion of dense versus fatty tissue in a mammographic image.  The American College of Radiology’s BI-RADS classifies breast density as follows:
The latter two categories are considered dense breast tissue, a description affecting 43% of women aged 40 to 74 years.  A radiologist's assignment of breast density is subjective, and in any woman, it may vary over time.  
While breast density is associated with an increased risk of breast cancer,  density is only a modest risk factor for breast cancer and does not confer a higher risk for breast cancer death. The fourfold elevated risk for breast cancer incidence according to breast density is a comparison of density category d versus density category a.
Supplemental imaging with ultrasonography or breast magnetic resonance imaging (MRI) has been suggested by some groups for screening women with dense breasts, but there are no data showing that this strategy results in lower breast cancer mortality. The potential harm of adding these supplemental screening tests is the likelihood of producing more false positives, leading to additional imaging and breast biopsies, with resultant anxiety and cost.  Supplemental screening may also increase overdiagnosis of breast cancer with resultant overtreatment.
Mucinous and lobular cancers are more easily detected by mammography. Rapidly growing cancers can sometimes be mistaken for normal breast tissue (e.g., medullary carcinomas, an uncommon type of invasive ductal breast cancer that is often associated with the BRCA1 mutation and aggressive characteristics, but that may demonstrate comparatively favorable responses to treatment).   Some other cancers associated with BRCA1/2 mutations, which may appear indolent, can also be missed.  
Radiologists’ performance is variable, affected by levels of experience and the volume of mammograms they interpret.  Biopsy recommendations of radiologists in academic settings have a higher positive PPV than do community radiologists.  Fellowship training in breast imaging may improve detection. 
Performance also varies by facility. Mammographic screening accuracy was higher at facilities offering only screening examinations than at those also performing diagnostic tests. Accuracy was also better at facilities with a breast imaging specialist on staff, performing single rather than double readings, and reviewing performance audits two or more times each year. 
False-positive rates are higher at facilities where concern about malpractice is high and at facilities serving vulnerable women (racial or ethnic minorities and women with less education, limited household income, or rural residence).  These populations may have a higher cancer prevalence and a lack of follow-up. 
The recall rate in the United States is twice that of the United Kingdom, with no difference in the rate of cancer detection. 
The likelihood of diagnosing cancer is highest with the prevalent (first) screening examination, ranging from 9 to 26 cancers per 1,000 screens, depending on the woman’s age. The likelihood decreases for follow-up examinations, ranging from 1 to 3 cancers per 1,000 screens. 
The optimal interval between screening mammograms is unknown; there is little variability across the trials despite differences in protocols and screening intervals. A prospective U.K. trial randomly assigned women aged 50 to 62 years to receive mammograms annually or triennially. Although tumor grade and nodal status were similar in the two groups, more cancers of slightly smaller size were detected in the annual screening group than in the triennial screening group. 
A large observational study found a slightly increased risk of late-stage disease at diagnosis for women in their 40s who were adhering to a 2-year versus a 1-year schedule (28% vs. 21%; odds ratio [OR], 1.35; 95% CI, 1.01–1.81), but no difference was seen for women in their 50s or 60s based on schedule difference.  
A Finnish study of 14,765 women aged 40 to 49 years randomly assigned women to receive either annual screens or triennial screens. There were 18 deaths from breast cancer in 100,738 life-years in the triennial screening group and 18 deaths from breast cancer in 88,780 life-years in the annual screening group (HR, 0.88; 95% CI, 0.59–1.27). 
RCTs that studied the effect of screening mammography on breast cancer mortality were performed between 1963 and 2015, with participation by over half-a-million women in four countries. One trial, the Canadian NBSS-2, compared mammography plus clinical breast examination (CBE) to CBE alone; the other trials compared screening mammography with or without CBE to usual care. Refer to the Appendix of Randomized Controlled Trials section of this summary for a detailed description of the trials.
The trials differed in design, recruitment of participants, interventions (both screening and treatment), management of the control group, compliance with assignment to screening and control groups, and analysis of outcomes. Some trials used individual randomization, while others used cluster randomization in which cohorts were identified and then offered screening; one trial used nonrandomized allocation by day of birth in any given month. Cluster randomization sometimes led to imbalances between the intervention and control groups. Age differences have been identified in several trials, although the differences had no major effect on the trial outcome.  In the Edinburgh Trial, socioeconomic status, which correlates with the risk of breast cancer mortality, differed markedly between the intervention and control groups, rendering the results uninterpretable.
Breast cancer mortality was the major outcome parameter for each of these trials, so the attribution of cause of death required scrupulous attention. The use of a blinded monitoring committee (New York) and a linkage to independent data sources, such as national mortality registries (Swedish trials), were incorporated but could not ensure impartial attributions of cancer death for women in the screening or control arms. Possible misclassification of breast cancer deaths in the Two-County Trial biasing the results in favor of screening has been suggested. 
There were also differences in the methodology used to analyze the results of these trials. Four of the five Swedish trials were designed to include a single screening mammogram in the control group and were timed to correspond with the end of the series of screening mammograms in the study group. The initial analysis of these trials used an evaluation analysis, tallying only the breast cancer deaths that occurred in women whose cancer was discovered at or before the last study mammogram. In some of the trials, a delay occurred in the performance of the end-of-study mammogram, resulting in more time for members of the control group to develop or be diagnosed with breast cancer. Other trials used a follow-up analysis, which counts all deaths attributed to breast cancer, regardless of the time of diagnosis. This type of analysis was used in a meta-analysis of four of the five Swedish trials as a response to concerns about the evaluation analyses. 
The accessibility of the data for international audits and verification also varied, with a formal audit having been undertaken only in the Canadian trials. Other trials have been audited to varying degrees, but with less rigor. 
All of these studies were designed to study breast cancer mortality rather than all-cause mortality because breast cancer deaths contribute only a small proportion of total mortality in any given population. When all-cause mortality in these trials was examined retrospectively, only the Edinburgh Trial showed a difference attributable to the previously noted socioeconomic differences in the study groups. The meta-analysis (follow-up methods) of the four Swedish trials also showed a small improvement in all-cause mortality.
The relative improvement in breast cancer mortality attributable to screening is approximately 15% to 20%, and the absolute improvement at the individual level is much less. The potential benefit of breast cancer screening can be expressed as the number of lives extended because of early breast cancer detection.  
There are several problems with using these RCTs that were performed up to 50 years ago to estimate the current benefits of screening on breast cancer mortality. These problems include the following:
For these reasons, estimates of the breast cancer mortality reduction resulting from current screening are based on well-conducted cohort and ecologic studies in addition to the RCTs.
An estimate of screening effectiveness can be obtained from nonrandomized controlled studies of screened versus nonscreened populations, case-control studies of screening in real communities, and modeling studies that examine the impact of screening on large populations. These studies must be designed to minimize or exclude the effects of unrelated trends influencing breast cancer mortality such as improved treatment and heightened awareness of breast cancer in the community.
Three population-based, observational studies from Sweden compared breast cancer mortality in the presence and absence of screening mammography programs. One study compared two adjacent time periods in 7 of the 25 counties in Sweden and found a statistically significant breast cancer mortality reduction of 18% to 32% attributable to screening.  The most important bias in this study is that the advent of screening in these counties occurred over a period during which dramatic improvements in the effectiveness of adjuvant breast cancer therapy were being made, changes that were not addressed by the study authors. The second study considered an 11-year period comparing seven counties with screening programs with five counties without them.  There was a trend in favor of screening, but again, the authors did not consider the effect of adjuvant therapy or differences in geography (urban vs. rural) that might affect treatment practices.
The third study attempted to account for the effects of treatment by using a detailed analysis by county. It found screening had little impact, a conclusion weakened by several flaws in design and analysis. 
In Nijmegen, the Netherlands, where a population-based screening program was undertaken in 1975, a case-cohort study found that screened women had decreased mortality compared with unscreened women (OR, 0.48).  However, a subsequent study comparing Nijmegen breast cancer mortality rates with neighboring Arnhem in the Netherlands, which had no screening program, showed no difference in breast cancer mortality. 
A community-based case-control study of screening in high-quality U.S. health care systems between 1983 and 1998 found no association between previous screening and reduced breast cancer mortality, but the mammography screening rates were generally low. 
A well-conducted ecologic study compared three pairs of neighboring European countries that were matched on similarity in health care systems and population structure, one of which had started a national screening program some years earlier than the others. The investigators found that each country had experienced a reduction in breast cancer mortality, with no difference between matched pairs that could be attributed to screening. The authors suggested that improvements in breast cancer treatment and/or health care organizations were more likely responsible for the reduction in mortality than was screening. 
A systematic review of ecologic and large cohort studies published through March 2011 compared breast cancer mortality in large populations of women, aged 50 to 69 years, who started breast cancer screening at different times. Seventeen studies met inclusion criteria, but all studies had methodological problems, including control group dissimilarities, insufficient adjustment for differences between areas in breast cancer risk and breast cancer treatment, and problems with similarity of measurement of breast cancer mortality between compared areas. There was great variation in results among the studies, with four studies finding a relative reduction in breast cancer mortality of 33% or more (with wide CIs) and five studies finding no reduction in breast cancer mortality. Because only a part of the overall reduction in breast cancer mortality could possibly be attributed to screening, the review concluded that any relative reduction in breast cancer mortality resulting from screening would likely be no more than 10%. 
A U.S. ecologic analysis conducted between 1976 and 2008 examined the incidence of early-stage versus late-stage breast cancer for women aged 40 years and older. To assess a screening effect, the authors compared the magnitude of increase in early-stage cancer with the magnitude of an expected decrease in late-stage cancer. Over the study, the absolute increase in the incidence of early-stage cancer was 122 cancers per 100,000 women, while the absolute decrease in late-stage cancers was 8 cases per 100,000 women. After adjusting for changes in incidence resulting from hormone therapy and other undefined causes, the authors concluded (1) the benefit of screening on breast cancer mortality was small, (2) between 22% and 31% of diagnosed breast cancers represented overdiagnosis, and (3) the observed improvement in breast cancer mortality was probably attributable to improved treatment rather than screening. 
An analytic approach was used to approximate the contributions of screening versus treatment to breast cancer mortality reduction and the magnitude of overdiagnosis.  The shift in the size distribution of breast cancers in the United States (before the introduction of mammography) to 2012 (after its widespread dissemination), was investigated using SEER data in women aged 40 years and older. The rate of clinically meaningful breast cancer was assumed to be stable during this time. The authors documented a lower incidence of larger (≥2 cm) tumors as well as a reduction in breast cancer case fatality. The lower mortality for women with larger tumors was attributed to improvements in therapy. Two-thirds of the decline in size-specific case fatality was ascribed to improved treatment.Figure 2. Screening mammography and increased incidence of invasive breast cancer. Shown are the incidences of overall invasive breast cancer and metastatic breast cancer among women 40 years of age or older at nine sites of the Surveillance, Epidemiology, and End Results (SEER) program, during the period from 1975 through 2012. From New England Journal of Medicine, Welch HG, Prorok PC, O'Malley AJ, Kramer BS, Breast-Cancer Tumor Size, Overdiagnosis, and Mammography Screening Effectiveness, Volume 375, Issue 15, Pages 1438-47, Copyright © 2016 Massachusetts Medical Society. Reprinted with permission from Massachusetts Medical Society.
A prospective cohort study of community-based screening programs in the United States found that annual compared with biennial screening mammography did not reduce the proportion of unfavorable breast cancers detected in women aged 50 to 74 years or in women aged 40 to 49 years without extremely dense breasts. Women aged 40 to 49 years with extremely dense breasts did have a reduction in cancers larger than 2.0 cm with annual screening (OR, 2.39; 95% CI, 1.37–4.18). 
An observational study of women aged 40 to 74 years conducted in 7 of 12 Canadian screening programs compared breast cancer mortality in those participants screened at least once between 1990 and 2009 (85% of the population) with those not screened (15% of the population). The abstract reported a 40% average breast cancer mortality among participants; however, it was likely intended to report a 40% reduction in breast cancer mortality on the basis of language utilized in the Discussion section. 
Limitations of this study included the lack of all-cause mortality data, the extent of screening, screening outside of the study, screening prior to the study, the method used for calculating expected mortality and the referent rates of nonparticipants, nonparticipant survival, province-specific population differences, the extent to which limitations of the database prevented correcting for age and other differences between participants, the generalizability of the substudy data of a single province (British Columbia), and the potentially large impact of selection bias. Overall, the study lacked important data and had limitations in methodology and data analysis.
The optimal screening interval has been addressed by modelers. Modeling makes assumptions that may not be correct; however, the credibility of modeling is greater when the model produces overall results that are consistent with randomized trials and when the model is used to interpolate or extrapolate. For example, if a model’s output agrees with RCT outcomes for annual screening, it has greater credibility to compare the relative effectiveness of biennial versus annual screening.
In 2000, the National Cancer Institute formed a consortium of modeling groups (Cancer Intervention and Surveillance Modeling Network [CISNET]) to address the relative contribution of screening and adjuvant therapy to the observed decline in breast cancer mortality in the United States.  These models predicted reductions in breast cancer mortality similar to those expected in the circumstances of the RCTs but updated to the use of modern adjuvant therapy. In 2009, CISNET modelers addressed several questions related to the harms and benefits of mammography, including comparing annual versus biennial screening.  Women aged 50 to 74 years received most of the mortality benefit of annual screening by having a mammogram every 2 years. The reduction in breast cancer deaths that was maintained because of the move from annual to biennial screening ranged across the six models from 72% to 95%, with a median of 80%.
Data are limited as to how much of the reduction in mortality, seen over time from 1990 onward, is attributable to advances in imaging techniques for screening and as to how much is the result of the improved effectiveness of therapy. In one CISNET study of six simulation models, about one-third of the decrease in breast cancer mortality in 2012 was attributable to screening, with the balance attributed to treatment.  In this CISNET study, the mean estimated reduction in overall breast cancer mortality rate was 49% (model range, 39%–58%), relative to the estimated baseline rate in 2012 if there was no screening or treatment; 37% (model range, 26%–51%) of this reduction was associated with screening, and 63% (model range, 49%–74%) of this reduction was associated with treatment.
The negative effects of screening mammography are overdiagnosis (true positives that will not become clinically significant), false positives (related to the specificity of the test), false negatives (related to the sensitivity of the test), discomfort associated with the test, radiation risk, psychological harm, financial stress, and opportunity costs.
|Age, y||No. of Breast Cancer Deaths Averted With Mammography Screening During the Next 15 y||No. (95% CI) With ≥1 False-Positive Result During the 10 y||No. (95% CI) With ≥1 False Positive Resulting in a Biopsy During the 10 y||No. of Breast Cancers or DCIS Diagnosed During the 10 y That Would Never Become Clinically Important (Overdiagnosis)|
|No. = number; CI = confidence interval; DCIS = ductal carcinoma in situ.|
|aAdapted from Pace and Keating. |
|bNumber of deaths averted are from Welch and Passow.  The lower bound represents breast cancer mortality reduction if the breast cancer mortality relative risk were 0.95 (based on minimal benefit from the Canadian trials   ), and the upper bound represents the breast cancer mortality reduction if the relative risk were 0.64 (based on the Swedish 2-County Trial  ).|
|cFalse positive and biopsy estimates and 95% confidence intervals are 10-year cumulative risks reported in Hubbard et al.  and Braithwaite et al. |
|dThe number of overdiagnosed cases are calculated by Welch and Passow.  The lower bound represents overdiagnosis based on results from the Malmö trial,  whereas the upper bound represents the estimate from Bleyer and Welch. |
|eThe lower-bound estimate for overdiagnosis reported by Welch and Passow  came from the Malmö study.  The study did not enroll women younger than 50 years.|
|40||1–16||6,130 (5,940–6,310)||700 (610–780)|
|50||3–32||6,130 (5,800–6,470)||940 (740–1,150)|
|60||5–49||4,970 (4,780–5,150)||980 (840–1,130)|
Overdiagnosis occurs when screening procedures detect cancers that would never become clinically significant. The magnitude of overdiagnosis is debated, particularly regarding DCIS, a cancer precursor whose natural history is unknown. By reason of this inability to predict confidently the tumor behavior at time of diagnosis, standard treatment for invasive cancers and DCIS can cause overtreatment. The related harms include treatment-related side effects and the number of harms associated with a cancer diagnosis, which are immediate. Conversely, a mortality benefit would occur at an uncertain point in the future.
One approach to understanding overdiagnosis is to examine the prevalence of occult cancer in women who died of noncancer causes. In an overview of seven autopsy studies, the median prevalence of occult invasive breast cancer was 1.3% (range, 0%–1.8%) and of DCIS was 8.9% (range, 0%–14.7%).  
Overdiagnosis can be indirectly measured by comparing breast cancer incidence in screened versus unscreened populations. These comparisons can be confounded by differences in the populations, such as time, geography, health behaviors, and hormone usage. The calculations of overdiagnosis can vary in their adjustment for lead-time bias.   An overview of 29 studies found calculated rates of overdiagnosis to be 0%–54%, with rates from randomized studies between 11% and 22%.  In Denmark, where screened and unscreened populations existed concurrently, the rate of overdiagnosis of invasive cancer was calculated to be 14% and 39%, using two different methodologies. If DCIS cases were included, the overdiagnosis rates were 24% and 48%. The second methodology accounts for regional differences in women younger than the screening age and is likely more accurate. 
Theoretically, in a given population, the detection of more breast cancers at an early stage would result in a subsequent reduction in the incidence of advanced-stage cancers. This has not occurred in any of the populations studied to date. Thus, the detection of more early stage cancers likely represents overdiagnosis. A population-based study in the Netherlands showed that about one-half of all screen-detected breast cancers, including DCIS, would represent overdiagnosis and is consistent with other studies, which showed substantial rates of overdiagnosis associated with screening. 
A cohort study in Norway compared the increase in cancer incidence in women who were eligible for screening with the cancer incidence in younger women who were not eligible for screening, eligibility was based on age and residence. Eligible women experienced a 60% increase in incidence of localized cancers (RR, 1.60; 95% CI, 1.42–1.79), while the incidence of advanced cancers remained similar in the two groups (RR, 1.08; 95% CI, 0.86–1.35). 
A population study that compared different counties in the United States showed that higher rates of screening mammography use were associated with higher rates of breast cancer diagnoses, yet there was no corresponding decrease in 10-year breast cancer mortality.  The strengths of this study include its very large size (16 million women) and the strength and consistency of correlation observed across counties. The limitations of this study include the self-reporting of mammograms, the use of a 2-year window to estimate screening prevalence, and the period of analysis (when menopausal hormone use was present). 
The extent of overdiagnosis has been estimated in the Canadian NBSS, a randomized clinical trial. At the end of the five screening rounds, 142 more invasive breast cancer cases were diagnosed in the mammography arm, compared with the control arm.  At 15 years, the excess number of cancer cases in the mammography arm versus the control arm was 106, representing an overdiagnosis rate of 22% for the 484 screen-detected invasive cancers. 
As a consequence of screening mammography, greater numbers of breast cancers with indolent behavior are now identified, resulting in potential overtreatment. In a secondary analysis of a randomized trial of tamoxifen versus no systemic therapy in patients with early breast cancer, the authors utilized the 70-gene MammaPrint assay and identified 15% of patients at ultra-low risk, with 20-year disease-specific survival rates of 97% in the tamoxifen group and 94% in the control group. Thus, these patients would likely have extremely good outcomes with surgery alone. The frequency of such ultra-low risk cancers in the screened population is likely around 25%. Tools such as the 70-gene MammaPrint assay might be utilized in the future to identify these cancers, and thereby, reduce the risk of overtreatment. However, additional studies are needed to confirm these findings. 
In 2016, the Canadian NBSS, a randomized screening trial with 25-year follow-up, re-estimated overdiagnosis of breast cancer from mammography screening by age group and concluded that approximately 30% of invasive screen-detected cancers in women aged 40 to 49 years and up to 20% of those detected in women aged 50 to 59 years were overdiagnosed. When in situ cancers are included, the estimated risks of overdiagnosis are 40% aged 40 to 49 years and 30% in women aged 50 to 59 years. Overdiagnosis was calculated as the persistent excess incidence in the screened arm versus the control arm divided by the number of screen-detected cases (excess incidence method). Requirements for adequate estimation of overdiagnosis utilizing this method included the following:
These conditions were largely met in the CNBSS because population-based screening did not become available throughout Canada until a minimum of 2 years later and in most instances 5 to 10 years later (thereby, allowing for cessation of screening after the trial screening period and follow-up longer than most estimates of lead time), because contamination is documented to have been minimal, and because individual randomization resulted in 44 almost identically distributed demographic factors and risk factors between the two trial arms.
Since the conclusion of the trial screening period in 1988, differences in screening quality, intensity, invited age range, and biopsy thresholds decrease the generalizability of these results. These factors and improved imaging technique/quality and low threshold for biopsy, likely contribute to lower estimates of overdiagnosis of in situ cancer than that of invasive cancer. 
Table 1, above, shows results from a 10-year period of screening 10,000 women, estimating the number of women with breast cancer or DCIS that would never become clinically important (overdiagnosis). There was likely no overdiagnosis in the Health Insurance Plan study, which used old-technology mammography and CBE. Overdiagnosis has become more prominent in the era of improved-technology mammography. The improved technology has not, however, been shown to make further reductions in mortality than the original technology. In summary, breast cancer overdiagnosis is a complex topic. Studies that used many different methods reported a wide range of estimates, and there is currently no way to assess whether new cancer cases are overdiagnosed or are of real harm to patients. 
Because fewer than 5 per 1,000 women screened have breast cancer, most abnormal mammograms are false positives, even given the 90% specificity of mammography (i.e., 90% of all women without breast cancer will have a negative mammogram). 
This high false positive rate of mammography is underestimated and can seem counterintuitive because of a statistically based cognitive bias known as the base rate fallacy. Because the base rate of breast cancer is low, (5/1000), the false-positive rate vastly exceeds the true-positive rate, even when utilizing a very accurate test.
Mammography’s true-positive rate of approximately 90% means that, of women with breast cancer, approximately 90% will test positive. The true-negative rate of 90% means that, of women without breast cancer, 90% will test negative. A 10% false-positive rate over 1,000 people means that there will be 100 false positives in 1,000 people. If 5 in 1,000 women have breast cancer, then 4.5 women with breast cancer will have a positive test. In other words, there will approximately 100 false positive for every 4.5 true positives.
Further, abnormal results from screening mammograms prompt additional tests and procedures, such as mammographic views of the region of concern, ultrasound, MRI, and tissue sampling (by fine-needle aspiration, core biopsy, or excisional biopsy). Overall, the harm from unnecessary tests and treatments must be weighed against the benefit of early detection.
A study of breast cancer screening in 2,400 women enrolled in a health maintenance organization found that over a decade, 88 cancers were diagnosed, 58 of which were identified by mammography. One-third of the women had an abnormal mammogram result that required additional testing: 539 additional mammograms, 186 ultrasound examinations, and 188 biopsies. The cumulative biopsy rate (the rate of true positives) resulting from mammographic findings was approximately 1 in 4 (23.6%). The PPV of an abnormal screening mammogram in this population was 6.3% for women aged 40 to 49 years, 6.6% for women aged 50 to 59 years, and 7.8% for women aged 60 to 69 years.  A subsequent analysis and modeling of data from the same cohort of women, estimated that the risk of having at least one false-positive mammogram was 7.4% (95% CI, 6.4%–8.5%) at the first mammogram, 26.0% (95% CI, 24.0%–28.2%) by the fifth mammogram, and 43.1% (95% CI, 36.6%–53.6%) by the ninth mammogram.  Cumulative risk of at least one false-positive result depended on four patient variables (younger age, higher number of previous breast biopsies, family history of breast cancer, and current estrogen use) and three radiologic variables (longer time between screenings, failure to compare the current and previous mammograms, and the individual radiologist’s tendency to interpret mammograms as abnormal). Overall, the factor most responsible for a false-positive mammogram was the individual radiologist’s tendency to read mammograms as abnormal.
A prospective cohort study of community-based screening found that a greater proportion of women undergoing annual screening had at least one false-positive screen after 10 years than did women undergoing biennial screening, regardless of breast density. For women with scattered fibroglandular densities, the difference was 68.9% (annual) versus 46.3% (biennial) for women in their 40s. For women aged 50 to 74 years, the difference for this density group was 49.8% (annual) versus 30.7% (biennial). 
As shown in Table 1, the estimated number of women out of 10,000 who underwent annual screening mammography during a 10-year period with at least one false-positive test result is 6,130 for women aged 40 to 50 years and 4,970 for women aged 60 years. The number of women with a false-positive test that results in a biopsy is estimated to range from 700 to 980, depending on age. 
The sensitivity of mammography ranges from 70% to 90%, depending on characteristics of the interpreting radiologist (level of experience) and characteristics of the woman (age, breast density, hormone status, and diet). Assuming an average sensitivity of 80%, mammograms will miss approximately 20% of the breast cancers that are present at the time of screening (false negatives). Many of these missed cancers are high risk, with adverse biologic characteristics. If a normal mammogram dissuades or postpones a woman or her doctor from evaluating breast symptoms, she may suffer adverse consequences. Thus, a negative mammogram should never dissuade a woman or her physician from additional evaluation of breast symptoms.
Positioning of the woman and breast compression reduce motion artifact and improve mammogram image quality. Pain and/or discomfort was reported by 90% of women undergoing mammography, with 12% of women rating the sensation as intense or intolerable.  A systematic review of 22 studies investigating mammography-associated pain and discomfort found wide variations, some of which were associated with menstrual cycle stage, anxiety, and premammography anticipation of pain. 
The major risk factors for radiation-associated breast cancer are young age at exposure and dose; however, rarely there are women with an inherited susceptibility to radiation-induced damage who must avoid radiation exposure at any age.   For many women older than 40 years, the likely benefits of screening mammography outweigh the risks.    Standard two-view screening mammography exposes the breasts to a mean dose of 4 mSv, and the whole body to 0.29 mSv.   Thus, up to one breast cancer may be induced per 1,000 women undergoing annual mammograms from ages 40 to 80 years. Such risk is doubled in women with large breasts who require increased radiation doses and in women with breast augmentation who require additional views. Radiation-induced breast cancers may be reduced fivefold for women who begin biennial screening at age 50 years rather than annually at age 40 years. 
A telephone survey of 308 women performed 3 months after screening mammography revealed that about one-fourth of the 68 women recalled for additional testing were still experiencing worry that affected their mood or functioning, even though that testing had ruled out cancer.  Research into whether the psychological impact of a false-positive test is long-standing yields mixed results. A cohort study in Spain in 2002 found immediate psychological impact to a woman after receiving a false-positive mammogram, but these results dissipated within a few months.  A cohort study in Denmark in 2013 that measured the psychological effects of a false-positive test result several years after the event found long-term negative psychological consequences.  Several studies have shown that the anxiety after evaluation of a false-positive test leads to increased participation in future screening examinations.    
These potential harms of screening have not been well researched, but it is clear that they exist.
Ultrasound is used for the diagnostic evaluation of palpable or mammographically identified masses, rather than serving as a primary screening modality. A review of the literature and expert opinion by the European Group for Breast Cancer Screening concluded that “there is little evidence to support the use of ultrasound in population breast cancer screening at any age.” 
Breast MRI is used in women for diagnostic evaluation, including evaluating the integrity of silicone breast implants, assessing palpable masses after surgery or radiation therapy, detecting mammographically and sonographically occult breast cancer in patients with axillary nodal metastasis, and preoperative planning for some patients with known breast cancer. There is no ionizing radiation exposure with this procedure. MRI has been promoted as a screening test for breast cancer among women at elevated risk of breast cancer based on BRCA1/2 mutation carriers, a strong family history of breast cancer, or several genetic syndromes, such as Li-Fraumeni syndrome or Cowden disease.    Breast MRI is more sensitive but less specific than screening mammography   and is up to 35 times as expensive.     
Using infrared imaging techniques, thermography of the breast identifies temperature changes in the skin as a possible indicator of an underlying tumor, displaying these changes in color patterns. Thermographic devices have been approved by the U.S. Food and Drug Administration under the 510(k) process, but no randomized trials have compared thermography to other screening modalities. Small cohort studies do not suggest any additional benefit for the use of thermography as an adjunct modality.  
The effect of screening clinical breast examination (CBE) on breast cancer mortality has not been fully established. The Canadian National Breast Screening Study (CNBSS) compared high-quality CBE plus mammography with CBE alone in women aged 50 to 59 years. CBE, lasting 5 to 10 minutes per breast, was conducted by trained health professionals, with periodic evaluations of performance quality. The frequency of cancer diagnosis, stage, interval cancers, and breast cancer mortality were similar in the two groups and similar to outcomes with mammography alone.  With a mean follow-up of 13 years, breast cancer mortality was similar in the two groups (mortality rate ratio, 1.02 [95% confidence interval [CI], 0.78–1.33]).  The investigators estimated the operating characteristics for CBE alone; for 19,965 women aged 50 to 59 years, sensitivity was 83%, 71%, 57%, 83%, and 77% for years 1, 2, 3, 4, and 5 of the trial, respectively; specificity ranged between 88% and 96%. Positive predictive value (PPV), which is the proportion of cancers detected per abnormal examination, was estimated to be 3% to 4%. For 25,620 women aged 40 to 49 years who were examined only at entry, the estimated sensitivity was 71%, specificity was 84%, and PPV was 1.5%. 
In clinical trials involving community clinicians, CBE-type screening had higher specificity (97%–99%)  and lower sensitivity (22%–36%) than that experienced by examiners.     A study of screening in women with a positive family history of breast cancer showed that, after a normal initial evaluation, the patient herself, or her clinician performing a CBE, identified more cancers than did mammography. 
Another study examined the usefulness of adding CBE to screening mammography; among 61,688 women older than 40 years and screened by mammography and CBE, sensitivity for mammography was 78%, and combined mammography-CBE sensitivity was 82%. Specificity was lower for women undergoing both screening modalities than it was for women undergoing mammography alone (97% vs. 99%).  Other international trials of CBE are under way, two in India and one in Egypt.
Monthly BSE has been promoted, but there is no evidence that it reduces breast cancer mortality.   The only large, randomized clinical trial of BSE assigned 266,064 female Shanghai factory workers to either BSE instruction with reinforcement and encouragement, or instruction on the prevention of lower back pain. Neither group underwent any other breast cancer screening. After 10 to 11 years of follow-up, 135 breast cancer deaths occurred in the instruction group, and 131 cancer deaths occurred in the control group (relative risk [RR], 1.04; 95% CI, 0.82–1.33). Although the number of invasive breast cancers diagnosed in the two groups was about the same, women in the instruction group had more breast biopsies and more benign lesions diagnosed than did women in the control group. 
Other research results on BSE come from three trials. First, more than 100,000 Leningrad women were assigned to BSE training or control by cluster randomization; the BSE group training had more breast biopsies without improved breast cancer mortality.  Second, in the United Kingdom Trial of Early Detection of Breast Cancer, more than 63,500 women aged 45 to 64 years were invited to educational sessions about BSE. After 10 years of follow-up, breast cancer mortality rates were similar to the rates in centers without organized BSE education (RR, 1.07; 95% CI, 0.93–1.22).  Thirdly, in contrast, a case-control study nested within the CNBSS compared self-reported BSE frequency before enrollment with breast cancer mortality. Women who examined their breasts visually, used their finger pads for palpation, and used their three middle fingers had a lower breast cancer mortality rate. 
Various methods to analyze breast tissue for malignancy have been proposed to screen for breast cancer, but none have been shown to be associated with mortality reduction.
The study design and conduct make these results difficult to assess or combine with the results of other trials.
The reduction in breast cancer mortality at a median follow-up of 17.7 years corresponds to an absolute risk reduction of 0.1 of 1,000 (or 1 of 10,000) fewer deaths.
The evidence is inadequate to support the conclusion of a clinically significant breast cancer mortality reduction attributable to initiation of screening mammography among women aged 39 to 49 years. The reported mortality reduction is a very small, transient reduction in breast cancer mortality based on a nonstandard imaging schedule, nonstandard imaging protocol, and nonstandard threshold for biopsy; therefore, it is of uncertain relevance to the general population. In absolute terms, it corresponds to an absolute risk reduction of 0.1 of 1,000 (or 1 of 10,000) fewer deaths. Additionally, the mortality reduction is based on a re-analysis of the original data set, which was not statistically significant, and the recalculation of breast cancer mortality in a subgroup restricted to 10 years of follow-up. At 20 years of follow-up, there was no statistically significant decrease in risk of breast cancer or all-cause mortality. 
The evidence is inadequate to make a clear determination of the magnitude of overdiagnosis. Because the evidence is based on subgroup analysis and nonstandard imaging schedule, nonstandard imaging protocol, and a nonstandard threshold for biopsy with uncertain relevance to the general population, it does not support the investigators' conclusion of “at worst a small amount of overdiagnosis." 
The PDQ cancer information summaries are reviewed regularly and updated as new information becomes available. This section describes the latest changes made to this summary as of the date above.
The Patient characteristics subsection was extensively revised.
Added text to state that most U.S. states have enacted laws mandating that mammography facilities report breast density, but inconsistent guidelines have generated confusion and anxiety among patients and health care providers (cited DenseBreast-info.org as reference 46).
Added text about the American College of Radiology's Breast Imaging Reporting and Data System (BI-RADS), which classifies breast density as almost entirely fatty, scattered fibroglandular density, heterogeneously dense, or extremely dense (cited Melnikow et al. as reference 47).
Added text to state that the BI-RADS categories of heterogeneously dense and extremely dense breast density are considered dense breast tissue, a description affecting 43% of women aged 40 to 74 years; a radiologist's assignment of breast density is subjective, and in any woman, it may vary over time (cited Sprague et al. and Ho et al. as references 48 and 49, respectively).
Added text to state that while breast density is associated with an increased risk of breast cancer, density is only a modest risk factor for breast cancer and does not confer a higher risk for breast cancer death; the fourfold elevated risk for breast cancer incidence according to breast density is a comparison of the density category extremely dense versus the density category almost entirely fatty (cited Smetana et al. as reference 50).
Revised text to state that rapidly growing cancers can sometimes be mistaken for normal breast tissue; e.g., medullary carcinomas, an uncommon type of invasive ductal breast cancer that is often associated with the BRCA1 mutation and aggressive characteristics, but that may demonstrate comparatively favorable responses to treatment. Some other cancers associated with BRCA1/2 mutations, which appear indolent, can also be missed.
Added text about the Canadian National Breast Screening Study (CNBSS), a randomized screening trial with 25-year follow-up, that re-estimated overdiagnosis of breast cancer from mammography screening by age group and concluded that approximately 30% of invasive screen-detected cancers in women aged 40 to 49 years and up to 20% of those detected in women aged 50 to 59 years were overdiagnosed; when in situ cancers are included, the estimated risks are 40% and 30%, respectively.
Added text to state that the conditions required for adequate estimation of overdiagnosis were largely met in the CNBSS because population-based screening did not become available throughout Canada until a minimum of 2 years later and in most instances 5 to 10 years later, because contamination is documented to have been minimal, and because individual randomization resulted in 44 almost identically distributed demographic factors and risk factors between the two trial arms.
Added text to state that since the conclusion of the trial screening period in 1988, differences in screening quality, intensity, invited age range, and biopsy thresholds decrease the generalizability of these results; these factors and improved imaging technique/quality and low threshold for biopsy, likely contribute to lower estimates of overdiagnosis of in situ cancer than that of invasive cancer (cited Baines et al. as reference 105).
Added text to state that breast cancer overdiagnosis is a complex topic; studies that used many different methods reported a wide range of estimates, and there is currently no way to assess whether new cancer cases are overdiagnosed or are of real harm to patients.
This summary is written and maintained by the PDQ Screening and Prevention Editorial Board, which is editorially independent of NCI. The summary reflects an independent review of the literature and does not represent a policy statement of NCI or NIH. More information about summary policies and the role of the PDQ Editorial Boards in maintaining the PDQ summaries can be found on the About This PDQ Summary and PDQ® - NCI's Comprehensive Cancer Database pages.
This PDQ cancer information summary for health professionals provides comprehensive, peer-reviewed, evidence-based information about breast cancer screening. It is intended as a resource to inform and assist clinicians who care for cancer patients. It does not provide formal guidelines or recommendations for making health care decisions.
This summary is reviewed regularly and updated as necessary by the PDQ Screening and Prevention Editorial Board, which is editorially independent of the National Cancer Institute (NCI). The summary reflects an independent review of the literature and does not represent a policy statement of NCI or the National Institutes of Health (NIH).
Board members review recently published articles each month to determine whether an article should:
Changes to the summaries are made through a consensus process in which Board members evaluate the strength of the evidence in the published articles and determine how the article should be included in the summary.
Any comments or questions about the summary content should be submitted to Cancer.gov through the NCI website's Email Us. Do not contact the individual Board Members with questions or comments about the summaries. Board members will not respond to individual inquiries.
Some of the reference citations in this summary are accompanied by a level-of-evidence designation. These designations are intended to help readers assess the strength of the evidence supporting the use of specific interventions or approaches. The PDQ Screening and Prevention Editorial Board uses a formal evidence ranking system in developing its level-of-evidence designations.
PDQ is a registered trademark. Although the content of PDQ documents can be used freely as text, it cannot be identified as an NCI PDQ cancer information summary unless it is presented in its entirety and is regularly updated. However, an author would be permitted to write a sentence such as “NCI’s PDQ cancer information summary about breast cancer prevention states the risks succinctly: [include excerpt from the summary].”
The preferred citation for this PDQ summary is:
PDQ® Screening and Prevention Editorial Board. PDQ Breast Cancer Screening. Bethesda, MD: National Cancer Institute. Updated <MM/DD/YYYY>. Available at: https://www.cancer.gov/types/breast/hp/breast-screening-pdq. Accessed <MM/DD/YYYY>. [PMID: 26389344]
Images in this summary are used with permission of the author(s), artist, and/or publisher for use within the PDQ summaries only. Permission to use images outside the context of PDQ information must be obtained from the owner(s) and cannot be granted by the National Cancer Institute. Information about using the illustrations in this summary, along with many other cancer-related images, is available in Visuals Online, a collection of over 2,000 scientific images.
The information in these summaries should not be used as a basis for insurance reimbursement determinations. More information on insurance coverage is available on Cancer.gov on the Managing Cancer Care page.
More information about contacting us or receiving help with the Cancer.gov website can be found on our Contact Us for Help page. Questions can also be submitted to Cancer.gov through the website’s Email Us.