Accuracy-based proficiency testing for estradiol measurements

madman · Dec 16, 2023

ABSTRACT

Objectives

Accuracy of estradiol measurements is important but conventional proficiency testing (PT) cannot assess accuracy when possibly non-commutable samples are used and method peer-group means are the targets. Accuracy-based assessment of estradiol measurements is needed.

Design and Methods

Five serum samples were prepared from single- donors, frozen, and distributed overnight to 76 New York State Department of Health (NYSDOH)-certified laboratories. Participants analyzed samples for estradiol. The biases of group means were assessed against the Centers for Disease Control and Prevention (CDC)-defined targets, and evaluated using the Hormones Standardization Program (HoSt) E2 performance criteria of±12.5%. Each laboratory’s performance was evaluated using total allowable error (acceptance limits) of target ±25% or ±15 pg/mL (55.1 pmol/L) (whichever was greater, NYSDOH), target±30% (Clinical Laboratory Improvement Amendments [CLIA]), and target ±26% (minimal limit based on biological variation [BV]).

Results

The biases (range) were 34% (-17% to 175%), 40% (-33% to 386%), 16% (-45% to193%), 5% (-27% to 117%), and -4% (-31% to 21%), for samples at estradiol of 24.1, 28.4, 61.7,94.1 and 127 pg/mL, or 88.5, 104.3, 226.5, 245.4 and 466.2 pmol/L, respectively. Large positive method/analytical systematic biases were revealed for 9 commonly used method/analytical systems in the United States at low estradiol concentrations. Of the 9 analytical systems, 0, 0, 3,7, and 6 met the HoSt criterion for the samples with estradiol at the five respective concentrations. PT evaluation showed that 59%, 69%, and 87% of laboratories would receive a PT event pass score when the CDC-defined target and a criterion of NYSDOH, CLIA, or BV were used, respectively. However, >95% of laboratories would obtain PT pass scores if method peer group means were used as targets regardless of the criterion used.

Conclusions

Improvement in the accuracy of estradiol measurements is needed, particularly at low estradiol concentrations. Accuracy-based PT provides unambiguous information about the accuracy of analytical/method methods/analytical systems.

INTRODUCTION and OBJECTIVES

Estrogens are responsible for the development of the secondary female sex characteristics and play an important role in female reproductive processes. Estradiol measurements have a wide range of clinical utilities, e.g., diagnosis of fertility disorders, gynecomastia in males, estrogen-producing ovarian and testicular tumors, disorders of sex steroid metabolism, monitoring low-dose female hormone replacement therapy in postmenopausal women, and antiestrogen therapy(1, 2). To meet those clinical needs, accurate measurement of estradiol in patient care at all clinically relevant concentrations is needed; however, information on measurement accuracy is limited. Furthermore, its accurate measurement provides information that can be used to improve the quality of manufacturers’ products, assess the effectiveness of manufacturer standardization, and advance current evaluations performed as part of activities related to meeting regulatory requirements.

Although proficiency testing (PT) is an effective tool for monitoring the quality performance of clinical laboratories and analytical systems, it has limitations (3). Conventional PT often uses non-commutable samples or modified samples whose commutability is unknown, and therefore only evaluates participants’ results using method peer-group mean values as targets (4). Therefore, due to the presumed existence of matrix effects, conventional PT can only assess whether a laboratory’s analysis can meet acceptance limits relative to its peers using the same method. Miller et al. (4) demonstrated that by using commutable materials and a bona fide reference method, it is possible to differentiate calibration bias from artifactual “matrix bias”. However, as it is commonly performed, conventional PT typically cannot differentiate between calibration bias and matrix bias; therefore, it cannot assess whether the results obtained are sufficiently accurate to meet clinical needs (5). In contrast, accuracy-based PT uses authentic, unaltered samples and target values determined by a reference method measurement procedure. Thus, it can assess the proficiency of a laboratory analysis using an analytical system as intended: the accuracy, and reliability of measurement results obtained with the instrument in the context of clinical needs. Because, for many reasons, accuracy-based PT is relatively expensive to perform, it has seen somewhat limited use by external quality assessment programs. New economical approaches using commutable samples are needed.

In 2022, the US Centers for Medicare & Medicaid Services (CMS) finalized changes to the Clinical Laboratory Improvement Amendments of 1988 (CLIA) regulations for proficiency testing PT, including an acceptance limit of ± 30% for scoring estradiol PT results (6). A more stringent evaluation criterion, of target ± 25% or 15 pg/mL (55.1 pmol/L) whichever is greater, was being used by the New York State Department of Health (NYSDOH) PT program at the time this study was done. This is consistent with the estimated “minimal” analytical performance(total error) (26%) based upon measurements of biological variation (BV) for estradiol (7, 8) and using an approach similar to that used by Miller et al. (4). For assessing the accuracy of a method or analytical system, the Centers for Disease Control and Prevention (CDC) Hormone Standardization Program for estradiol (CDC HoSt E2) uses performance criteria derived from epidemiological studies (9) of ±12.5% bias for samples with estradiol of >20 pg/mL (73.4 pmol/L), and ±2.5 pg/mL (9.2 pmol/L) absolute bias for estradiol <20 pg/mL (73.4 pmol/L); in this study all specimens exceeded the 20 pg/mL (73.4 pmol/L) threshold concentration.

Objectives of this study were to assess the accuracy of measurement procedures for total estradiol using an accuracy-based PT and to explore the effect of different possible acceptance limits using either the CDC-defined target or method peer-group mean.

LIMITATIONS

Our findings were limited to one event of five samples, so we cannot make conclusions about accuracy over time. A relatively small number of laboratories participated, but these are representative of the available estradiol assays in use. We were not able to test the five samples for possible endogenous or exogenous steroids or drugs that could have interfered with some assays.

CONCLUSIONS

In summary, we showed that at high (> 60 pg/mL or 220.3 pmol/L) estradiol concentrations in a majority of laboratories were able to meet minimal requirements for accuracy based upon BV. However, we concluded that the immunoassay measurements for estradiol are inaccurate at low concentrations. Applying desirable or optimal criteria would result in more misses and a different overall assessment. Our results are consistent with reports of inaccuracy and variability in estradiol measurements, which could impact patient care, particularly for samples with low concentrations (16, 20 - 21). Efforts to standardize estradiol measurements were made over two decades ago by multiple institutions (2, 20, 22 – 24). This snapshot assessment of the accuracy of estradiol measurements would benefit by following up with similar accuracy-based assessments to determine whether these efforts are continuing to increase accuracy. Only accuracy-based PT, like the approach we describe here, can assess absolute accuracy. Future efforts would be best focused on lower concentrations, where improvements are most needed.

madman · Dec 16, 2023

Table 1. A statistical summary of the reported results

¥ 1 pg/mL = 3.671 pmol/L

madman · Dec 16, 2023

Table 2. Mean bias (%) of instruments’ and laboratories' results against CDC-defined targets

¥ 1 pg/mL = 3.671 pmol/L

* Results were reported as <20 pg/mL

# High-lighted is the method means within the CDC HoStcriterion (±12.5%)

madman · Dec 16, 2023

Table 3. Evaluation of participant laboratories' results using a single CDC-defined target and three different criteria

¥ 1 pg/mL = 3.671 pmol/L

*NY indicates an NYSDOH criterion of Target ± 25% or 15 pg/mL whichever is greater.

#CL indicates the CLIA criterion of Target ± 30%

$BV indicates a criterion of Target ± 26% derived from biological variability.

**Pass-PT score is obtained if a laboratory has 4 out of 5 sample results within the allowable limits.

madman · Dec 16, 2023

Table 4. Evaluation of participant laboratories' results using peer-group mean as target and three different criteria

¥ 1 pg/mL = 3.671 pmol/L

*NY indicates an NYSDOH criterion of Target ± 25% or 15 pg/mL whichever is greater.

#CL indicates the CLIA criterion of Target ± 30%

$BV indicates a criterion of Target ± 26%, derived from biological variability

madman · Dec 16, 2023

HIGHLIGHTS

● Traditional proficiency testing (PT) cannot identify inaccurate estradiol methods

● Estradiol results from commutable PT samples were compared to CDC target values

● Biases were 34% (-17% to 175%) and 40% (-33% to 386%) for the two lowest samples

● Scores were affected by the acceptance limit criteria and use of peer-grouping

● Improvement in estradiol measurement is needed, particularly at low concentrations

madman · Dec 16, 2023

CDC’s Clinical Standardization Programs - Excel Male TRT Forum

High-quality disease biomarker tests are critical for the correct diagnosis and treatment of patients, accurate interpretation of research data, and effective use of research findings in health care. CDC’s Clinical Standardization Program (CDC CSP) assists researchers and laboratories with...

www.excelmale.com

madman · Dec 16, 2023

Very low E2 levels during menopause complicate assay use, measurement - Excel Male TRT Forum

https://www.healio.com/news/endocrinology/20201210/very-low-estrogen-levels-during-menopause-complicate-assay-use-measurement Clinicians using mass spectrometry to measure hormone levels should exercise caution when assessing data from postmenopausal women when very low estradiol levels are...

www.excelmale.com

“Mass spectrometry is not a turnkey method,” Auchus said. “It is real people in the lab, running experiments on a day-to-day basis, getting funny results, and trying to repeat them again. Maybe we need to change the column; maybe we need to clean the instrument or replace a part or run some solvent through it over the weekend to clean it up. That is why you cannot necessarily get results one day. It is not the sensitivity that limits you. It is the noise that is the problem.”