TY - JOUR
T1 - The PCOS Phenotypes in Unselected Populations (P-PUP) study
T2 - participant clinical features and data harmonization on analysis of individual participant data
AU - Bizuneh, Asmamaw Demis
AU - Kiconco, Sylvia
AU - Earnest, Arul
AU - Khomami, Mahnaz Bahri
AU - Dhungana, Raja Ram
AU - Azziz, Ricardo
AU - Suturina, Larisa V.
AU - Zhao, Xiaomiao
AU - Gambineri, Alessandra
AU - Tehrani, Fahimeh Ramezani
AU - Yildiz, Bulent O.
AU - Kim, Jin Ju
AU - Xu, Liangzhi
AU - Makwe, Christian Chigozie
AU - Teede, Helena J.
AU - Joham, Anju E.
AU - Tay, Chau Thien
N1 - Publisher Copyright:
© The Author(s) 2025.
PY - 2025/12
Y1 - 2025/12
N2 - Background: Polycystic ovary syndrome (PCOS) is a multifaceted condition with diagnostic challenges and clinical heterogeneity across populations. Research priorities include enhanced accuracy in defining cut-offs for diagnostic features. Here, we aim to describe participant clinical features and data harmonization in the international PCOS Phenotype in Unselected Populations (P-PUP) study. Methods: We searched EMBASE and Medline (Ovid) from 1990 to October 2, 2020, in population-based, medically unbiased study cohorts. Included studies had ≥ 300 participants, directly assessed PCOS-related features, and provided Individual Participant Data (IPD). Risk of bias was assessed using the AXIS tool. Data integrity was ensured via cross-referencing, identifying outliers/implausible data, and variable harmonization. Reporting follows PRISMA-IPD guidelines, summarizing findings with frequencies and proportions. Results: The study included 9979 reproductive-age women from 12 studies across eight countries (China, Iran, Italy, Nigeria, Russia, South Korea, Turkey, and the USA), representing 11 ethnicities. Ovulatory dysfunction was variably recorded, from mean menstrual cycle length, minimum or maximum cycle length, number of cycles per year, or urinary progesterone measurements. Clinical hyperandrogenism was assessed via modified Ferriman–Gallwey (mFG) scores, with a few also including acne and alopecia. Biochemical hyperandrogenism thresholds varied (95th, 97.5th, or 98th percentile of healthy controls). Polycystic ovary morphology was assessed via transvaginal, transabdominal, or transrectal approaches. Harmonization adhered to International PCOS Guidelines for ovulatory dysfunction, ethnicity-specific cut-offs for hirsutism (via k-means clustering), and 95th percentile thresholds for biochemical hyperandrogenism. PCOS prevalence ranged from 3.3 to 19.8% in the original studies and was 11.0% overall after harmonization. Conclusions: The P-PUP study offers an unprecedented, ethnically diverse, medically unbiased population-based cohort, an extraordinarily valuable tool to enhance knowledge and research in PCOS. However, variability in data collection methods and definitions of PCOS diagnostic features across studies limited the ability to fully integrate data for analysis. Despite these limitations, we optimized harmonization in this IPD, and the findings provided valuable insights into the challenges of data harmonization and established a foundation for future collaborative research. Future research should focus on standardizing data collection, establishing normative cut-offs based on true natural groupings, and linking diagnostic clusters to outcomes in diverse populations. Protocol registration: CRD42021267847.
AB - Background: Polycystic ovary syndrome (PCOS) is a multifaceted condition with diagnostic challenges and clinical heterogeneity across populations. Research priorities include enhanced accuracy in defining cut-offs for diagnostic features. Here, we aim to describe participant clinical features and data harmonization in the international PCOS Phenotype in Unselected Populations (P-PUP) study. Methods: We searched EMBASE and Medline (Ovid) from 1990 to October 2, 2020, in population-based, medically unbiased study cohorts. Included studies had ≥ 300 participants, directly assessed PCOS-related features, and provided Individual Participant Data (IPD). Risk of bias was assessed using the AXIS tool. Data integrity was ensured via cross-referencing, identifying outliers/implausible data, and variable harmonization. Reporting follows PRISMA-IPD guidelines, summarizing findings with frequencies and proportions. Results: The study included 9979 reproductive-age women from 12 studies across eight countries (China, Iran, Italy, Nigeria, Russia, South Korea, Turkey, and the USA), representing 11 ethnicities. Ovulatory dysfunction was variably recorded, from mean menstrual cycle length, minimum or maximum cycle length, number of cycles per year, or urinary progesterone measurements. Clinical hyperandrogenism was assessed via modified Ferriman–Gallwey (mFG) scores, with a few also including acne and alopecia. Biochemical hyperandrogenism thresholds varied (95th, 97.5th, or 98th percentile of healthy controls). Polycystic ovary morphology was assessed via transvaginal, transabdominal, or transrectal approaches. Harmonization adhered to International PCOS Guidelines for ovulatory dysfunction, ethnicity-specific cut-offs for hirsutism (via k-means clustering), and 95th percentile thresholds for biochemical hyperandrogenism. PCOS prevalence ranged from 3.3 to 19.8% in the original studies and was 11.0% overall after harmonization. Conclusions: The P-PUP study offers an unprecedented, ethnically diverse, medically unbiased population-based cohort, an extraordinarily valuable tool to enhance knowledge and research in PCOS. However, variability in data collection methods and definitions of PCOS diagnostic features across studies limited the ability to fully integrate data for analysis. Despite these limitations, we optimized harmonization in this IPD, and the findings provided valuable insights into the challenges of data harmonization and established a foundation for future collaborative research. Future research should focus on standardizing data collection, establishing normative cut-offs based on true natural groupings, and linking diagnostic clusters to outcomes in diverse populations. Protocol registration: CRD42021267847.
KW - Acne
KW - Androgens
KW - Data harmonization
KW - Female pattern hair loss
KW - Hirsutism
KW - IPD
KW - Individual participant data
KW - PCOS
KW - Phenotype
KW - Polycystic ovary
KW - Polycystic ovary syndrome
UR - https://www.scopus.com/pages/publications/105010658539
U2 - 10.1186/s12916-025-04221-9
DO - 10.1186/s12916-025-04221-9
M3 - Article
C2 - 40660246
AN - SCOPUS:105010658539
SN - 1741-7015
VL - 23
JO - BMC Medicine
JF - BMC Medicine
IS - 1
M1 - 420
ER -