Does size really matter? A sensitivity analysis of number of seeds in a respondent-driven sampling study of gay, bisexual and other men who have sex with men in Vancouver, Canada Lachowsky, Nathan J; Sorge, Justin T; Raymond, Henry F; Cui, Zishan; Sereda, Paul; Rich, Ashleigh; Roth, Eric A; Hogg, Robert S; Moore, David M
Background: Respondent-driven sampling (RDS) is an increasingly used peer chain-recruitment method to sample “hard-to-reach” populations for whom there are no reliable sampling frames. Implementation success of RDS varies; one potential negative factor being the number of seeds used. Methods: We conducted a sensitivity analysis on estimates produced using data from an RDS study of gay, bisexual and other men who have sex with men (GBMSM) aged ≥16 years living in Vancouver, Canada. Participants completed a questionnaire on demographics, sexual behavior and substance use. For analysis, we used increasing seed exclusion criteria, starting with all participants and subsequently removing unproductive seeds, chains of ≤1 recruitment waves, and chains of ≤2 recruitment waves. We calculated estimates for three different outcomes (HIV serostatus, condomless anal intercourse with HIV discordant/unknown status partner, and injecting drugs) using three different RDS weighting procedures: RDS-I, RDS-II, and RDS-SS. We also assessed seed dependence with bottleneck analyses and convergence plots. Statistical differences between RDS estimators were assessed through simulation analysis. Results: Overall, 719 participants were recruited, which included 119 seeds and a maximum of 16 recruitment waves (mean chain length = 1.7). The sample of >0 recruitment waves removed unproductive seeds (n = 50/119, 42.0%), resulting in 69 chains (mean length = 3.0). The sample of >1 recruitment waves removed 125 seeds or recruits (17.4% of overall sample), resulting in 37 chains (mean length = 4.8). The final sample of >2 recruitment waves removed a further 182 seeds or recruits (25.3% of overall sample), resulting in 25 chains (mean length = 6.1). Convergence plots and bottleneck analyses of condomless anal intercourse with HIV discordant/unknown status partner and injecting drugs outcomes were satisfactory. For these two outcomes, regardless of seed exclusion criteria used, the crude proportions fell within 95% confidence intervals of all RDS-weighted estimates. Significant differences between the three RDS estimators were not observed. Conclusions: Within a sample of GBMSM in Vancouver, Canada, this RDS study suggests that when equilibrium and homophily are met, although potentially costly and time consuming, analysis is not negatively affected by large numbers of unproductive or lowly productive seeds.
Item Citations and Data
Attribution 4.0 International (CC BY 4.0)