Discover
Visualize rapid, validated insights through real-world data.
Datavant Completes Acquisition of Aetion. Learn more →
 
        Nataile Schibell, MPH and Pippa Hodgkins
In real-world settings, treatment decisions aren’t randomized—they reflect clinical judgment, patient characteristics, and systemic factors. These underlying differences introduce confounding, making it difficult to isolate true treatment effects.
Propensity score (PS) methods — such as matching, weighting, and high-dimensional PS—help address this bias by estimating the probability of treatment based on baseline covariates. When applied correctly, these techniques can balance treatment groups and support more credible comparisons in non-randomized studies. High-dimensional PS, as described in recent peer-reviewed research by Aetion co-founders Jeremy Rassen, Sc.D., and Sebastian Schneeweiss, M.D., Sc.D., offers a scalable, data-driven approach to covariate selection in complex datasets such as claims and EHRs.¹
Aetion® Substantiate operationalizes these methods within a structured, transparent workflow, enabling teams to adjust for confounding with scientific rigor and day-to-day efficiency. From study design through reproducible outputs, Substantiate helps ensure that PS implementation is consistent, scalable, and aligned with regulatory expectations.
Propensity score methods are foundational to confounding adjustment in real-world evidence. But their value depends on consistent, transparent execution. Substantiate enables teams to implement these methods—matching, weighting, and high-dimensional PS—within a defined, audit-ready workflow that aligns with study protocols.
Rather than stitching together manual steps or relying on custom code, users can apply PS adjustments end-to-end within the platform’s Comparative Effectiveness Plan. Every step is linked—from cohort construction through covariate selection, model configuration, diagnostics, and output—ensuring alignment across studies and teams.
 Image 1: Getting started with the Comparative Effectiveness Analysis Plan
Image 1: Getting started with the Comparative Effectiveness Analysis Plan
Substantiate allows users to:
This framework supports reproducibility and operational consistency while giving teams the flexibility to tailor methods to the complexity of their data. Substantiate brings structure to real-world analytics—so that scientifically sound methods scale reliably across studies, datasets, and therapeutic areas.
Propensity score matching is a widely used method for reducing confounding in real-world comparative studies. The goal is to create two groups with similar observed characteristics, so any outcome differences are more likely due to treatment than baseline differences.
In Substantiate, propensity score matching is built into the core study workflow. Analysts define the covariates, specify the time window for baseline measurement, and the platform generates propensity scores. Based on the configuration, Substantiate then matches patients across treatment arms using one of several matching algorithms, including 1:1 and variable-ratio methods.
Matching is particularly useful when constructing trial-like cohorts—such as in external control arms, trial emulation studies, or regulatory-aligned analyses. Substantiate tracks and retains all produced study outputs, providing traceability of matching parameters and patient selection logic throughout the study lifecycle.
 Image 2: Propensity score method selection options in the Comparative Effectiveness Analysis Plan
Image 2: Propensity score method selection options in the Comparative Effectiveness Analysis Plan
Propensity score weighting adjusts for baseline differences by scaling how much each patient contributes to the analysis. Unlike matching, all patients are retained; their influence is weighted based on the probability of receiving treatment, rebalancing the population to improve comparability. This approach is particularly useful when preserving sample size is important or when treatment arms have sufficient overlap but full matching isn’t feasible. For a detailed explanation of these methods, see Understanding Propensity Score Weighting Methods.
In Substantiate, weighting is integrated directly into the comparative effectiveness workflow. Users can select from several weighting strategies and configure method-specific parameters—all within a structured, transparent interface that supports consistent application across studies.
 Image 3: Propensity score overlap diagram showing patient density across treatment arms
Image 3: Propensity score overlap diagram showing patient density across treatment arms
Truncation and trimming can be applied to any of the above methods to reduce the impact of extreme weights. This is particularly important in studies with low overlap or high-dimensional covariate sets.
In real-world datasets—especially claims and EHRs—the number of potential covariates can be extensive. Manually selecting covariates can be time-intensive and may require iterative clinical and methodological input. While investigator-defined models remain standard, they can be limited when working with unfamiliar therapeutic areas or large, exploratory datasets.
High-dimensional propensity score (hdPS) methods were developed to help address this complexity. Instead of relying solely on predefined covariate lists, hdPS uses structured, data-driven algorithms to systematically identify and rank covariates most likely to influence both treatment and outcome. This is especially useful when confounders are not well-characterized or when covariate definitions vary across datasets. Published validation has shown that hdPS can perform comparably to—or better than—investigator-defined models in high-dimensional settings.
 Image 4: Patient Characteristics for patients with propensity scores generated by the high-dimensional propensity score model
Image 4: Patient Characteristics for patients with propensity scores generated by the high-dimensional propensity score model
In Substantiate, high-dimensional propensity score (hdPS) modeling is integrated directly into the workflow. The platform enables users to:
The process is fully transparent and easy to adjust. Users can control which data attributes are included, how far back in time to look for baseline information, and how covariates are prioritized—while relying on automation to identify those most likely to influence treatment and outcome. This helps ensure that studies remain consistent and repeatable across teams and datasets.
Different study designs call for different adjustment strategies. Some require tightly matched cohorts for interpretability; others prioritize preserving the whole sample or minimizing variance. Substantiate supports multiple propensity score methods within a unified platform, giving researchers the flexibility to choose the right approach based on the question, the data, and the analytic constraints.
| Method | Purpose | What It Does | Best Used When | 
| Matching | Create balanced comparison groups | Matches patients with similar propensity scores; supports both 1:1 and variable-ratio configurations | Used by RWE teams conducting comparative effectiveness studies, regulatory-aligned analyses, and trial emulations | 
| IPTW (Weighting) | Estimate the treatment effect across the full population | Applies weights to all patients to balance covariates between groups | Preferred by HEOR teams conducting population-level analyses where generalizability and full sample retention matter | 
| ATT/SMR Weighting | Estimate the effect among those treated | Reweights the comparator arm to resemble the treated group | Common in safety or outcomes studies comparing real-world cohorts to trial populations or registry-based controls | 
| Overlap Weighting | Focus on the most comparable subset of patients | Prioritizes patients with similar treatment probabilities; minimizes the influence of outliers | Ideal when treatment groups differ substantially—often used by methods teams and comparative safety researchers | 
| High-Dimensional PS | Empirically identify key confounders in large datasets | Algorithmically selects and ranks covariates based on bias or prevalence to build the PS model | Used by data science teams working with large claims or EHR data when covariate selection is complex or uncertain | 
Substantiate equips research teams to implement propensity score methods—matching, weighting, and high-dimensional PS—within a consistent, transparent framework. The platform guides users from cohort construction through covariate selection, model configuration, and diagnostics, supporting both methodological rigor and day-to-day workflow efficiency.
All study inputs and outputs are version-controlled and fully documented, supporting internal reproducibility and external defensibility. Whether designing early feasibility analyses or generating comparative evidence for regulatory or payer decision-making, teams can rely on Substantiate to deliver consistency across studies and datasets.
Explore the Evidence Hub or contact our team to see how Substantiate powers reliable, scalable implementation of PS methods.