Real World Evidence (RWE) 101 – Pregnancy Registries
A pregnancy registry is a type of real-world evidence collection system that collects data from pregnant women who have been exposed to a particular medication, vaccine, or medical intervention during pregnancy. The purpose of a pregnancy registry is to gather information about the safety and effectiveness of these exposures in pregnant women and their offspring.
Pregnancy registries are important because pregnant women are often excluded from clinical trials due to safety concerns, which can limit the amount of data available about the safety and effectiveness of medications, vaccines, and medical inyterventions during pregnancy. By collecting data from pregnant women who have been exposed to these agents, pregnancy registries can provide valuable information to healthcare providers and patients to help guide treatment decisions during pregnancy.
A pregnancy registry typically collects information about the mother’s health status, the medication or medical intervention being studied, and pregnancy outcomes such as miscarriage, stillbirth, preterm birth, and birth defects. The data collected can be used to identify potential safety signals or adverse effects associated with the medication or medical intervention, and to evaluate the overall safety and effectiveness of the treatment in pregnant women.
In summary, pregnancy registries are an important tool in real-world evidence collection for understanding the safety and effectiveness of medications, vaccines, and medical interventions during pregnancy. By gathering data from pregnant women who have been exposed to these agents, pregnancy registries can provide valuable information to healthcare providers and patients to help guide treatment decisions and improve the health outcomes of both mother and child.
Real World Evidence (RWE) 101 – ICH GCP (R3) – Real World Evidence Context
stuart.mccully2023-08-09T15:27:28+00:00August 9, 2023|2023, RWE 101|
RWE 101 - ICH GCP (R3) - Real World Evidence Context Revision 2 of ICH GCP caused confusion to those of us who work with non-interventional studies. The glossary [...]
Real World Evidence (RWE) 101 – Non-Interventional Studies vs Market Health Research
stuart.mccully2023-08-09T15:02:58+00:00August 9, 2023|2023, RWE 101|
RWE 101 - Non-Interventional Studies vs Market Health Research Key differences between a non-interventional study (NIS) and market health research include:1. Research Objective: NIS are conducted to examine real-world [...]
Real World Evidence (RWE) 101 – Real World Evidence (RWE) 101 – Audits vs Inspections
stuart.mccully2023-08-09T14:51:12+00:00August 9, 2023|2023, RWE 101|
RWE 101 - Real World Evidence (RWE) 101 - Audits vs Inspections In the context of regulatory compliance for Real-World Evidence (RWE), both audits and inspections play crucial roles, [...]
Real World Evidence (RWE) 101 – A Career of Many Pathways
stuart.mccully2023-08-07T22:28:44+00:00August 7, 2023|2023, RWE 101|
RWE 101 - A Career of Many Pathways Real-world evidence (RWE) refers to the information on health care that is derived from analysis of real-world data (RWD). RWE [...]
Real World Evidence (RWE) 101 – Evolution of Regulatory Affairs
stuart.mccully2023-08-07T22:02:40+00:00August 7, 2023|2023, RWE 101|
RWE 101 - Evolution of Regulatory Affairs Real-world evidence (RWE) and real-world data (RWD) are increasingly influencing regulatory affairs in the biopharmaceutical and healthcare industry. This change has been [...]
Real World Evidence (RWE) 101 – Project Managers
stuart.mccully2023-08-07T21:50:20+00:00August 7, 2023|2023, RWE 101|
RWE 101 - Project Managers Real-World Evidence (RWE) observational studies and clinical trials are both key elements of medical research, but they involve very different methodologies, aims, and requirements. [...]
Quality Considerations when Using RWD from Registries to Support Regulatory Decisions in the EU
RWR CONTEXT
EMA has published a comprehensive guideline, which provides recommendations on key methodological aspects that are specific to the use of patient registries when planning to conduct registry-based studies to support regulatory decision making on medicinal products within the European Union (EU).
In October 2021, the EMA published its “Guideline on Registry-Based Studies” [Link] [1].
According to the EMA:
-
- The purpose of the guideline is to improve the use of patient registries to support regulatory decision-making on medicinal products within the European Union (EU) (as per Section 1 of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
- The objective of the Guideline is to provide recommendations on key methodological aspects that are specific to the use of patient registries when planning to conduct registry-based studies (as per Section 2 of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
- To support these recommendations…considerations and aspects of patient registries that national competent authorities (NCAs) and EMA view important as good regulatory practice in registry-based studies are included (as per Section 2 of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
In this article we will explore:
-
- The differences between a registry and a registry-based study
- Use of registry-based studies for evidence generation
- Considerations when planning a registry-based study – Feasibility analysis
- Legal obligations and regulatory requirements for registry-based studies
- Good Registry Practice (GRP) – Quality considerations for patient registries
- Examples of agreed key performance indicators (KPIs) of data quality
- Data sharing outside the context of registry-based studies – Contractual considerations
- Checklist for evaluating the suitability of registries for registry-based studies
>>>DOWNLOAD A PDF OF THE ARTICLE: https://rwr-regs.com/wp-content/uploads/2022/04/2022-04-01_Quality-Standards-for-Registry-Based-Studies-EU-converted.pdf
Patient Registries as an Important Data Source for Registry-Based Studies
Patient registries may have several purposes, such as to monitor the clinical status, quality of life, comorbidities and treatments of patients over time or to monitor and improve overall quality of care. They are a source of data on the presence or occurrence of a particular disease or health-related individual characteristic(s), such as a set of signs or symptoms, or a specific condition, such as pregnancy, breast-feeding, a birth defect or a molecular or genomic feature. They are therefore an important source of data for registry-based studies on healthcare practices, utilisation of medicines and medical devices, and outcomes of treatments. They may, in particular, represent an important source of data on rare diseases and patients treated with advanced therapy medicinal products (ATMP), including gene therapy (as per Section 2 of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
Differences Between a Registry-Based Study and a Patient Registry
(Source: Section 3.1 of the EMA – Guideline on Registry-Based Studies, October 2021 [1])
Use of Registry-Based Studies for Evidence Generation
The acceptability of registry-based studies as a source of evidence for regulatory purposes depends on several factors related to the specific regulatory assessment procedure for the concerned medicinal product, the characteristics of the concerned registry (see Annex) and the objectives, design and analytical plan of the proposed study. Early consultation with national competent authorities (NCAs), where applicable, and with EMA (e.g., the procedure for Scientific Advice and Protocol Assistance) is recommended when a registry-based study is proposed to be used and study protocols should be published (as per Section 3.2 of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
Examples where registry-based studies have been used for evidence include (as per Section 3.2 of the EMA – Guideline on Registry-Based Studies, October 2021) [1]:
-
- To complement the evidence generated in the pre-authorisation phase
-
-
-
- Examples of such evidence may include information on standards or real-world practice of care for the disease, incidence, prevalence and determinants of disease outcomes in clinical practice, or the characteristics of the registry population.
- Studies based on patient registries may also contextualise the results of uncontrolled trials, and patient registries have been used to support registry-based randomised controlled trials (RRCTs) for patient recruitment (e.g., to identify patients meeting inclusion/exclusion criteria), randomisation allocation, sample size calculation, endpoints identification, data collection and study follow-up. Open questions remain regarding the validity and relevance of RRCTs. It is therefore recommended to obtain Scientific Advice from EMA and, where applicable, from the concerned NCAs, health technology assessment (HTA) bodies and health insurance schemes as payers on the acceptability of the chosen approach for evidence generation in case deviations from a traditional randomised clinical trial (RCT) design are considered.
-
-
-
- To provide evidence in the post-authorisation phase
-
-
-
- Patient registries can be the basis for recruitment and randomisation for RCTs and non-interventional studies, post-authorisation efficacy studies (PAES) and post-authorisation safety studies (PASS) performed after marketing authorisation.
- Patient registries may allow linkage of patient records with other data sources such as biobank data, census data, or demographic data.
- In the context of medicinal products with efficacy previously demonstrated in RCTs, registry-based studies may help, for example, to assess the effectiveness of adapted dosing schemes applied in clinical practice and understand effectiveness and safety of products in a broader clinical disease-related context and a more heterogenous patient population.
- Products intended for rare diseases are often studied in uncontrolled trials and the size of the safety and efficacy datasets at time of marketing authorisation application is small. In these cases, follow-up for efficacy and safety may be needed, and PAES and PASS are often imposed for post-authorisation evidence generation. These are frequently and preferentially performed on the basis of existing patient registries.
-
-
-
- To evaluate the effects of medicinal products used during pregnancy and breast feeding
-
-
-
- Pregnancy registries include pregnant women exposed or not to different treatments and followed up to collect information on outcomes of pregnancy and in the offspring for a given medicinal product. Despite the challenges of such studies related to the completeness of information on pregnancy outcomes, the ascertainment of the exposure window/ trimester, teratology information services or electronic healthcare records where mother-child linkage is possible, pregnancy registries may also provide valuable data on the benefit-risk balance of medicinal products in breastfeeding.
-
-
Considerations When Planning a Registry-Based Study – Feasibility Analysis
MAAs/MAHs proposing a registry-based study should provide adequate information regarding the availability of data, the quality management procedures applied and the need and feasibility of introducing any study-specific additional data collection and quality control measures (as per Section 3.3 of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
A feasibility analysis should be considered by the MAA/MAH or research organisation initiating the study prior to writing the study protocol, to guide its development and facilitate the discussion with NCAs, EMA, HTA bodies and other parties. The feasibility analysis should be performed in collaboration with registry holders and include the following information, as applicable (as per Section 3.3 of the EMA – Guideline on Registry-Based Studies, October 2021) [1]:
-
- General Description – General description of the registry(ies) or network of registries; the Checklist for evaluating the suitability of registries for registry-based studies can be used to prepare this description; the epidemiology of the disease, this is more precise, medicines use and standards of care applied in the country or registry setting should be described if relevant for the specific study.
- Availability of Core Data Elements – Analysis of the availability in the registry of the core data elements needed for the planned study period (as availability of data elements may vary over time), including relevant confounding and effect-modifying variables, whether they are mapped to any standard terminologies (e.g., MedDRA, OMOP common data model), the frequency of their recording and the capacity to collect any additional data elements or introduce additional data collection methods if necessary .
- Quality and Completeness of the Data Elements – Analysis of the quality, completeness and timeliness of the available data elements needed for the study, including information on missing data and possible data imputations, risk of duplicate data for the same patient, results of any verification or validation performed (e.g., through an audit), analysis of the differences between several registries available in the network and their possible impact on data integration, description of the methods applied for data linkage as applicable, and possible interoperability measures that can be adopted.
- Adverse Event Reporting Processes – Description of processes in place for the identification of adverse events and prompt reporting of suspected adverse reactions occurring in the course of treatments, and capacity to introduce additional processes for their collection and reporting if needed.
- Study Size and Patient Recruitment – Study size estimation and analysis of the time needed to complete patient recruitment for the clinical study by providing available data on the number of centres involved in the registry(ies), numbers of registered patients and active patients, number of new patients enrolled per month/year, number of patients exposed to the medicinal product(s) of interest, duration of follow-up, missing data and losses to follow-up, need and possibility to obtain informed consent.
- Bias – Evaluation of any potential information bias, selection bias due to the inclusion/exclusion criteria of centres (e.g., primary, secondary or tertiary care) and patients, potential time-related bias between and within registry(ies), and potential bias due to loss to follow-up.
- Confounding – Evaluation of any potential confounding that may arise, especially if some data elements cannot be collected or measured.
- Analytical Issues – Analytical issues that may arise based on the data characteristics and the study design.
- Data Privacy – Any data privacy issues, possible limitations in relation to informed consent and governance related issues such as data access, data sharing and funding source.
- Suitability of the Registry – Overall evaluation of the suitability of the registry for the specific study, taking into account any missing information on the above-mentioned aspects.
The final report of the feasibility analysis may be submitted either separately or as part of the proposed protocol for a registry-based study. In order to inform the feasibility of other studies in the same registry and reduce duplication of work, the feasibility analysis should be published with the study protocol in the EU PAS Register in agreement with the registry holder. Any confidential information may be redacted if needed (as per Section 3.3 of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
Joint Registry-Based studies
For regulatory studies addressing a class of products where several MAHs have the same obligation to perform a study, MAHs are encouraged to design a joint registry-based study or to join an already existing study on the same topic (as per Section 3.3 of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
Study Protocol
The study protocol should describe how the registry infrastructure and population will be used to address the research question of interest, how the study will be conducted and how the validity (both internal and external) of the results will be ensured (as per Section 3.4 of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
Protocols for non-interventional studies should follow the guidance on the format and content of the protocol for PASS or the Scientific Guidance on PAES. They should apply the best methodological standards, including if applicable those described by the ENCePP Guide on Methodological Standards in Pharmacoepidemiology. The ENCePP Checklist for Study Protocols identifies important points to be addressed when designing a non-interventional study and writing the study protocol (as per Section 3.4 of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
Where the registry-based study entails secondary use of data, the study protocol should specify the events of interest that are already collected in the registry and discuss the risks of bias and unmeasured confounding. Dedicated and complete search strategies, coding lists or adjudication should be used to accurately define the outcomes of interest (as per Section 3.4 of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
The protocol should specify agreements made with the registry holder on the additional variables that can be collected, with timelines for data availability (as per Section 3.4 of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
If a registry-based study is to be conducted across multiple registries, a common study protocol should be developed based on core data elements available in the registry and a common design, even if some aspects of the study may vary according to the characteristics of each registry and not all outcomes may be assessed in all registries. Nevertheless, the protocol should also describe differences between registries, assess the resulting heterogeneity of data and critically discuss its potential impact on study results. The protocol or statistical analysis plan (SAP) should propose sensitivity analyses addressing this heterogeneity (as per Section 3.4 of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
Where several registries are suitable for a study but not all of them are intended to be involved, the study protocol should provide the justification of the choice, i.e., inclusion and exclusion criteria, and discuss the potential impact of selection and interpretability of datasets and findings (as per Section 3.4 of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
Choice of Study Population – Procedures for Primary Data Collection
The registry population serves as the source population for the registry-based study. The choice of the study population should be driven by the study objectives and may represent the totality of the registry population or only a subset with pre-defined characteristics. For example, when studying a medicine of interest, the potential study population may include various groups of patients: newly diagnosed patients entering the registry and receiving a first prescription of the medicine of interest, and registry patients already diagnosed with the disease and who are switched from another treatment, receive the medicine of interest as add-on therapy or have received the medicine of interest only in the past. In such situations, it is useful to collect the data needed to describe all patients receiving the medicine of interest and assess the heterogeneity between subsets of these patients (as per Section 3.5.1 of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
In case of study-specific primary data collection within an existing registry, it is critical that procedures are in place to support complete data collection on all eligible patients enrolled in the registry (as per Section 3.5.1 of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
Additional study-specific primary data collection may add complexity to the registry-based study. The data collection method applied should clearly be described in the study protocol as it has implications with regards to potential sources of bias and confounding, adequate retrieval of missing data and safety reporting requirements (as per Section 3.6 of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
Additional study-specific data collection may also affect the ongoing registries’ data collection and maintenance and require audit and validation (as per Section 3.6 of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
Informed Consent
Informed consent serves as an ethical standard and procedural obligation. It provides the fundamental condition under which a person can be included into a study. It is not conceived as a legal basis but should be seen as a safeguard for data processing
compliance. Therefore, it is important to distinguish between the requirement for consent for a subject to participate in a study and the requirements for a lawful processing of personal data under the GDPR (as per Section 3.5.2 of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
In the context of a registry-based study, the ethical and procedural obligations require that informed consent be obtained from patients to participate in the study in addition to the consent already given for participating in the registry, as applicable. It should clearly outline areas such as an explanation of the purposes of the study, the expected duration, intended use of their data and cover all data to be accessed and processed as specified in the study protocol (including but not limited to the access for monitoring, auditing or inspections by competent authorities). It should also provide information about what will happen to the results of the study (as per Section 3.5.2 of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
Data Protection
The conduct of registry-based studies needs to respect the following applicable Union data protection rules at each step of the processing of personal data, including the option for data sharing/pooling between registries and other stakeholders like competent authorities and MAAs/MAHs:
(as per Section 3.5.3 of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
When conducting registry-based studies, the legal basis of the personal data processing needs to be established. Specific considerations may be required in case of processing of special categories of personal data such as sensitive (health) information. It should be noted that Member States are allowed to maintain or introduce further conditions, including limitations with regard to the processing of genetic data, biometric data or data concerning health (as per Section 3.5.3 of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
According to the principle of accountability, it is the obligation of the data controller (e.g. a registry holder, MAA/MAH, investigator) to implement appropriate technical and organisational measures to ensure and be able to demonstrate that the personal data are processed in accordance with data protection requirements (as per Section 3.5.3 of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
Data Quality Management
Data quality management for a registry-based study depends on various factors, including the planned use of the study results and whether the study makes use of primary data collection or secondary use of registry data. While data quality management of the registry is the responsibility of the registry holder, it is the MAA/MAH’s responsibility to manage the data quality of the registry-based study and interpret the results based on findings on data quality. Specific details on level of data verification and actions to be taken if there are relevant findings, including possible internal or external audits, should be described in a specific data management plan. This plan should be discussed and agreed upon by the MAA/MAH and the registry holder (as per Section 3.7 of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
Methods and specific measures should be guided by the feasibility analysis and be selected with a view to minimise risk of invalid study results (as per Section 3.7 of the EMA – Guideline on Registry-Based Studies, October 2021) [1]:
-
- The validity of any data cleaning, extraction and transformation processes should be verified and monitored. This may be specifically relevant in studies using a network of registries where the transformation is performed locally. A risk-based approach requires the identification of data that are critical for data protection and the reliability of the study results.
- Quality checks of the data used in the study should be performed to alert on erroneous, missing or out-of-range values and logical inconsistencies, and trigger prompt data verification and remedial measures if needed.
- In studies with primary data collection, the various factors (e.g. limited human or material resources or inadequate training) influencing quality should be identified and addressed to preserve the integrity of the study. Possible measures include random source data verification, onsite review of processes and computerised systems used for data collection and management. The collected information per time interval for the main outcome parameters can be compared to the amount expected.
The European Commission’s risk-proportionate approaches in clinical trials, the EMA Reflection Paper on risk-based quality management in clinical trials, the GVP Module III on pharmacovigilance inspections and national regulations should be consulted on these aspects (as per Section 3.7 of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
Data Analysis
The analytical approach to the outcomes of interest should be pre-specified in the registry-based study protocol and the SAP as applicable. Changes to the pre-specified statistical analysis should be reflected by an amendment to the study protocol and/or by an amendment to the SAP. All changes should be presented in the study report (as per Section 3.8 of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
For non-interventional studies, the ENCePP Guide on Methodological Standards in Pharmacoepidemiology presents methods to address bias and adjust for confounding (as per Section 3.8 of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
Depending on the objectives of the registry-based study, the data analysis may need to include an evaluation of the representativeness of the study population in relation to the source population, as it may influence the external validity of the registry-based study. In case of primary data collection, a comparison of available data between eligible registry patients who are recruited, who decline recruitment or who withdraw from the study and between patients randomised and not randomised in the study, should be performed. If possible, this should be supplemented by a comparison of the study population with a similar population identified from scientific literature data, available electronic healthcare databases, other registries deemed suitable for the study but not used for data collection as justified in the study protocol, or other population-based data sources (as per Section 3.8 of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
Missing data may lead to bias and confounding, and their handling should be carefully described in the study protocol and the SAP. A thorough justification should be provided for the assumptions about their distribution, causes and timing. The ENCePP Guide on Methodological Standards in Pharmacoepidemiology provides guidance on how to handle missing data (as per Section 3.8 of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
In the absence of randomised treatment allocation in registry-based non-interventional studies, some common analytical issues should be addressed (as per Section 3.8 of the EMA – Guideline on Registry-Based Studies, October 2021) [1]:
-
- The characteristics of patient groups prescribed different treatments are likely to differ. Treatment decisions may be influenced by various factors that may also be associated with the risk of occurrence of the outcome of interest, such as disease severity or the monitoring practice of patients. While methods for addressing this underlying problem have been proposed, these do not provide a unique solution and several sensitivity analyses using different approaches should be performed. In addition, ascertainment of marginal treatment effects over time and factors underlying treatment trajectories may require complete collection of information over the course of the study.
- Registries and registry-based studies may involve different time points for patient inclusion and follow-up, initiation of treatments of interest and ascertainment of events and other variables. The probability of occurrence of events of interest may also be time-dependent. These time points are important to consider as they affect the comparability between treatment groups. Graphical representation of the analysis plan should be used to help understand the various time components of the study and the registry. When investigating a treatment effect, immortal time bias can occur when the follow-up period for the study starts before initiation of the treatment under study and the period between start of follow-up and start of treatment is misclassified as exposed.
- Selection bias, information bias and time-related bias may also occur in comparisons to historical control groups. The clinical context may have changed with regard to e.g., treatment options, diagnosis, medical practice in choice of treatments according to severity of disease, patient care, secular trends in the occurrence of important events, completeness of data collection or other uncollected or unknown factors. These sources of bias should be identified and the impact on the validity of the results assessed.
- A comparative non-exposed control group may be selected from outside the registry, for example from another registry or electronic healthcare records in a country/region where the medicine has not yet been marketed. In this situation, one should ensure that underlying differences between the two populations influencing the risk of outcome occurrence are adequately measured and accounted for in the analysis. Since it may not be possible to identify all underlying differences between populations and completeness of data collection may differ, such comparisons need to be interpreted cautiously.
- Registries offer the opportunity to compare patients prescribed a treatment of interest with patients who are untreated or who have received a different treatment(s) over a long period of time. Inclusion of prevalent medicine users (i.e., patients already treated for some time before study follow-up begins) can introduce two types of bias. Firstly, prevalent medicine users are “survivors” of the early period of treatment, which can introduce substantial (selection) bias if the risk for adverse reactions varies with time (e.g., if treatments carry a risk of hypersensitivity reactions or affect cardiovascular risk). Secondly, covariates influencing medicine prescription at study entry (e.g., disease severity) may be affected by previous medicine use, or patients may differ regarding health-related behaviours (e.g. healthy user effect). A new user design reduces these biases by restricting the analysis to incident medicine users, i.e., patients who enter the study cohort only at the start of the first course of the treatment(s) of interest during the study period. The disadvantages of a new-user design may be a lower sample size and a lower number of patients with long-term exposure, which may then require to extend the duration of the study.
- In the context of the new user design, use of an active comparator may reduce confounding by indication or disease severity as a comparison is made between patients with the same indication initiating different treatments. With newly marketed medicines, however, an active comparator with ideal comparability of patients’ characteristics may often be unavailable because newly marketed medicines are often strictly prescribed according to patients’ prognostic characteristics and reimbursement considerations, which leads to channelling bias.
Data Reporting
National and EU obligations and reporting requirements for non-interventional studies should be followed (as per Section 3.9 of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
The methods used in the study should be published with sufficient details, while protecting patient privacy, to allow for replication using the same registry database or using a database derived from another registry collecting similar data. Relevant guidelines on reporting of results from non-interventional studies are presented in the Good Pharmacovigilance Practices Module VIII and the ENCePP Guide on Methodological Standards in Pharmacoepidemiology (as per Section 3.9 of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
Post-authorisation registry-based non-interventional studies should be registered in the EU PAS Register with the study protocol, the SAP if applicable and the final study report. The final report must contain all study results derived from the analyses prespecified in the study protocol and SAP, whether favourable or unfavourable. The analytical code as well as any prior feasibility analyses are ideally also made available. A summary in lay language of the main results and conclusions of the final study report should be prepared and distributed to the registry participants in collaboration with the registry holder (as per Section 3.9 of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
For non-interventional studies, the principles of scientific independence and transparency for reporting study results described in the ENCePP Code of Conduct and the ADVANCE Code of Conduct for vaccines should be followed. The responsibility for preparing the final study report lies at the appropriate level of study governance, e.g., medical/scientific advisory board, principal investigator and local registry investigators in studies based on multiple registries. For studies funded by a MAA/MAH and requested by a regulatory authority, all parties involved should be responsible for ensuring that the study meets the regulatory requirements of the competent authority and the MAA/MAH should be
able to comment on the study results and their interpretation as well as on the format of the report. Requests by the MAA/MAH that interpretation of the results or their presentation be changed should be based on sound scientific reasons or documented regulatory requirements (as per Section 3.9 of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
Following the submission of the final study report, the competent authority may request additional information and clarifications from the MAA/MAH or may initiate an inspection. Therefore, if a research contract is signed between the MAA/MAH and the registry holder, the contract should include a requirement for the registry holder to address the scientific aspects of the request, with the possibility for the MAA/MAH to provide comments, as well as a requirement to allow a possible regulatory inspection of the registry-based study (as per Section 3.9 of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
Legal Obligations and Regulatory Requirements for Registry-Based Studies
The following table summarises the legal basis and regulatory requirements applicable to MAAs/MAHs for different activities related to registry-based studies (as per Section 4 of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
Good Registry Practice (GRP) – Considerations for Patient Registries
Quality Management – Framework for Quality Management
Uncertainties about the quality of the data collected in registries may undermine the confidence in the validity and reliability of the evidence generated from registry data in registry-based studies. The Commission Implementing Regulation (EU) No 520/2012 and GVP Module I provide a quality framework for MAHs, competent authorities of Member States and the EMA. Measurable quality requirements can be achieved by (as per Section A.4.1 of the Annex of the EMA – Guideline on Registry-Based Studies, October 2021) [1]:
-
- Quality planning: establishing structures (including validated computerised systems) and planning integrated and consistent processes
- Quality assurance and control: monitoring and evaluating how effectively the structures and processes have been established and how effectively the processes are being carried out
- Quality improvement: correcting and improving the structures and processes where necessary.
These quality management activities (“plan, do, check, act”) should be done in a continuous manner throughout the lifetime of the registry and be regularly assessed. They should be made available to patients, health care professionals and potential users of the registry data to provide confidence that quality management is adequately performed. Responsibilities should be clearly defined to enable sustainability of the quality management system (as per Section A.4.1 of the Annex of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
Data security should be part of quality management. Use of an existing patient registry for a new purpose, such as a registry-based study, may require availability of predefined data elements for specific users (e.g. users who perform data entry, management, quality control, extraction or analysis) but not necessarily all registry data. Specific measures (e.g., fire walls, log-in codes or access rights) may therefore need to be in place or introduced in the registry system when needed for some categories of users. Traceability (i.e., the possibility to trace changes made to patient data in the registry and who made these changes) should be part of the data security measures (as per Section A.4.1 of the Annex of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
Quality Management – Requirements for Data Quality
In this context, data quality includes four main components (as per Section A.4.2 of the Annex of the EMA – Guideline on Registry-Based Studies, October 2021) [1]:
-
- Consistency: the formats and definitions of the variables are consistent over time, across all centres within a registry and across all registries within a network of registries
- Completeness: patient enrolment is maximised, patient attrition is minimised and complete information on a core data set is recorded for all eligible patients with minimisation of missing data
- Accuracy: the data available in the registry is a correct representation of patient information available to the health care professional, e.g., data available in medical charts or laboratory test results; where the registry data are a compilation or duplication of electronic medical records at the point of care, accuracy should rely on a check of the extraction and uploading procedure
- Timeliness: there is a timely recording and reporting of data and data updates, based on their intended use in compliance with an agreed procedure.
Requirements of data quality may be difficult to achieve concomitantly in all centres within a registry or within all registries of a network of registries; implementation of the same data elements, terminologies, data entry procedures and data control software may not be feasible simultaneously in all centres. Intermediate solutions may be adopted focussing on a core data set and mapping procedures. Centres may progressively implement components of data quality and be included in the registry or network of registries once they have achieved an adequate level of data quality as agreed between the concerned parties according to the data needs (as per Section A.4.2 of the Annex of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
Quality Management – Key Performance Indicators of Data Quality
Registries should use performance indicators to assess and drive improvement of data quality. Such indicators should be measurable and associated with remedial measures if acceptable levels of quality are not found. Their definition depends on the disease, governance, infrastructure, local health system and processes in place within the registry or network of registries. They should therefore be defined in a multi-disciplinary approach with all concerned parties. Examples of agreed key performance indicators of data quality are presented in the reports of the EMA workshops on cystic fibrosis registries, multiple sclerosis registries and CAR T-cell Therapy Registries (as per Section A.4.3 of the Annex of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
Examples of Agreed Key Performance Indicators of Data Quality
Table References: EBMT and CIBMTR Registries: https://www.ema.europa.eu/en/documents/report/report-car-t-cell-therapy-registries-workshop_en.pdf ; Haemophilia Registries: https://www.ema.europa.eu/en/documents/report/report-haemophilia-registries-workshop_en.pdf ; Multiple Sclerosis Registries: https://www.ema.europa.eu/en/documents/report/report-multiple-sclerosis-registries_en.pdf ; Cystic Fibrosis Registries: https://www.ema.europa.eu/en/documents/report/report-cystic-fibrosis-registries_en.pdf
Quality Management – Data Quality Management Activities
Quality management can be supported by the activities described below. These activities should take into account appropriate technical and organisational measures to be implemented to ensure a sufficient level of security when personal data and more specifically health data is processed. Such measures should at least consist of pseudonymisation, encryption, non-disclosure agreements, strict access role distribution, access role restrictions as well as access logs. National provisions, which may stipulate specific technical requirements or other safeguards such as adherence to professional secrecy rules should be also taken into account (as per Section A.4.4 of the Annex of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
Given the variety in the organisation and infrastructure of registries, these recommendations should be adapted to each situation (as per Section A.4.4 of the Annex of the EMA – Guideline on Registry-Based Studies, October 2021) [1]:
-
- KPIs and SOPs – Data quality management activities should be documented, communicated, maintained and updated as necessary, and all relevant source documents should be kept, managed and made available for auditing purposes in a timely manner, including:
-
-
-
- Standard Operating Procedures (SOPs), steps of data quality management from data planning to reporting, with data management responsibilities
- Key Performance Indicators (KPIs) of data quality, planned and performed data checks (manual or automated) and cleaning processes including query management and on-site monitoring.
-
-
-
- Support Tools – Should be developed and provided, e.g., data collection and reporting software, support function (helpdesk), training material and training sessions. A centralised remote electronic quality control could be set-up to limit on-site visits to be done according to a predefined risk approach.
- Appropriately Qualified and Trained Staff – Appropriate qualification and training of data managers and other persons involved in the data collection process should be ensured, with knowledge about the disease, exposures and outcomes captured in the registry.
- Routine Data Quality Checks – In case of a local data extraction process or manual data entry, routine data quality checks should be performed to alert on erroneous, missing or out-of-range values and logical inconsistencies, and trigger prompt data verification and remedial measure if needed. The validity of any data cleaning, extraction and transformation processes should be documented, especially if it involves mapping of data to a common terminology.
- Internal or External Audits – Internal or external audits with on-site review of processes and data audits should be performed according to a risk-based approach; remote quality control measures, targeted visits and targeted source data verification should be triggered by pre-defined thresholds of data quality measures.
- Data Verification – The minimum amount of data verification required may depend on the amount of data collected and should ideally take into account critical aspects of data collection where differences may occur, e.g., between individual centres or between persons within individual centres.
- External Comparisons of Aggregated Registry Data – Aggregated registry data should ideally be compared to literature data or data from external data sources such as electronic health records or insurance claims databases as regards the distribution of categories of important variables such as age, gender, factors associated with disease occurrence or severity, or drug exposure.
- Feedback on Data Quality Issues – Feedback on findings on data quality issues should be given systematically to data providers so that escalation and remedial action can be taken at the level of the data source.
- Corrective and Preventative Activities (CAPAs) – When considering implementation of corrective and preventive activities, additional workload for data collection and data entry should be addressed, as a cumbersome data entry process may increase the amount of missing data and decrease data quality.
Governance
Registries generally operate under governance principles influenced by their purpose, operating procedures, legal environment or funding sources (55). Different parties may potentially also have divergent priorities, such as scientific independence, fulfilment of regulatory commitments, transparency or intellectual property rights. Clear governance principles supporting effective collaborations between all parties for regulatory use of registries, including data sharing between stakeholders, are therefore useful (as per Section A.5 of the Annex of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
Registry holders should consider the following aspects to ensure transparency, best use and
sustainability of their registry(as per Section A.5 of the Annex of the EMA – Guideline on Registry-Based Studies, October 2021) [1]:
-
- To publish documentation of key registry characteristics, such as purpose of the registry, inclusion and exclusion criteria for participating centres and enrolment of patients, core and optional data sets collected (with timelines and frequency of data uploads), quality management process and experience of previous collaborations; the registry should be registered in the ENCePP Resources Databases.
- To establish a governance structure for the management of the registry and registry-based studies, with a steering committee, ethics committee and scientific advisory board.
- To establish a single contact point within the registry or network of registries for requesting information on available data and data access conditions.
- To publish a policy for collaborations with external organisations, including information on the scope and decision-making process for participating in collaborations, policy for data sharing and data analysis (explaining possible options for data transfer and analysis based on data privacy rules in place), possible involvement of a third-party, publication policy, and principles for private and public funding.
- To provide a supportive scientific and technical function for collaborations, which may include support for the development of the study protocol, interoperability between registries, amendments to the scope, schedule or methods of data collection or extraction, data management and analysis; the support provided may vary according to the approach of collaboration for using multiple data sources (see the ENCePP Guide on Methodological Standards in Pharmacoepidemiology), resources available in the registry and the contractual agreements proposed.
- To develop a template for research contracts between the registry and external organisations, in line with those recommended by the ENCePP Code of Conduct or the ADVANCE Code of Conduct.
Data Sharing Outside the Context of Registry-Based Studies – Contractual Considerations
There may be situations where registry data could be shared outside the context of formal registry-based studies in the format of counts, aggregated data or statistical reports with NCAs, EMA, MAAs/MAHs, HTA bodies, payer organisations or other parties for clinical development planning or the evaluation or monitoring of medicinal products. These data may concern, for example (as per Section A.6 of the Annex of the EMA – Guideline on Registry-Based Studies, October 2021) [1]:
-
- Disease epidemiology in terms of prevalence, incidence, outcomes, prognostic factors, potential confounding variables for defined outcomes
- Size and distribution of the population with a specific disease, condition or exposure for a planned clinical trial or non-interventional study according to demographics, co-morbidities or medication use
- Drug utilisation, with number of prescriptions for specific medicinal products (or other indicator of intensity of exposure), indications, dose, route of administration, schedule, duration of use, co-medications or use in specific population groups such as extent of paediatric use
- Medical device utilisation, with number, types, indications and dates for specific implanted products
- Surgical procedures with numbers, types, indications, dates and any other relevant details
- Safety information on medicinal products, for example summary tables of adverse events recorded for specific medicinal products, aggregated data or anonymised line listings of patients presenting AESIs, or outcomes of exposed pregnancies
- Utilisation of health care resources such as number of visits, hospitalisations, or laboratory tests performed.
This information may require capacity for sound analysis within the registry or, if allowed by the registry governance and patient consent, transfer of an anonymised dataset with selected variables to the requester or a third-party performing the analysis on behalf of the registry or the requester. Data sharing may require a contractual agreement between the registry or network of registries and the other concerned parties (as per Section A.6 of the Annex of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
Checklist for Evaluating the Suitability of Registries for Registry-Based Studies
(Source: Appendix 1 of the EMA – Guideline on Registry-Based Studies, October 2021; List adapted from the REQuEST tool published by EUnetHTA)
Definitions
Patient Registry (synonym: registry)
Organised system that collects uniform data (clinical and other) to identify specified outcomes for a population defined by a particular disease, condition or exposure. The term ‘patient’ highlights the focus of the registry on health information. It is broadly defined and may include patients with a certain disease, pregnant or lactating women or individuals presenting with another condition such as a birth defect or a molecular or genomic feature (as per the Glossary of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
Disease Registry
Patient registry whose members are defined by a particular disease or disease-related patient characteristic regardless of exposure to any medicinal product, other treatment or particular health service (as per the Glossary of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
Product Registry
The term product registry is sometimes used to indicate a system of data collection by marketing authorisation applicants and holders (MAAs/MAHs) targeting patients exposed to a specific medicinal product or substance. From a regulatory perspective, recruitment and follow-up of these patients with the aim to evaluate the use, safety, effectiveness or another outcome of this exposure typically falls outside of normal routine follow-up of patients and therefore corresponds to a clinical trial or non-interventional study in the targeted population. It is therefore preferable to avoid using the term “product registry” in this situation and directly refer to the appropriate terminology instead (clinical trial or non-interventional study) (as per Section 2 of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
Registry-Based Study
Investigation of a research question using the data collection infrastructure or patient population of one or several patient registries. A registry-based study is either a clinical trial or a non-interventional study. A registry-based study may apply primary data collection in addition to secondary use of the existing data in the registry (as per the Glossary of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
Registry Database (synonym: register)
Database derived from one or several registries (as per the Glossary of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
Primary Data Collection
Collection of data directly from patients, caregivers, healthcare
professionals or other persons involved in patient care (as per the Glossary of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
Secondary Use of Data
Use of existing data for a different purpose than the one for which it was originally collected (as per the Glossary of the EMA – Guideline on Registry-Based Studies, October 2021) [1].
References
1. EMA – Guideline on Registry-Based Studies (October 2021)
Link: https://www.ema.europa.eu/en/guideline-registry-based-studies-0
2. Draft FDA Guidance – Real-World Data: Assessing Registries to Support Regulatory Decision-Making for Drug and Biological Products Guidance for Industry (November 2021)
3. EUnetHTA – REQueST Tool and its vision paper (September 2019)
Link: https://www.eunethta.eu/request-tool-and-its-vision-paper/
RWR Insights | Quality Considerations when Using RWD from Registries to Support Regulatory Decisions – USA
RWR CONTEXT
Study sponsors should ensure they have documented policies and procedures in place that enable them to address these FDA recommendations, so that they can systematically assess and use (appropriate quality) registry data as a source of real world data (RWD) to support their drug development strategies, new drug applications (NDAs)/marketing authorisation applications (MAAs), label extensions and post-marketing commitments (PMCs)/post-marketing requirements (PMRs).
In November 2021, the FDA’s published its draft Guidance on “Real-World Data: Assessing Registries to Support Regulatory Decision-Making for Drug and Biological Products” [Link] [1].
This FDA guidance aligns with the EMA’s Guideline on Registry-Based Studies, which was published October 2021. We’ll discuss the EMA Guideline in detail in the March 2022 RWR Regulatory Updates Report [2].
According to the FDA, whether registry data are fit-for-use in regulatory decision-making depends on the attributes that support the collection of relevant and reliable data as well as additional scientific considerations related to study design and study conduct (as per Section I of the Draft FDA Guidance) [1].
In this article we explore the scientific aspects (e.g., strengths and limitations) and quality aspects of registries (e.g., policies and procedures) that registry owners and study sponsors should consider when addressing these FDA recommendations.
What is a Registry?
Definitions are important because they provide the parameters around which the guidelines and legislation are built. Definitions help us understand what is applicable/relevant and therefore what we need to comply with.
In the US, the term ‘registry’ is often used to describe the data collection system (registry) and the clinical study that uses the data from the data collection system (registry-based study e.g., non-interventional study).
So, what is a registry in the context of this latest FDA draft guidance?
Registry: A registry is defined as an organized system that collects clinical and other data in a standardized format for a population defined by a particular disease, condition, or exposure. Establishing registries involves enrolling a predefined population and collecting pre-specified health-related data for each patient in that population (patient-level data) (as per Section II of the Draft FDA Guidance) [1].
This context and definition are important because they help us understand why the FDA specifically draws out the uses of registry data in a regulatory context e.g., to inform the design and support the conduct of either interventional studies (clinical trials) or non-interventional (observational) studies.
Meaning? Non-interventional (observational) studies are not registries…they are clinical studies that use registry data…they are registry-based studies. So, when we talk about using registry data to support regulatory decisions, think of this in the context of registry data being used as a source of real world data (RWD) for non-interventional studies (and/or clinical trials) which generate the real world evidence (RWE) that is submitted to the FDA as part of (for example) a new drug application (NDA).
Uses of Registry Data
Registries have the potential to support medical product development, and registry data can ultimately be used, when appropriate, to inform the design and support the conduct of either interventional studies (clinical trials) or non-interventional (observational) studies (as per Section II of the Draft FDA Guidance) [1].
Examples of such uses include, but are not limited to:
- Characterizing the natural history of a disease
- Providing information that can help determine sample size, selection criteria, and study endpoints when planning an interventional study
- Selecting suitable study participants—based on factors such as demographic characteristics, disease duration or severity, and past history or response to prior therapy—to include in an interventional study (e.g., randomized trial) that will assign a drug to assess that drug’s safety or effectiveness
- Identifying biomarkers or clinical characteristics that are associated with important clinical outcomes of relevance to the planning of interventional and non-interventional studies
- Supporting, in appropriate clinical circumstances, inferences about safety and effectiveness in the context of:
-
-
- A non-interventional study evaluating a drug received during routine medical practice and captured by the registry
- An externally controlled trial including registry data as an external control arm
-
The data collected in a given registry and the procedures for data collection are relevant when considering how registry data can be used. For example, registries used for quality assurance purposes related to the delivery of care for a particular health care institution or health care system tend to collect limited data related to the provision of care. Registries designed to address specific research questions tend to systematically collect longitudinal data in a defined population, on factors characterizing patients’ clinical status, treatments received, and subsequent clinical events (as per Section II of the Draft FDA Guidance) [1].
Using Registry Data to Support Regulatory Decisions
[Garbage in = Garbage out]
Image Source: •https://xkcd.com/2295/
Before using any RWD (including registry data) for regulatory decision-making, sponsors should consider whether the data are fit-for-use by assessing the data’s relevance and reliability. The term relevance includes the availability of key data elements (patient characteristics, exposures, outcomes) and a sufficient number of representative patients for the study, and the term reliability includes data accuracy, completeness, provenance, and traceability (as per Section III.A of the Draft FDA Guidance) [1].
Data Accuracy = Correctness of collection, transmission, and processing of data (as per the Glossary of the Draft FDA Guidance) [1].
Completeness = The presence of the necessary data to address the study question, design, and analysis (as per the Glossary of the Draft FDA Guidance) [1].
Provenance = An audit trail that “accounts for the origin of a piece of data (in a database, document or repository) together with an explanation of how and why it got to the present place” (as per the Glossary of the Draft FDA Guidance) [1].
Traceability = Permits an understanding of the relationships between the analysis results (tables, listings, and figures in the study report), analysis datasets, tabulation datasets, and source data (as per the Glossary of the Draft FDA Guidance) [1].
Registry data can have varying degrees of suitability within a regulatory context, depending on several factors, including how the data are intended to be used for regulatory purposes; the patient population enrolled; the data collected; and how registry datasets are created, maintained, curated, and analyzed. Registry data collected initially for one purpose (e.g., to obtain comprehensive clinical information on patients with a particular disease) may or may not be fit for-use for another purpose (e.g., to examine a drug-outcome association in a subset of these patients) (as per Section III.A of the Draft FDA Guidance) [1].
According to the FDA, sponsors should consider both the strength and limitations of using registries as a source of data to generate evidence for regulatory decision-making (as per Section III.A of the Draft FDA Guidance) [1].
Registry strengths:
-
- Registries may have advantages over other RWD sources, given that registries collect structured and predetermined data elements and can offer longitudinal, curated data about a defined population of patients and their corresponding disease course, complications, and medical care.
- Registries can systematically collect patient-reported data that medical claims datasets or EHR datasets may lack.
Registry limitations:
- Existing registries may focus on one disease, with limited information on comorbid conditions, even after linkage to other data sources.
- Enrolled patients may not be representative of the target population of interest due to challenges related to patient recruitment and retention.
-
-
- For example, patients with more severe disease may be more likely to be enrolled in a registry compared to patients with milder disease; or enrolled patients might have different self-care practices, socioeconomic backgrounds, or levels of supportive care versus the entire population of interest. These issues can potentially introduce bias into analyses that make use of registry data.
-
- Additional potential limitations of registries involve issues with data heterogeneity (e.g., different clinical characteristics across various populations) and variation in approaches used to address data quality.
Relevance of Registry Data
When considering whether to use an existing registry for regulatory purposes, a sponsor’s overall assessment of the relevance of registry data should consider whether the registry is adequate for evaluating the scientific objectives (as per Section III.B of the Draft FDA Guidance) [1].
For example, the EMA recommends conducting a feasibility analysis prior to writing the study protocol, to guide its development and facilitate the discussion with national competent authorities (e.g., FDA, EMA), health technology assessors (HTAs) and other parties. The feasibility analysis should be performed in collaboration with registry holders and include the following information, as applicable (as per Section 3.3 of the EMA – Guideline on Registry-Based Studies, October 2021) [2]:
- General Description – General description of the registry or network of registries; the Checklist for evaluating the suitability of registries for registry-based studies can be used to prepare this description; the epidemiology of the disease, this is more precise, medicines use and standards of care applied in the country or registry setting should be described if relevant for the specific study.
- Availability of Core Data Elements – Analysis of the availability in the registry of the core data elements needed for the planned study period (as availability of data elements may vary over time), including relevant confounding and effect-modifying variables, whether they are mapped to any standard terminologies (e.g., MedDRA, OMOP common data model), the frequency of their recording and the capacity to collect any additional data elements or introduce additional data collection methods if necessary .
- Quality and Completeness of the Data Elements – Analysis of the quality, completeness and timeliness of the available data elements needed for the study, including information on missing data and possible data imputations, risk of duplicate data for the same patient, results of any verification or validation performed (e.g., through an audit), analysis of the differences between several registries available in the network and their possible impact on data integration, description of the methods applied for data linkage as applicable, and possible interoperability measures that can be adopted.
- Adverse Event Reporting Processes – Description of processes in place for the identification of adverse events and prompt reporting of suspected adverse reactions occurring in the course of treatments, and capacity to introduce additional processes for their collection and reporting if needed.
- Study Size and Patient Recruitment – Study size estimation and analysis of the time needed to complete patient recruitment for the clinical study by providing available data on the number of centres involved in the registry(ies), numbers of registered patients and active patients, number of new patients enrolled per month/year, number of patients exposed to the medicinal product(s) of interest, duration of follow-up, missing data and losses to follow-up, need and possibility to obtain informed consent.
- Bias – Evaluation of any potential information bias, selection bias due to the inclusion/exclusion criteria of centres (e.g., primary, secondary or tertiary care) and patients, potential time-related bias between and within registry(ies), and potential bias due to loss to follow-up.
- Confounding – Evaluation of any potential confounding that may arise, especially if some data elements cannot be collected or measured.
- Analytical Issues – Analytical issues that may arise based on the data characteristics and the study design.
- Data Privacy – Any data privacy issues, possible limitations in relation to informed consent and governance related issues such as data access, data sharing and funding source.
- Suitability of the Registry – Overall evaluation of the suitability of the registry for the specific study, taking into account any missing information on the above-mentioned aspects.
Reliability of Registry Data
When considering using an existing registry or establishing a new registry, sponsors should ensure there are processes and procedures to govern (as per Section III.C of the Draft FDA Guidance) [1]:
- Registry operation
- Education and training of registry staff
- Resource planning
- General practices that help ensure the quality of the registry data.
Such governance attributes help ensure that the registry can achieve its objectives and should include, but not be limited to:
- An established data dictionary and rules for the validation of queries and edit checks of registry data (as applicable), to be made available for those who intend to use the registry data to perform analyses
-
-
- To support the collection of reliable data within a registry, a registry’s data dictionary should include:
- Data elements and how the data elements are defined
- Ranges and allowable values for the data elements
- Reference to the source data for the data elements
- To support the collection of reliable data within a registry, a registry’s data dictionary should include:
-
- Defined processes and procedures for the registry, such as:
-
-
- Data collection, curation, management, and storage, including processes in place to help ensure that data within a registry can be confirmed by source data (as applicable) for that registry
- Plans for how patients, researchers, and clinicians will access and interact with the registry data and the registry’s data collection systems
- Terms and conditions for use of the registry data by parties other than the registry creator (e.g., terms and conditions a sponsor should satisfy to permit combining the registry data with data from another source)
-
- Conformance with 21 CFR part 11, as applicable, including maintenance of access controls and audit trails to demonstrate the provenance of the registry data and to support traceability of the data
Factors that FDA considers when assessing the reliability of registry data include (as per Section III.C of the Draft FDA Guidance):
- How the data were collected (data accrual)
- Whether the registry personnel and processes in place during data collection and analysis provide adequate assurance that errors are minimized and that data integrity is sufficient.
- Whether the registry has privacy and security controls in place to ensure that the confidentiality and security of data are preserved.
-
-
- For recommendations on controls to ensure confidence in the reliability, quality, and integrity of electronic source data in FDA-regulated clinical investigations, see the FDA Guidance – Electronic Source Data in Clinical Investigations (September 2013) [4]
-
Quality Considerations when Using RWD from Registries to Support Regulatory Decisions
Based on the draft guidance provided by the FDA in their November 2021 publication, what quality aspects of registries (e.g., policies and procedures) should registry owners and study sponsors should consider when addressing these FDA recommendations?
Quality Consideration #1: Policies and Procedures to Support FDA Review of Submissions that Include Registry Data (as per Section III.E of the Draft FDA Guidance) [1].
Sponsors interested in using a specific registry as a data source to support a regulatory decision should meet with the relevant FDA review division before conducting a study that will include registry data (as per Section III.E of the Draft FDA Guidance) [1].
Sponsors should:
- Confer with FDA regarding:
-
-
- The ability to accurately define and evaluate the target population based on the planned inclusion and exclusion criteria
- Which data elements will come from the registry (versus other data sources) and their adequacy, as well as the frequency and timing of data collection
- The planned approach for linking the registry to another registry or other data system, when linking is anticipated
- The planned methods to ascertain and validate outcomes, including diagnostic requirements and the level of validation or adjudication of outcomes FDA agrees is needed
- The planned methods to validate the diagnosis of the disease being studied.
-
- Submit protocols and statistical analysis plans for FDA review and comment before conducting an interventional or a non-interventional study when including data from registries.
- Predefine all essential elements of a registry study’s design, analysis, and conduct in the protocol and describe how that element will be ascertained from the selected RWD source or sources.
- Ensure that patient-level data are provided to FDA in accordance with applicable legal and regulatory requirements.
- Ensure that source records necessary to verify the RWD are made available for inspection as applicable.
Quality Consideration #2: Conduct a feasibility analysis of the registry to guide protocol development and facilitate discussions with regulators (as per Section III.B of the Draft FDA Guidance) [1].
- Conduct a feasibility analysis prior to writing the study protocol, to guide its development and facilitate the discussion with national competent authorities (e.g., FDA, EMA), health technology assessors (HTAs) and other parties. The feasibility analysis should be performed in collaboration with registry holders (as per Section 3.3 of the EMA – Guideline on Registry-Based Studies, October 2021) [2].
Quality Consideration #3: Policies and procedures should be in place to support the reliability of the registry data, including (as per Section III.C of the Draft FDA Guidance) [1]:
- Pre-specifying data validation rules for queries and edit checks of registry data
- Validating the electronic systems used to collect registry data
-
-
- Validation of electronic systems may include, but is not limited to, demonstrating correct installation of the electronic system and testing of the system to ensure that it functions in the manner intended.
- This topic is also discussed in the Draft FDA Guidance – Use of Electronic Records and Electronic Signatures in Clinical Investigations Under 21 CFR Part 11 — Questions and Answers (June 2017) [3]
-
- Enabling FDA and persons interested in using the registry’s data to assess the quality of the data, including to help address issues such as errors in coding or interpretation of the source document or documents, as well as data entry, transfer, or transformation errors.
- Plans for how patients, researchers, and clinicians will access and interact with the registry data and the registry’s data collection systems
- Terms and conditions for use of the registry data by parties other than the registry creator (e.g., terms and conditions a sponsor should satisfy to permit combining the registry data with data from another source)
Quality Consideration #4: Policies and Procedures for Linking a Registry to Another Registry or Another Data System (as per Section III.D of the Draft FDA Guidance) [1].
If a registry is to be populated with data from another data system, sponsors should:
- Consider the potential impact of the additional data on overall integrity of the registry data.
- Use strategies to correct for redundant data, to resolve any inconsistencies in the data, and to address other potential problems, such as the ability to protect patient privacy while transferring data securely.
- Have a plan for addressing the adequacy of patient-level linkages (i.e., that the same patient is being matched).
- Consider any jurisdictional requirements (e.g., country-specific laws) when seeking to link patient-level data to another registry or data system.
- Consider whether the data sources to be linked are interoperable and support appropriate informatics strategies to ensure data integration.
- Ensure that:
-
- Sufficient testing is conducted to demonstrate interoperability of the linked data systems,
- The automated electronic transmission of data elements to the registry functions in a consistent and repeatable fashion, and
- Data are accurately, consistently, and completely transmitted.
- Use predefined rules to check for logical consistency and value ranges to confirm that data within a registry were retrieved accurately from a linked data source and that the operational definitions for the linked variables are aligned.
Quality Consideration #5: Documentation of the Process Used to Validate the Transfer of Data (as per Section III.D of the Draft FDA Guidance) [1].
Documentation of the process sponsors used to validate the transfer of data should be available for FDA to review during sponsor inspections. Sponsors should also ensure that software updates to the registry database or additional data sources do not affect the integrity, interoperability, and security of data transmitted to the registry. For example, issues such as the correct temporal alignment of linked data and registry data should be considered (as per Section III.D of the Draft FDA Guidance) [1].
The appropriateness of using additional data sources also depends on how the sponsor intends to use the linked data and the ability to obtain similar data for all patients. For example, for each potential data source, the sponsor should consider whether:
- The linkage is appropriate for the proposed research question (e.g., the additional data source provides relevant clinical detail and/or long-term follow-up information)
- The data can be accurately matched to patients in the registry and whether linking records between the two (or more) databases can be performed accurately
- The variables of interest in the registry and additional data sources have consistent definitions and reliable ascertainment approaches
- The data have been captured with sufficient accuracy, consistency, and completeness to meet registry objectives
After a sponsor decides to use an additional data source or sources to supplement the registry, the sponsor should:
- Develop the approach and algorithms needed to link such data to a registry.
- Determine how data integrity will be evaluated, including how assessments of any inaccuracies introduced by the linkage (e.g., overcounts of a particular data measure) will be made.
- Use appropriate methods for data entry, coding, cleaning, and transformation for each linked data source.
Quality Consideration #6: Policies and Procedures to Support Data Management Strategies, including (as per Section III.C of the Draft FDA Guidance) [1]:
- Standard Operation Procedures (SOPs) for Data Aggregation and Data Curation: Trained staff should follow standard operating procedures to aggregate data for a registry and carry out data curation
- Implement and maintain version control by documenting the date, time, and originator of data entered in the registry; performing preventative and/or corrective actions to address changes to the data (including flagging erroneous data without deleting the erroneous data, while inserting the corrected data for subsequent use); and describing reasons for any changes to data without obscuring previous entries.
-
-
- Source data originators include persons, systems, devices, and instruments.
- For additional information, see: FDA Guidance – Electronic Source Data in Clinical Investigations (September 2013) [4]
-
- Ensure data transferred from another data format or system are not altered in the migration process
- Seek to integrate data in the registry that were previously collected using data formats or technology (e.g., operating systems, hardware, software) that are now outdated
- Account for changes in clinical information over time (e.g., criteria for disease diagnosis, cancer staging)
- Explain the auditing rules and methods used and the mitigation strategies used to reduce errors
- Describe the types of errors that were identified based on audit findings and how the data were corrected
Quality Consideration #7: Periodic Assessment of Data Consistency, Accuracy and Completeness (as per Section III.C of the Draft FDA Guidance) [1].
- Adequate controls should be in place to ensure confidence in the reliability, quality, and integrity of the electronic source data [4]
- Indicators of data consistency, accuracy, and completeness should be assessed periodically, with the frequency dependent on the purposes of the registry data (e.g., for the sole purpose of facilitating recruitment in a randomized controlled trial versus using the registry data in an interventional or non-interventional study analysis).
- Routine descriptive statistical analyses should be performed to detect the extent of any missing data, inconsistent data, outliers, and losses to follow-up
Conclusion
Whether registry data are fit-for-use in regulatory decision-making (e.g., as a data source for non-interventional studies) depends on the attributes that support the collection of relevant and reliable data as well as additional scientific considerations related to study design and study conduct (as per Section I of the Draft FDA Guidance) [1].
What does this mean for sponsors who are looking to utilise existing disease registries and their associated real world data (RWD) to support their drug development and life cycle management activities?
Study sponsors should ensure they have documented policies and procedures in place that enable them to address these FDA recommendations, so that they can systematically assess and use (appropriate quality) registry data as a source of real world data (RWD) to support their drug development strategies, new drug applications (NDAs)/marketing authorisation applications (MAAs), label extensions and post-marketing commitments (PMCs)/post-marketing requirements (PMRs).
Examples of the policies, procedures and documentation recommended in the draft FDA guidance [1], include:
-
- Policies and procedures to support FDA review of submissions that Include registry data (Study Sponsor).
- Conducting a feasibility analysis of the registry to guide protocol development and facilitate discussions with regulators (Sponsor).
- Policies and procedures to support the reliability of the registry data (Registry Owner).
- Policies and procedures for linking a registry to another registry or another data system (Registry Owner).
- Documentation of the process(es) used to validate the transfer of data (Registry Owner and Study Sponsor).
- Policies and procedures to support data management strategies (Registry Owner and Study Sponsor).
- Periodic assessment of data consistency, accuracy, and completeness (Registry Owner and Study Sponsor).
References
1. Draft FDA Guidance – Real-World Data: Assessing Registries to Support Regulatory Decision-Making for Drug and Biological Products Guidance for Industry (November 2021)
2. EMA – Guideline on Registry-Based Studies (October 2021)
Link: https://www.ema.europa.eu/en/guideline-registry-based-studies-0
3. Draft FDA Guidance – Use of Electronic Records and Electronic Signatures in Clinical Investigations Under 21 CFR Part 11 — Questions and Answers (June 2017)
4. FDA Guidance – Electronic Source Data in Clinical Investigations (September 2013)
Useful Links
21 CFR 11 – Electronic Records; Electronic Signatures
Link: https://www.ecfr.gov/current/title-21/chapter-I/subchapter-A/part-11
EUnetHTA – REQueST Tool and its vision paper (September 2019)
Link: https://www.eunethta.eu/request-tool-and-its-vision-paper/
USA | Draft FDA Guidance on Registries
Please login to view this page.