Clinical trial analysis

I2E is invaluable in clinical trial analysis, assisting with trial design, optimization, site selection and competitive intelligence.

Clinical trials are used to gather safety and efficacy data on new drugs in development, or existing drugs being tested for new indications. Although some information in clinical trial reports is well structured and searchable using keywords, much of the information lies buried in unstructured text.

I2E is an essential tool for extracting and synthesizing the high value information that is found in this unstructured text. This can then be used in future study design and site selection, or to gain actionable information about competitors' worldwide clinical development activities.

Customer report that using I2E, the time for site selection can be reduced by over 80%. For patient recruitment, time spent can be reduced by at least 25%.


There are currently nearly 200,000 study records in, testing over 70,000 unique pharmacotherapies in approximately 190 countries. Other cancer registries, both public and commercial, also provide a rich source of clinical trial data. The case for using advanced text mining over clinical trials is particularly compelling as the industry looks to cut the costs and time required for trials.

Text mining with I2E is used widely by our pharma and biotech customers to aid in clinical trial site selection and study design. The outcome is significant time and cost savings: for example, as patient recruitment in the mature markets becomes increasingly difficult, I2E enables sponsors to locate clinical trial sites abroad. 

Customers are able to run queries over the detailed unstructured textual record fields in databases such as, Cortellis Clinical Trials Intelligence, WHO ICTRP, or Citeline's TrialTrove to rapidly identify, extract, synthesize and analyze relevant information such as clinical trial site, selection criteria, study characteristics, patient numbers and characteristics that would not be possible using other approaches. These data can be used to answer key questions such as:

  • What clinical endpoints would be appropriate to measure for xyz diseases?
  • Which investigators are expert in running clinical trials for diseases xyz?
  • Who else has drugs in clinical trials for indication xyz?
  • Which trials (in a given disease area) use drug xyz in combination with another drug?
  • What potential due diligence information can I find for in-licensing opportunities for disease area xyz?

Case studies

Mining clinical trials reports with I2E at AstraZeneca
Use of I2E for knowledge discovery from clinical trials reports. This case study outlines two investigations using I2E to provide answers for clinical decision makers; the first identifies the blind status of trials with differing intravenous (IV) drug doses, and the second examines the dose durations of follow-on clinical trials.

Network graph view of Phase 1 MAD (multiple ascending dose) studies and follow-on Phase 2 three-month dosing studies in the infectious disease area, linked via text-mining for sponsor, disease area and intervention.