Solve your scientific search problems

Solve your Scientific Search Problems: Easy-to-Use Web Access to Deep NLP Text Mining Capabilities

November 29 2017

In this world of ever-increasing volume and variety of textual data, there is a growing variety of tools and technologies to handle and get value from these data.  We hear about a potentially bewildering barrage of AI technologies including Natural Language Processing (NLP), Machine learning, and other textual data science applications. A recent blog I read highlighted this, with a Venn covering over a dozen different disciplines (see figure below). These techniques all bring benefits, but often we just need straightforward simple access to our unstructured text data.

Empower a wide variety of users to find relevant data with high recall and precision

Linguamatics I2E brings a combination of powerful text mining tools to many pharma, biotech and healthcare users. We recognize that users’ -demands vary, and so we have created I2E Web Portals. I2E Web portals aim to engage users that want rapid easy access to scientific knowledge from both public domain knowledgebase (e.g. MEDLINE, ClinicalTrials.gov) and internal data silos, ranging from regulatory dossiers, preclinical safety data, patient/customer call transcripts, and many more.

An example I2E Web Portal. Simple Search provides a Google-type search bar. With Advanced Search (illustrated), the user can easily build effective, Boolean-type queries for text-mining searches over one or more data source. The Smart Search form provides access to powerful bespoke queries written by NLP experts, to generate results sets that answer specific user challenges.

next generation search and text mining user interface

Searching Clinical Investigator Brochures for safety assessments at a Top-10 pharma

I2E Web Portals are an out-of-the-box framework that can be customized to create a search interface that fits the needs of a specific business focus. One top-10 pharma has provided access to their silo of Clinical Investigator Brochures using an I2E Web Portal. This means the safety assessment teams can, with just a couple of clicks, get answers to questions such as:

  • Which compounds have we ever studied that have shown kidney effects in any species?
  • Which compounds in our pipeline have toxicology studies with >1 non-rodent species?
  • Which compounds cause liver enzymes elevations in both preclinical and clinical studies?

Behind the scenes, the documents have been processed and indexed, ontologies have been applied, appropriate document regions have been identified, NLP queries run, key concepts such as chemicals and disease have been standardized, and mutations and dosages normalized. And for the users? They just get their answers.

For more information, please download our web portal datasheet

 

 

A representation of the overlap of disciplines involved in textual data science; from: Text Mining - Predictive Methods for Analyzing Unstructured Information

Artificial intelligence, Natural language processing, and other text mining tools