Posts from May 2014

A recent customer project highlighted to me the importance of being able to apply NLP to cohort selection to support medical research, clinical trials recruitment and outcomes analysis.

A new customer of ours was setting up a study into patients with HIV and Hepatitis C and needed to identify potential subjects from their AllScripts EHR. As many organizations do, they had five medical students spend four months trawling through patient records to identify 700 potential study candidates.

The process was particularly painful because simply looking for the ICD-9 codes for HIV and Hepatitis C in structured fields was missing significant numbers of potential subjects. This was caused by variations in where the data was recorded; sometimes it was coded in structured fields; sometimes it was written in the patient narrative that he or she was positive for HIV or Hepatitis C; sometimes it was both.

Assessing the narrative is always a problem with variations in patient history vs family history and “tested for HIV, negative result” and “positive for HIV” requiring careful reading.

Our customer had recently installed our I2E NLP platform and had indexed a large collection of patient records by extracting documents from AllScripts via their analytical data warehouse.

The data sets were indexed with the usual domain ontologies covering diseases, medications, procedures etc. to support rapid searching in I2E.


I2E 4.2 introduces integrated visualization for improved analysis and semantic enrichment capabilities for integration with enterprise search engines.

(Cambridge, UK and Boston, USA – 19 May 2014) Linguamatics announces the latest release of its award-winning natural language processing (NLP)-based text mining and analytics platform I2E. I2E 4.2 further enhances users' experience by introducing integrated charting and graphing to provide visual analytics for results extracted from large volumes of unstructured data.

I2E 4.2 includes new capabilities to support the integration of semantically enriched data into enterprise search engines to enhance the search experience for a wider audience.

The integrated visualization capabilities in I2E 4.2 will allow users to gain a comprehensive view of large or complex data sets with the ability to filter down to the information of most interest, making it easier to access the most important information faster and share results throughout the organization thus enabling more rapid decision support and increased speed to insight.

The new semantic enrichment functionality enables I2E to automatically identify and mark up concepts and relationships within data already used by enterprise search engines and link these to I2E's powerful domain knowledge to provide dramatically improved search results.

David Milward, Linguamatics CTO, commented "We are now working to a shorter release cycle, allowing us to respond in a more agile way to customer requirements.

We’re excited about the increasing demand for I2E in semantic enrichment for enterprise search platforms such as Microsoft SharePoint, and have delivered dedicated functionality to support that".