I’m thrilled to see that Linguamatics I2E 4.3 is named as a KMWorld 2015 Trend-Setting Product.  Linguamatics I2E has a proven track record in delivering best of breed text mining capabilities across a broad range of application areas. Its agile nature allows tuning of query strategies to deliver the precision and recall needed for specific tasks, but at an enterprise scale.

According to customers, I2E gets to actionable results at least 10 times faster than a traditional keyword search. In many cases, I2E will produce successful results for projects that would otherwise be impossible or intractable.

Actionable information extracted using I2E can be presented in a variety of ways depending on your needs. NLP-based text mining provides the capability to look through unstructured text (typically in large sets of documents, from scientific reports, patents, or electronic healthcare records, pathology and radiology reports); and use sophisticated queries to automatically identify and extract out structured data (concepts and associations) to enable the system to interpret the meaning of the text. 


Linguamatics I2E natural language processing technology to automatically extract clinical attributes from pathology reports across eight hospital groups in Stratified Medicine Programme.

LONDON and CAMBRIDGE, UK, September 1st, 2015 – Cancer Research UK and Linguamatics announced today they will work on a joint project to apply Linguamatics’ natural language processing (NLP) text analytics platform, I2E, to automatically extract clinical attributes from cancer pathology reports and improve annotation of clinical samples relating to Cancer Research UK’s Stratified Medicine Programme (SMP). This project will allow the analysis of detailed patient characteristics alongside large volumes of genetic data, enabling more effective research into the causes and personalised treatment of cancer.

Dr Ian Walker, Director of Clinical Research and Strategic Partnerships at Cancer Research UK, said: “Pathology reports tell us a range of important information about a patient’s cancer, but the way this data is recorded can vary widely, which makes it harder to spot trends or other significant information that could have a bearing on treatment decisions or prognosis. This collaboration should help translate these reports into more meaningful data, which should help our researchers better understand the disease and accelerate advances in personalised medicine.”

On July 16, delegates across the life sciences, biotech, healthcare and other knowledge-driven industries gathered in Princeton for Linguamatics’ one-day seminar: “From bench to bedside, unlocking key insights in your data”.  

We heard from Regeneron Pharmaceuticals, Johnson & Johnson, Copyright Clearance Center (CCC) and Linguamatics on how NLP technology is moving into new application areas to improve patient outcomes and unlock key insights across the drug discovery, development and delivery continuum. Delegates were very engaged and many stayed long after the talks had finished, to continue the day’s discussions.  

Jim Dixon, Senior Application Specialist, gave us an introduction to I2E NLP text mining and the new features in the latest I2E release and industry’s first federated text mining platform. Whatever the content, I2E can mine and extract with precision and at scale. You can use Linguamatics I2E to provide valuable intelligence from text, getting you to the answers faster so you can make smarter and better informed decisions.

Dr. Peng Zhang’s presentation showed us a real-life use case of I2E’s potential at Regeneron. Eliminating or modifying a single gene in the mouse genome can provide insight into the role that gene plays in normal physiology and disease pathogenesis, but keeping up-to-date with novel information is time-consuming. Dr. Zhang uses I2E to systematically mine the scientific literature for any reported gene knockout in mice, and associated autoimmune phenotype.

Life sciences and healthcare professionals gathered at the UCSF Mission Bay campus for the West Coast Natural Language Processing (NLP) & Big Data Symposium on June 18th. The symposium, co-hosted by UCSF, featured presenters from UCSF, Merck, City of Hope, Copyright Clearance Center and Linguamatics and delegates from a diverse range of organizations.

The central theme of this year’s symposium was “From bench-to-bedside, unlocking key insights from your data”. Healthcare delegates were keen to find new ways to address meaningful use and accountable care leveraging NLP text mining of electronic health records. Life sciences delegates were keen to increase the efficiency and effectiveness of their business operations by mining real world data. There was also a strong interest in forging partnership opportunities between pharma/biotech and hospitals/cancer centers.

Sorena Nadaf, the CIO and Director of Translational Informatics at UCSF Helen Diller Family Comprehensive Cancer Center delivered the welcome address and highlighted the foundation of clinical NLP and its common uses for extracting and transforming narrative information in EMR’s to support and accelerate clinical research.

NLP & Big Data Symposium
Sorena Nadaf at the NLP & Big Data Symposium in San Francisco.

Linguamatics I2E: the first text mining platform to integrate with Copyright Clearance Center's RightFind XML for Mining, to allow access to full-text journal articles

(Cambridge, UK and Boston, USA - 24 June 2015 ) - Linguamatics is expanding its natural language processing (NLP)-based text mining platform I2E to include easier access to full-text articles, with the integration of Copyright Clearance Center's (CCC) new text mining solution, RightFind™ XML for Mining.

Commercial life science researchers can now create sets of full-text XML articles from more than 4,000 peer-reviewed journals produced by over 25 scientific, technical, and medical (STM) publishers, and automatically make them available for text mining in I2E.

The solution enables researchers to make discoveries and connections that can only be found in full-text articles. All of the content is stored securely by CCC and is pre-authorized by publishers for commercial text mining. Users access the content using Linguamatics’ unique federated text mining architecture which allows researchers to find the key information to support business-critical decisions. The integrated solution is available now, and enables users to save time, reduce costs and help mitigate an organization’s copyright infringement risk.