Blog

I2E 4.0 delivers the next-generation in text mining and knowledge discovery for existing and new users

(Cambridge, England and Boston, USA – December 13th, 2012) Linguamatics is proud to announce the launch of a dramatically enhanced version of its already successful and well established text mining software platform, I2E.  Significant step change enhancements in the latest release, Version 4.0, have been added in terms of Web Services, Virtual Data Integration and Chemistry, amongst others.  These improvements have been made as part of Linguamatics’ continuing investment in research and technology development.

Web Services API: The new Linguamatics I2E WSAPI (Web Services API) will open up the power of state-of-the-art natural language processing (NLP) based text mining to a broader range of users within the organization by allowing easier and more focussed interfaces for different audiences.  Organizations will therefore find it much easier to embed text mining into core systems and work flows, deployed via intuitive, accessible, customized interfaces.

Virtual Data Integration: Regardless of how many disparate data sources need to be mined, I2E now has the power to analyze and extract information and knowledge from all of them simultaneously.  This enables users to dramatically increase the speed of their knowledge extraction, by avoiding the need to carry out a series of separate searches over multiple data sources. It also makes it easy to create queries that connect together the different data sources, allowing information from different silos to be brought together to create new knowledge.


European Community research project to enrich multilingual terminologies in biomedicine

(Cambridge, England and Boston, USA – December 11th, 2012) A key factor in successful text mining is the use of comprehensive terminologies which capture the different ways concepts can be expressed. However, although extensive terminologies exist for English, they are less common for other languages.

According to Wikipedia, a mantra is “a group of sounds, syllables or words capable of creating transformation”.  MANTRA is also the highly apposite acronym of a new, European Community funded research project: Multilingual Annotation of Named entities and Terminological Resource Acquisition. Linguamatics is pleased to announce its participation as a commercial partner.

The object of MANTRA is to enrich multilingual terminologies in the biomedical domain by exploiting parallel corpora in several different languages. For example, from the knowledge that an English patent (claims 4, 5 and 6) refers to Branching Enzyme, it should be possible to discover the previously unknown German synonym Verzweigungsenzym from claims 4, 5 and 6 of the German translation. The new synonym can then be used in analysing other document sets. Terminologies in one language and the same documents in other languages can be mined simultaneously to provide enriched terminologies in those other languages.


I2E Patents Solution delivers new intellectual property insights from deeper natural language processing based analysis

(Cambridge, England and Boston, USA – September 13th, 2012) Linguamatics is pleased to announce the launch of its new service for mining the world’s patents. This service builds on the core capabilities of its market leading text mining engine, I2E. Linguamatics’ Patents Solution provides large-scale text mining of full-text patents from the world’s major patent organizations, including the USPTO, EPO and WIPO. The solution enables powerful NLP based analysis over tens of millions of full text documents in real time.

Linguamatics is not new to the world of big data. The I2E text mining platform is already recognized as the text mining product of choice in a number of sectors - including pharmaceuticals, biotech, chemistry, healthcare, and government - due to its sophisticated use of proprietary natural language processing based algorithms. With I2E, knowledge extracted from patents can be linked to other data sources such as scientific literature or structured databases.


CDER to use Linguamatics I2E in support of drug safety research activities

(Cambridge, England and Boston, USA – August 1st 2012)

Linguamatics, the leader in natural language processing (NLP)-based text mining, announced today that the FDA’s Center for Drug Evaluation and Research (CDER) has licensed its I2E text mining platform to support laboratory research efforts on drug safety. Financial details of the agreement were not disclosed.

During the term of the license, CDER will use I2E to mine published literature and drug product labels to answer questions relating to a range of biomedical topics, including drug toxicity mechanisms and disease processes.

I2E’s NLP-based querying capabilities, coupled with its scalability and flexibility, mean it is ideally suited to answering many challenging, high value questions in life sciences and healthcare by unlocking knowledge buried in the scientific literature and other textual information. Rather than just retrieving documents, I2E can rapidly identify, extract, synthesize and analyze specific, relevant facts and relationships, such as those between genes and diseases or compounds and side effects. Customers include nine of the top ten global pharmaceutical companies.

“We are very pleased to welcome CDER/FDA as a customer”, commented John Brimacombe, Executive Chairman at Linguamatics. “Their adoption of the I2E text mining platform reflects the continued growth in demand for NLP-based text mining as a key knowledge discovery and decision support tool in the pharmaceutical, biotechnology and healthcare sectors.”


Linguamatics are probably best known for our application in the Pharma industry, and perhaps our Twitter mining project in the May 2010 election.

Applications of I2E have expanded significantly over the last 18 months so we’ve highlighted a couple of interesting examples below.

Healthcare Linguamatics’ high performance text mining platform, I2E, allows healthcare providers to identify, extract, synthesize and analyze relevant facts, connections and correlations from unstructured or semi-structured textual information with precision and at scale, radically improving speed to insight. Find out more.

Text Mining within a Biotech Setting Having access to advanced text mining capabilities to make target selection decisions was identified as a key informatics technology to integrate into Syntaxin’s selection process.

This case study outlines how combined text mining queries via I2E allowed an informed disease selection process to be implemented.