Linguamatics I2E 5.1 focuses on further increasing the power and scale of querying, while optimizing the users’ experiences of query building.

I2E 5.1 enriches and expands on the capabilities introduced in I2E 5.0, which made a big splash in NLP text mining technology.  

I2E 5.1 addresses the increasing variety of representations of the same concept in big data by finding more matches for terms in a document: variations in accented characters, spelling errors, and OCR artefacts are taken into consideration when matching. This ‘fuzzy matching’ returns greater search results and increases recall and accuracy.

One customer commented: ‘I am really looking forward to I2E 5.1’s spelling correction…you don’t realize how much you can miss in your search results because of typos and spelling mistakes.’

Data normalization in I2E, a key feature for tackling big data’s increasing variety, is now easier to use. Regardless of how the original document is written, you can define your numeric ranges in a different unit; for example, you can filter in pounds (as an upper or lower threshold or as a range) and display the results in kilograms.

I2E 5.1 introduces an integrated view of your query and a way of dragging queries around the editor, making it easier to design, tune and maintain your searches.


The 2017 Text Mining Summit (New Castle, New Hampshire, October 2-4) will be your first opportunity to take part in our new I2E Certificate Program.  The Level 1 Query User Certificate will be open to those who have just taken the “Introduction to I2E” hands-on workshops provided at the TMS, as well as more established users, who have taken the “Introduction to I2E” training on previous occasions. See the TMS Workshop Selection Guide for more details. It’s free to join in as part of your TMS registration.

Completing the different levels of the Certificate Program will allow you to validate, extend and improve your I2E skills. The Query User Certificate will focus on using and editing basic queries and Resource queries to:

  • Create simple queries with different constraints, morphological variants, preferred terms and alternative lists

  • Use classes to improve recall and precision of queries with linguistic classes, ontologies, and pattern ontologies

  • Work with results by using limits, output formats and displays

  • Use Resource queries to answer common questions

Those taking the Query User Certificate at the TMS will have access to:

  • In-class instruction

  • Practical, hands-on experience with I2E

  • Open question sessions with I2E Experts

  • A set of learning objectives

  • Learning materials, including

    • Tutorial booklets


I2E makes natural language processing-based text mining intuitive and interactive

SANTA CLARA, Calif. — July 20, 2017 — Based on its recent analysis of the Big Data text analytics market for the healthcare industry, Frost & Sullivan recognizes Linguamatics with the 2017 Global Frost & Sullivan Award for Enabling Technology Leadership. Linguamatics stands out in the natural language processing (NLP) market for its technology expertise and commitment to delivering exceptional value to clients in the US healthcare industry. The highly flexible and scalable Linguamatics Health platform, powered by I2E, is helping healthcare providers and payers to transition to value-based care.

Within the last year, Linguamatics introduced its fifth iteration of I2E, which includes cutting-edge capabilities such as the normalization of concepts and relationships for quick and comprehensive data retrieval regardless of format; advanced range research; and an extraction and search query language (EASL). The EASL can be generated external to the platform to support custom interfaces, queries in a human-readable format, and superior workflow automation.


I2E Natural Language Processing advances research and care delivery by mining clinical insights from unstructured patient data

Cambridge, UK & Boston, USA – June 22nd, 2017 – Market leading Natural Language Processing (NLP) text analytics provider Linguamatics today announced the implementation of the Linguamatics Health enterprise NLP platform, powered by I2E, at the University of Pennsylvania Health System for the extraction of actionable insights from unstructured patient data.

“We look forward to working with Penn Medicine to help them unlock valuable insights from clinical notes in order to advance research initiatives and enhance the delivery of care,” said Simon Beaulah, senior director of healthcare at Linguamatics. “Our growing community of academic medical centers across the country have deployed the Linguamatics Health platform, and are taking advantage of its ease of use, powerful NLP capabilities, rapid query development and successful integration with enterprise systems. Our platform is particularly well-suited for this environment because it empowers organizations to work independently, and get the data they want without requiring extensive services.”


Text Mining Platform I2E features in Best Practices Final and as a Best of Show Award Contender; Linguamatics CTO David Milward a Featured Speaker

Cambridge, UK & Boston, USA – May 22, 2017 – Leading Natural Language Processing (NLP) text analytics provider Linguamatics today announced plans to highlight the latest version of its text mining platform at this week’s Bio-IT World Conference & Expo in Boston. Bio-IT World has named Linguamatics I2E 5.0 a contender for the Best of Show Award, and Linguamatics’ customer Pentavere Research Group a Best Practices finalist.

The Best of Show Awards showcase exceptional innovation in technologies used by life science professionals. As a Best of Show Award contender, Linguamatics is also eligible for the Bio-IT World People’s Choice Award, chosen by votes from the Bio-IT World Community. Voting for the People’s Choice Award is open from 5 pm ET Tuesday May 23 through 1 pm ET on Wednesday May 24.

Bio-IT World also chose Linguamatics' customer Pentavere Research Group as a Best Practices finalist, based on their work using I2E to mine unstructured data for real-world evidence to improve health outcomes. Best Practices finalists are recognized for their outstanding examples of technology innovation, from basic R&D to translational medicine. Pentavere deployed I2E to effectively mine unstructured EHR data, expediting delivery of their product daRWEn™ to the Real World Evidence market.