Pharma and healthcare companies are data rich organizations; however a large proportion of this data isn't as accessible as desired, because the information is locked in unstructured text. Linguamatics I2E is used by our customers across drug discovery, development and delivery of therapeutics to tackle this problem, and our biannual user conferences are always a great way to hear updates on applications of text mining and best practise to help solve these data access challenges.

Last month, we had speakers presenting on where they are getting value from I2E, from bench to bedside. Attendees from the pharmaceutical industry, biotech, healthcare, academia, and partner vendor companies came to our Spring Text Mining Conference, for training sessions, networking, discussions, and of course, excellent presentations and talks.

Structure Activity Relationships from Patents, for Medicinal Chemistry

Starting in early discovery, Ortrud Steinfuehr (Information Manager Bayer AG) presented on “An Approach to provide SAR Data from Patents”. Structure Activity Relationships (SAR) provide information about how the 3D structure of a chemical compound impacts biological activity, such as effective dose or inhibitory concentration values for specific targets. SAR is very valuable for medicinal chemistry research around optimisation and modification of lead compounds. These data are often in patents, typically written in a way to make automated extraction of SAR very tricky.


AbbVie, Bayer, Merck KgaA, Mundipharma, and Novo Nordisk to share text mining insights at Cambridge, UK meeting

Cambridge, England and Boston, USA — April, 17, 2018 — Linguamatics, the leading natural language processing (NLP) text analytics provider, today announced its Spring Text Mining Conference 2018 will feature presentations from several top-tier biomedical organizations. The conference, taking place April 23 to April 25 in Cambridge, England, will highlight the wide range of ways that organizations are leveraging I2E, Linguamatics’ powerful NLP-based AI technology, to extract actionable insights from the huge amount of unstructured data available in healthcare and the life sciences.

In addition to presentations from AbbVie, Bayer, Merck KgaA, Mundipharma, and Novo Nordisk, the conference will offer hands-on training for users, opportunities for exchanging ideas and networking, sessions on industry trends and best practices, and demonstrations of the latest Linguamatics technology updates.

“We are seeing ever broader use of NLP for research, intellectual property and real-world evidence,” said David Milward, chief technology officer for Linguamatics. “We are looking forward to learning more about the innovative ways our customers are taking advantage of our text mining technology, and to share details on the latest enhancements to our technology stack, including NLP and machine learning updates.”


At this year’s Spring Text Mining Conference, you will get the opportunity to take part in our new I2E Certificate Program, which we launched at our 2017 Text Mining Summit.

This exciting opportunity will allow you to certify your I2E natural language processing (NLP) text-mining knowledge and skills. The Level 1 Query User Certificate will be open to those who attend the “Introduction to I2E” hands-on workshops that will take place at the STMC this April, as well as more established users, who have already attended the “Introduction to I2E” training. See the Spring Text Mining Conference Workshop Selection Guide for more details. It’s free to join in as part of your registration.

Completing the different levels of the Certificate Program will allow you to validate, extend and improve your I2E skills. The Query User Certificate will focus on using and editing basic queries and Resource queries to:

  • Create simple queries with different constraints, morphological variants, preferred terms and alternative lists
  • Use classes to improve recall and precision of queries with linguistic classes, ontologies, and pattern ontologies
  • Work with results by using limits, output formats and displays
  • Use Resource queries to answer common questions

Those taking the Level 1 Query User Certificate at the Spring Text Mining Conference will have access to:


I2E Natural Language Processing improves Medicare Advantage reimbursement, streamlines ACO reporting and reduces care gaps

Boston, USA – February 1, 2018 – Natural Language Processing (NLP) text analytics provider Linguamatics today announced the implementation of the Linguamatics Health enterprise NLP platform, powered by I2E, at Atrius Health to identify and extract critical clinical information hidden within unstructured patient data.

Atrius Health is a non-profit healthcare leader providing primary care and specialty care to more than 740,000 adult and pediatric patients across eastern and central Massachusetts. As a long-term Accountable Care Organization (ACO) for Medicare, commercial, and Medicaid patients, Atrius Health requires ready access to clinical notes and data to address reporting requirements and advance quality care initiatives, including programs that require the proper identification of at-risk patients to minimize care gaps.

“A vast amount of critical clinical data exists as unstructured text which is difficult to access and analyze. We are leveraging the power of NLP to replace the manual, inefficient data extraction processes that many healthcare organizations struggle with, in order to advance our quality care initiatives more rapidly,” said Joe Kimura, MD, Atrius Health’s chief medical officer. “Linguamatics NLP allows us to close gaps in care, enhance clinical documentation for chronic disorders, reduce litigation risks, and streamline Medicare ACO quality reporting.”


New innovations increase democratic and programmatic access to text analytics, expanding and speeding the transformation of unstructured data for knowledge discovery

Cambridge, UK & Boston, USA – December 7th, 2017 – Natural Language Processing (NLP) text analytics provider Linguamatics today announced that the company has extended its position as a leading provider of NLP text-mining solutions for healthcare and the life sciences with the introduction of multiple innovations in 2017.

New capabilities include custom search interfaces using I2E web portals, providing much larger, wide-ranging user communities with web access to the benefits of deep NLP capabilities. Programmatic use of I2E is enhanced by I2E AMP, which delivers high-throughput, fault tolerant workflow management for real-time document processing. AMP is deployed at multiple customer sites in a variety of workflows, including safety and regulatory affairs.

“Over the last year more customers have taken advantage of the capabilities of I2E with AMP for ETL (Extract, Transform, and Load) use cases,” said Linguamatics Chief Business Development Officer Phil Hastings. “It’s exciting to see the numerous ways customers are benefitting from the ability to transform their unstructured data to a structured format that can be loaded into data warehouses or data lakes, or consumed by downstream applications for broader access and decision support.”