Professional Services

Linguamatics provides industry-leading professional services and consulting in text mining and Natural Language Processing (NLP), to support your information extraction and knowledge discovery projects.


Our world-class team has experience in projects and solutions across the drug discovery and development cycle, real world data, electronic health records and more. We can work with multiple formats and types of data – such as internal documents, scientific literature, patent documents, call center feeds, pathology reports, nurses notes, radiology reports. In addition, we have in-house multi-lingual capabilities to tackle documents for our global clients.

Our team of experts includes natural language processing experts, life sciences and healthcare specialists, who work with customers to help solve their knowledge discovery challenges. Typical consulting services include training, work on specific text mining projects (including algorithm and terminology development), and software deployment and integration. Our consulting services are designed to help our customers reach (and go beyond) their objectives, over the lifetime of their relationship with Linguamatics.

Over the years, we have worked on hundreds of customer projects across both pharma and healthcare organisations. For example:

IDMP Mundipharma

Mundipharma Research Limited implemented a project using Linguamatics NLP solution, to find, highlight and extract data elements for Iteration 1 from unstructured documents such as the EMA Summary of Product Characteristics (SmPC) documents.

Our experts developed algorithms to extract the individual data elements using standard and customized ontologies, as well as linguistic features and region structures of SmPCs. Accuracy was evaluated against a ‘gold standard’ data set that had been manually extracted by an independent expert.

We were really impressed when we saw the accuracy with which Linguamatics NLP had been able to extract data elements from the documents”.

Jon Sanford, Head of Regulatory Information Management and Operations at Mundipharma Research


Pfizer Automated Quality Review

Checking for errors in documents for FDA submission is a complex problem, which is currently undertaken as a manual and costly process. Automation of this process could speed it up and make it more efficient.

Pfizer and Linguamatics have worked together on a project to create a solution that allows reviewers to submit document packets and use Linguamatics NLP to generate reports summarizing the detected errors. The solution was implemented on premise at Pfizer, ensuring that the sensitivity of these documents is preserved.



Merck SALAR KAT (Knowledge Access Tool)

Linguamatics Professional Services worked on a project with the Safety Assessment and Laboratory Animals Resources (SALAR) division at Merck MSD. This division helps advance high quality drug candidates into development by defining the non-clinical safety and selectivity of lead compounds.

Together, we developed an automated workflow to extract unstructured conclusions and interpretations from final study reports, ante-mortem reports, post-mortem reports and protocols stored in a Documentum-based electronic official file repository. The Linguamatics NLP algorithms developed were able to identify, extract, and normalize study annotation metadata and organ pathology findings. The results are combined with structured output, loaded into a SALAR knowledgebase, and visualized via dashboards for the safety assessment teams.



For more information, visit our use cases page; or our partners and affiliations page and learn more about our extended network and partnerships in the industry. If you want more information, please get in touch.

Phone: +1 617-674-3256 (North America) or +44 (0)1223 651910 (Europe and ROW)