Skip to main content

Posts from October 2016

Linguamatics are delighted once more to sponsor the Findacure Student Voice Essay Competition. Findacure is a UK charity that is building the rare disease community to drive research and develop treatments.   

The winning essay will be published in the Orphanet Journal of Rare Diseases, and the essay topics are:

  1. The impact of a rare disease is much more widespread than its direct symptoms. Discuss how, with particular reference to the patient experience.
  2. How can rare diseases lead the way in medical research and clinical innovation?
  3. How can clinicians and researchers, including students, help to deliver the UK Strategy for Rare Diseases?

One of the big challenges for the development of treatments for rare disease is the need for a thorough understanding of the natural history of each of the 7000 currently known rare diseases. It’s critical to have detailed systematic information on both the genotypic aspect (the genes and mutations), and the phenotypic aspect (pathways involved or disrupted, symptom severities, etc.).

BOSTON, MA and NEW YORK, NY--(Marketwired - October 18, 2016) - Linguamatics, a world leader in NLP Text Mining, and Sinequa, a leader in Cognitive Search and Analytics, today announced a partnership based on a tight integration between I2E and Sinequa ES. This integration will provide life sciences and healthcare organizations with deeper insights from their ever-increasing volumes of enterprise unstructured textual data content across the entire enterprise.

Linguamatics' I2E text mining platform enhances the Sinequa Cognitive Search & Analytics platform with its advanced text mining capabilities, providing an unparalleled foundation to build upon in life sciences. The combined strength of both platforms helps users get more precise, actionable and contextual information in their field. They can ask questions such as "what treatments are used for breast cancer?" or "what diseases are treated by drug X?"

"The integration of I2E with Sinequa ES allows users to surface more relevant results and increase speed to insight. The semantically enriched data that I2E provides complements Sinequa ES, using key biomedical concepts such as drugs, diseases, anatomy, genes and genetic mutations, chemical structures, numerical data and many others. We are looking forward to delivering significant added value to life sciences and healthcare organizations through this partnership," said Phil Hastings, chief business development officer, Linguamatics.

I recently attended a talk by Linguamatics CTO David Milward on Structured Queries for Unstructured Data, delivered to the Data Insights Cambridge Meetup group.

The data science community wants to know:

  • How can we deliver insights from big data?

  • What are the optimal approaches to ‘handle’ (store, capture) and analyze (query, structure, repurpose) big data?

The amount of data we can store and generate is many times what we could store or capture just 10 years ago. SQL Database technology is able to handle structured data well and has not changed significantly since the 1980s.  It’s easier to deliver insights from structured data for basic queries than it is for unstructured data in free text sources.

Unstructured data is the new frontier for data science

What drew so many people to David’s talk is the promise of the ‘data insights’ that are locked away in unstructured data. The audience spanned various industries, from those dealing with astronomical data to financial data sources, to many people concerned with health and life science unstructured data. Many industries rely heavily on data to inform their day to day business decisions. For healthcare and life science, where Linguamatics is the text mining leader, transforming how we understand and improve upon population health and patient outcomes will primarily entail extracting data insights from unstructured data sources.