Case study: National Cancer Institute (NCI) analyzes biomarker values at scale


NCI needed a solution for automating the extraction of cancer-related data from pathology reports for its SEER cancer surveillance program, which assesses patient data from 15 US states.


Linguamatics I2E first identified documents containing biomarkers of interest, then extracted corresponding test values from those reports.


Linguamatics demonstrated the flexibility of adopting I2E to perform biomarker-value extraction from pathology reports. I2E takes a data-driven, rule-based NLP approach and I2E’s interactive manner allows rules to be created and refined transparently by users.