Text2Table

Named Entity Recognition (NER) and Relation Extraction (RE) are two common ways of summarizing clinical documents (e.g., discharge summaries). While deep learning methods have been received a lot of attention lately, it is not practical to run these methods on every single machine. Besides, the restriction of dataset makes the fine-tuning biased toward the trained corpus.

With the availability of UMLS, Snomed and other NER/RE datasets we were able to create a system to include the new and old NLP techniques to improve the speed and performance of NER/RE models.

Text2Table is a two part package for fast and reliable clinical named entities and relation extraction. First part, LinearNER, includes a very quick and easy to deploy approach to extract named entites using a combination of database lookup (LevelDB), Conditional Random Field (CRF) classification, and Inverted Index (Apache Solr) approaches. The second part, AINER, detect named entities and relation extraction between the extracted named entites using deep learning transformers.

Public codes of the Text2table project will be released soon…

A paper will be submitted for this project soon…

ยซ
ยป