fenominal: Phenomenal text mining for disease and phenotype concepts

fenominal

Fenominal is a Java 17 library for text-mining Human Phenotype Ontology (HPO) terms from text. Fenominal is a multimodule project with a core module with the text-mining logic and a cli module for use from the command line. A graphical user interface (GUI) is available in a separate project called Fenominal-GUI.

Fenominal implements the T-BLAT algorithm, which is inspired by the BLAST algorithm for biosequence alignment. T-BLAT screens texts for potential matches on the basis of matching k-mer counts and scores candidates based on conformance to typical patterns of spelling errors derived from 2.9 million clinical notes. Fenominal also implements exact matching but matches also on multitoken HPO labels or synonyms that are permuted.

Fenominal does not rely on external APIs and can be used in settings in which a firewall does not permit applications to access the internet. Fenominal is intended for use as a software library and the CLI module only contains simple demo applications.