BIROn - Birkbeck Institutional Research Online

    A text-mining system for extracting metabolic reactions from full-text articles

    Czarnecki, Jan M. and Nobeli, Irene and Smith, Adrian M.L. and Shepherd, Adrian J. (2012) A text-mining system for extracting metabolic reactions from full-text articles. BMC Bioinformatics 13 (172), ISSN 1471-2105.

    [img]
    Preview
    Text
    1471-2105-13-172.pdf - Published Version of Record
    Available under License Creative Commons Attribution.

    Download (1MB) | Preview

    Abstract

    Background: Increasingly biological text mining research is focusing on the extraction of complex relationships relevant to the construction and curation of biological networks and pathways. However, one important category of pathway—metabolic pathways—has been largely neglected. Here we present a relatively simple method for extracting metabolic reaction information from free text that scores different permutations of assigned entities (enzymes and metabolites) within a given sentence based on the presence and location of stemmed keywords. This method extends an approach that has proved effective in the context of the extraction of protein–protein interactions. Results: When evaluated on a set of manually-curated metabolic pathways using standard performance criteria, our method performs surprisingly well. Precision and recall rates are comparable to those previously achieved for the well-known protein-protein interaction extraction task. Conclusions: We conclude that automated metabolic pathway construction is more tractable than has often been assumed, and that (as in the case of protein–protein interaction extraction) relatively simple text-mining approaches can prove surprisingly effective. It is hoped that these results will provide an impetus to further research and act as a useful benchmark for judging the performance of more sophisticated methods that are yet to be developed.

    Metadata

    Item Type: Article
    School: Birkbeck Schools and Departments > School of Science > Biological Sciences
    Research Centre: Bioinformatics, Bloomsbury Centre for, Structural Molecular Biology, Institute of (ISMB)
    Depositing User: Adrian Shepherd
    Date Deposited: 16 Jan 2013 10:32
    Last Modified: 15 Dec 2019 06:34
    URI: http://eprints.bbk.ac.uk/id/eprint/5959

    Statistics

    Downloads
    Activity Overview
    256Downloads
    157Hits

    Additional statistics are available via IRStats2.

    Archive Staff Only (login required)

    Edit/View Item Edit/View Item