



A curated Awesome list of resources, tools, and research on extracting information from unstructured biomedical data and text.
Loading more......
A curated Awesome List focused on methods, tools, and resources for extracting structured information from unstructured biomedical data and text (e.g., clinical notes, scientific articles, biological data reports). It emphasizes freely available, publicly accessible, and actively maintained resources.
BioIE (Biomedical Information Extraction) covers techniques and resources for transforming unstructured or inconsistently structured biomedical, clinical, or biological data into structured information and, ultimately, knowledge. Typical sources include:
The field has evolved rapidly with the advent of language models such as BERT and modern LLMs (e.g., GPT‑3/4, LLaMA 2/3, Gemini), requiring adaptations of general NLP methods to biomedical-specific data and vocabularies.
The list is part of the broader Awesome ecosystem and is related to:
The repository organizes content into clearly defined sections:
Research Overviews
High-level and survey-style references that map the BioIE landscape, especially around LLMs in medicine.
Groups Active in the Field
Research labs, academic groups, and industry teams focusing on biomedical information extraction, text mining, and related NLP.
Organizations
Professional bodies, consortia, and initiatives relevant to biomedical text mining, clinical NLP, and structured knowledge extraction.
Journals and Events
Tutorials
Code Libraries
Tools, Platforms, and Services
Techniques and Models
Datasets A broad collection of datasets used to train and evaluate BioIE systems, including:
Ontologies and Controlled Vocabularies
Data Models
Credits