Research
- Research Topics
- Cell Biology and Tumor Biology
- Stem Cells and Cancer
- Inflammatory Stress in Stem Cells
- Experimental Hematology
- Molecular Embryology
- Signal Transduction and Growth Control
- Epigenetics
- Redox Regulation
- Vascular Oncology and Metastasis
- Clinical Neurobiology
- Molecular Neurogenetics
- Molecular Neurobiology
- Mechanisms Regulating Gene Expression
- Molecular Biology of Centrosomes and Cilia
- Dermato-Oncology
- Pediatric Leukemia
- Tumour Metabolism and Microenvironment
- Personalized Medical Oncology
- Molecular Hematology - Oncology
- Cancer Progression and Metastasis
- Translational Surgical Oncology
- Neuronal Signaling and Morphogenesis
- Cell Signaling and Metabolism
- Cell Fate Engineering and Disease Modeling
- Cancer Drug Development
- Cell Morphogenesis and Signal Transduction
- Functional and Structural Genomics
- Molecular Genome Analysis
- Molecular Genetics
- Pediatric Neurooncology
- Cancer Genome Research
- Chromatin Networks
- Functional Genome Analysis
- Theoretical Systems Biology
- Neuroblastoma Genomics
- Signaling and Functional Genomics
- Signal Transduction in Cancer and Metabolism
- RNA-Protein Complexes and Cell Proliferation
- Systems Biology of Signal Transduction
- Areas of Interest
- Advancement of clinical proteomics for systems medicine
- Bridging from the single cell to the cell population – Epo-induced cellular responses and erythroleukemia
- Deciphering tumor microenvironment interactions determining lung cancer development
- Mechanisms controlling the compensation of liver injury and towards model-based biomarkers for early detection of liver cancer
- Application of dynamic pathway modelling for personalized medicine
- Group Members
- Publications
- Open Positions
- Funding
- Teaching
- Areas of Interest
- Molecular thoracic Oncology
- Proteomics of Stem Cells and Cancer
- Computational Genomics and System Genetics
- Applied Functional Genomics
- Applied Bioinformatics
- Translational Medical Oncology
- Metabolic crosstalk in cancer
- Pediatric Glioma Research
- Cancer Epigenomics
- Translational Pediatric Sarcoma Research
- Artificial Intelligence in Oncology
- Mechanisms of Genomic Variation and Data Science
- Neuropathology
- Pediatric Oncology
- Neurooncology
- Somatic Evolution and Early Detection
- Translational Control and Metabolism
- Soft-Tissue Sarcoma
- Precision Sarcoma Research
- Brain Mosaicism and Tumorigenesis
- Mechanisms of Genome Control
- Translational Gastrointestinal Oncology and Preclinical Models
- Translational Lymphoma Research
- Mechanisms of Leukemogenesis
- Genome Instability in Tumors
- Developmental Origins of Pediatric Cancer
- Brain Tumor Translational Targets
- Translational Functional Cancer Genomics
- Regulatory Genomics and Cancer Evolution
- SPRINT
- Cancer Risk Factors and Prevention
- Cancer Epidemiology
- Biostatistics
- Clinical Epidemiology and Aging Research
- Health Economics
- Physical Activity, Prevention and Cancer
- Primary Cancer Prevention
- Personalized Early Detection of Prostate Cancer
- Digital prevention, diagnostics and therapy guidance
- Policy and Implementation Research for Cancer Prevention
- Tumorigenesis and molecular cancer prevention
- Genomic Epidemiology
- Cancer Survivorship
- Immunology, Infection and Cancer
- Structural Biology of Infection and Immunity
- Cellular Immunology
- B Cell Immunology
- Immune Diversity
- Immunoproteomics
- Personalized Immunotherapy
- mRNA Cancer Immunotherapies
- Tumor Immunology and Tumor Immunotherapy
- Infections and Cancer Epidemiology
- Pathogenesis of Virus-Associated Tumors
- Immunotherapy and Immunoprevention
- Virus-associated Carcinogenesis
- Chronic Inflammation and Cancer
- Microbiome and Cancer
- Molecular Oncology of Gastrointestinal Tumors
- Applied Tumor Immunity
- Neuroimmunology and Brain Tumor Immunology
- Applied Tumor Biology
- Virotherapy
- Adaptive Immunity and Lymphoma
- Dermal Oncoimmunology
- Immune Regulation in Cancer
- Systems Immunology and Single Cell Biology
- Pediatric Immuno-Oncology
- Epithelium Microbiome lnteractions
- Experimental Hepatology, Inflammation and Cancer
- GMP & T Cell Therapy
- Tumorvirus-specific Vaccination Strategies
- Mammalian Cell Cycle Control Mechanisms
- Molecular Therapy of Virus-Associated Cancers
- DNA Vectors
- Episomal-Persistent DNA in Cancer- and Chronic Diseases
- Immune Monitoring
- News
- Imaging and Radiooncology
- Radiology
- Research
- Computational Radiology Research Group
- Contrast Agents In Radiology Research Group
- Neuro-Oncologic Imaging Research Group
- Radiological Early Response Assessment Of Modern Cancer Therapies
- Imaging In Monoclonal Plasma Cell Disorders
- 7 Tesla MRI - Novel Imaging Biomarkers
- Functional Imaging
- Visualization And Forensic Imaging
- PET/MRI
- Dual- and Multienergy CT
- Radiomics Research Group
- Prostate Research Group
- Bone marrow
- Musculoskeletal Imaging
- Microstructural Imaging Research Group
- Staff
- Patients
- Research
- Medical Physics in Radiology
- X-Ray Imaging and Computed Tomography
- Federated Information Systems
- Translational Molecular Imaging
- Medical Physics in Radiation Oncology
- Biomedical Physics in Radiation Oncology
- Intelligent Medical Systems
- Medical Image Computing
- Radiooncology - Radiobiology
- Smart Technologies for Tumor Therapy
- Team
- Research
- Microrobots and Miniaturize Devices for Minimally-invasive Surgery
- Magnetic localization and sensing for biomedical devices
- Nanorobots for Targeted Delivery in Deep Biological Tissues
- 3D Additive Manufacturing of Soft Materials as In Vitro Tumor Models
- Surgical Simulation on Cyber-physical Organ Models
- News
- Vacancies
- Radiation Oncology
- Molecular Radiooncology
- Nuclear Medicine
- Translational Radiation Oncology
- Translational Radiotheranostics
- Interactive Machine Learning
- Intelligent Systems and Robotics in Urology
- Multiparametric methods for early detection of prostate cancer
- Translational Molecular Imaging in Oncologic Therapy Monitoring
- Radiology
- Cell Biology and Tumor Biology
- Research Groups A-Z
- Junior Research Groups
- Core Facilities
- News
- List of Core Facilities
- Antibodies
- Cellular Tools
- Center for Preclinical Research
- Central Library
- Chemical Biology
- Dieter Morszeck Biorepository
- Electron Microscopy
- Flow Cytometry
- Information Technology ITCF
- Light Microscopy
- Metabolomics
- Microarray
- Microbiological Diagnostics
- Next Generation Sequencing
- Omics IT and Data Management
- Proteomics
- Radiopharmaceuticals and Preclinical Trials
- Single-cell Open Lab
- Small Animal Imaging
- Transgenic Service
- Tumor Models
- OMERO@DKFZ
- List of Technologies
- DKFZ Core Facilities Publication Policy
- Enabling Technology
- Data Science @ DKFZ
- INFORM
- Baden-Württemberg Cancer Registry
- Cooperations & Networks
- National Cooperations
- International Cooperations
- Cooperational Research Program with Israel: DKFZ - MOST in Cancer Research
- Program
- Members of the Program Committee
- Call
- Publication Database
- German-Israeli Cancer Research Schools
- Archive
- Heidelberg - Israel, Science and Culture
- Symposium 40 Years of German-Israeli Cooperation
- 35th Anniversary Symposium
- 34th Meeting of the DKFZ-MOST Program
- 40th Anniversary Publication
- 30th Anniversary Publication
- 20th Anniversary Publication
- Flyer - The Cancer Cooperation Program
- List Publications 1976-2004
- Highlight-Projects
- Cooperational Research Program with Israel: DKFZ - MOST in Cancer Research
- Cooperations with industrial companies
- DKFZ PostDoc Network
- Cross Program Topic RNA@DKFZ
- Cross Program Topic Epigenetics@dkfz
- Cross Program Topic Single Cell Sequencing
- WHO Collaborating Centers
- DKFZ Site Dresden
- Health + Life Science Alliance Heidelberg Mannheim
Large Language Models Encode Clinical Knowledge
Register for the online event here.
The talk will not be recorded.
Large language models (LLMs) have demonstrated impressive capabilities in natural language understanding and generation, but the quality bar for medical and clinical applications is high. Today, attempts to assess models' clinical knowledge typically rely on automated evaluations on limited benchmarks. There is no standard to evaluate model predictions and reasoning across a breadth of tasks. To address this, we present MultiMedQA, a benchmark combining six existing open question answering datasets spanning professional medical exams, research, and consumer queries; and HealthSearchQA, a new free-response dataset of medical questions searched online. We propose a framework for human evaluation of model answers along multiple axes including factuality, precision, possible harm, and bias. In addition, we evaluate PaLM (a 540-billion parameter LLM) and its instruction-tuned variant, Flan-PaLM, on MultiMedQA. Using a combination of prompting strategies, Flan-PaLM achieves state-of-the-art accuracy on every MultiMedQA multiple-choice dataset (MedQA, MedMCQA, PubMedQA, MMLU clinical topics), including 67.6% accuracy on MedQA (US Medical License Exam questions), surpassing prior state-of-the-art by over 17%. However, human evaluation reveals key gaps in Flan-PaLM responses. To resolve this we introduce instruction prompt tuning, a parameter-efficient approach for aligning LLMs to new domains using a few exemplars. The resulting model, Med-PaLM, performs encouragingly, but remains inferior to clinicians. We show that comprehension, recall of knowledge, and medical reasoning improve with model scale and instruction prompt tuning, suggesting the potential utility of LLMs in medicine. Our human evaluations reveal important limitations of today's models, reinforcing the importance of both evaluation frameworks and method development in creating safe, helpful LLMs for clinical applications. [Link to preprint]
Biosketch Shekoofeh Azizi
Shekoofeh Azizi is a senior research scientist at Google Research, Brain Team and she completed her Ph.D. at the University of British Columbia (UBC), Vancouver, Canada in 2018. Her research concentrated on developing simple and efficient machine learning algorithms that are broadly applicable to a range of computer vision applications. Specifically, over the past few years, she has been focused on developing methods to accelerate the translation of AI solutions to clinical impact. Her work has been covered in various media outlets and recognized by multiple awards including the Governor General's Canada Academic Gold Medal for her contribution in improving diagnostic ultrasound. [Read more]