Official Repo for Paper ‘’HealthGPT : A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation‘’

Python 311 44 Updated Feb 26, 2025

Awenbocc / GEMeX-Project

Official code of paper "GEMeX: A Large-Scale, Groundable, and Explainable Medical VQA Benchmark for Chest X-ray Diagnosis"

Jupyter Notebook 14 Updated Feb 17, 2025

JasonZuu / SegX

Knowledge Incoporated Explanable AI

Python 7 Updated Feb 17, 2025

uni-medical / GMAI-MMBench

GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI.

41 1 Updated Dec 17, 2024

ucwxb / I-MedSAM

[ECCV2024] I-MedSAM: Implicit Medical Image Segmentation with Segment Anything

Python 46 Updated Aug 1, 2024

zhaoziheng / SAT

The official repository for "One Model to Rule them All: Towards Universal Segmentation for Medical Images with Text Prompts"

Python 177 13 Updated Jan 24, 2025

ljy19970415 / AutoRG-Brain

The official codes for "AutoRG-Brain: Grounded Report Generation for Brain MRI".

Python 25 Updated Nov 18, 2024

RupertLuo / VoCoT

VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Models

Python 47 1 Updated Jul 13, 2024

jincan333 / LoR-VP

LoR-VP: Low-Rank Visual Prompting for Efficient Vision Model Adaptation (ICLR 2025)

Python 28 1 Updated Feb 5, 2025

JerrryNie / ConceptCLIP

Python 6 1 Updated Jan 28, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 43,230 5,291 Updated Mar 6, 2025

thunlp / Migician

Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models

Python 38 2 Updated Jan 15, 2025

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 7,179 553 Updated Feb 26, 2025

penghao-wu / vstar

PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"

Python 570 39 Updated Jan 7, 2024

IDEA-Research / ChatRex

Code for ChatRex: Taming Multimodal LLM for Joint Perception and Understanding

Python 160 8 Updated Jan 24, 2025

UCSC-VLAA / MedTrinity-25M

[ICLR 2025] This is the official repository of our paper "MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine“

Python 264 18 Updated Feb 26, 2025

xmed-lab / MedRegA

MedRegA: Interpretable Bilingual Multimodal Large Language Model for Diverse Biomedical Tasks

Python 21 1 Updated Dec 16, 2024

Shengcao-Cao / groundLMM

Emerging Pixel Grounding in Large Multimodal Models Without Grounding Supervision

Python 32 1 Updated Oct 21, 2024

LeapLabTHU / Pseudo-Q

[CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding

Python 147 10 Updated Jul 13, 2024

ys-zong / awesome-self-supervised-multimodal-learning

[T-PAMI] A curated list of self-supervised multimodal learning resources.

246 7 Updated Aug 16, 2024

awaisrauf / Awesome-CV-Foundational-Models

500 31 Updated Nov 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

YueJK Yuejingkun

Highlights

Block or report Yuejingkun

Stars

Luffy03 / FreeTumor

Liuziyu77 / Visual-RFT

ashkamath / mdetr

Holipori / MIMIC-Diff-VQA

QwenLM / Qwen2.5-VL

BioMedIA-MBZUAI / MedPromptX

om-ai-lab / VLM-R1

LinjieMu / MMXU

AFeng-x / Draw-and-Understand

DCDmllm / HealthGPT