Skip to content
View Yuejingkun's full-sized avatar

Highlights

  • Pro

Block or report Yuejingkun

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Synthesize and segment tumors at scale.

Python 23 2 Updated Feb 28, 2025

Official repository of ’Visual-RFT: Visual Reinforcement Fine-Tuning’

Python 834 30 Updated Mar 6, 2025
Python 993 131 Updated Oct 3, 2022
Python 61 4 Updated Feb 3, 2025

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 8,415 591 Updated Mar 4, 2025
Jupyter Notebook 64 2 Updated Jan 27, 2025

Solve Visual Understanding with Reinforced VLMs

Python 3,878 238 Updated Mar 6, 2025
7 Updated Feb 16, 2025

[ICLR2025] Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want

Python 66 2 Updated Jan 27, 2025

Official Repo for Paper ‘’HealthGPT : A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation‘’

Python 311 44 Updated Feb 26, 2025

Official code of paper "GEMeX: A Large-Scale, Groundable, and Explainable Medical VQA Benchmark for Chest X-ray Diagnosis"

Jupyter Notebook 14 Updated Feb 17, 2025

Knowledge Incoporated Explanable AI

Python 7 Updated Feb 17, 2025

GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI.

41 1 Updated Dec 17, 2024

[ECCV2024] I-MedSAM: Implicit Medical Image Segmentation with Segment Anything

Python 46 Updated Aug 1, 2024

The official repository for "One Model to Rule them All: Towards Universal Segmentation for Medical Images with Text Prompts"

Python 177 13 Updated Jan 24, 2025

The official codes for "AutoRG-Brain: Grounded Report Generation for Brain MRI".

Python 25 Updated Nov 18, 2024

VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Models

Python 47 1 Updated Jul 13, 2024

LoR-VP: Low-Rank Visual Prompting for Efficient Vision Model Adaptation (ICLR 2025)

Python 28 1 Updated Feb 5, 2025
Python 6 1 Updated Jan 28, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 43,230 5,291 Updated Mar 6, 2025

Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models

Python 38 2 Updated Jan 15, 2025

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 7,179 553 Updated Feb 26, 2025

PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"

Python 570 39 Updated Jan 7, 2024

Code for ChatRex: Taming Multimodal LLM for Joint Perception and Understanding

Python 160 8 Updated Jan 24, 2025

[ICLR 2025] This is the official repository of our paper "MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine“

Python 264 18 Updated Feb 26, 2025

MedRegA: Interpretable Bilingual Multimodal Large Language Model for Diverse Biomedical Tasks

Python 21 1 Updated Dec 16, 2024

Emerging Pixel Grounding in Large Multimodal Models Without Grounding Supervision

Python 32 1 Updated Oct 21, 2024

[CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding

Python 147 10 Updated Jul 13, 2024

[T-PAMI] A curated list of self-supervised multimodal learning resources.

246 7 Updated Aug 16, 2024
Next
Showing results