Skip to content
@VisionXLab

VisionXLab

VisionXLab at Shanghai Jiao Tong University, led by Prof. Xue Yang.

Pinned Loading

  1. h2rbox-mmrotate h2rbox-mmrotate Public

    [ICLR'23] PyTorch Implementation for H2RBox

    Python 106 11

  2. mllm-mmrotate mllm-mmrotate Public

    [IGARSS 2025 Oral] A Simple Aerial Detection Baseline of Multimodal Language Models.

    Jupyter Notebook 91 6

  3. point2rbox-v2 point2rbox-v2 Public

    [CVPR'25] Official repo of "Point2RBox-v2:Rethinking Point-supervised Oriented Object Detection with Spatial Layout Among Instances"

    Python 40 4

  4. whollywood whollywood Public

    [TPAMI] Wholly Leveraging Diversified-quality Labels for Weakly-supervised Oriented Object Detection

    Jupyter Notebook 11

  5. LRS-VQA LRS-VQA Public

    [ICCV'25] When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning

    Python 47 1

  6. CrossEarth CrossEarth Public

    [TPAMI 2025] CrossEarth: Geospatial Vision Foundation Model for Cross-Domain Generalization in Remote Sensing Semantic Segmentation

    Python 175 9

Repositories

Showing 10 of 31 repositories
  • Awesome-RS-VL-Data Public

    Awesome Remote Sensing Vision-Language Datasets

    VisionXLab/Awesome-RS-VL-Data’s past year of commit activity
    38 MIT 1 125 0 Updated Feb 9, 2026
  • Rise-Video Public
    VisionXLab/Rise-Video’s past year of commit activity
    Python 21 0 2 0 Updated Feb 7, 2026
  • OF-Diff Public

    [ICLR'26] OF-Diff: Object Fidelity Diffusion for Remote Sensing Image Generation

    VisionXLab/OF-Diff’s past year of commit activity
    Python 17 0 2 0 Updated Feb 6, 2026
  • SPWOOD Public

    SPWOOD: SPARSE PARTIAL WEAKLY-SUPERVISED ORIENTED OBJECT DETECTION

    VisionXLab/SPWOOD’s past year of commit activity
    0 0 1 0 Updated Feb 3, 2026
  • VisionXLab/VisionXLab_LaTeX_Template’s past year of commit activity
    TeX 7 0 0 0 Updated Feb 3, 2026
  • SpaCE-10 Public

    [ICLR 2026] SpaCE-10: A Comprehensive Benchmark for Multimodal Large Language Models in Compositional Spatial Intelligence

    VisionXLab/SpaCE-10’s past year of commit activity
    Python 16 2 1 0 Updated Jan 26, 2026
  • mllm-mmrotate Public

    [IGARSS 2025 Oral] A Simple Aerial Detection Baseline of Multimodal Language Models.

    VisionXLab/mllm-mmrotate’s past year of commit activity
    Jupyter Notebook 91 6 0 1 Updated Jan 21, 2026
  • RSCoVLM Public

    [Remote Sensing 2026] Co-Training Vision Language Models for Remote Sensing Multi-task Learning

    VisionXLab/RSCoVLM’s past year of commit activity
    Python 21 0 0 0 Updated Jan 21, 2026
  • DVGBench Public

    [ISPRS2026] DVGBench: Implicit-to-Explicit Visual Grounding Benchmark in UAV Imagery with Large Vision-Language Models

    VisionXLab/DVGBench’s past year of commit activity
    12 0 1 0 Updated Jan 14, 2026
  • AirSpatialBot Public

    [TGRS'25] AirSpatialBot: A Spatially-Aware Aerial Agent for Fine-Grained Vehicle Attribute Recognization and Retrieval

    VisionXLab/AirSpatialBot’s past year of commit activity
    Python 29 1 1 0 Updated Jan 6, 2026