Skip to content
@VisionXLab

VisionXLab

VisionXLab at Shanghai Jiao Tong University, led by Prof. Xue Yang.

Pinned Loading

  1. h2rbox-mmrotate h2rbox-mmrotate Public

    [ICLR'23] PyTorch Implementation for H2RBox

    Python 106 11

  2. mllm-mmrotate mllm-mmrotate Public

    [IGARSS 2025 Oral] A Simple Aerial Detection Baseline of Multimodal Language Models.

    Jupyter Notebook 88 6

  3. point2rbox-v2 point2rbox-v2 Public

    [CVPR'25] Official repo of "Point2RBox-v2:Rethinking Point-supervised Oriented Object Detection with Spatial Layout Among Instances"

    Python 40 3

  4. whollywood whollywood Public

    [TPAMI] Wholly Leveraging Diversified-quality Labels for Weakly-supervised Oriented Object Detection

    Jupyter Notebook 11

  5. LRS-VQA LRS-VQA Public

    [ICCV'25] When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning

    Python 44 1

  6. CrossEarth CrossEarth Public

    [TPAMI 2025] CrossEarth: Geospatial Vision Foundation Model for Cross-Domain Generalization in Remote Sensing Semantic Segmentation

    Python 172 9

Repositories

Showing 10 of 28 repositories
  • RSCoVLM Public

    [Remote Sensing 2026] Co-Training Vision Language Models for Remote Sensing Multi-task Learning

    VisionXLab/RSCoVLM’s past year of commit activity
    Python 17 0 0 0 Updated Jan 9, 2026
  • DVGBench Public

    [ISPRS2026] DVGBench: Implicit-to-Explicit Visual Grounding Benchmark in UAV Imagery with Large Vision-Language Models

    VisionXLab/DVGBench’s past year of commit activity
    6 0 1 0 Updated Jan 8, 2026
  • AirSpatialBot Public

    [TGRS'25] AirSpatialBot: A Spatially-Aware Aerial Agent for Fine-Grained Vehicle Attribute Recognization and Retrieval

    VisionXLab/AirSpatialBot’s past year of commit activity
    Python 26 1 1 0 Updated Jan 6, 2026
  • avi-math Public

    [ISPRS'25] Multimodal Mathematical Reasoning Embedded in Aerial Vehicle Imagery: Benchmarking, Analysis, and Exploration

    VisionXLab/avi-math’s past year of commit activity
    Python 13 1 0 0 Updated Jan 4, 2026
  • CastDet Public

    [ECCV'24/IJCV'26] Code repo for "Toward Open Vocabulary Aerial Object Detection with CLIP-Activated Student-Teacher Learning"

    VisionXLab/CastDet’s past year of commit activity
    Python 68 Apache-2.0 4 4 0 Updated Jan 1, 2026
  • Awesome-RS-VL-Data Public

    Awesome Remote Sensing Vision-Language Datasets

    VisionXLab/Awesome-RS-VL-Data’s past year of commit activity
    20 MIT 0 125 0 Updated Jan 1, 2026
  • CrossEarth Public

    [TPAMI 2025] CrossEarth: Geospatial Vision Foundation Model for Cross-Domain Generalization in Remote Sensing Semantic Segmentation

    VisionXLab/CrossEarth’s past year of commit activity
    Python 172 MIT 9 5 0 Updated Dec 22, 2025
  • ProCLIP Public

    Official PyTorch implementation of ProCLIP: Progressive Vision-Language Alignment via LLM-based Embedder

    VisionXLab/ProCLIP’s past year of commit activity
    Python 21 2 1 0 Updated Dec 4, 2025
  • Point2RBox-v3 Public

    Point2RBox-v3: Self-Bootstrapping from Point Annotations via Integrated Pseudo-Label Refinement and Utilization

    VisionXLab/Point2RBox-v3’s past year of commit activity
    Python 12 0 0 0 Updated Nov 19, 2025
  • OF-Diff Public
    VisionXLab/OF-Diff’s past year of commit activity
    Python 12 0 1 0 Updated Sep 26, 2025