ReLaGS: Relational Language Gaussian Splatting

CVPR 2025

Yaxu Xie* · Abdalla Arafa* · Alireza Javanmardi · Christen Millerdurai · Jia Cheng Hu · Shaoxiang Wang · Alain Pagani · Didier Stricker

^{*Equal contribution}

¹ German Research Center for Artificial Intelligence (DFKI) · ² RPTU University Kaiserslautern-Landau · ³ University of Modena and Reggio Emilia

😊 TL;DR

ReLaGS enables hierarchical 3D scene understanding with language guidance for Gaussian-splatted scenes. It provides open-vocabulary semantic segmentation, multi-level object hierarchies, and optional 3D scene graph generation—all without scene-specific training.

Note: Evaluation scripts will be published soon.

Relational Language Gaussian Splatting. We build a platform with multi-hierarchical language Gaussian field and open-vocabulary 3D scene graph, to support various tasks such as object selection via click, open vocabulary 3D object segmentation across semantic granularity, spatial relationship reasoning between objects and querying object with relation-guidance.

🧭 Framework Overview

ReLaGS combines three core components for unified 3D scene understanding:

Language-Distilled Gaussian Fields: Multi-level hierarchical semantic representations on 2D Gaussian splatting with language embeddings (scenes → objects → parts)
Multi-View Language Alignment: Robust aggregation of Vision Language Model features (CLIP/DINO) from multiple views into accurate 3D semantic embeddings via ray-tracing
Graph-Based Relationago wigggggl Reasoning: Two complementary approaches for understanding inter-object relationships:
- GNN-based (default): Graph Neural Network trained on Scan3R predicts 27 relationship types, scene-agnostic and no API keys needed
- VLM+SoM (alternative): ChatGPT-powered Set-of-Marks prompting for flexible, custom relationship definitions

For technical details, see the paper.

Installation

Requirements: Python 3.10.13, PyTorch 2.2.0, CUDA 11.8, GPU with >20GB VRAM

Step 1: Clone and create conda environment

git clone https://github.com/dfki-av/ReLaGS.git
cd ReLaGS
conda env create -f environment.yml
conda activate ReLaGS

# install additional dependencies
pip install pyg_lib torch_scatter torch_cluster -f https://data.pyg.org/whl/torch-2.2.0+cu118.html
python scripts/setup_dependencies.py build_ext

Step 3: Verify installation

python -c "from gaussian_renderer import render; print('✓ ReLaGS installed successfully')"

Datasets

Download Example Data

We provide LERF example scenes with pre-trained checkpoints and extracted semantics on HuggingFace:

Download from HuggingFace

Original datasets:

Prepare Your Own Data (COLMAP Format)

For custom datasets, ReLaGS requires multi-view RGB images and camera poses via COLMAP:

mkdir -p my_scene/images
cp /path/to/your/images/*.jpg my_scene/images/

# Preprocess with COLMAP
python convert.py --source_path my_scene --camera OPENCV --resize

Output structure:

my_scene/
├── images/
├── distorted/sparse/0/  # COLMAP reconstruction
│   ├── cameras.bin
│   ├── images.bin
│   └── points3D.bin
└── images_resized/

Extract 2D Semantic Features

To use language-guided semantics (same preprocessing as LangSplat), extract 2D CLIP features and SAM segmentation masks:

python scripts/image_encoding.py --source_path my_scene

Optional arguments:

--save_dir – output directory (default: language_features/)
--resolution – rescale images to specific width (default: auto-downscale if >1080p)
--sam_ckpt_path – SAM checkpoint path (default: ckpts/sam_vit_h_4b8939.pth)

Outputs:

my_scene/language_features/
├── {image_name}_f.npy  # CLIP embeddings (300 features × 512-dim)
└── {image_name}_s.npy  # SAM segmentation maps

Note: Download SAM checkpoint from Meta Research if not present.

Optional - Extract 2D Scene Graphs with SoM

To extract relational structure from images before lifting to 3D, optionally generate 2D scene graphs using ChatGPT Set-of-Marks prompting:

Generate SoM annotations with GPT
Lift 2D annotations to 3D scene graph

See scene_graph/som/README.md for complete setup, usage instructions, and advanced options including custom relationship types and embedding conversion.

Training

Step 1: Prepare 2D Gaussian Splatting Scene

Follow the official 2D Gaussian Splatting implementation to train a scene:

cd /path/to/2d-gaussian-splatting
python train.py \
  --source_path /path/to/colmap/preprocessed/scene \
  --model_path /path/to/output \
  --iterations 30000

Step 2: Run ReLaGS Pipeline

The easiest way is to use the provided shell scripts:

# For geometry-only reconstruction
bash scripts/run_lerf.sh configs/lerf.yml /path/to/scene

# For full pipeline with scene graphs
bash scripts/run_scannet.sh configs/scannet.yml /path/to/scene

Or run key steps manually:

# Initial render and pruning
python max_weight_pruning.py -m output_dir

# Hierarchical partitioning (2 passes)
python sp_partition.py -m output_dir -a
python merge_proj.py -m output_dir
python sp_partition.py -m output_dir -k

# Edge reweighting and final mesh
python graph_weight.py -m output_dir --config configs/scannet.yml

Relational Reasoning

ReLaGS supports two approaches for 3D scene graph generation:

GNN-Based (Default)

Predicts 27 relationship types using a Graph Neural Network trained on Scan3R. Scene-agnostic and requires no API keys.

python predict_3d_scene_graph.py --model_path output_dir --root_pred output_dir

Outputs: jina_edges_predicted.pt, combined_edges_predicted.pt

Requires: Download graph_transformer_512_80.pth from HuggingFace and place in ckpts/

(Optional) To train GNN on 3DSSG dataset, see scene_graph/data_anno/README.md for training data preprocessing, then run:

python scene_graph/train_gnn.py --output_dir ckpts

VLM+Set-of-Marks (Alternative)

ChatGPT-powered approach for flexible, custom relationship definitions. Requires OpenAI API key.

See scene_graph/som/README.md for setup and usage.

Citation

If you find ReLaGS useful for your research, please cite:

@inproceedings{xiearafa2026relags,
  title     = {ReLaGS: Relational Language Gaussian Splatting},
  author    = {Xie, Yaxu and Arafa, Abdalla and Javanmardi, Alireza and Millerdurai, Christen and Hu, Jia Cheng and Wang, Shaoxiang and Pagani, Alain and Stricker, Didier},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
  year      = {2026}
}

Acknowledgements

This work has been partially funded by the EU projects dAIEDGE (GA Nr 101120726) and LUMINOUS (GA Nr 101135724).

We thank the contributors to THGS, 3D Gaussian Splatting, 2D Gaussian Splatting, and Segment Anything for their foundational work.

License

This project is licensed under the AGPL-3.0 License. See LICENSE for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ReLaGS: Relational Language Gaussian Splatting

😊 TL;DR

🧭 Framework Overview

Installation

Datasets

Download Example Data

Prepare Your Own Data (COLMAP Format)

Extract 2D Semantic Features

Optional - Extract 2D Scene Graphs with SoM

Training

Step 1: Prepare 2D Gaussian Splatting Scene

Step 2: Run ReLaGS Pipeline

Relational Reasoning

GNN-Based (Default)

VLM+Set-of-Marks (Alternative)

Citation

Acknowledgements

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
arguments		arguments
assets/images		assets/images
configs		configs
ext/spt		ext/spt
gaussian_renderer		gaussian_renderer
lpipsPyTorch		lpipsPyTorch
scene		scene
scene_graph		scene_graph
scripts		scripts
submodules		submodules
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
convert.py		convert.py
environment.yml		environment.yml
environment_base.yml		environment_base.yml
graph_weight.py		graph_weight.py
max_weight_pruning.py		max_weight_pruning.py
merge_proj.py		merge_proj.py
metrics.py		metrics.py
nag_data.py		nag_data.py
predict_3d_scene_graph.py		predict_3d_scene_graph.py
render.py		render.py
sp_partition.py		sp_partition.py

Folders and files

Latest commit

History

Repository files navigation

ReLaGS: Relational Language Gaussian Splatting

😊 TL;DR

🧭 Framework Overview

Installation

Datasets

Download Example Data

Prepare Your Own Data (COLMAP Format)

Extract 2D Semantic Features

Optional - Extract 2D Scene Graphs with SoM

Training

Step 1: Prepare 2D Gaussian Splatting Scene

Step 2: Run ReLaGS Pipeline

Relational Reasoning

GNN-Based (Default)

VLM+Set-of-Marks (Alternative)

Citation

Acknowledgements

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages