zero-shot accuracy on scannet200

Hi, have you ever test the zero-shot accuracy on scannet200, i.e., replace those class-agnostic mask proposals predicted by Mask3D and use ground-truth instances as input to your mask feature computation module to get the mask features, then dot product with text embeddings from CLIP text encoder and select the maximum index as the predicted label for ground-truths? I just wonder how accurate CLIP is.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

zero-shot accuracy on scannet200 #27

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

zero-shot accuracy on scannet200 #27

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions