Choosing labels based on their names for classification models by jcchr · Pull Request #705 · open-edge-platform/geti-sdk

jcchr · 2025-12-05T10:09:16Z

Summary

Model API - used by geti-sdk to perform inference, returns results based on a list of labels which is sorted alphabetically - whereas geti-sdk works on a list ordered like it is in a model itself. In some cases order of labels might be different in both libraries - which leads to problems like the one described here : open-edge-platform/geti#1578

Correction is to establish labels returned by model based on their names instead of using their location on a list. Change should be done for classification models only - as a change related to sorting labels was introduced only for this kind of models.

How to test

Execute inference on a model - checking first if labels in a model (check the .xml file with models - list of labels is located at the end of it) are not sorted alphabetically. Check if results of inference are ok.

Checklist

I have tested my changes manually.
I have added tests to cover my changes.

License

I submit my code changes under the same Apache License that covers the project.
Feel free to contact the maintainers if that's a concern.
I have updated the license header for each file (see an example below).

# Copyright (C) 2025 Intel Corporation
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing,
# software distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions
# and limitations under the License.

Copilot

Pull request overview

This PR fixes a label ordering mismatch between geti-sdk and ModelAPI. ModelAPI returns inference results with labels sorted alphabetically, while geti-sdk uses the model's original label order. This inconsistency caused incorrect inference results when the orders differed.

Key changes:

Modified label processing in ResultsToPredictionConverter to sort labels alphabetically before mapping inference results

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

leoll2

Model API - used by geti-sdk to perform inference, returns results based on a list of labels which is sorted alphabetically

If confirmed, this is a rather strange behavior. Can you provide a reference to the ModelAPI code that sorts the labels alphabetically?

leoll2 · 2025-12-16T08:54:28Z

tests/pre-merge/unit/deployment/test_prediction_converter.py

-                ["foo bar", "foo_bar"],
-                ["foo_bar", "foo_bar"],
-                {"label_ids": ["1", "2"], "labels": ["foo_bar", "foo_bar"]},
+                ["foo bar1", "foo_bar2"],
+                ["foo_bar1", "foo_bar2"],
+                {"label_ids": ["1", "2"], "labels": ["foo_bar1", "foo_bar2"]},


The purpose of this test case was to check an edge case where the label names are very similar and conflict after escaping (both foo bar and foo_bar become foo_bar). The new label names foo bar1 and foo_bar2 do not trigger the intended edge case.

leoll2 · 2025-12-16T08:56:29Z

...k/deployment/predictions_postprocessing/results_converter/results_to_prediction_converter.py

        for label in inference_results.top_labels:
            label_idx, label_name, label_prob = label
-            scored_label = ScoredLabel.from_label(label=self.get_label_by_idx(label_idx), probability=label_prob)
+            scored_label = ScoredLabel.from_label(label=self.get_label_by_str(label_name), probability=label_prob)


What's wrong with get_label_by_idx?

I remember that MAPI does not handle hierarchical classification in a consistent way, so we could not rely on the label indices. See related issue: open-edge-platform/geti#402.

Therefore, it is correct to use the label name for classification tasks. However, this means that we might get ambigous results if label are poorly named. For example, foo bar and foo_bar, spaces are not supported by either MAPI or OV AFAIK, resulting in a name collision.

maxxgx · 2026-01-02T08:12:57Z

...k/deployment/predictions_postprocessing/results_converter/results_to_prediction_converter.py

        for label in inference_results.top_labels:
            label_idx, label_name, label_prob = label
-            scored_label = ScoredLabel.from_label(label=self.get_label_by_idx(label_idx), probability=label_prob)
+            scored_label = ScoredLabel.from_label(label=self.get_label_by_str(label_name), probability=label_prob)


I remember that MAPI does not handle hierarchical classification in a consistent way, so we could not rely on the label indices. See related issue: open-edge-platform/geti#402.

Therefore, it is correct to use the label name for classification tasks. However, this means that we might get ambigous results if label are poorly named. For example, foo bar and foo_bar, spaces are not supported by either MAPI or OV AFAIK, resulting in a name collision.

sorting labels alphabetically to preserve compatibility with modelapi

45977ea

Copilot AI review requested due to automatic review settings December 5, 2025 10:09

Copilot AI reviewed Dec 5, 2025

View reviewed changes

jcchr mentioned this pull request Dec 5, 2025

Hierarchical classification model incorrect labels open-edge-platform/geti#1578

Closed

jcchr added 4 commits December 5, 2025 13:31

unit tests correction

e35ea3c

handle two scenarios of sorting labels for classification models

7edbab2

readability of a code

fb048cc

styles correction

99c4896

jcchr changed the title ~~Sorting labels alphabetically to preserve compatibility with modelapi~~ Choosing labels based on their names for classification models Dec 16, 2025

leoll2 reviewed Dec 16, 2025

View reviewed changes

leoll2 requested a review from maxxgx December 16, 2025 08:58

maxxgx approved these changes Jan 2, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Choosing labels based on their names for classification models#705

Choosing labels based on their names for classification models#705
jcchr wants to merge 5 commits intoreleases/v2.13.xfrom
jchrapko/sorting_labels

jcchr commented Dec 5, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

leoll2 left a comment

Uh oh!

leoll2 Dec 16, 2025

Uh oh!

leoll2 Dec 16, 2025

Uh oh!

maxxgx Jan 2, 2026

Uh oh!

maxxgx Jan 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

jcchr commented Dec 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

How to test

Checklist

License

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

leoll2 left a comment

Choose a reason for hiding this comment

Uh oh!

leoll2 Dec 16, 2025

Choose a reason for hiding this comment

Uh oh!

leoll2 Dec 16, 2025

Choose a reason for hiding this comment

Uh oh!

maxxgx Jan 2, 2026

Choose a reason for hiding this comment

Uh oh!

maxxgx Jan 2, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

jcchr commented Dec 5, 2025 •

edited

Loading