Feature Design for Floor Segmentation: An SVM Study

We study how feature design affects binary floor-vs-non-floor segmentation when using a single classifier (SVM) on a COCO-annotated dataset. The experiments move from minimal features (raw color) to richer representations (spatial layout, region summaries, texture), so we can see what each layer of structure contributes.

Motivation

Rather than jumping to deep learning, we wanted to see how far classical CV + SVM can go when we systematically add structure to the input: first color only, then color + position, then region-level color, and finally texture (HOG). The same pipeline and dataset are used across experiments so comparisons are fair.

Dataset

We use the Floor Segmentation dataset (Roboflow):

Roboflow: Floor Segmentation (universe.roboflow.com)
Google Drive (short): Drive link

Details:

Format: COCO segmentation
Labels: Binary — floor (1) vs non-floor (0), with pixel-level masks
Content: Indoor scenes, mixed lighting and textures

Download and point the notebooks to the train split (paths are set in each notebook).

Experiments

We run four strategies, each in its own notebook. They share the same train/test protocol and evaluation; only the feature extraction and unit of prediction change.

Strategy A — Color-only (pixel)

Pixels are sampled at random; each sample is just (R, G, B). No position or context.

Features: (R, G, B)
Unit: Pixel
Classifier: SVM

Result: weak and noisy; color alone is not enough and is sensitive to lighting.

Strategy B — Color + layout (pixel)

Same pixel sampling, but we add normalized (x, y) so the model can use where in the image the pixel is.

Features: (R, G, B, x, y)
Unit: Pixel
Classifier: SVM

Result: large gain over A; layout is very informative (e.g. floor near bottom). Still no texture.

Strategy C — Tiled color + layout (region)

Image is tiled into fixed-size blocks; each block is one sample. Features are mean RGB and mean (x, y) over the block.

Features: (mean R, mean G, mean B, mean x, mean y)
Unit: Region (e.g. 32×32)
Classifier: SVM

Result: best trade-off in our setup — stable, good accuracy, blocky but clean boundaries. Less noise than per-pixel.

Strategy D — Texture (HOG patches)

Patches are described by HOG; no color. We use two data regimes: small and large.

Features: HOG descriptor
Unit: Patch
Classifier: SVM
Small data: Overfits (high train, low test).
Large data: Generalizes well and can reach strong recall.

HOG is discriminative but needs enough data and does not use color.

Results (summary)

Strategy	Features	Test accuracy (approx.)	Notes
A	RGB	~0.66	Poor, noisy
B	RGB + (x,y)	~0.86	Good
C	Region mean RGB + (x,y)	~0.91	Very good
D (small)	HOG	~0.70	Overfitting
D (large)	HOG	~0.95	Strong

SVM: C and kernel in this project

Regularization (C)

Low C: wider margin, more tolerance for errors, simpler boundary, less overfitting (can underfit).
High C: fits training data tightly, complex boundary, more overfitting.

In our runs: pixel-level (A, B) worked best with moderate C; region-based (C) was less sensitive; HOG (D) overfit badly with high C on small data.

Kernels we used

Linear: Fast and stable; worked well for B and C (color + layout, region).
RBF: More expressive; helped D (HOG) when data was sufficient, but overfit on small HOG sets.
Polynomial: No clear benefit here; slower and less stable, so we did not rely on it.

Takeaways

Raw RGB alone is not sufficient for reliable floor segmentation.
Adding (x, y) brings a big improvement; layout matters a lot.
Region-level (tiled) color + layout gave the best balance of robustness and simplicity in our setting.
HOG works well with enough data but is data-hungry and ignores color.
Careful feature design matters as much as the choice of classifier.

Repo layout

├── approach1_pixel_rgb.ipynb      → Strategy A (color-only)
├── approach2_pixel_rgb_xy.ipynb  → Strategy B (color + layout)
├── approach3_region_rgb_xy.ipynb → Strategy C (tiled color + layout)
├── approach4_hog_small_dataset.ipynb
├── approach4_hog_large_dataset.ipynb  → Strategy D (HOG, two data sizes)
└── README.md

Possible extensions

Hybrid HOG + color features.
Handling class imbalance (e.g. sampling or weighting).
Comparison with a small deep segmentation model.
Cross-dataset evaluation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Feature Design for Floor Segmentation: An SVM Study

Motivation

Dataset

Experiments

Strategy A — Color-only (pixel)

Strategy B — Color + layout (pixel)

Strategy C — Tiled color + layout (region)

Strategy D — Texture (HOG patches)

Results (summary)

SVM: C and kernel in this project

Regularization (C)

Kernels we used

Takeaways

Repo layout

Possible extensions

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
README.md		README.md
approach1_pixel_rgb.ipynb		approach1_pixel_rgb.ipynb
approach2_pixel_rgb_xy.ipynb		approach2_pixel_rgb_xy.ipynb
approach3_region_rgb_xy.ipynb		approach3_region_rgb_xy.ipynb
approach4_hog_large_dataset.ipynb		approach4_hog_large_dataset.ipynb
approach4_hog_small_dataset.ipynb		approach4_hog_small_dataset.ipynb

Folders and files

Latest commit

History

Repository files navigation

Feature Design for Floor Segmentation: An SVM Study

Motivation

Dataset

Experiments

Strategy A — Color-only (pixel)

Strategy B — Color + layout (pixel)

Strategy C — Tiled color + layout (region)

Strategy D — Texture (HOG patches)

Results (summary)

SVM: C and kernel in this project

Regularization (C)

Kernels we used

Takeaways

Repo layout

Possible extensions

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages