🏠🏞 Indoor/Outdoor Image Classification

This project contains a Python script for classifying images as either indoor or outdoor scenes using a pre-trained CLIP model and a lightweight few-shot, prototype-based method. It was built in response to a practical classification task using real-world, unlabeled image data.

📦 Description

The classifier leverages OpenAI’s CLIP (clip-vit-base-patch32) as a visual encoder and classifies new images by comparing their embeddings to representative class prototypes created from a small set of hand-picked example images (few-shot learning).

It is:

✅ Training-free (no fine-tuning needed)
⚡ Fast, with support for CPU
🔧 Modular, and easy to improve further

⚙️ Requirements

Python 3.8+
Git
Dependencies listed in requirements.txt

🚀 How to Run: Step-by-Step Guide

Follow these steps to set up and run the classifier on your own machine.

1. Clone the Repository

First, clone this repository to your local machine using Git:

git clone https://github.com/DrUkachi/inside-outside-classification.git

2. Navigate into the Project Directory

Change your current directory to the newly cloned project folder:

cd project-root

3. Install Dependencies

Install the required Python libraries using the requirements.txt file:

pip install -r requirements.txt

This installs:

PyTorch
HuggingFace Transformers
Pillow (for image handling)
tqdm (for progress bars)

4. Prepare Your Data

Before running the script, make sure your image data is placed in the data/ directory as ZIP files. The expected project structure should be:

project-root/
├── data/
│   ├── few_shot.zip
│   ├── validation.zip
│   └── unlabeled.zip
│
├── .gitignore
├── README.md
├── classify.py
├── experiment.ipynb
├── few_shot_images.json
└── requirements.txt

📂 Automatic Unzipping

The script is designed to automatically unzip the few_shot.zip, validation.zip, and unlabeled.zip files into the root directory the first time you run it.

It will create the following folders:

few_shot/
validation/
unlabeled/

💡 The script will only unzip the files if the corresponding folders don’t already exist, so you can safely rerun it without duplicating data.

5. Run the Script

You can run the script in two primary modes: classify or validate.

I. To classify a folder of new images:

Use this command to classify all images inside the unlabeled/ directory.

python classify.py --mode classify --folder ./unlabeled

II. To validate predictions using labeled validation data:

Use this command to run the classifier on the validation/ set and check its accuracy. This mode assumes the filenames in the validation set contain ground-truth labels (e.g., indoor_image_1.jpg).

python classify.py --mode validate --folder ./validation

After running, all processed images will be sorted into one of three directories: classified/indoor/, classified/outdoor/, or classified/review/ for ambiguous cases.

🧠 Methodology

✔ Model

Uses openai/clip-vit-base-patch32 from HuggingFace
Embeddings are extracted for both:
- Text prompts (e.g., “a photo taken indoors”)
- Few-shot example images

✔ Classification Logic

For each image:
1. Get its CLIP embedding
2. Compare it to the averaged prompt embeddings and few-shot image embeddings
3. Compute similarity to both classes
4. 🔄 If the top two scores are close (within 0.05), the image is sent to review
5. Otherwise, assign to the class with the highest similarity

⚖️ What Does the `REVIEW_MARGIN = 0.05` Mean?

When the model is unsure (i.e., the similarity difference between the top two classes is small), the image is routed to a review/ folder for manual inspection.

🧠 Why 0.05? A 5% margin was selected as a practical threshold for ambiguity. It captures borderline cases where CLIP's semantic similarity doesn't clearly favor one class. This value can be tuned depending on tolerance for false positives or the capacity for human review.

🧪 Observations & Insights

✅ Strengths

No model training required
Strong generalization from a few visual examples
Prompt-based reasoning makes it adaptable to other classes

⚠️ Limitations & Edge Cases

Category	Example	Insight
Ambiguous scenes	`190881191_*.jpg`	Even humans disagree — routed to review/ folder
Roofed or car interiors	`219636488_.jpg`, `70939958_.jpg`	Challenging without contextual metadata
Clear misclassifications	`79869777_.jpg`, `227589596_.jpg`	Could benefit from a secondary model (e.g., object detection)
Unexplainable predictions	`253900795_*.jpg`	Explaining CLIP decisions is non-trivial — visual interpretability tools could help
Prompt sensitivity	`50587842_.jpg`, `56540294_.jpg`	Slight changes in text can impact results — consider dynamic prompt ensembling
Environmental cues	`99454779_*.jpg`	Brightness, lighting, and framing may bias CLIP's perception

💡 What I'd Improve with More Time

Scene-Based Inference Engine Use models like Places365 to classify contextually confusing cases (e.g., parking lots, stadiums).
Explainability Tools Add SHAP, Grad-CAM, or embedding heatmaps to interpret classification decisions.
Prompt Augmentation & Tuning Dynamically improve text prompts using automated selection or fine-tuned language prompts.

📁 Deliverables

classify.py: Main classification and validation script
README.md: Full guide and technical report (this file)
Folder structure with few_shot, unlabeled, validation, and classified/ directories

🏁 Evaluation Criteria

This solution demonstrates:

✅ Effective use of large pre-trained vision-language models
✅ Lightweight and reproducible code
✅ Clear handling of edge cases
✅ Review mechanism for ambiguous images
✅ Good modularity for future extensions

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🏠🏞 Indoor/Outdoor Image Classification

📦 Description

⚙️ Requirements

🚀 How to Run: Step-by-Step Guide

1. Clone the Repository

2. Navigate into the Project Directory

3. Install Dependencies

4. Prepare Your Data

📂 Automatic Unzipping

5. Run the Script

I. To classify a folder of new images:

II. To validate predictions using labeled validation data:

🧠 Methodology

✔ Model

✔ Classification Logic

⚖️ What Does the `REVIEW_MARGIN = 0.05` Mean?

🧪 Observations & Insights

✅ Strengths

⚠️ Limitations & Edge Cases

💡 What I'd Improve with More Time

📁 Deliverables

🏁 Evaluation Criteria

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
data		data
.gitignore		.gitignore
README.md		README.md
classify.py		classify.py
experiment.ipynb		experiment.ipynb
few_shot_images.json		few_shot_images.json
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

🏠🏞 Indoor/Outdoor Image Classification

📦 Description

⚙️ Requirements

🚀 How to Run: Step-by-Step Guide

1. Clone the Repository

2. Navigate into the Project Directory

3. Install Dependencies

4. Prepare Your Data

📂 Automatic Unzipping

5. Run the Script

I. To classify a folder of new images:

II. To validate predictions using labeled validation data:

🧠 Methodology

✔ Model

✔ Classification Logic

⚖️ What Does the REVIEW_MARGIN = 0.05 Mean?

🧪 Observations & Insights

✅ Strengths

⚠️ Limitations & Edge Cases

💡 What I'd Improve with More Time

📁 Deliverables

🏁 Evaluation Criteria

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

⚖️ What Does the `REVIEW_MARGIN = 0.05` Mean?

Packages