🛡️ DeepSentry — AI Deepfake Detector

Detect deepfakes in images and videos using a fine-tuned Vision Transformer (ViT) model — ~92% accuracy.

Demo · Features · Quick Start · API Docs · How It Works

✨ Features

🖼️ Image detection — JPG, PNG, WEBP, BMP
🎬 Video detection — MP4, AVI, MOV, MKV, WEBM (samples up to 20 frames)
🤖 ViT-powered — prithivMLmods/Deep-Fake-Detector-v2-Model (~92% accuracy on 56k test images)
⚡ Fast REST API — FastAPI + Uvicorn with auto-generated Swagger UI at /docs
🎨 Polished frontend — drag-and-drop SPA, confidence ring, live activity log — no build step required
🔁 Frame averaging — video predictions average softmax probabilities across all sampled frames for robustness
💾 Cached model weights — ~330 MB downloaded once, then loaded from ~/.cache/huggingface/ on every run

🚀 Quick Start

1. Clone the repo

git clone https://github.com/YAXH64/Deepsentry---Deepfake-Detector.git
cd deepsentry

2. Create a virtual environment

python -m venv .venv
source .venv/bin/activate        # Windows: .venv\Scripts\activate

3. Install dependencies

pip install -r requirements.txt

4. Start the backend

python main.py

First run only: the model weights (~330 MB) are downloaded automatically and cached. This takes ~1 minute depending on your connection. All subsequent starts are near-instant.

5. Open the frontend

Open index.html directly in your browser — no web server needed.

Backend API  →  http://localhost:8000
Swagger UI   →  http://localhost:8000/docs

📁 Project Structure

deepsentry/
├── index.html        # Frontend SPA — drag-and-drop UI, results panel
├── main.py           # FastAPI app — routes, CORS, response schema
├── model.py          # ViT model loader + inference engine
├── processor.py      # File decoding, video frame extraction
└── requirements.txt  # Python dependencies

🔌 API Reference

All endpoints return JSON. Full interactive docs at http://localhost:8000/docs.

`GET /`

Health check.

{ "status": "ok" }

`POST /detect/image`

Upload an image and get a deepfake prediction.

Request: multipart/form-data

Field	Type	Required	Notes
`file`	File	✅	`.jpg` `.jpeg` `.png` `.webp` `.bmp`

Response:

{
  "label":      "Real",
  "confidence": 94.72,
  "media_type": "image",
  "elapsed_ms": 312.5
}

`POST /detect/video`

Upload a video and get a deepfake prediction. Up to 20 evenly-spaced frames are sampled and averaged.

Request: multipart/form-data

Field	Type	Required	Notes
`file`	File	✅	`.mp4` `.avi` `.mov` `.mkv` `.webm`

Response:

{
  "label":      "Deepfake",
  "confidence": 87.13,
  "media_type": "video",
  "elapsed_ms": 4821.0
}

Error Codes

Code	Cause
`400`	No file, unsupported format, corrupted file, or no extractable frames
`422`	FastAPI validation error (missing required field)
`500`	Unhandled server error during inference

🧠 How It Works

DeepSentry uses a 4-step analysis pipeline:

┌─────────────────────┐
│  1. Facial Geometry  │  Analyzes 3D facial structure and landmark positions
├─────────────────────┤
│  2. Temporal Check   │  Frame-to-frame coherence (videos only)
├─────────────────────┤
│  3. Artifact Scan    │  GAN/diffusion model pixel-level fingerprints
├─────────────────────┤
│  4. ViT Inference    │  Final classification with confidence score
└─────────────────────┘

Under the hood, steps 1–3 are surfaced in the UI as animated checks. Step 4 is the actual model inference:

Uploaded file → decoded by OpenCV → converted to PIL Image
ViTImageProcessor resizes to 224×224 and normalizes pixel values
ViT forward pass → softmax probabilities over ["Realism", "Deepfake"]
For video: probabilities are averaged across all sampled frames
Highest-probability class returned as the label with confidence %

Model

Property	Value
Model ID	`prithivMLmods/Deep-Fake-Detector-v2-Model`
Architecture	ViT (vit-base-patch16-224-in21k fine-tuned)
Accuracy	~92% on 56,001 test images
Input size	224 × 224 px
Labels	`Realism` → Real · `Deepfake` → Deepfake
Device	CUDA (if available) · CPU fallback

⚙️ Configuration

Constant	File	Default	Description
`MAX_FRAMES`	`processor.py`	`20`	Max video frames sampled per upload
`MODEL_ID`	`model.py`	`prithivMLmods/...`	HuggingFace model identifier
`DEVICE`	`model.py`	auto	`cuda` if available, else `cpu`
`host`	`main.py`	`0.0.0.0`	Uvicorn bind address
`port`	`main.py`	`8000`	Uvicorn port
`API_BASE`	`index.html`	`http://localhost:8000`	Backend URL used by the frontend

Deploying remotely? Update API_BASE in the <script> block of index.html to point to your server's address.

📦 Dependencies

fastapi>=0.100.0
uvicorn[standard]>=0.23.0
python-multipart>=0.0.7
torch>=2.0.0
transformers>=4.35.0
Pillow>=9.0.0
opencv-python-headless>=4.8.0
numpy>=1.24.0
pydantic>=2.0.0

⚠️ Known Limitations

No file size limit — large video uploads are read fully into RAM. Add a size guard before production use.
CORS is open — allow_origins=["*"] is set for local dev. Restrict this before deploying publicly.
Single file at a time — batch upload is not currently supported.
Model accuracy — ~92% means roughly 1 in 12 predictions may be incorrect. Do not use as a sole source of truth.

🤝 Contributing

Pull requests are welcome! For major changes, please open an issue first to discuss what you'd like to change.

Fork the repo
Create your branch (git checkout -b feature/your-feature)
Commit your changes (git commit -m 'Add your feature')
Push to the branch (git push origin feature/your-feature)
Open a Pull Request

📄 License

This project is licensed under the MIT License. See LICENSE for details.

_{Built with ❤️ by Yash Yadav · Powered by prithivMLmods/Deep-Fake-Detector-v2-Model}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
backend		backend
frontend		frontend
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🛡️ DeepSentry — AI Deepfake Detector

✨ Features

🚀 Quick Start

1. Clone the repo

2. Create a virtual environment

3. Install dependencies

4. Start the backend

5. Open the frontend

📁 Project Structure

🔌 API Reference

`GET /`

`POST /detect/image`

`POST /detect/video`

Error Codes

🧠 How It Works

Model

⚙️ Configuration

📦 Dependencies

⚠️ Known Limitations

🤝 Contributing

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🛡️ DeepSentry — AI Deepfake Detector

✨ Features

🚀 Quick Start

1. Clone the repo

2. Create a virtual environment

3. Install dependencies

4. Start the backend

5. Open the frontend

📁 Project Structure

🔌 API Reference

GET /

POST /detect/image

POST /detect/video

Error Codes

🧠 How It Works

Model

⚙️ Configuration

📦 Dependencies

⚠️ Known Limitations

🤝 Contributing

📄 License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`GET /`

`POST /detect/image`

`POST /detect/video`

Packages