OpenGazeLab

A web-based toolkit for processing eye-tracking gaze data and classifying it into fixations (eyes holding still on a target) and saccades (rapid eye movements between targets). Supports both stationary eye trackers and head-mounted eye trackers. Provides a Python processing pipeline and a browser-based UI for researchers working with eye-tracking data.

Note: Python 3.10 recommended.

Quick Start

Installation

cd backend
pip install -r requirements.txt

cd frontend
npm install

Run the App

Option A — Windows: Double-click start_servers.bat in the project root. Two terminals open and start both servers automatically.

Option B — Manual:

# Terminal 1 — Backend
cd backend
python main.py
# → Backend runs at http://127.0.0.1:5000

# Terminal 2 — Frontend
cd frontend
npm run start
# → Frontend opens at http://localhost:8000

Open the frontend URL in your browser, pick a mode (Stationary or Head-Mounted), and follow the steps below.

How to Use — Stationary Eye Tracker

For screen-based / desktop eye trackers that record gaze coordinates in a CSV file.

Input

What	Required	Description
Gaze CSV file	Yes	One sample per row with x, y, and timestamp columns
Background image	No	Stimulus screenshot (PNG/JPG/BMP/GIF/WebP) — overlaid behind the gaze plot for context

CSV format

The CSV needs three columns. Names and units are auto-detected as either pixels or normalized (0–1) for gaze, and milliseconds, seconds, or epoch for time(common variants supported, see Input Reference)

Column meaning	Accepted names	Units
Horizontal gaze	`x`, `gaze_x`, `X`, …	pixels OR normalized (0–1) — auto-detected
Vertical gaze	`y`, `gaze_y`, `Y`, …	pixels OR normalized (0–1) — auto-detected
Time	`timestamp`, `time`, …	milliseconds, seconds, or epoch — auto-detected

Auto-detected delimiters: ;, ,, \t, |, space.

Example:

timestamp;x;y
1000;1280;720
1010;1281;720
1020;1282;720
1030;1500;800
1040;1501;801

Configuration

Parameter	Description
Algorithm	`I-DT` (dispersion-based) or `I-VT` (velocity-based)
Y-Origin	Coordinate origin convention for visualization
Display Resolution	Screen resolution in pixels (width,height)
Sampling Rate	Eye-tracker sampling rate
Min Fixation Duration	Minimum duration to count as a fixation
Detection Threshold	I-DT: dispersion in pixels. I-VT: velocity in px/ms
Merge Threshold	Max distance (px) to merge nearby fixations
Adaptive Threshold	Enable MAD-based adaptive thresholding

Output

After clicking "Process Gaze Data", you get:

Statistics panel — total events, fixation samples, saccade samples, invalid samples
Downloadable CSV — original data plus event classification columns (see Output Reference)
Stationary plot (interactive Plotly HTML) — gaze samples colored by event type, fixation centers numbered in scan order, scanpath lines, optional background image
Time-scrolling plot (animated Plotly HTML) — playback of fixations/saccades over time with play/pause controls and a time slider

How to Use — Head-Mounted Eye Tracker

For head-mounted eye trackers that record gaze data, optical flow from the scene camera, and a scene video.

Input

Upload two files:

Dataset ZIP (max 100 MB) — archive of dataset files
Scene Camera Video (max 5 GB) — .mp4 from the head-mounted camera

OpenGazeLab auto-detects the dataset layout from the ZIP contents. Two layouts ship out of the box; bring your own data in either shape (see Input Reference).

Layout A — Drews & Dierkes (DD) style: `.npy` arrays

Required files inside the ZIP

File	Shape	Description
`gaze.npy`	(N, 2)	Eye gaze position (x, y) in pixels
`time_gaze.npy`	(N,)	Gaze timestamps in seconds
`optic_flow.npy`	(M, 11, 11, 2)	Per-frame 11×11 optical flow grid
`time_optic_flow.npy`	(M,)	Optical flow frame timestamps in seconds
`time_scene_camera.npy`	(M,)	Scene camera frame timestamps in seconds

Optional files

File	Shape	Description
`gt_labels.npy`	(N,)	Ground truth labels (1 = Fixation, 0 = Saccade). Triggers automatic F1 score computation.

.npy files may live at the ZIP root or inside a single subfolder.

Layout B — Gaze-in-Wild (GiW) style: `.mat` files + pre-computed flow

Required files inside the ZIP

File	Description
`PrIdx_<P>_TrIdx_<T>.mat`	GiW signals file (exactly one). Filename participant/trial IDs are parsed to apply the labeler-priority rule. Provides gaze (`ProcessData.ETG.POR`), timestamps (`ProcessData.T`), and per-sample frame indices.
`PrIdx_<P>_TrIdx_<T>_Lbr_<N>.mat`	One or more labeler annotation files.
`optic_flow.npy`	`(M, 2)` per-frame mean optical flow

How GiW labels are handled

GiW labels have six classes (UNDEFINED, FIXATION, PURSUIT, SACCADE, BLINK, FOLLOWING). OpenGazeLab collapses them to binary so the existing F1 scoring applies:

stable gaze (1) = FIXATION ∪ FOLLOWING — i.e. eye-only fixation plus head+eye co-rotation tracking an attended target.
everything else (0) = SACCADE ∪ PURSUIT ∪ BLINK ∪ UNDEFINED

Strict-fixation alone is rare in real-world recordings (~1% of samples on the trials we inspected), which is why following is lumped in.

When multiple labelers are present, OpenGazeLab uses the priority rule from Kothari et al.: trial 1 → labeler 5; otherwise prefer labeler 6, then 5, 1, 2, 3.

Triggers automatic F1 score computation just like a DD upload with gt_labels.npy.

Files may live at the ZIP root or inside any subfolder.

Configuration

Parameter	Description
Algorithm	`I-DT` (relative dispersion) or `I-VT` (relative velocity) — both run with optical-flow compensation
Video Resolution	Scene camera resolution (width,height)
Sampling Rate	Gaze sampling rate
Min Fixation Duration	Minimum duration to count as a fixation
Detection Threshold	I-VT: relative-velocity threshold (px/ms). I-DT: relative-dispersion threshold (px). See Recommended Parameters
Adaptive Threshold	Enable flow-RMS-based adaptive thresholding

Output

After clicking "Process Video Data", you get:

Statistics panel — fixation samples, saccade samples, total duration, video resolution, FPS
F1 scores — fixation and saccade F1 (only when gt_labels.npy is provided)
Downloadable CSV — gaze samples with event classification (see Output Reference)
Video overlay HTML — the scene video with:
- Gaze samples drawn as colored dots (fixation vs. saccade)
- Fixation centers labeled with sequence numbers
- An optical-flow arrow showing head motion
- A clickable event timeline bar for seeking
- Side-by-side comparison with ground truth (if provided)

Input Reference

OpenGazeLab can be used with both stationary and head-mounted eye-tracking datasets. The required input files and their formats differ between these two modes. Below is a reference for the example expected inputs in each case.

Stationary Eye Tracker

Reference
Disagreement Detection	Paper	Dataset
gazeRE	Paper	Dataset

Head-Mounted Eye Tracker

Reference
Drews & Dierkes (DD)	Paper	Dataset
Gaze-in-Wild (GiW)	Paper	Dataset (Disclaimer: The website is no longer available, try this repository instead)

Recommended Parameters

Adaptive-threshold suggestions

The adaptive-threshold path on the Head-Mounted tab takes two user inputs, gain and window_size_ms. Both default to 0 when left blank in the UI.

gain — multiplier applied to the rolling RMS of optical-flow velocity. The per-sample threshold is computed as detection_threshold + gain × flow_rms_mag, so a larger gain pushes the threshold up more aggressively under head motion. With gain = 0, the motion-driven adjustment is disabled (the threshold equals detection_threshold for every sample).
window_size_ms — length of the centered rolling window used to compute the flow-RMS magnitude. Shorter windows track rapid head movements more closely; longer windows produce a smoother, less reactive threshold.

The recommendations below come from our parameter sweeps on the DD and GiW datasets listed in Input Reference. Treat them as practical starting points — tune for recordings on different headsets, scene cameras, or sampling rates.

Dataset	Algorithm	`gain`	`window_size_ms`
DD	I-DT	∈ {0.4, 0.6, 0.7, 0.8}	55
DD	I-VT	0	55
GiW	I-DT	0.05	155
GiW	I-VT	∈ {0.6, 0.8, 0.9, 1.0}	55

Detection-threshold suggestions

These values are the per-algorithm best detection thresholds found on the DD dataset:

Algorithm	`detection_threshold`	Units
I-DT	30	relative-dispersion threshold in pixels
I-VT	1.5	relative-velocity threshold in px/ms

Tune these values for datasets recorded with a different headset, scene-camera resolution, or sampling rate.

DD parameters

Parameter	Suggested value
Video Resolution	`1088,1080`
Sampling Rate	`200`

GiW parameters

Parameter	Suggested value
Video Resolution	`1920,1080`
Sampling Rate	`300`

Output Reference

The output CSV contains every input gaze sample plus these classification columns:

Column	Description
`x`, `y`	Gaze coordinates (pixels)
`timestamp`	Time in milliseconds
`event_type`	`Fixation`, `Saccade`, `NaN` (missing data), or `Out of Range Gaze Samples`
`fixation_x`, `fixation_y`	Fixation centroid coordinates (filled for fixation rows)
`fixation_id`	Unique fixation identifier
`saccade_id`	Unique saccade identifier
`event_duration`	Event duration in ms
`start_time`, `end_time`	Event temporal bounds in ms

Head-mounted output additionally includes flow_x, flow_y, video_timestamp, frame, and gt_label (if ground truth was provided).

Toolkit Structure

OpenGazeLab/
├── start_servers.bat                      # Windows one-click startup
├── README.md                              # This file
│
├── backend/                               # Python FastAPI server (port 5000)
│   ├── main.py                            # API endpoints (upload, plot, video streaming)
│   ├── requirements.txt                   # Python dependencies
│   └── src/
│       ├── __init__.py                    # Public exports (EventDetection, EyeTrackingVisualizer)
│       ├── pipeline.py                    # EventDetection — orchestrates the full pipeline
│       ├── algorithms.py                  # I-DT and I-VT classifiers
│       ├── feature_extraction.py          # Head-mounted pipeline: Savgol smoothing, flow velocity, adaptive threshold
│       ├── preprocess_csv.py              # CSV parsing: delimiter/column/normalization auto-detection
│       ├── preprocess_headmounted/        # Head-mounted loader package
│       │   ├── __init__.py                # Re-exports the public API
│       │   ├── common.py                  # Shared helpers (extract_video_metadata)
│       │   ├── dd.py                      # DD (.npy) loader
│       │   ├── giw.py                     # GiW (.mat) loader
│       │   └── dispatcher.py              # Auto-routes DD vs GiW by ZIP contents
│       ├── utils.py                       # Velocity, MAD, fixation merging, timestamp helpers
│       └── visualization/
│           ├── __init__.py                # Visualization public exports
│           ├── stationary_plot.py         # Static Plotly: gaze + fixations + scanpath
│           ├── time_scrolling_plot.py     # Animated Plotly with playback controls
│           ├── video_overlay.py           # HTML5 video + canvas gaze overlay
│           └── _image_utils.py            # Encodes images as base64 for Plotly embedding
│   └── data/                              # Created at runtime
│       ├── events/                        # Processed event CSVs (downloadable)
│       └── visualization/                 # Generated HTML visualizations and stored scene videos
│
└── frontend/                              # Static web UI (port 8000)
    ├── index.html                         # HTML entry point
    ├── package.json                       # Uses http-server (no build step)
    ├── package-lock.json
    └── src/
        ├── App.js                         # React app — mode toggle, upload forms, results display
        └── App.css                        # Styles

What each backend file does

File	Role
main.py	FastAPI app. Defines endpoints `/api/upload`, `/api/upload-video`, `/api/plot/`, `/api/plot-video/`, `/api/video/*`
pipeline.py	`EventDetection` class — entry point that runs the full workflow: normalize → preprocess → detect → post-process
algorithms.py	The two core detection algorithms: `classify_idt` (dispersion) and `classify_ivt` (velocity)
feature_extraction.py	Head-mounted pipeline: Savitzky-Golay smoothing, flow velocity, relative velocity/dispersion, adaptive thresholds
preprocess_csv.py	Reads CSV files: detects delimiter, column names, and coordinate normalization
preprocess_headmounted/	Head-mounted loader package. Contains `dd.py` (Drews `.npy` loader), `giw.py` (Gaze-in-Wild `.mat` loader), `common.py` (shared video-metadata helper), and `dispatcher.py` (auto-routes uploads by inspecting the ZIP — any `.mat` entry → GiW, otherwise DD).
utils.py	Math helpers — velocity, MAD, fixation merging, timestamp normalization
visualization/stationary_plot.py	Builds the static Plotly chart
visualization/time_scrolling_plot.py	Builds the animated playback Plotly chart
visualization/video_overlay.py	Generates a self-contained HTML page with video + canvas gaze overlay
visualization/_image_utils.py	Encodes background images as base64 data URIs for embedding in Plotly

How the Pipeline Works

End-to-end data flow for a single upload:

1. UPLOAD              Browser sends file(s) + parameters → FastAPI
2. PARSE               preprocess_csv.py / preprocess_headmounted/ (auto-routes DD vs GiW) → DataFrame with x, y, timestamp (+ flow data for head-mounted)
3. NORMALIZE           pipeline.py — denormalize coords if needed, convert timestamps to ms, separate invalid samples
4. FEATURE EXTRACTION  feature_extraction.py (head-mounted only) → Savgol smoothing, gaze velocity, flow velocity, relative velocity / relative dispersion, adaptive threshold
5. CLASSIFY            algorithms.py — I-DT or I-VT labels each sample as Fixation or Saccade
6. POST-PROCESS        utils.py — merge nearby fixations, renumber IDs, reinsert invalid samples with reason
7. VISUALIZE           visualization/ — generate Plotly HTML and/or video-overlay HTML
8. RESPOND             Send statistics + visualization URLs to frontend

Detection Algorithms Explained

I-DT (Dispersion-Threshold Identification)

Classifies a window of samples as a fixation when their spatial dispersion stays below a threshold.

Dispersion formula: (max_x − min_x) + (max_y − min_y) in pixels
Best for: Low sampling rate, noisy data, stationary trackers

I-VT (Velocity-Threshold Identification)

Classifies each sample as a fixation when point-to-point velocity stays below a threshold.

Velocity formula: sqrt(dx² + dy²) / dt in pixels/ms
Best for: High sampling rate, clean data

Head-Mounted Enhanced Pipeline

A variant designed for head-mounted trackers, where head motion contaminates raw gaze velocity / dispersion. It feeds either the I-DT or the I-VT classifier with flow-compensated features:

Savitzky-Golay smoothing (55 ms window, 3rd order) on raw gaze coordinates
Flow velocity extracted from the optical flow grid (head/camera motion)
For I-VT: gaze velocity from smoothed coordinates, then relative velocity = gaze_velocity − flow_velocity — isolates true eye movement
For I-DT: relative dispersion — gaze dispersion measured against a flow-integrated "ideal" trajectory, removing apparent motion caused by head movement
Flow RMS = sqrt(mean(flow_x_vel²) + mean(flow_y_vel²)) where flow_x_vel, flow_y_vel = flow_x, flow_y / flow_t_delta — quantifies how much the head is moving
Adaptive threshold = base + gain × flow_rms — tightens during stillness, loosens during head movement
Classification compares the relative feature against the (adaptive or fixed) threshold

Adaptive Thresholding

Stationary (MAD-based):

adapted_threshold = original_threshold × (1 + tuning × MAD(velocity))

Head-mounted (flow-RMS-based):
```
threshold_i = base_threshold + gain × flow_rms_i
```
See Recommended Parameters for the gain and window_size_ms defaults.

Troubleshooting

Backend won't start

Ensure Python 3.10 or below is installed
Install dependencies: pip install -r backend/requirements.txt
Check whether port 5000 is in use; change it if needed
Read the console for the actual error

Frontend can't connect to backend

Backend must be running on http://127.0.0.1:5000
Frontend must be on http://localhost:8000
Check the browser console for CORS errors
Restart both servers

Processing fails

Check that the CSV has gaze columns and a timestamp (any of the supported names)
Verify numeric columns don't contain text
Try a different delimiter if auto-detection misfires
Look for NaN values in coordinate columns

No fixations detected

Lower the detection threshold (see Recommended Parameters for head-mounted starting points)
Increase the minimum fixation duration if data is noisy
Switch between I-DT and I-VT
Enable adaptive thresholding

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
backend		backend
frontend		frontend
.gitignore		.gitignore
README.md		README.md
start_servers.bat		start_servers.bat

Folders and files

Latest commit

History

Repository files navigation

OpenGazeLab

Table of Contents

Quick Start

Installation

Run the App

How to Use — Stationary Eye Tracker

Input

CSV format

Configuration

Output

How to Use — Head-Mounted Eye Tracker

Input

Layout A — Drews & Dierkes (DD) style: .npy arrays

Required files inside the ZIP

Optional files

Layout B — Gaze-in-Wild (GiW) style: .mat files + pre-computed flow

Required files inside the ZIP

How GiW labels are handled

Configuration

Output

Input Reference

Stationary Eye Tracker

Head-Mounted Eye Tracker

Recommended Parameters

Adaptive-threshold suggestions

Detection-threshold suggestions

DD parameters

GiW parameters

Output Reference

Toolkit Structure

What each backend file does

How the Pipeline Works

Detection Algorithms Explained

I-DT (Dispersion-Threshold Identification)

I-VT (Velocity-Threshold Identification)

Head-Mounted Enhanced Pipeline

Adaptive Thresholding

Troubleshooting

Backend won't start

Frontend can't connect to backend

Processing fails

No fixations detected

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Layout A — Drews & Dierkes (DD) style: `.npy` arrays

Layout B — Gaze-in-Wild (GiW) style: `.mat` files + pre-computed flow

Packages