EEG Biomarker Platform

A reproducible pipeline for EEG-based biomarker extraction using the Linear Observer Control Framework (LOCF), with a dashboard for cohort- and subject-level visualization.

Project Supervisor: Zheng Wang

Overview

This platform:

Ingests raw EEG data (starting with the LEMON dataset)
Preprocesses and quality-checks recordings
Extracts standard EEG features and LOCF-derived biomarkers (K_norm, L_norm, etc.)
Stores outputs in a structured biomarker database
Displays results in an interactive dashboard

Longer-term targets: sleep, mood, depression, meditation, PBM, digital-twin applications.

Repository Structure

src/
  ingestion/        # data loading, metadata parsing
  preprocessing/    # artifact rejection, filtering, epoching
  features/         # standard EEG feature extraction
  biomarkers/       # LOCF biomarker computation
  database/         # schema, write/read utilities
  dashboard/        # Dash/Streamlit app

scripts/            # CLI entry points
notebooks/          # exploratory and onboarding notebooks
configs/            # pipeline configs and path templates
docs/               # biomarker dictionary, design docs
tests/              # unit and smoke tests
outputs/            # generated outputs (not committed)

Team

Name	Owns
Pawan	preprocessing, feature extraction, biomarker extraction, LEMON validation
Vedansh	database schema, backend/API, dashboard UI
Aditya	data ingestion, metadata parsing, orchestration, config, reproducibility

Getting Started

1. Clone the repo

git clone <repo-url>
cd eeg-biomarker-platform

2. Create environment

conda env create -f environment.yml
conda activate eeg-biomarker-platform

or

pip install -r requirements.txt

3. Configure data paths

cp configs/paths.example.yaml configs/paths.local.yaml

Edit configs/paths.local.yaml to point to your local LEMON data (synced from Google Drive).

4. Run environment check

jupyter notebook notebooks/00_environment_check.ipynb

5. Run one sample subject

python scripts/run_preprocessing.py --config configs/preprocessing.yaml --subject sub-0001
python scripts/run_biomarkers.py --config configs/biomarkers.yaml --subject sub-0001

6. Build database

python scripts/build_database.py

7. Launch dashboard

python scripts/launch_dashboard.py

Data Policy

Do not commit raw dataset files to GitHub.
Raw LEMON data and shared example notebooks live in Google Drive.
Reference data paths through configs/paths.local.yaml — never hard-code paths.
configs/paths.example.yaml is the committed template; paths.local.yaml is gitignored.

Standard Outputs

Type	Location
QC summaries	`outputs/qc/`
Subject biomarker tables	`outputs/biomarkers/`
Cohort summary tables	`outputs/cohort/`

Example:

outputs/qc/sub-0001_ses-rest_run-01_qc.csv
outputs/biomarkers/sub-0001_ses-rest_run-01_biomarkers.parquet

Project Rules

Every pull request must be linked to an issue.
Notebook logic must be converted to scripts/modules before merging.
Use standard file and variable naming across modules.
Every major feature needs at least one test or smoke check.

Milestones

#	Milestone
1	Onboarding and replication
2	LEMON ingestion and preprocessing
3	Biomarker database
4	Dashboard prototype
5	Prototype release

See docs/milestones.md for detailed checklists.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EEG Biomarker Platform

Overview

Repository Structure

Team

Getting Started

1. Clone the repo

2. Create environment

3. Configure data paths

4. Run environment check

5. Run one sample subject

6. Build database

7. Launch dashboard

Data Policy

Standard Outputs

Project Rules

Milestones

Documentation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.github		.github
configs		configs
data		data
docs		docs
scripts		scripts
src		src
tests		tests
.gitignore		.gitignore
README.md		README.md
environment.yml		environment.yml
oryx-build-commands.txt		oryx-build-commands.txt
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

EEG Biomarker Platform

Overview

Repository Structure

Team

Getting Started

1. Clone the repo

2. Create environment

3. Configure data paths

4. Run environment check

5. Run one sample subject

6. Build database

7. Launch dashboard

Data Policy

Standard Outputs

Project Rules

Milestones

Documentation

About

Resources

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages