Tested on High-Performance Infrastructure using the KDD Cup '99 dataset (~494k samples).
| Metric | Result |
|---|---|
| Throughput | ~345,000 samples processed in < 6s |
| Global Accuracy | 98.2% |
| Anomaly Recall | 99.1% |
| Parallel Efficiency | Distributed across 12-core CPU Architecture |
| Scalability | Linear scaling with Batch-based Parallelism |
This implementation strictly follows the architectural framework of the P-ELM paper published in Applied Intelligence (Springer):
- ⚡ SVD-Augmented Initialization: Unlike standard ELMs, this version uses Singular Value Decomposition on augmented data matrices to initialize both Weights and Biases, ensuring superior numerical stability.
- 🧠 Intelligent Knowledge Base (KB): Features a fixed-length KB buffer that stores high-performing model weights, filtering out noise through eligibility criteria.
- 🔄 Master-Worker Synthesis: Parallel workers compute local output weights which are then synthesized by a central Master node using Centrality-based Model Averaging.
- 🛡️ Online Evaluator: Real-time feedback loop that validates learning quality before updating the Knowledge Base.
Project Structure & Components
src/elm_svd.py: Core ELM logic with SVD-based initialization for weights and biases.src/weight_synthesizer.py: Knowledge Base management and eligibility-based weight merging.src/elm_online.py: Parallel orchestration layer usingjoblibfor multi-core distribution.Demo.ipynb: Interactive visualization and performance analytics dashboard.
This project implements the four main components of the P-ELM framework:
- Parallel ELM Workers: Independent learners processing data chunks.
- Weight Synthesizer: Aggregates knowledge from workers.
- Knowledge Base (KB): Retains historical learning with a fixed-length memory.
- Evaluator: ensures the reliability of newly learned patterns.
Last updated: 2026-05-26 07:10:54
Dataset: data/kddcup.data_10_percent.gz
Samples used: 120,000 total, 84,000 train, 36,000 test
Configuration: 4096 hidden neurons, 40000 online batch size, 4 workers
| Model | Accuracy | Anomaly Recall | Train Time | Throughput | Peak CPU | Peak RAM |
|---|---|---|---|---|---|---|
| Sequential ELM | 0.9973 | 0.9993 | 73.265s | 1,147/s | 26.2% | 12882.8 MB |
| Sequential SVD-ELM | 0.9949 | 0.9960 | 66.774s | 1,258/s | 23.4% | 12936.9 MB |
| Online Parallel ELM | 0.9944 | 0.9942 | 192.278s | 437/s | 52.4% | 15871.3 MB |
Best accuracy: Sequential ELM (0.9973)
Fastest training: Sequential SVD-ELM (66.774s)
Based on the research paper:
Parallelized Extreme Learning Machine for Online Data Classification
Vidhya M. & Aji S. (2022)
Journal: Applied Intelligence, Springer.
DOI: 10.1007/s10489-022-03308-7



