Tim Davis GraphBLAS Interview Series Summary

This repository contains 14 video interview sessions with Dr. Tim Davis (Texas A&M University), author of the SuiteSparse GraphBLAS library. The series provides an in-depth exploration of GraphBLAS internals, covering code architecture, data structures, algorithms, and the JIT compilation system used to achieve high-performance graph processing through sparse linear algebra.

Series Overview

The interviews span from May 2024 to January 2025, progressively building understanding from foundational concepts to advanced implementation details. Topics include matrix storage formats, the five kernel types, JIT compilation infrastructure, and specific algorithm implementations for operations like transpose, reduce, apply, and element-wise operations. More sessions are planned in the future.

Future topics that need to be discussed (from the Source/ folders):

- assign: `GrB_assign` (C(I,J)=A) and variants
- concat: concatenat matrices
- container: load/unload from opaque objects into non-opaque arrays
- context: manage compute resources
- convert: convert sparse/hyper/full/bitmap
- cumsum: cumulative sum
- diag: create diagonal matrix from vector, get diagonal from matrix
- dup: duplicate a matrix or vector
- element: get/set a single entry
- emult: ewise multiply
- extract:  C=A(I,J) and variants
- extractTuples: extract COO from an opaque object
- `get_set`: `GrB_get` and `GrB_set` and variants
- global: global settings
- hyper: support methods for hypersparse matrices
- ij: properties of I and J for extract (C=A(I,J)) and assign (C(I,J)=A)
- init: `GrB_init`
- iso: support for iso-valued matrices and operations
- iterator: `GxB_Iterator` object and its operations
- `jit*`: the JIT
- kronecker: `GrB_kronecker`
- mask: mask/accum phase, `C<M>+=T`
- math: basic math operations
- matrix: support for basic operations on a `GrB_Matrix`
- memory: malloc/free wrappers, parallel memset, memcpy, etc
- mxm: `GrB_mxm` and variants
- nvals: `GrB_Matrix_nvals` and variants
- ok: debugging support
- omp: OpenMP support
- pending: pending tuples
- `pji_control`: 32/64 bit integers
- positional: support for positional operators
- print: printing opaque objects
- reshape
- resize
- scalar: support for `GrB_Scalar` objects
- select: `GrB_select`
- serialize: `GrB_serialize`; see also `lz4_wrapper` and `zstd_wrapper`
- sort
- split
- transplant
- type: support for `GrB_Type` objects

The two most complex topics in the list above are mxm and assign (each about 16 KLOC in size), and each of those will require multiple sessions. The jitifyer (in the jit* folders) is about 14 KLOC, but much of it has already been discussed, indirectly at least. Details of how the JIT works have not been discussed yet; this may merit at least a session. The rest of the folders are all under 4 KLOC; most are smaller (emult: 3.6 KLOC, sort: 3.4, select 2.7, etc, extract: 2.7 KLOC, and smaller), and could be done in one Session each, or less.

Session	Date	Topics	Link
Session 1	May 28, 2024	Code organization, 5 kernel types, JIT system intro, data structures overview	Summary
Session 2	June 4, 2024	Matrix/vector object structure, storage formats (Full/Bitmap/Sparse/Hypersparse), zombies & pending tuples	Summary
Session 3	June 11, 2024	Hyper hash data structure, format conversions, matrix multiply variants overview	Summary

JIT Compilation Deep Dive

Session	Date	Topics	Link
Session 4	June 17, 2024	JIT architecture (encodify/jitify/execute), kernel caching, Pre-JIT, CUDA integration	Summary
Session 5	June 25, 2024	JIT control modes, kernel loading, macrofication process, binary operators & monoids	Summary
Session 6	August 1, 2024	JIT synchronization, multi-threading/multi-process, macrofy system, error handling	Summary
Session 7	August 7, 2024	Kernel validation, macro system deep dive, type casting, compilation methods	Summary

Algorithm Implementations

Session	Date	Topics	Link
Session 8	August 13, 2024	GrB_apply operation, parallel execution, factory kernel system, three-tier execution	Summary
Session 9	August 19, 2024	Reduce operations, monoid vs binary operator debate, panel reduction algorithm	Summary
Session 10	September 18, 2024	Transpose algorithms (builder vs bucket), shallow copies, format selection	Summary
Session 11	October 2, 2024	Matrix build process, duplicate handling, 5-phase parallel algorithm, hyperhash	Summary
Session 12	October 16, 2024	Transpose/apply integration, 8 transpose methods, iso-valued optimization	Summary

Element-wise Operations

Video	Session	Date	Topics	Link
	Session 13	November 8, 2024	Element-wise add vs union, format-agnostic design, debugging infrastructure, factory vs JIT	Summary
	Session 14	January 10, 2025	32/64-bit integer infrastructure, workspace management (Werk), parallelism strategies, task slicing	Summary

Key Themes Across Sessions

Data Structures

Four matrix formats: Full, Bitmap, Sparse (CSR/CSC), Hypersparse
P-H-B-I-X arrays: Unified storage model across formats
Zombies & pending tuples: Dynamic graph support through deferred operations
Hyper hash: O(1) lookup for hypersparse matrices
Iso-valued matrices: Single-value optimization for structural matrices

Kernel Architecture

Five kernel types: Factory, JIT, Pre-JIT, Generic, CUDA
Three-tier execution: Factory → JIT → Generic fallback
Macrofication: Template specialization through macro expansion
Switch factory pattern: C-based runtime templating

JIT System

Encodify/Jitify/Execute: Three-phase kernel generation
28-bit encoding: Compact representation of operator/type combinations
Hash table caching: In-memory and on-disk kernel storage
Five control modes: OFF, PAUSE, RUN, LOAD, ON
Multi-process safety: File locking for shared cache directories

Parallelization Strategies

Coarse vs fine tasks: Adaptive granularity based on work distribution
EK_slice / EY_slice: Work partitioning for sparse operations
P_slice: Recursive load balancing
Atomic vs non-atomic bucket sort: Trade-offs in transpose algorithms

Design Philosophy

Format-agnostic algorithms: CSR/CSC unified through descriptor handling
Operation fusion: Transpose + typecast + operator in single pass
Shallow copies: O(1) array sharing for temporary matrices
Greppable code style: Consistent formatting for searchability
Debug as documentation: Reference implementations in debug code

Session Participants

Dr. Tim Davis - Creator of SuiteSparse GraphBLAS, Professor at Texas A&M University
Michel Pelletier - Primary interviewer
Piotr Luszczek - Participating researcher (some sessions)

File Structure

TimSessions/
├── README.md           # This file - series overview
├── CLAUDE.md           # Claude Code guidance
├── Session1/
│   ├── README.md       # Session summary with timestamps
│   ├── *.vtt           # WebVTT transcript
│   ├── *.m4a           # Audio recording
│   └── *.mp4           # Video recording
├── Session2/
│   └── ...
└── Session14/
    └── ...

Searching the Content

To find discussions of specific topics:

# Search all transcripts for a keyword
grep -r "hypersparse" Session*//*.vtt

# Search all summaries
grep -r "JIT" Session*/README.md

Each session summary includes timestamps in [HH:MM:SS] format for jumping to specific sections in the video recordings.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
Session1		Session1
Session10		Session10
Session11		Session11
Session12		Session12
Session13		Session13
Session14		Session14
Session2		Session2
Session3		Session3
Session4		Session4
Session5		Session5
Session6		Session6
Session7		Session7
Session8		Session8
Session9		Session9
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Tim Davis GraphBLAS Interview Series Summary

Series Overview

Table of Contents

Foundational Sessions

JIT Compilation Deep Dive

Algorithm Implementations

Element-wise Operations

Key Themes Across Sessions

Data Structures

Kernel Architecture

JIT System

Parallelization Strategies

Design Philosophy

Session Participants

File Structure

Searching the Content

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

GraphBLAS/SuiteSparseSessions

Folders and files

Latest commit

History

Repository files navigation

Tim Davis GraphBLAS Interview Series Summary

Series Overview

Table of Contents

Foundational Sessions

JIT Compilation Deep Dive

Algorithm Implementations

Element-wise Operations

Key Themes Across Sessions

Data Structures

Kernel Architecture

JIT System

Parallelization Strategies

Design Philosophy

Session Participants

File Structure

Searching the Content

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Packages