Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
32 changes: 32 additions & 0 deletions _posts/ai/2026-04-13-mooncake_store_mechanism.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,32 @@
---
title: "Mooncake Store"
subtitle: "存储机制剖析"
layout: chirpy-post
author: "Peter Lau"
published: true
header-style: text
categories:
- AI
tags:
- AI
- Engineering
---

## Mooncake Store

**本次分析基于Mooncake版本v0.3.9**

### 整体架构设计

<div>
<img class="shadow" src="/img/mooncake/mooncake-store-preview.png" width="600" height="240" alt="Mooncake Store Architecture">
</div>


## P2P Store

p2p store主要用于大模型checkpoint分发,基于Transfer Engine构建。

试想如果所有GPU卡都从固定的源头同时加载权重切片,那么源头处的带宽会瞬间饱和,无法进一步提升传输性能。

这个方案的独特之处是每个GPU卡在加载完权重切片后,会将其传输到也需要这份切片的GPU卡上,这样源头处的带宽压力就会降低,数据传输效率得到提升。
30 changes: 30 additions & 0 deletions _posts/ai/2026-04-13-mooncake_transfer_engine_mechanism.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,30 @@
---
title: "Mooncake Transfer Engine"
subtitle: "传输机制剖析"
layout: chirpy-post
author: "Peter Lau"
published: true
header-style: text
categories:
- AI
tags:
- AI
- Engineering
---

## Mooncake transfer engine

**本次分析基于Mooncake版本v0.3.9**

### Transfer engine

<div>
<img class="shadow" src="/img/mooncake/transfer_engine_arch.png" width="600" height="240" alt="Transfer engine Architecture">
</div>

上图中,**vRAM**代表GPU显存,**DRAM**代表CPU主存,**NVMe**(配合NvMEof协议)属于外接硬盘。


### 相关问题

1. Prefill transfer failed for request rank xxx
Binary file added img/mooncake/mooncake-arch.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added img/mooncake/mooncake-store-preview.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added img/mooncake/transfer_engine_arch.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading