Add unified cache by NikitaEvs · Pull Request #1 · NikitaEvs/ClickHouse

NikitaEvs · 2023-01-17T17:05:38Z

Summary

PoC of the unified cache model with new/delete integration, custom Allocator support and Uncompressed cache and Marks cache integrations.

Features

Allocations which go through the new/delete methods or custom BuddyAllocator instances can trigger cache entries eviction from the Uncompressed cache and Marks cache
All allocations discussed above use the global memory arena that is initialized on the startup
The trigger for evictions is set up using a threshold of the consumed memory

Main known problems

Memory tracker isn't working with this schema
There are async evictions so there is no guarantee that memory will be released instantly in case of a memory shortage
Majority of allocations are done through custom Allocator instances (for example PODArray use Allocator class), so it is needed to change Allocator instances to BuddyAllocator to make these structures use BuddyArena
Buddy allocator schema isn't very efficient
Potential high contention on the global cache policy and global allocator state
Some allocations can be done aside from BuddyArena (e.g. static constructors cannot use BuddyArena)

Implementation details

The high-level architecture of the PoC:

More specific diagram:

You can find components from the picture above in the following places:

Global allocator state - BuddyArena class
Global cache policy - LRUUnifiedCacheGlobal class
Custom Allocator instance - BuddyAllocator class
new/delete integration (and other places in the same file)
MarksCache integration
UncompressedCache integration

Test plan

The main components of the allocator were tested using separate unit tests, WIP.

Benchmarks

WIP

…policy and wrappers

…or wrappers

…marks cache policies

rschu1ze · 2023-01-23T09:33:19Z


    StackTrace::setShowAddresses(config().getBool("show_addresses_in_stack_traces", true));

+    const size_t buddy_arena_size = 2048 * (1ull << 20); // 2024 MB 


(could be written more succinctly using the KiB/MiB/GiB suffix literals in base/base/units.h)

(not urgent but it would be nice if we could ask the OS for the total available memory here and then allocate a percentual fraction of it, e.g. 90%)

I guess l. 659, l. 660 and l. 662 should be moved into the (private) constructor of the buddy arena (which get's called once when BuddyArena::instance() is first called)

Should we move the allocator arena parameters inside the private constructor? We can otherwise configure it using settings

rschu1ze · 2023-01-23T09:47:03Z

+
+    char * initializeMetaStorage() 
+    {
+        // Calculate sizes


For the final version, it would be nice to add a comment that explains the layout of the meta/directory structure.

rschu1ze · 2023-01-23T09:48:59Z

+            meta_storage_size_round_up_to_power_of_2 *= 2;
+        }
+
+        /// Deallocate minimal blocks to fill the space between the meta storage and the size that is


I am not sure if I understand the reason for below code in this method.

Added more detailed comments, will add more in the final version

rschu1ze · 2023-01-23T09:54:34Z

+    {
+        auto level = calculateLevel(size);
+
+        std::lock_guard lock(mutex);


In case the locking becomes a bottleneck, you could try our new futex-based lock implementation (ClickHouse#44924). Or, alternatively, stride the lock, i.e. introduce multiple locks (e.g. one per equally-large range of the lowest level of the buddy allocator)

We do not have a faster std::mutex reimplementation yet, only faster std::shared_mutex (i.e. DB::SharedMutex). And I guess we won't get better performance than ordinary std::mutex in the same way.

rschu1ze · 2023-01-23T09:57:30Z

+    static void * alloc(size_t size, size_t alignment = 0)
+    {
+        checkSize(size);
+        CurrentMemoryTracker::alloc(size);


I remember our discussion that the memory tracker doesn't know about allocations at runtime using the buddy allocator (i.e. it just sees the initial allocation of the buddy allocator).

You added instrumentation for memory tracking here. Does that solve the issue?

For now we have a hack with CurrentMemoryTracker::free right after the initial arena allocation. It should add support of memory tracker instrumentation for the project

…chmarking

…ed cache

NikitaEvs added 5 commits January 17, 2023 19:32

Add UnifiedCache main file with global allocator state, global cache …

202d5f4

…policy and wrappers

Add Uncompressed cache and Marks cache integration with buddy allocat…

bc45180

…or wrappers

Add unified cache handler

50149d0

Add buddy allocator initialization and change uncompressed cache and …

5892227

…marks cache policies

Handle new/delete with buddy allocator

cd10bf9

rschu1ze reviewed Jan 23, 2023

View reviewed changes

NikitaEvs and others added 17 commits February 16, 2023 12:37

Cosmetic changes

ffd58fe

Change hard-coded base class for ColumnVectorHelper to BuddyAllocator

51b3104

Implement threshold cache eviction&better exceptions support&cosmetics

27e270d

Temporary insert BuddyAllocator code in the default allocator for ben…

55ee5b2

…chmarking

Integration of BuddyAllocator in the internal data structures

d024cb7

Cosmetic

c3f99da

Disable memory tracker integration

25c6639

Temporary change default value

293a40a

Implement cache weight logic&fix free space ratio bug

c6c0c4d

Add unified cache in the async metrics

d82d0d0

Cosmetics

65b8041

Refactor UnifiedCache, move out definitions in the cpp

c3a1e19

Add cache policy setting to the index mark cache and index uncompress…

0048f4d

…ed cache

Add size threshold to allocate items through default allocator

fdbbb7f

Add policies parameters to the index marks cache and index uncompress…

b4eaca0

…ed cache

Fix realloc method

e3a6c7b

Add shards to the BuddyAllocator

f1bf485

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add unified cache#1

Add unified cache#1
NikitaEvs wants to merge 22 commits intomasterfrom
add-unified-cache

NikitaEvs commented Jan 17, 2023 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

rschu1ze Jan 23, 2023

Uh oh!

rschu1ze Jan 23, 2023

Uh oh!

rschu1ze Jan 23, 2023

Uh oh!

NikitaEvs Feb 16, 2023

Uh oh!

Uh oh!

rschu1ze Jan 23, 2023

Uh oh!

rschu1ze Jan 23, 2023

Uh oh!

NikitaEvs Feb 16, 2023

Uh oh!

rschu1ze Jan 23, 2023

Uh oh!

serxa Jan 25, 2023

Uh oh!

rschu1ze Jan 23, 2023

Uh oh!

NikitaEvs Feb 16, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants


		StackTrace::setShowAddresses(config().getBool("show_addresses_in_stack_traces", true));

		const size_t buddy_arena_size = 2048 * (1ull << 20); // 2024 MB

Conversation

NikitaEvs commented Jan 17, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Features

Main known problems

Implementation details

Test plan

Benchmarks

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

NikitaEvs commented Jan 17, 2023 •

edited

Loading