Skip to content

Conversation

@ericliang
Copy link

Ekaf will store the prepare request to dict and would response it when got worker-up messages, in my production case with many Kafka partitions, it needs to wait so long for the worker-up messages that will reach the timeout and exit the caller process. And also, it have a little chance to miss the worker-up messages since ekaf_server state change logic is separated with process of worker-up message.

First of all, I've changed the prepare process to an instant manner as there is a pick operation when producing sync messages on non-prepared topic.

Then I've added three trivial features, one for operation friendliness which can purge messages in case too many messages buffered in memory, one for fast recovery on kafka cluster restart or network problem which will timeout on connection, one bug fix on restart worker which will lead to twofold reconnection on each connection failure.

We've run this version in production environment for about one month, and I guess it's time to send them back. HTH.

Re-pull from #13

bosky101 added a commit that referenced this pull request Jul 31, 2015
Fix prepare timeout problem & other fixes from @ericliang
@bosky101 bosky101 merged commit 4622228 into helpshift:feature/prepare-timeout Jul 31, 2015
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants