[axonframework] DomainEventEntry insert performance with a MySQL backed JPAEventStore

Allard · March 8, 2013, 8:35pm

Hey Sebastian,

the main reason for using client generated ID’s is that a client can send multiple commands for the same aggregate, without the need to wait for a return value of the first command. I don’t think mysql key generator will have any trouble keeping up.
The disruptor commandbus solution solves another problem. I have seen cases where applications would have so many concurrent (and active) connections, that there was high contention. In such a case, using fewer connections in a smart way could help. The disruptor command bus can do exactly that.

The link you sent is interesting, to say the least. I don’t think the UUID has anything to do with it. It is probably purely the use of a unique index. It seems that MySQL has difficulty managing large numbers of unique indexes. With a auto increment, mysql knows it doesn’t need to check anything. There can’t be any value equal to the generated one.

In the JPA Event Store, the unique key is the combination of the aggregate identifier and the event’s sequence number. Using an auto-increment key in mysql is not really going to be a solution. But it may be interesting to find out if there is a way to remove the need for a unique index…

To be continued…

Allard

Sebastian_Ganslandt1 · March 8, 2013, 9:32pm

From what I’ve understood it has to do with how MySQL lays indexes out on pages. A unique index is fine as long as long as inserts coming together end up on about the same pages. If they do, these pages will be in the InnoDB buffer pool and uniqueness can be checked without going to disk. This will be the case if you for example are doing inserts with an auto increment key, the page where a possible duplicate would live is the same as we inserted to just before and will definitely be in memory.

With the composite PK of DomainEventEntry that is pretty random with it’s aggregate identifier (and it just struck me, this only applies if you are creating many new short lived aggregates) the page containing possible duplicates can be located anywhere and with only 10% of the table in memory it’s a 9/10 chance that you will have to read that page from disk to find out.

It would be very nice to not have to count on the specific event persistance mechanism to check for concurrent modifications. Only a single thread executing domain logic would do the trick : )

And isn’t there a locking repository in Axon? Using that you should be able to skip the unique part?

Allard · March 9, 2013, 9:41am

Hi Sebastian,

something that might help here, is using time-based UUIDs. They are more sequential of nature (duh). They use the machine’s mac address and a sequence number when multiple ID’s are generated in the same (milli)second. The same machine will generate sequential identifiers, making the chance a page is already in memory a lot larger.

If you have a single-machine setup, you don’t need the event store to check for uniqueness. By default, the repository uses pessimistic locking. The disruptor command bus uses a non-locking algorithm that has the same effect.

The way databases work with keys isn’t really optimized for event stores. An event store doesn’t really need to maintain all the used values for the keys. Only the most recent ones per aggregate are probably ever used. But you can’t tell a database to only store the latest 100 keys per aggregate, unfortunately. These things are just shouting for a custom event store implementation…

Cheers,

Allard

Sebastian_Ganslandt1 · March 9, 2013, 11:14am

Interesting thinking and actually we are using time base UUIDs which might mean the problem is not as big as it could have been. Would be interesting to do some tests though.

We are on a one machine setup and probably will it will probably suffice for our needs, except for a possible hot spare. So that’s great to hear.

And yes, using a relational database for an event store is somewhat crazy, apart from the fact the MySQL has a been pretty battle proofed when it comes to not screwing up the data you put in there. With the recent discussions about whether MongoDB can achieve that you get a little reserved against trying out new and fancy storage engines like Greg Youngs Event Store for example.

Sebastian_Ganslandt1 · May 14, 2013, 8:29pm

Let me share my results from doing the actual tests.

I created 3 tables with a subset of 10M events from our production event store, each with a different configuration and tested insert and replay speed. Inserts where performed sequentially with a SQL-procedure and the data was just md5(rand()).

With the original primary key (aggId, seqNr, type) and replay index (timeStamp, seqNr) we got 1000 inserts done in 10 seconds.
With an auto_increment primary key and non unique agg-lookup index (aggId, seqNr, type) the insert speed was around 50% worse. Replay speed was marginally faster.
With an MyISAM table instead of InnoDB (created by accident from an create table as select) we got 10000 inserts / sec and replays that where 3-4x faster.

So, if we skip transactions and all the other safety measures of InnoDB we get amazing performance. With just replacing the primary key with an auto increment and dropping the unique index on (aggId, seqNr and type), nothing.

Cheers

Allard · May 15, 2013, 6:37am

Hi Sebastian,

that looks pretty good. The primary auto_increment key results amaze me. I expected it to be faster (as long as the primary key is not assigned as such in the JPA classes). Do you still have the indexes? Some have suggested that the auto-increment primary key will make for smaller indexes (each secundary key contains the primary key it refers to. BigInteger is much smaller that 2 strings and a BigInteger).

MyISAM could be relatively safe to use when storing an entire commit per entry, instead of a single event. It’s an idea I’ve been playing with and have implemented in MongoDB. Might be worth investigating for the JPA Event Store as well.

Thanks for sharing the results!
Cheers,

Allard

Sebastian_Ganslandt1 · May 15, 2013, 7:19am

The results amazes me as well. Completely contrary to what the blog post reported.

I’m not sure if I made it clear, but when using an auto_increment primary key that was added as an additional column on top of the natural key spanning over the uuid aggId, seqNr and type. Using a BigInteger (auto_incr or not) instead of an UUID for aggId should definitely make both key, indexes and the whole table smaller but I can’t see that it would have any dramatic effect on performance.

About the index sizes for my test, the summed index size for the original table layout was 956MB and for the table with a bigint primary key and aggLookup- and replay-index on top of that, 1272MB so 33% larger.

Sebastian