Security and Serialization in Axon

Antonio_Mota · August 4, 2015, 3:20pm

Hi Allard and all.

We are close to start a quite big project and we (almost) decided to use Axon in a “progressive” way. That means we are planning to use it to model a Human Task processor and integrating it with our existent structure, and later expand gradually the use of Axon to implement a bunch of micro-services.

For now we identified 3 areas of concern:

Security: our infrastructure is heavily based on Spring, including Spring Security. As far as I know this won’t play nicely once we start to use distributed services, since the Spring Security authentication is based om thread-bounded variables. What we are trying to implement is to create a UAA service for all the distributed services to use, which will probably be based in the Cloud Foundry UAA project and the JWT standard. But I have no idea where to start this integration. Is there any “how to” or “best practices” for security in Axon?

Events and Sagas storage: the storage of events and sagas is completely separated and uses different structures. Is there a way to “unify” this storage?

And related to this, Events and Sagas serialization. At the moment Sagas are serialized using binary serialization. Is there a way to do all the serialization in XML or JSON?

And of course, if someone implemented Human Tasks in Axon I would love to hear about it…

Thanks all.

Antonio_Mota · August 5, 2015, 12:30pm

Hmmm, it seems I forgot a very important thing, to read the documentation… I just saw there is aJdbcSagaRepository that serves our purpose of "unification" and serilaization, so the only question here is about theSecurity.

Thanks and sorry bout this…

Antonio_Mota · August 5, 2015, 1:59pm

Wadda ya know, it seems Sagas can only be stores as byte[] after all… I had to extend PostgresSagaSqlSchema and JdbcSagaRepository.storeSaga to get rid off the SagaEntry.class that seriializes as byte[] and it’s not extensible…

Should be a big deal to make it extensible thougth.

Still looking at ``Security now…

Cheers/

Antonio_Mota · August 5, 2015, 4:27pm

Well, after successfully putting both Events and Sagas being serialized with XStreamSerializer and JacksonSerializer, the powers that be decided that in fact they have to be serialized to XML using JAXB, which is causing me to much pain…

Any alternative here behind writing my own?

Cheers.

Ho, I forgot to talk about the ``Security …

Nedim_Sabic · August 5, 2015, 5:26pm

Hi,

For the security and authorization related concerns I would implement a custom CommandBusInterceptor which would collaborate with the UAA service to verify the JWT signature, and then decide if the command should be dispatched to the command handler or denied. Hope that helps.

Antonio_Mota · August 5, 2015, 7:54pm

Yes, that would definitely help, I still have to finish my UAA side but looks like a good idea indeed.

Cheers.

Allard · August 6, 2015, 7:19pm

Hi Antonio,

personally, I don’t think serializing using JAXB is a sensible choice to make. I’d like “power that be” to focus on the outcome of serialization formats, instead of the technologies used to generate them. Creating a serializer shouldn’t be too hard, there is a number of examples in Axon already. Let’s hope JAXB can do the job. Otherwise, why don’t you ask the powers that be to develop it? They made the dcision, after all…

That with Nedo’s answer about security (which I completely agree with) should address your concerns, right?

Cheers,

Allard

Antonio_Mota · August 7, 2015, 2:38pm

Hi Allard.

Well, the “powers that be” also sign my check so it’s better to leave it that way… No, really, the issue is our entire model is schema based and we use JAXB to create the model classes from it. And we plan to use that also for our commands and events classes, so it makes sense to use JAXB only. I did some work implementing a JAXB serializer, is far from complete but I’m happy with the proof-of-concept.

Regarding the security I think Nedo’s solution does the trick, yes. However I couldn’t test it yet because I’m having a heck of a time just to put the Authentication to work using the cloudfoundry uaa service, let alone the Authorization part… I’ve been in Spring configuration hell for 2 days now… Do you have anything (examples, pointers) of using a SSO solution with Axon?

What I’m concern now is with using Event Sourcing to implement Human Tasks and corresponding control (like a workflow but less rigid). This was my idea so it’s my *** on the line and I’m starting to be uncomfortable wit it because it’s a new territory for me. We don’t need a implementation of the WS-HT specification, it’s more like a glorified ToDo, so you have any poiters to a thing like that? I think I saw a Axon example of a to-do list but written in old versions of Axon…

I’ll try to get rid of this authentication problem today to try and start “real” tests with Axon.

Thanks for your support.

Allard · August 9, 2015, 12:23pm

Hi,

I’m afraid I don’t have any examples in full fledged uaa.
You should use dispatch interceptors to trigger authentication. Attach the user id and perhaps a bit of profile information (e.g. roles) to the outgoing command’s meta data.
For the authorization part, you can use dispatch interceptors if your command classes have authorization configuration annotated on them. If the command handlers need a full profile, you can use command handler interceptors to fetch the full user profile based on the id of the meta data abd attach it to the command. You can also attach the profile to a threadlocal and use a ParameterResolverFactory to allow for the profile to be injected into an @CommandHandler method parameter.

Cheers,

Allard

Antonio_Mota · August 11, 2015, 2:02pm

Hi again.

I finally have the UAA working with my Axon prototype, so I’ll begin the tests today. I also modeled a simple Task that is basically a id, a name and a owner, and half-dozen commands and events to handle the task lifecycle (create, assign, start, complete). So I have a Task Aggregate for this.

But now (and this is the question of the day) I concluded that one task is in fact a compositions of several tasks. For instance, a Audit task can be comprised of HandlePaper task plus a PhoneCompany task. The HandlePaper task can be comprised of MakePhotocopies and ArchivePapers. And actually, since this are Human Tasks, the human can create “had hoc” tasks as “children” of any tasks. All of this tasks have the same lifecycle, being that the Audit task can only be completed when all children tasks are completed, and so on.

So now my problem is, how can I model this as a aggregate? My design until now is only for a “independent” task.

Any suggestion?

Thanks.

PS. Did you saw my comment above about the Sagas?

Sagas can only be stores as byte after all… I had to extend PostgresSagaSqlSchema and JdbcSagaRepository.storeSaga to get rid off the SagaEntry.class that seriializes as byte and it’s not extensible

Don’t you think it will be good to allow extensibility to serialize sagas in other formats like the Events? (Json, XML, …). If you want I can send you what I did, but it’s extended only to my particular case…

Allard · August 12, 2015, 8:35am

If you have aggregates whis behavior depends on things happening in other aggregates, two things can be going on. First, it’s possible that your aggregate boundary is wrong. Maybe, subtasks should be part of the parent task’s aggregate, so that it can manage the workflow properly. Note that this decision is completely independent of the query model structure, where each task is likely to be an ‘independent’ entity.
Secondly, if they are separate aggregates, you’ll need to have a Saga to orchestrate behavior between related tasks.

My gut feeling here is that you should investigate option 1.

Cheers,

Allard

Allard · August 12, 2015, 8:35am

About the Sagas, you can store them any way you like. However, the database table used by the generic JPA SagaRepository (and JDBC) uses a column type of byte[] to store the data. That still alows you to store json, xml, etc in there.

Cheers,

Allard

Antonio_Mota · August 12, 2015, 2:47pm

Hmmm, I was thinking in using something like you describe in 4.2.3. Complex Aggregate structures of the Ref Guide, but since you didn’t mention it I assume my understating of Non-Aggregate Root Entities is wrong… Are there any “real” examples of those “complex aggregates”?

Regarding your pont 1, what I was thinking was having a kind of “nested” aggregates, with parent/child relations. I was thinking about it to maintain the lifecycle of each child task within it’s aggregate, but now that I’m saying it out loud maybe it’s just bad design. Also, my boss says that “task” per se doesn’t have a “business meaning”, so in terms of DDD should not be a Aggregate. So maybe I should put all this “lifecycle” thing inside the Task class itself and create the Aggregates around one or more tasks that makes business sense, like Audit, BookFligth and things like that.

That will also avoid the use of Sagas to coordinate behaviour, but in fact some of those “business tasks” may be long lived, and that would indicate the use of a Saga, correct?

Regarding the Saga store, our plans in store everything (events, sagas, metadata) in XML is to make use of the XML capabilities of postgres, for instance, updating columns with a the result of a XSLT applied to their values.

Nevertheless my main concern is to test the security part for now.

Thanks again for your support.

Antonio_Mota · August 13, 2015, 11:09am

Well, what I was talking about the Complex Aggregates seems to work like this:

`
RootTaskAggregate

@EventSourcedMember
protected ist childs = new ArrayList();

and

ChildTaskAggregate extends RootTaskAggregate implements EventSourcedEntity
`

but I have the feeling that I turned the complicometer on… I guess is going to be difficult to decide to which Childs a event should be applied or not.

What do you think of this?

Cheers.

Allard · August 16, 2015, 7:04pm

This is the intended way. When working with entities in collections, make sure the event handler methods of these entities check whether the event ‘belongs’ to that specific entity.
Nothing complicated…

Cheers,

Allard

Antonio_Mota · August 17, 2015, 9:34am

Yes, but actually after my poc I decided that it was too much complexity in one place. I reimplemented the Aggregate with only one root Task that include other tasks that include… It’s running very close to what I envisaged in the first place. And yet…

My boss is not very convinced with the model (a “task” is hardly a domain model) and he argues that a “task” should not have any knowledge of the business process, it should be a agnostic service to be used by other services (services in the of independent micro-services). That means a Task aggregate shouldn’t know nothing about the more business-oriented tasks like AuditDocument, BookFligth, ByTheBook and thus can’t control this processes. He suggests, with reason, to control those high-level processes with Sagas, that would be responsible for sending commands to the Task Aggregate and coordinate all the tasks necessary.

That seems OK to me, however I was under the impression that the “normal” flow of ES should be Command->Aggregate->Event->Saga, and with this approach in fact we’ll have Event->Saga->Command->Aggregate. Now other than a question of “style” I can’t see nothing wrong with this from a technical point of view. But it’s a question of “it works in practice, but it will work in theory?”

Do you fond any drawbacks of using this Saga-first approach?

Cheers.

Allard · August 21, 2015, 6:37pm

Hi Antonio,

using Sagas to control these processes will work fine. Modelling is a complex proces. I can only assume that you have more in-depth knowledge of all the intricacies of your domain. It’s the little things that can make all the difference.

My advice would be to create the model and see if you can do so without resorting to “strange” commands and events. Once that happens, it’s an indication your model is wrong.
In the end, theory will have to accept the fact that things have worked out in practice

Cheers,

Allard

Antonio_Mota · September 24, 2015, 9:34am

Hi again.

Finally it’s official, we do are going to use Axon in our project and my PoC is now being “migrated” to a full production-level service. For that I did some work in:

Serialization:
We have to support Postgres with XML data types. That means serialize Metadata, Events and Sagas to XML (or a string representation of) and store then as XML. For that I had to implement a Axon Serializer, extend GenericEventSqlSchema, PostgresSagaSqlSchema and JdbcSagaRepository. Not a big problem here but the code could be more clean with some small changes in Axon itself, if you want to discuss that I’ll be happy to do it.

Security:
I’m going to implement a CommandDispatchInterceptor for Identity and a CommandHandlerInterceptor for Access Control. For the first I don’t see any problems, however for the second I did got into trouble. The intention here was to access the Aggregate itself from the CommandHandlerInterceptor to check some constraints in the aggregate entity itself. I do have access to the unitOfWork there and I was hoping to access the aggregate from there, but I couldn’t find a way. Did I forgot something?

A few more doubts I’m having. I tend to have a command raising a event only. However in some cases it may make sense to raise more than one event, or raising different events depending on some conditions. Is this OK or is considered a bad practice?

Some times it make sense to me to do the “work” in the commands and pass the results to the event, while other times I tend to do that “work” in the events themselves. But I think this second option can be dangerous because of the replays. Should the “work” always be done in the command handler? A example:

“work” done on the event

@CommandHandler
public TaskAggregate(CreateTaskCommand command) throws Exception {
apply(new TaskCreatedEvent(command.getId()) );
}

@EventSourcingHandler
public void on(TaskCreatedEvent event) throws Exception {
this.id = event.getId();
Task task = new Task();
task.creationDate = DateTime.now();
task.setState(TaskState.CREATED);
}

“work” done on the command

@CommandHandler
public TaskAggregate(CreateTaskCommand command) throws Exception {
Task task = new Task();
task.creationDate = DateTime.now();
apply(new TaskCreatedEvent(task) );
}

@EventSourcingHandler
public void on(TaskCreatedEvent event) throws Exception {
this.id = event.getId();
Task task = event.getTask();
task.setState(TaskState.CREATED);
}

3) hybrid "work"


@CommandHandler
public TaskAggregate(CreateTaskCommand command) throws Exception {
 apply(new TaskCreatedEvent(command.getId(),  DateTime.now());
}

@EventSourcingHandler
public void on(TaskCreatedEvent event) throws Exception {
 this.id = event.getId();
 Task task = new Task(); 
 task.creationDate = event.getDate()
 task.setState(TaskState.CREATED);
}

This is a simple example, but taking into account that for the creation of a entity it will take more than just creating a object (call injected services, access repositories, etc.) what is the “correct” way of do it? Assuming there is such a thing…

Thanks again for your precious support.

Cheers.

Allard · September 24, 2015, 9:55am

Hi Antonio,

events should always provide enough information about the context, so that a handler can find all relevant information in that event. Doing DateTime.now() based on receiving a TaskCreatedEvent is not likely to produce the result that you expect. When loading an aggregate, and streaming events on it, DateTime.now() is not the creation time of the Task anymore. Therefore, include it in the event.
The taskState created is a logical result of a Created Event, so you could omit if for that reason. On the other hand, explicitly including it can be a good thing, too.

Be careful not to expose entities (such as Task) in your event. Entities are stateful and are very bad candidates to include in events (as they effectively make the event stateful as well).

Cheers,

Allard

Antonio_Mota · September 24, 2015, 12:06pm

So, a s a general rule of thumb, you’ll you say that “work” should be done in the command and the strictly-relevant information set in the Event so the Handler will use it to set the aggregate internal state only?

I quite don;t understand what you mean by “Be careful not to expose entities (such as Task) in your event.”. In my example, you mean the Task should be created in the event, as in 3) instead of in the command as in 2) ? Note that a Task it’s not the “entity” per se, it’s the “main” entity, because there can be several child tasks on this task, that also can have childs and so on (I kept then in a flat structure though). The Aggregate however is of type Task. I forgot to add the line below in the 3 examples:


task.setState(TaskState.CREATED);
tasks.put(task.getId(), task);

I do agree that in this case the CREATED is not needed but I kept it there to be more explicit.

And yes, 1) I knew it was wrong because of the date, it was the first error I fixed