Few questions out of curiosity ...
Did you benchmark RTPengine with the kernel module or without ?
I have the impression that this solution is using libnetfilter, queue, etc ?
Doe it mean that everything is self contain or the code to manage libnetfiler is somewhere else ?