[SR-Users] problem unreferencing dialog in dialog module

Anton Roman antonroman at gmail.com
Thu Mar 3 10:19:05 CET 2011


Hello,

thanks for your quick reply, my answer is inline.

2011/3/2 Daniel-Constantin Mierla <miconda at gmail.com>

>  Hello,
>
> looks like related to the callbacks for dialog module. Are you loading
> other modules that require dialog module?
>

we are using some features of dialog module such as ending dialogs after a
timeout period, and we are using engage_mediaproxy() function, as well. It's
an old configuration we had to put in production with no  time enough to
test. Do you recommend not to use dialog module if not strictly required?


>
> Checking the time staps from the acc and the crash log, the BYE for the
> dialog was before the crash but the To-tag is not printed from dlg_hash.c,
> although it is in the acc for INVITE and BYE. Do you have parallel forking
> in front of this SIP server? I mean, is there another proxy that can do
> parallel forking then send two or more branches to this instance?
>
> AFAIK the the client who is sending that calls is not doing parallel
forking, they are sending calls over a SIP trunk to our Kamailio. They are
calling to PSTN numbers and we are sending that calls to a gateway, so they
shouldn't do parallel forking, I'll get some traces to check it.


> I will dig in more to see what went wrong there.
>
> I got the attached debug level logs for the reported problem, hope it
helps. I'm still trying to find out why the core is not being generated.


> Thanks,
> Daniel
>
Thanks a lot,
Anton


>
>
> On 3/2/11 4:34 PM, Anton Roman wrote:
>
> Hi all,
>
> we are running Kamailio 3.1.2 in a production environment, using the dialog
> module, and it crashed two hours ago.
>
>
> Here you have the logs we got (addtional log fragments with the acc records
> involved in this call are appended at the end of the mail):
>
> Mar  2 14:43:05 kamailio2 /usr/local/sbin/kamailio[28927]: CRITICAL: dialog
> [dlg_hash.c:599]: bogus ref -1 with cnt 1 for dlg 0x7f23f472db30
> [2490:1070436595] with clid 'e0a20cb844d211e0acd8001422093865@<CLIENT IP>'
> and tags '1577886432-3759264324-335599788-1698171170' ''
> Mar  2 14:43:05 kamailio2 /usr/local/sbin/kamailio[28927]: : <core>
> [mem/q_malloc.c:446]: BUG: qm_free: freeing already freed pointer, first
> free: dialog: dlg_cb.c: destroy_dlg_callbacks_list(80) - aborting
> Mar  2 14:43:05 kamailio2 /usr/local/sbin/kamailio[28896]: ALERT: <core>
> [main.c:741]: child process 28927 exited by a signal 6
> Mar  2 14:43:05 kamailio2 /usr/local/sbin/kamailio[28896]: ALERT: <core>
> [main.c:744]: core was not generated
> Mar  2 14:43:05 kamailio2 /usr/local/sbin/kamailio[28896]: INFO: <core>
> [main.c:756]: INFO: terminating due to SIGCHLD
> Mar  2 14:43:05 kamailio2 /usr/local/sbin/kamailio[28948]: INFO: <core>
> [main.c:807]: INFO: signal 15 received
> Mar  2 14:43:05 kamailio2 /usr/local/sbin/kamailio[28942]: INFO: <core>
> [main.c:807]: INFO: signal 15 received
>
> We get the kamailio code from git last week:
>
> sercmd> core.info
> {
>     version: kamailio 3.1.2
>     id: 4ace86
>     compiler: gcc 4.3.2
>     compiled: 09:12:36 Feb 23 2011
>     flags: STATS: Off, USE_IPV6, USE_TCP, USE_TLS, TLS_HOOKS,
> USE_RAW_SOCKS, DISABLE_NAGLE, USE_MCAST, DNS_IP_HACK, SHM_MEM, SHM_MMAP,
> PKG_MALLOC, DBG_QM_MALLOC, USE_FUTEX, FAST_LOCK-ADAPTIVE_WAIT,
> USE_DNS_CACHE, USE_DNS_FAILOVER, USE_NAPTR, USE_DST_BLACKLIST,
> HAVE_RESOLV_RES
> }
>
> The problem looks like this other one already fixed:
> http://lists.sip-router.org/pipermail/sr-users/2009-November/027351.html
>
> We set the Kamailio to debug level in case it happens again.
>
> On the other side, I need to know why the core is not been generated. I
> have already checked the points mentioned in http://www.kamailio.org/dokuwiki/doku.php/troubleshooting:corefiles
>
> 1. disable_core_dump is not set in the config file.
>
> 2. From /etc/default/kamailio:
> ...
> DUMP_CORE=yes
> ...
>
> 2. From /etc/init.d/kamailio:
> ...
> if test "$DUMP_CORE" = "yes" ; then
>     # set proper ulimit
>     ulimit -c unlimited
>
>     # directory for the core dump files
>      COREDIR=/home/corefiles
>      [ -d $COREDIR ] || mkdir $COREDIR
>      chmod 777 $COREDIR
>      echo "$COREDIR/core.%e.sig%s.%p" > /proc/sys/kernel/core_pattern
> fi
> ...
>
> 4. Writting permissions of $COREDIR
>
> ls -hall /home
> ...
> drwxrwxrwx  2 root   root   4.0K 2010-12-21 09:15 corefiles
> ...
>
> What else should I check?
>
> Thanks in advance,
> regards
>
> Antón
>
>
> *Acc records related to the dialog whose destruction causes the problem:*
>
> Mar  2 14:42:44 kamailio2 /usr/local/sbin/kamailio[28902]: NOTICE: acc
> [acc.c:275]: ACC: transaction answered:
> timestamp=1299073364;method=INVITE;from_tag=1577886432-3759264324-335599788-1698171170;to_tag=5FFAEA34-6A;call_id=e0a20cb844d211e0acd8001422093865@<client
> IP>;code=200;reason=OK;src_user=<caller number>;src_domain=<client IP>;dst_ouser=<called
> number>;dst_user=<called number>;dst_domain=10.90.1.251;src_ip=<client IP>
>
> ...
>
> Mar  2 14:42:44 kamailio2 /usr/local/sbin/kamailio[28920]: NOTICE: acc
> [acc.c:275]: ACC: request acknowledged:
> timestamp=1299073364;method=ACK;from_tag=1577886432-3759264324-335599788-1698171170;to_tag=5FFAEA34-6A;call_id=e0a20cb844d211e0acd8001422093865@<client
> IP>;code=200;reason=OK;src_user=<caller number>;src_domain=<client
> IP>;dst_ouser=<called number>;dst_user=<called
> number>;dst_domain=10.90.1.251;src_ip=<client IP>
> ...
>
>
> Mar  2 14:43:00 kamailio2 /usr/local/sbin/kamailio[28903]: ERROR:
> <script>:  ACK WITHOUT MATCHING TRANSACTION in
> e0a20cb844d211e0acd8001422093865@<client IP> call... ignore and discard.
>
> ...
>
> Mar  2 14:43:00 kamailio2 /usr/local/sbin/kamailio[28904]: NOTICE: acc
> [acc.c:275]: ACC: transaction answered:
> timestamp=1299073380;method=BYE;from_tag=1577886432-3759264324-335599788-1698171170;to_tag=5FFAEA34-6A;call_id=e0a20cb844d211e0acd8001422093865@<client
> IP>;code=200;reason=OK;src_user=<caller number>;src_domain=<client
> IP>;dst_ouser=<called number>;dst_user=<called
> number>;dst_domain=10.90.1.251;src_ip=<client IP>
>
>
> _______________________________________________
> SIP Express Router (SER) and Kamailio (OpenSER) - sr-users mailing listsr-users at lists.sip-router.orghttp://lists.sip-router.org/cgi-bin/mailman/listinfo/sr-users
>
>
> --
> Daniel-Constantin Mierlahttp://www.asipto.com
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.sip-router.org/pipermail/sr-users/attachments/20110303/445d88de/attachment-0001.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: log_kamailio_error_unreferencing_dialog.log
Type: application/octet-stream
Size: 9466 bytes
Desc: not available
URL: <http://lists.sip-router.org/pipermail/sr-users/attachments/20110303/445d88de/attachment-0001.obj>


More information about the sr-users mailing list