This is the mail archive of the libc-help@sourceware.org mailing list for the glibc project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

threads hung in __nptl_setxid and __lll_mutex_lock_wait


Hi All,
I have a suse 9 box, where once in a while I see a grid engine process
in hung state with the following stack traces:

(gdb) info thread
  6 Thread 1082132832 (LWP 22931)  0x0000002a958d1e22 in
pthread_cond_timedwait@@GLIBC_2.3.2 () from /lib64/tls/libpthread.so.0
  5 Thread 1090525536 (LWP 22932)  0x0000002a958d1e22 in
pthread_cond_timedwait@@GLIBC_2.3.2 () from /lib64/tls/libpthread.so.0
  4 Thread 1098918240 (LWP 22933)  0x0000002a95a8f152 in poll () from
/lib64/tls/libc.so.6
  3 Thread 1107310944 (LWP 22934)  0x0000002a958d1e22 in
pthread_cond_timedwait@@GLIBC_2.3.2 () from /lib64/tls/libpthread.so.0
  2 Thread 1124096352 (LWP 22936)  0x0000002a958d3fdb in
__lll_mutex_lock_wait () from /lib64/tls/libpthread.so.0
  1 Thread 182901033728 (LWP 22929)  0x0000002a958ceb88 in
__nptl_setxid () from /lib64/tls/libpthread.so.0

I found the threads 1 and 2 mysterious where the stack trace is:

thread 1

#0  0x0000002a958ceb88 in __nptl_setxid () from /lib64/tls/libpthread.so.0
#1  0x0000002a95a90cca in seteuid () from /lib64/tls/libc.so.6

And

Thread 2

#0  0x0000002a958d3fdb in __lll_mutex_lock_wait () from
/lib64/tls/libpthread.so.0

#1  0x000000000073f2a0 in first.1 ()
#2  0xffffffffffffffff in ?? ()
#3  0x0000002a958cfe8a in __make_stacks_executable () from
/lib64/tls/libpthread.so.0
#4  0x0000002a959eace8 in ?? () from /lib64/tls/libc.so.6

#5  0x0000002a957704e0 in ?? ()
#6  0x0000000000000000 in ?? ()
#7  0x00000000007657b8 in ?? ()
#8  0x0000000000000000 in ?? ()
#9  0x0000002a95d272a0 in ?? ()
#10 0x0000000000000000 in ?? ()
#11 0x0000000000000001 in ?? ()

#12 0x0000000000000009 in ?? ()
#13 0x0000002a9556119f in fixup () from /lib64/ld-linux-x86-64.so.2
#14 0x0000000000000000 in ?? ()
#15 0x0000000000000009 in ?? ()

#16 0x0000002a95a90d6a in setegid () from /lib64/tls/libc.so.6

The glibc version is 2.3.3-98.61, am I hitting
http://sourceware.org/bugzilla/show_bug.cgi?id=3270

Or do the stack traces point to something else?
I have tried the test program provided with PR 3270 and yet to hit the race.

thanks and regards,
Davinc


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]