This is the mail archive of the gdb-patches@sourceware.org mailing list for the GDB project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: [RFA] Fix tid-reuse sometimes blocks for a very long (infinite?) time.

From: Simon Marchi <simon dot marchi at polymtl dot ca>
To: Philippe Waroquiers <philippe dot waroquiers at skynet dot be>
Cc: gdb-patches at sourceware dot org
Date: Mon, 03 Dec 2018 21:56:35 -0500
Subject: Re: [RFA] Fix tid-reuse sometimes blocks for a very long (infinite?) time.
References: <20181104200048.2463-1-philippe.waroquiers@skynet.be>

On 2018-11-04 15:00, Philippe Waroquiers wrote:

A failure that seems to cause a long/infinite time is the following:

For a not clear reason, tid-reuse.c spawner thread sometimes gets anerror:

     tid-reuse:
/bd/home/philippe/gdb/git/build_moreaa/gdb/testsuite/../../../moreaa/gdb/testsuite/gdb.threads/tid-reuse.c:58:
spawner_thread_func: Assertion `rc == 0' failed.

which causes a SIGABRT to be trapped by gdb, and tid-reuse does notreach the

after_count breakpoint:
  Thread 2 "tid-reuse" received signal SIGABRT, Aborted.
  [Switching to Thread 0x7ffff7518700 (LWP 10368)]
  __GI_raise (sig=sig@entry=6) at ../sysdeps/unix/sysv/linux/raise.c:51
  51	../sysdeps/unix/sysv/linux/raise.c: No such file or directory.

(gdb) FAIL: gdb.threads/tid-reuse.exp: continue to breakpoint:after_count

After that, tid-reuse.exp gets the value of reuse_time, but this onekept its

initial value of -1 (as unsigned) :
  print reuse_time
  $1 = 4294967295
  (gdb) PASS: gdb.threads/tid-reuse.exp: get reuse_time

tid-reuse then dies, and the .exp script continues (with some FAIL)
till it executes:
  set timeout [expr $reuse_time * 2]

leading to the error:

  (gdb) ERROR: integer value too large to represent as non-long integer
      while executing
  "expect {
  -i exp8 -timeout 8589934590
          -re ".*A problem internal to GDB has been detected" {
              fail "$message (GDB internal error)"
              gdb_intern..."
      ("uplevel" body line 1)
      invoked from within
  "uplevel $body" ARITH IOVERFLOW {integer value too large to
represent as non-long integer} integer value too large to represent as
non-long integer
  ERROR: GDB process no longer exists

and then everything blocks.

This last 'GDB process no longer exists' is strange, as I still see thegdb

when this all blocks, e.g.
philippe 16058 31085  0 20:30 pts/15   00:00:00
 /bin/bash -c rootme=`pwd`; export rootme;
srcdir=../../../binutils-gdb/gdb/testsuite ; export srcdir ;
EXPECT=`if [
philippe 16386 16058  0 20:30 pts/15   00:00:00
   expect -- /usr/share/dejagnu/runtest.exp --status GDB_PARALLEL=yes
--outdir=outputs/gdb.threads/tid-reuse gdb.thre
philippe 24848 16386  0 20:30 pts/20   00:00:00

/bd/home/philippe/gdb/git/build_binutils-gdb/gdb/testsuite/../../gdb/gdb
-nw -nx -data-directory /bd/home/philip

This patch gives a default value of 60, so that if ever something wronghappensin tid-reuse, then the value retrieved by the .exp script stays in areasonable

range.

Note that I could not reproduce this failure often enough to be surethatinitializing to 60 ensures it does not block, but in any case, itshould

not harm.

I replaced the pthread_create call with the value 1 to simulate afailure, and the test succeeds to fail quickly with your patch applied.Without your patch, I get the infinite hang that you describe.


The patch LGTM, thanks!

Simon

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]