Re: segfaults building documentation when machine under load

octave-maintainers

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: segfaults building documentation when machine under load

From:	Daniel J Sebald
Subject:	Re: segfaults building documentation when machine under load
Date:	Tue, 19 May 2020 22:52:22 -0400
User-agent:	Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.9.0

On 5/19/20 9:54 PM, John W. Eaton wrote:

On 5/19/20 4:11 PM, Dmitri A. Sergatskov wrote:
On Tue, May 19, 2020 at 4:02 PM John W. Eaton <address@hidden<mailto:address@hidden>> wrote:
    On 5/19/20 3:26 PM, Dmitri A. Sergatskov wrote:

     >     Should we switch to bug-tracker?
     >     I was able to get a crash when I bumped the jobs to 200.
     >     bt is attached. The relevant part seems to be:

    If I use a large number of jobs, I see

        error: imwrite: invalid empty image
        error: called from
            __imwrite__ at line 40 column 5
            imwrite at line 125 column 5
            print at line 755 column 13
            interpimages at line 72 column 5

    but no segfaults.

    It does look like a threading issue.


I used a simplified test by Andreas:
parallel -N0 -q octave --norc --silent --no-history --eval 'figure(1,"visible", "off");' ::: {1..200}
OK, I'm able to duplicate the problem using this method.
In the stack traces I've seen, Octave is crashing inside the interpreterobject destructor while attempting to close any remaining figures.
Changing the eval above to be

   figure (1, "visible", "off"); close ("all"); pause (1);
eliminates the crash for me, apparently because then there are no figurewindows to close when exiting. But attempting the same thing in thedoc/interpreter scripts that generate plots I see the "invalid emptyimage" error on every attempt to create a figure, at least when using alarge number of parallel Make jobs.
So clearly this kind of change is not a solution, but it may point ustoward one. Ultimately, we need to determine the correct sequence forshutting down the GUI and interpreter, including what actions can happenor need to be blocked, and what signals need to be sent or locksacquired so that there are no more races between the threads.
It's a bit tricky because figures can have callbacks set to run when thefigure is closed. Do we expect those to run when Octave is in theprocess of exiting or is it OK to skip them? Those functions couldregister code to run when Octave exits. Should that be possible? Wouldit be OK for an atexit function to display a graph? What is reasonableto expect or attempt to do?

Are a "close" command and upper-right close button the same path,essentially? So it is


1) "close"
2) call close callback
3) destroy figure object

? What is the route when Octave exits? Also, is the graphics enginestill valid in the exit scenario? At the end of


void
gh_manager::execute_callback (const graphics_handle& h,
                              const octave_value& cb_arg,
                              const octave_value& data)
{

is the following:

      // Redraw after interacting with a user-interface (ui*) object.
      if (Vdrawnow_requested)
        {
          if (go)
            {
              std::string go_name
                = go.get_properties ().graphics_object_name ();

              if (go_name.length () > 1
                  && go_name[0] == 'u' && go_name[1] == 'i')
                {
                  Fdrawnow (m_interpreter);
                  Vdrawnow_requested = false;
                }
            }
        }

Perhaps Fdrawnow() is where the failure is happening because thegraphics was shut down prior. To debug, place a


printf("About to redraw...")
                  Fdrawnow (m_interpreter);
printf("...finished redraw")

and retry the doc build.

Dan

[Prev in Thread]

Current Thread

[Next in Thread]

Re: segfaults building documentation when machine under load, (continued)

Prev by Date: Re: https://www.gnu.org/software/octave/commercial-support.html
Next by Date: Re: https://www.gnu.org/software/octave/commercial-support.html
Previous by thread: Re: segfaults building documentation when machine under load
Next by thread: Re: segfaults building documentation when machine under load
Index(es):
- Date
- Thread