[Ace-users] [ace-bugs] [ACE_Message_Queue] notify PIPE block causesSelect_Reactor deadlock.

Johnny Willemsen jwillemsen at remedy.nl
Tue Mar 11 09:28:38 CDT 2008


Hi,

 

Great work! Thanks for making the test. Can you put this in bugzilla
(http://deuce.doc.wustl.edu/bugzilla/index.cgi). Be aware of our bug fixing
policies as described in the docs/ACE-bug-process.html document

 

Regards,

 

 

Johnny Willemsen
Remedy IT
Postbus 101
2650 AC  Berkel en Rodenrijs
The Netherlands
www.theaceorb.nl / www.remedy.nl 

*** Integrated compile and test statistics see
http://scoreboard.theaceorb.nl ***
*** Commercial service and support for ACE/TAO/CIAO             ***
*** See http://www.theaceorb.nl/en/support.html                 *** 

 

 

From: Rudy Pot [mailto:rpot at aweta.nl] 
Sent: Tuesday, March 11, 2008 3:18 PM
To: Johnny Willemsen; ace-bugs at cs.wustl.edu
Subject: RE: [ace-bugs] [ACE_Message_Queue] notify PIPE block
causesSelect_Reactor deadlock.

 

 

Hi,

 

Below you can find an extended version of the
ACE_wrappers/tests/Message_Queue_Notifications_Test.cpp

 

The original source was taken from the head of
<https://svn.dre.vanderbilt.edu/viewvc/Middleware/> [Middleware] /
<https://svn.dre.vanderbilt.edu/viewvc/Middleware/trunk/> trunk /
<https://svn.dre.vanderbilt.edu/viewvc/Middleware/trunk/ACE/> ACE /
<https://svn.dre.vanderbilt.edu/viewvc/Middleware/trunk/ACE/tests/> tests /
Message_Queue_Notifications_Test.cpp

(Subversion web)

 

NEW TESTS

Two new tests are added which both can be (de)selected with defines:

1) TEST_CLOSEHOOK: This should test the usecase where notify() race
conditions can occur when an Task EH uses "delete this" in its close hook. 

However, the race condition is not reproducable for sure. 

I have had a segmentation fault only once and unfortunately at that time
coredump was not active on my test system.

I tried to increase the chance of race condition by adding an iteration loop
but that didn't help either...

Any ideas for getting the race condition addressed for sure might be useful.

 

2) TEST_FULLNOTIFY: This tests the deadlock-usecase (notified message
enqueue with full reactor notification pipe).

On our test scoreboard this test will timeout when the deadlock occurs
(maximum run time per test allowed)

The deadlock is always adressed whith the latest version of ACE.

 

NOTES:

If you want to use this, two things have to be looked at before checking in:

1) #includes section: I had to change some includes in order to get the test
through our compilers and the ACE versions used for the different platforms
(W32 BCB ACE-5.3.1, RH9 gcc ACE-5.3.1, FC6 gcc ACE-5.5.6)

2) "ACE_TMAIN (int, ACE_TCHAR *[])" probably to be replaced with "run_main
(int, ACE_TCHAR *[])" ?
(In subversion it was "run_main" but that doesn't work with the ACE versions
we use, which are probably already very old...).

By the way, some minor change has been made in the code so the
default_message = "ACE RULES" is used for all the test cases which enqueue
messages...

 

Message_Queue_T.cpp:

For now, I've patched this code to avoid deadlocks. In our applications we
don't use "delete this" on any EH so we won't get struck by the race
conditions.

A suggested fix for both of the Message_Queue_T.cpp notify problems (race /
deadlock) is in my second post in this bug-thread (Feb 27 2008 3:37pm). See
also FIX CODE EXAMPLE below in this post. However that is only pseudo code,
not fully worked out. It probably also needs modifications in the
Notification_Strategy and Reactor classes.

 

Regards,

 

Rudy Pot

Embedded Computer Systems

 

AWETA G&P b.v

Postbox 17

NL-2630 AA  Nootdorp

tel +31 (0)15 3109961 

fax +31 (0)15 3107321 

mail  <mailto:rpot at aweta.nl> rpot at aweta.nl

web www.aweta.com <blocked::http://www.aweta.com> 

 

//
============================================================================

 

 

// $Id: Message_Queue_Notifications_Test.cpp 10757 2008-03-10 11:41:40Z rpot
$

 

//
============================================================================
//
// = LIBRARY
//    tests
//
// = FILENAME
//    Message_Queue_Notification_Test.cpp
//
// = DESCRIPTION
//      There are two tests that test 2 different notification
//      mechanisms in Message Queue.
//
//      The first test illustrates the notification mechanisms in
//      Message_Queue and its integration with Reactor.
//
//      Note the following things about this part of the test:
//
//      1. Multiple threads are not required.
//      2. You do not have to explicitly notify the Reactor
//      3. This code will work the same with any Reactor Implementation
//      4. handle_input, handle_exception, handle_output are the only
//         callbacks supported by this mechanism
//      5. The notification mechanism need not notify the Reactor. You can
//         write your own strategy classes that can do whatever application
//         specific behavior you want.
//
//      The second test also makes sure the high/low water mark
//      signaling mechanism works flawlessly.
//
//      The third test handles the use case of notification strategy in 
//      combination with EH "delete this" termination.
//
//      There is also a fourth stress test added for addressing possible 
//      notification deadlocks. It look similar as the first test but needs 
//      threads. If deadlock occurs, this test has to be killed to
terminate.
//
// = AUTHOR
//    Irfan Pyarali <irfan at cs.wustl.edu> and Nanbor Wang
<nanbor at cs.wustl.edu>
//
//
============================================================================
#include "test_config.h"
#include "ace/Reactor.h"
#include "ace/Task.h"
#include "ace/Reactor_Notification_Strategy.h"
#include "ace/Atomic_Op.h"

 

#ifndef ACE_SYNCH_H
#include "ace/Synch.h"
#endif
//#include "ace/Barrier.h"
//#include "ace/Synch_Traits.h"
//#include "ace/Null_Condition.h"
//#include "ace/Null_Mutex.h"

 

// Are below includes needed?
//#include "ace/OS_NS_string.h"
//#include "ace/OS_NS_unistd.h"

 

ACE_RCSID(tests, Message_Queue_Notifications_Test, "$Id:
Message_Queue_Notifications_Test.cpp 10757 2008-03-10 11:41:40Z rpot $")

 

static int iterations = 10;

 

static const size_t worker_threads = 2;
static const char * default_message = "ACE RULES";
static const size_t default_high_water_mark = 20;
static const size_t default_low_water_mark = 10;
static const int watermark_iterations = 2 * default_high_water_mark;

 

#define TEST_CLOSEHOOK  1  // Test for Message_Queue_T notify race
conditions.
#define TEST_FULLNOTIFY 1  // Test for Message_Queue_T notify deadlock.

 

//--------------------------------------------------------------------------
---
class Message_Handler : public ACE_Task<ACE_NULL_SYNCH>
{
  // = TITLE
  //   This class implements a notification strategy for the Reactor.
public:
  // = Initialization and termination.
  Message_Handler (ACE_Reactor &reactor);
  // Constructor.

 

  // = Demuxing hooks.
  virtual int handle_input (ACE_HANDLE);
  virtual int handle_output (ACE_HANDLE fd = ACE_INVALID_HANDLE);
  virtual int handle_exception (ACE_HANDLE fd = ACE_INVALID_HANDLE);

 

private:
  int process_message (void);
  void make_message (void);

 

  ACE_Reactor_Notification_Strategy notification_strategy_;
};

 

//--------------------------------------------------------------------------
---
class Watermark_Test : public ACE_Task<ACE_SYNCH>
{
  // = TITLE
  //     This class test the correct functioning of build-in flow
  //     control machanism in ACE_Task.
public:
  Watermark_Test (void);

 

  virtual int svc (void);

 

  int consumer (void);
  int producer (void);
  int put_message (ACE_Time_Value* timeout = 0);
  int get_message (void);
  void print_producer_debug_message (void);

 

private:
  const size_t len_;
  const size_t hwm_;
  const size_t lwm_;
  ACE_Atomic_Op <ACE_SYNCH_MUTEX, int> role_;
#if defined (ACE_HAS_THREADS)
  ACE_Barrier mq_full_;
  ACE_Barrier mq_low_water_mark_hit_;
#endif /* ACE_HAS_THREADS */
};

 

//--------------------------------------------------------------------------
---
#if defined (ACE_HAS_THREADS)

 

#if defined (TEST_CLOSEHOOK)
class CloseHook_Handler : public ACE_Task<ACE_MT_SYNCH>
{
  // = TITLE
  //   This class tests the behaviour of the Task <close> hook called in 
  //   combination with notified MB_STOP message enqueing.
  //   It should normally cause this dynamically created task (EH) to 
  //   gracefully delete itself.
public:
  // = Initialization and termination.
  CloseHook_Handler (ACE_Reactor &reactor);

 

  // = Demuxing hooks.
  virtual int handle_input (ACE_HANDLE);

 

  // = Starts the threads needed for testing.
  virtual int open (void *args = 0); 

 

private:
  // = Ensure dynamic creation.
  virtual ~CloseHook_Handler ();

 

  // = Svc hook.
  virtual int svc (void);
  // = Hook called from <ACE_Thread_Exit> during thread exit.
  virtual int close (u_long flags = 0);

 

  ACE_Reactor_Notification_Strategy   notification_strategy_;
  ACE_Barrier                         startupBarrier_; 
  ACE_Message_Block::ACE_Message_Type messageType_;
};
#endif /* TEST_CLOSEHOOK */

 

//--------------------------------------------------------------------------
---
#if defined (TEST_FULLNOTIFY)
class FullNotify_Handler : public ACE_Task<ACE_MT_SYNCH>
{
  // = TITLE
  //   This class tests the behaviour of (notified) message enqueing at a
point 
  //   when the Reactor notification pipe is full. It should then just
block.
  //   After the (consumer) Reactor eventloop has handled some pending 
  //   notifications, the notifiers (producer threads) should just get
unblocked.
public:
  // = Initialization and termination.
  FullNotify_Handler (ACE_Reactor &reactor);
  virtual ~FullNotify_Handler ();

 

  // = Demuxing hooks.
  virtual int handle_input (ACE_HANDLE);

 

  // = Starts the threads needed for testing.
  virtual int open (void *args = 0); 

 

private:
  // = Svc hook.
  virtual int svc (void);

 

  void make_message (void);

 

  ACE_Reactor_Notification_Strategy notification_strategy_;
  static const int        iMAX_NR_THREADS = 2;              
  // Identifiers needed when we have more than one thread active.
  ACE_thread_t            thread_ids_[iMAX_NR_THREADS];  
};
#endif /* TEST_FULLNOTIFY */

 

#endif /* ACE_HAS_THREADS */

 

//--------------------------------------------------------------------------
---
Message_Handler::Message_Handler (ACE_Reactor &reactor)
  // First time handle_input will be called
  : notification_strategy_ (&reactor,
                            this,
                            ACE_Event_Handler::READ_MASK)
{
  this->msg_queue ()->notification_strategy (&this->notification_strategy_);
  this->make_message ();
}

 

int
Message_Handler::handle_input (ACE_HANDLE)
{
  ACE_DEBUG ((LM_DEBUG,
              ACE_TEXT ("Message_Handler::handle_input\n")));

 

  // Next time handle_output will be called.
  this->notification_strategy_.mask (ACE_Event_Handler::WRITE_MASK);

 

  return process_message ();
}

 

int
Message_Handler::handle_output (ACE_HANDLE fd)
{
  ACE_DEBUG ((LM_DEBUG,
              ACE_TEXT ("Message_Handler::handle_output\n")));
  ACE_UNUSED_ARG (fd);

 

  // Next time handle_exception will be called.
  this->notification_strategy_.mask (ACE_Event_Handler::EXCEPT_MASK);

 

  return process_message ();
}

 

int
Message_Handler::handle_exception (ACE_HANDLE fd)
{
  ACE_DEBUG ((LM_DEBUG,
              ACE_TEXT ("Message_Handler::handle_exception\n")));
  ACE_UNUSED_ARG (fd);

 

  // Next time handle_input will be called.
  this->notification_strategy_.mask (ACE_Event_Handler::READ_MASK);

 

  return this->process_message ();
}

 

int
Message_Handler::process_message (void)
{
  ACE_Message_Block *mb;

 

  if (this->getq (mb,
                  (ACE_Time_Value *) &ACE_Time_Value::zero) == -1)
    ACE_ERROR_RETURN ((LM_ERROR,
                       ACE_TEXT ("%p\n"),
                       ACE_TEXT ("dequeue_head")),
                      -1);
  else
    {
      ACE_DEBUG ((LM_DEBUG,
                  ACE_TEXT ("message received = %s\n"),
                  mb->rd_ptr ()));
      mb->release ();
    }

 

  this->make_message ();
  return 0;
}

 

void
Message_Handler::make_message (void)
{
  if (--iterations > 0)
    {
      ACE_Message_Block *mb;
      ACE_NEW (mb,
               ACE_Message_Block (default_message));

 

      ACE_DEBUG ((LM_DEBUG,
                  ACE_TEXT ("sending message\n")));
      this->putq (mb);
    }
}

 

//--------------------------------------------------------------------------
---
Watermark_Test::Watermark_Test (void)
  : len_ (ACE_OS::strlen (default_message) + 1),
    hwm_ (this->len_ * default_high_water_mark),
    lwm_ (this->len_ * default_low_water_mark),
    role_ (0)
#if defined (ACE_HAS_THREADS)
    , mq_full_ (worker_threads),
    mq_low_water_mark_hit_ (worker_threads)
#endif /* ACE_HAS_THREADS */
{
  this->water_marks (ACE_IO_Cntl_Msg::SET_LWM,
                     this->lwm_);
  this->water_marks (ACE_IO_Cntl_Msg::SET_HWM,
                     this->hwm_);
}

 

int
Watermark_Test::producer (void)
{
  int i = watermark_iterations;

 

  for (ssize_t hwm = this->hwm_;
       hwm >= 0 ;
       hwm -= this->len_)
    {
      this->put_message ();
      this->print_producer_debug_message ();
      i--;
      if (this->msg_queue ()->is_full ())
        break;

 

    }
  ACE_DEBUG ((LM_DEBUG,
              ACE_TEXT ("(%P|%t) Producer: High water mark hit ---- \n")));

 

  ACE_MT (this->mq_full_.wait ());

 

  // The following put_message should block until the message queue
  // has dropped under the lwm.
  this->put_message ();

 

  ACE_ASSERT (this->msg_queue ()-> message_bytes () <= this->lwm_ +
this->len_);

 

  this->print_producer_debug_message ();

 

  for (i--; i >= 0 ; i--)
    {
      this->put_message ();
      this->print_producer_debug_message ();
    }

 

  return 0;
}

 

int
Watermark_Test::consumer (void)
{
  ACE_MT (this->mq_full_.wait ());

 

  ACE_OS::sleep (1);

 

  // Let producer proceed and block in putq.

 

  for (int i = watermark_iterations; i >= 0; i--)
    {
      this->get_message ();
      ACE_OS::sleep (0);
    }

 

  return 0;
}

 

int
Watermark_Test::get_message (void)
{
  ACE_Message_Block *mb;

 

  if (this->getq (mb) == -1)
    ACE_ERROR_RETURN ((LM_ERROR,
                       ACE_TEXT ("%p\n"),
                       ACE_TEXT ("dequeue_head")),
                      -1);
  else
    {
      ACE_DEBUG ((LM_DEBUG,
                  ACE_TEXT ("(%P|%t) Consumer: message size = %3d, ")
                  ACE_TEXT ("message count = %3d\n"),
                  this->msg_queue ()-> message_bytes (),
                  this->msg_queue ()-> message_count ()));
      mb->release ();
    }

 

  return 0;
}

 

int
Watermark_Test::put_message (ACE_Time_Value *timeout)
{
  ACE_Message_Block *mb;

 

  ACE_NEW_RETURN (mb,
                  ACE_Message_Block (default_message,
                                     this->len_),
                  -1);

 

  return this->putq (mb, timeout);
}

 

void
Watermark_Test::print_producer_debug_message (void)
{
  ACE_DEBUG ((LM_DEBUG,
              ACE_TEXT ("(%P|%t) Producer: message size = %3d, ")
              ACE_TEXT ("message count = %3d\n"),
              this->msg_queue ()-> message_bytes (),
              this->msg_queue ()-> message_count ()));
}

 

int
Watermark_Test::svc (void)
{
  // this->role_ is an Atomic_Op object.
  int role = this->role_++;

 

  switch (role)
    {
    case 0:
      this->producer ();
      break;
    case 1:
      this->consumer ();
      break;
    default:
      break;
    }
  return 0;
}

 


//--------------------------------------------------------------------------
---
#if defined (ACE_HAS_THREADS)

 

#if defined (TEST_CLOSEHOOK)

 

CloseHook_Handler::CloseHook_Handler (ACE_Reactor &reactor)
  : notification_strategy_ (&reactor,
                            this,
                            ACE_Event_Handler::READ_MASK)
  , startupBarrier_(2)        // Barrier for ourself and the new thread.
  , messageType_(ACE_Message_Block::MB_NORMAL)
{
  this->msg_queue()->notification_strategy (&this->notification_strategy_);
}

 

CloseHook_Handler::~CloseHook_Handler()
{
  if (this->reactor() != 0)
  {
    // Remove ourself from reactor Eh queue (don't call handle_close).
    this->reactor()->remove_handler(this,
                                    ACE_Event_Handler::READ_MASK |
                                    ACE_Event_Handler::DONT_CALL  );
    this->reactor(0); 
  };
}

 

int
CloseHook_Handler::handle_input (ACE_HANDLE)
{
  ACE_DEBUG ((LM_DEBUG,
              ACE_TEXT ("CloseHook_Handler::handle_input\n")));

 

  // All messages handled in svc().
  return 0; // Stay registered.
}

 

int
CloseHook_Handler::open (void *args)
{
  ACE_UNUSED_ARG (args);
  if (this->activate(THR_NEW_LWP | THR_DETACHED) < 0)
  {
    ACE_ERROR_RETURN ((LM_ERROR
                       ,ACE_TEXT ("CloseHook_Handler::open, activate
ERROR(%s)!\n")
                       ,strerror(ACE_OS::last_error())
                      ),-1);
  };
  startupBarrier_.wait();
  return 0;
}

 

int
CloseHook_Handler::svc (void)
{
ACE_Message_Block *pMb=0;

 

  startupBarrier_.wait();
  while (messageType_ != ACE_Message_Block::MB_STOP)
  {
    if (this->msg_queue()->dequeue_head(pMb) >= 0) //  Block on input queue.
    {
      messageType_ = pMb->msg_type();
      pMb->release();
      ACE_DEBUG ((LM_DEBUG, ACE_TEXT ("CloseHook_Handler::svc[t=%t]:
pMb->release()\n")));
    }
    else
    {
      ACE_DEBUG ((LM_WARNING,
                  ACE_TEXT ("CloseHook_Handler::svc: Invalid message
dequeued!\n")
                  ));
    };
  };
  return 0;
}

 

int 
CloseHook_Handler::close (u_long flags)
{
  ACE_UNUSED_ARG (flags);
  ACE_DEBUG ((LM_DEBUG,ACE_TEXT ("CloseHook_Handler::close hook
called.\n")));
  // Service thread has exited, and this object is now invalid.
  delete this;
  return 0;
}

 

#endif /* TEST_CLOSEHOOK */

 

//--------------------------------------------------------------------------
---
#if defined (TEST_FULLNOTIFY)

 

FullNotify_Handler::FullNotify_Handler (ACE_Reactor &reactor)
  : notification_strategy_ (&reactor,
                            this,
                            ACE_Event_Handler::READ_MASK)
{
  this->reactor(&reactor);
  this->msg_queue()->notification_strategy (&this->notification_strategy_);
}

 

FullNotify_Handler::~FullNotify_Handler()
{
  this->wait();
}

 

int
FullNotify_Handler::handle_input (ACE_HANDLE)
{
  ACE_DEBUG ((LM_DEBUG,
              ACE_TEXT ("FullNotify_Handler::handle_input\n")));

 

  ACE_Message_Block *mb;

 

  if (this->msg_queue()->is_empty())
    ACE_DEBUG ((LM_DEBUG,
                ACE_TEXT ("Message queue is empty!\n")
                ));
  else if (this->getq (mb,
                  (ACE_Time_Value *) &ACE_Time_Value::zero) == -1)
    ACE_ERROR_RETURN ((LM_ERROR,
                       ACE_TEXT ("%p\n"),
                       ACE_TEXT ("dequeue_head")),
                      -1);
  else
    {
      ACE_DEBUG ((LM_DEBUG,
                  ACE_TEXT ("message received = %s\n"),
                  mb->rd_ptr ()));
      mb->release ();
    }

 

  return 0; // Stay registered.
}

 

int
FullNotify_Handler::open (void *args)
{
  ACE_UNUSED_ARG (args);
  // Activate the threads: Most are default parms, except n_threads 
  // and thread_ids_.
  // thread_ids_ is used to distinguise between the threads.
  if (this->activate(                             // Activate the svc() 
                      THR_NEW_LWP | THR_JOINABLE   // (default) flags.
                     ,iMAX_NR_THREADS              // - n_threads 
                     ,0                            // (default) force_active
                     ,ACE_DEFAULT_THREAD_PRIORITY  // (default) priority
                     ,-1                           // (default) grp_id
                     ,0                            // (default) *task
                     ,0                            // (default)
thread_handles[]
                     ,0                            // (default) *stack[]
                     ,0                            // (default) stack_size[]
                     ,thread_ids_                  // - thread_ids[]  
                    ) < 0)
  {
    ACE_ERROR_RETURN ((LM_ERROR
                       ,ACE_TEXT ("FullNotify_Handler::open, activate
ERROR(%s)!\n")
                       ,strerror(ACE_OS::last_error())
                      ),-1);
  };
  return 0;
}

 

int
FullNotify_Handler::svc (void)
{
int iNotifications=0;

 

  //ACE_DEBUG((LM_ERROR, ">> FullNotify_Handler::svc(%t)\n"));
  //--------------------------------------------
  if (ACE_Thread::self() == thread_ids_[0]) 
  {
    // [Thread 0]
    // Hit the reactor's notification queue until it is full.
    // We can only continue if main thread handles the reactor
notifications.
    // Global 'iterations' will countdown if next thread performs
'make_message'.
    while (iterations > 0)
    {
      ACE_DEBUG ((LM_DEBUG,
                  ACE_TEXT ("(%t) Reactor notification (%d)\n")
                  ,iNotifications++
                  ));

 

      // Notify the Reactor, which will call our <handle_input>.
      // Timeout when notify pipe is full? Somehow it isn't full when
timeout
      // occurs.
      //  FC6: 
      //  - With timeout parm:    108 x notify until timeout occurs.
      //  - Without timeout parm: 432 x notify until it blocks.
      //ACE_Time_Value timeout (10);
      //this-reactor()->notify (this, ACE_Event_Handler::READ_MASK,
&timeout);

 

      this->reactor()->notify (this, ACE_Event_Handler::READ_MASK);
    };
  }
  //--------------------------------------------
  else if (ACE_Thread::self() == thread_ids_[1])
  {
    // [Thread 1]
    // Use Message_Queue and hit the notification queue again (blocks).
    // We can only continue if main thread handles the reactor
notifications.

 

    // Sleep some time until notification pipe is full?
    ACE_OS::sleep(2); 
    while (iterations > 0)
    {
      this->make_message();
    };
  };

 

  //ACE_DEBUG((LM_ERROR, "<< FullNotify_Handler::svc(%t)\n"));
  return 0;
}

 

void
FullNotify_Handler::make_message (void)
{
  if (--iterations > 0)
    {
      ACE_Message_Block *mb;
      ACE_NEW (mb,
               ACE_Message_Block (default_message));

 

      ACE_DEBUG ((LM_DEBUG,
                  ACE_TEXT ("sending message\n")));
      this->putq (mb);
    }
}

 

#endif /* TEST_FULLNOTIFY */
#endif /* ACE_HAS_THREADS */

 

//--------------------------------------------------------------------------
---
int
//run_main (int, ACE_TCHAR *[])
ACE_TMAIN (int, ACE_TCHAR *[])
{
  ACE_START_TEST (ACE_TEXT ("Message_Queue_Notifications_Test"));

 

  {
    ACE_DEBUG ((LM_DEBUG,
                ACE_TEXT ("\nStarting message queue reactive notification
test...\n")));
  
    ACE_Reactor reactor;
    Message_Handler mh (reactor);
  
    while (iterations > 0)
      reactor.handle_events ();
  }
#if defined (ACE_HAS_THREADS)
  {
    ACE_DEBUG ((LM_DEBUG,
                ACE_TEXT ("\nStarting message queue watermark test...\n")));
    Watermark_Test watermark_test;
    ACE_DEBUG ((LM_DEBUG,
                ACE_TEXT ("High water mark is %d\n")
                ACE_TEXT ("Low water mark is %d\n"),
                default_high_water_mark,
                default_low_water_mark));
  
    watermark_test.activate (THR_NEW_LWP | THR_DETACHED,
                             worker_threads);

 

    // This frees the memory used by the DETACHED thread.
    ACE_Thread_Manager::instance ()->wait ();
  }
#else
  ACE_DEBUG ((LM_INFO,
              ACE_TEXT ("Message queue watermark test not performed because
threads are not supported\n")));
#endif /* ACE_HAS_THREADS */

 

#if !defined (TEST_CLOSEHOOK)
  ACE_DEBUG ((LM_INFO,
              ACE_TEXT ("Task <close> hook trigger test not used.\n")));
#elif defined (ACE_HAS_THREADS)
  {
    ACE_DEBUG ((LM_DEBUG,
                ACE_TEXT ("\nStarting test: Task <close> hook triggered by
notified message enqueue...\n")));

 

    ACE_Reactor        reactor;
    CloseHook_Handler* pChh  = 0;
    ACE_Message_Block* pMBlk = 0;

 

    for (int iIterations=0; iIterations<10; iIterations++)
    {
      // Dynamically allocate Task EH which deletes itself after MB_STOP 
      // is received.
      ACE_NEW_RETURN (pChh, CloseHook_Handler(reactor), -1);
  
      // Start svc thread.
      pChh->open();

 

      // Just enqueue some normal messages.
      ACE_NEW_RETURN( pMBlk, ACE_Message_Block(0,
ACE_Message_Block::MB_NORMAL), -1);
      pChh->putq( pMBlk );
      ACE_NEW_RETURN( pMBlk, ACE_Message_Block(0,
ACE_Message_Block::MB_NORMAL), -1);
      pChh->putq( pMBlk );
      ACE_NEW_RETURN( pMBlk, ACE_Message_Block(0,
ACE_Message_Block::MB_NORMAL), -1);
      pChh->putq( pMBlk );

 

      // Quit svc thread and let task delete itself (and see what happens).
      ACE_NEW_RETURN( pMBlk, ACE_Message_Block(0,
ACE_Message_Block::MB_STOP), -1);
      pChh->putq( pMBlk );

 

      // This frees the memory used by the DETACHED thread.
      ACE_Thread_Manager::instance ()->wait ();
    };

 

  }
#else
  ACE_DEBUG ((LM_INFO,
              ACE_TEXT ("Task <close> hook trigger test not performed
because threads are not supported\n")));
#endif /* ACE_HAS_THREADS */

 

#if !defined (TEST_FULLNOTIFY)
  ACE_DEBUG ((LM_INFO,
              ACE_TEXT ("Blocking (full) notification test not used.\n")));
#elif defined (ACE_HAS_THREADS)
  {
    ACE_DEBUG ((LM_DEBUG,
                ACE_TEXT ("\nStarting test: message enqueue with blocking
(full) notification...\n")));

 

    // Global message enqueue iterations.
    iterations = 10;  

 

    ACE_Reactor reactor;
    FullNotify_Handler fnh (reactor); 

 

    // Start producer threads.
    fnh.open();

 

    // Sleep until notification pipe is full and all producer threads block.
    ACE_OS::sleep(4); 

 

    // Empty the notification pipe so producers should normally unblock.
    while (iterations > 0)
      reactor.handle_events ();
  }
#else
  ACE_DEBUG ((LM_INFO,
              ACE_TEXT ("Message enqueue with full notification test not
performed because threads are not supported\n")));
#endif /* ACE_HAS_THREADS */

 

  ACE_END_TEST;
  return 0;
}

 

  _____  

From: Greg Popovitch [mailto:gpy at altair.com] 
Sent: woensdag 27 februari 2008 21:35
To: Johnny Willemsen; Rudy Pot; ace-bugs at cs.wustl.edu
Subject: RE: [ace-bugs] [ACE_Message_Queue] notify PIPE block
causesSelect_Reactor deadlock.

Johnny, I understand completely. I hope someone will provide the regression
test (I can't). Thanks for the help.

 

greg

 

From: Johnny Willemsen [mailto:jwillemsen at remedy.nl] 
Sent: Wednesday, February 27, 2008 3:32 PM
To: Greg Popovitch; 'Rudy Pot'; ace-bugs at cs.wustl.edu
Subject: RE: [ace-bugs] [ACE_Message_Queue] notify PIPE block
causesSelect_Reactor deadlock.

 

Hi,

 

Even when a deadlock can be shown on paper, we do need to have a regression
test that triggers it. That way we can see it ourselves as core developers
and we can also make sure we don't introduce the problem back in the future
by any other change.

 

Johnny

 

From: Greg Popovitch [mailto:gpy at altair.com] 
Sent: Wednesday, February 27, 2008 8:14 PM
To: Johnny Willemsen; Rudy Pot; ace-bugs at cs.wustl.edu
Subject: RE: [ace-bugs] [ACE_Message_Queue] notify PIPE block
causesSelect_Reactor deadlock.

 

Johnny, I agree with the usefulness of having regression tests. 

 

Still, this issue is a clear deadlock situation, which can be easily
explained and understood (see below). I think it is a  no-brainer to remove
a known deadlock, occuring when the program is running and busy, even if
there is a chance of a race condition at exit.  

 

Deadlock explaination:

 

In my enqueuing thread:

----------------------

 

1/ enqueue a message for the other thread on my own message queue

------------------------------------------------------------------

 

        if (_msg_queue) _msg_queue->enqueue_head(mb);

 

2/ in ACE DLL, enqueue and notify reactor

-----------------------------------------

 

template <ACE_SYNCH_DECL> int

ACE_Message_Queue<ACE_SYNCH_USE>::enqueue_head (ACE_Message_Block *new_item,

                                                ACE_Time_Value *timeout) {

  ACE_TRACE ("ACE_Message_Queue<ACE_SYNCH_USE>::enqueue_head");

  int queue_count = 0;

  {

    ACE_GUARD_RETURN (ACE_SYNCH_MUTEX_T, ace_mon, this->lock_, -1);

 

    if (this->state_ == ACE_Message_Queue_Base::DEACTIVATED)

      {

        errno = ESHUTDOWN;

        return -1;

      }

 

    if (this->wait_not_full_cond (ace_mon, timeout) == -1)

      return -1;

 

    queue_count = this->enqueue_head_i (new_item);

 

    if (queue_count == -1)

      return -1;

 

=>    this->notify ();

  }

  return queue_count;

}

 

3/ notification proceeds through, nothing interesting there

-----------------------------------------------------------

 

      ACEd.dll!ACE_Reactor_Notification_Strategy::notify()  Line 29

C++

      ACEd.dll!ACE_Reactor::notify(ACE_Event_Handler *
event_handler=0x000000002971bc70, unsigned long mask=1, ACE_Time_Value *
tv=0x00

      ACEd.dll!ACE_WFMO_Reactor::notify(ACE_Event_Handler *
event_handler=0x000000002971bc70, unsigned long mask=1, ACE_Time_Value *
timeo

 

4/ getting into WFMO reactor which enqueues on its own message queue

--------------------------------------------------------------------

 

int

ACE_WFMO_Reactor_Notify::notify (ACE_Event_Handler *event_handler,

                                 ACE_Reactor_Mask mask,

                                 ACE_Time_Value *timeout) {

  if (event_handler != 0)

    {

      ACE_Message_Block *mb = 0;

      ACE_NEW_RETURN (mb,

                      ACE_Message_Block (sizeof (ACE_Notification_Buffer)),

                      -1);

 

      ACE_Notification_Buffer *buffer =

        (ACE_Notification_Buffer *) mb->base ();

      buffer->eh_ = event_handler;

      buffer->mask_ = mask;

 

      // Convert from relative time to absolute time by adding the

      // current time of day.  This is what <ACE_Message_Queue>

      // expects.

      if (timeout != 0)

        *timeout += timer_queue_->gettimeofday ();

 

=>      if (this->message_queue_.enqueue_tail

          (mb, timeout) == -1)

        {

          mb->release ();

          return -1;

        }

 

      event_handler->add_reference ();

    }

 

  return this->wakeup_one_thread_.signal (); }

 

5/ enqueue blocks because queue is full (deadlock because calling thread is
hung in step 1 and will never dequeue)

---------------------------------------

 

      message_queue_    {head_=0x000000006deaf1d0

tail_=0x000000007a5826b0 low_water_mark_=16384 ...}

ACE_Message_Queue<ACE_MT_SYNCH>           ACE_Message_Queue_Base

{state_=1 } ACE_Message_Queue_Base

            head_ 0x000000006deaf1d0 {rd_ptr_=0 wr_ptr_=0

priority_=0 ...}  ACE_Message_Block *

            tail_ 0x000000007a5826b0 {rd_ptr_=0 wr_ptr_=0

priority_=0 ...}  ACE_Message_Block *

            low_water_mark_   16384 unsigned __int64

            high_water_mark_  16384 unsigned __int64

            cur_bytes_  16384 unsigned __int64

            cur_length_ 0     unsigned __int64

            cur_count_  1024  unsigned __int64

 

template <ACE_SYNCH_DECL> int

ACE_Message_Queue<ACE_SYNCH_USE>::enqueue_tail (ACE_Message_Block *new_item,

                                              ACE_Time_Value *timeout) {

  ACE_TRACE ("ACE_Message_Queue<ACE_SYNCH_USE>::enqueue_tail");

  int queue_count = 0;

  {

    ACE_GUARD_RETURN (ACE_SYNCH_MUTEX_T, ace_mon, this->lock_, -1);

 

    if (this->state_ == ACE_Message_Queue_Base::DEACTIVATED)

      {

        errno = ESHUTDOWN;

        return -1;

      }

 

=>    if (this->wait_not_full_cond (ace_mon, timeout) == -1)

      return -1;

 

    queue_count = this->enqueue_tail_i (new_item);

 

    if (queue_count == -1)

      return -1;

 

    this->notify ();

  }

  return queue_count;

}

 

greg

 

 

 

 

From: Johnny Willemsen [mailto:jwillemsen at remedy.nl] 
Sent: Wednesday, February 27, 2008 1:22 PM
To: Greg Popovitch; 'Rudy Pot'; ace-bugs at cs.wustl.edu
Subject: RE: [ace-bugs] [ACE_Message_Queue] notify PIPE block
causesSelect_Reactor deadlock.

 

Hi,

 

Also, in order to make such an important change to the core of ACE, we do
need an automated regression test that reproduces the problem. This must be
written like the ACE regression tests under ACE_wrappers/tests. If someone
can make this test and add it to bugzilla, the issue gets higher in the todo
list, without such a test we have to write one, which consumes more time.

 

Johnny

 

From: Greg Popovitch [mailto:gpy at altair.com] 
Sent: Wednesday, February 27, 2008 5:37 PM
To: Johnny Willemsen; Rudy Pot; ace-bugs at cs.wustl.edu
Subject: RE: [ace-bugs] [ACE_Message_Queue] notify PIPE block
causesSelect_Reactor deadlock.

 

Johnny, I didn't mean any offense. You do confirm my observation with the
statement: "people don't have time anymore to just fix these kind of issues
without funding". I am sorry that this is the case, as I really like ACE.
But I strongly feel that  if there is not anyone who feels enough sense of
ownership to fix clear bugs like this one, then it is hard to justify using
ACE in any project which doesn't allocate specific funding for ACE
maintenance.

 

Now again that's fine with me, and I certainly don't ask anyone to provide a
service for free. However in the light of my own experience (serious bug
fixed in 2002, reintroduced in 2003 and still there in 2008), I tend to
disagree with the statement in ACE's overview: "ACE continues to improve and
its future <http://www.cs.wustl.edu/%7Eschmidt/ACE-future.html>  is bright".

 

greg

 

From: Johnny Willemsen [mailto:jwillemsen at remedy.nl] 
Sent: Wednesday, February 27, 2008 11:20 AM
To: Greg Popovitch; 'Rudy Pot'; ace-bugs at cs.wustl.edu
Subject: RE: [ace-bugs] [ACE_Message_Queue] notify PIPE block
causesSelect_Reactor deadlock.

 

Hi,

 

Please read docs/ACE-bug-process.html, the issue is more that people don't
have time anymore to just fix these kind of issues without funding. This is
a change in the core of ACE which has to be investigated, reviewed, tested,
all taking time. At the moment someone is willing to fund such a change
there are multiple people that would be willing to work on this. We as
Remedy IT do fix issues on the doc_group release without funding, but our
budgets for doing that are limited, at the end we also have to pay our
bills.

 

Johnny

 

From: Greg Popovitch [mailto:gpy at altair.com] 
Sent: Wednesday, February 27, 2008 5:13 PM
To: Rudy Pot; Johnny Willemsen; ace-bugs at cs.wustl.edu
Subject: RE: [ace-bugs] [ACE_Message_Queue] notify PIPE block
causesSelect_Reactor deadlock.

 

Rudy, I reported the exact same issue and diagnostic a couple of months ago
(see mailing list on 11/29/2007 and 11/30/2007). Even though this is a clear
bug, it seemed to me that there was no one in particular planning to fix the
code. I ended up patching my own version, moving the notify() in the code
below one line down after the closing brace, so it would be out of the scope
of the ACE_GUARD. This fixes the issue and  I have not seen any termination
race conditions.

 

template <ACE_SYNCH_DECL> int

ACE_Message_Queue<ACE_SYNCH_USE>::enqueue_head (ACE_Message_Block *new_item,

                                                ACE_Time_Value *timeout) {

  ACE_TRACE ("ACE_Message_Queue<ACE_SYNCH_USE>::enqueue_head");

  int queue_count = 0;

  {

    ACE_GUARD_RETURN (ACE_SYNCH_MUTEX_T, ace_mon, this->lock_, -1);

 

    if (this->state_ == ACE_Message_Queue_Base::DEACTIVATED)

      {

        errno = ESHUTDOWN;

        return -1;

      }

 

    if (this->wait_not_full_cond (ace_mon, timeout) == -1)

      return -1;

 

    queue_count = this->enqueue_head_i (new_item);

 

    if (queue_count == -1)

      return -1;

 

=>    this->notify ();

  }

  return queue_count;

}

 

 

From: ace-bugs-bounces at cse.wustl.edu [mailto:ace-bugs-bounces at cse.wustl.edu]
On Behalf Of Rudy Pot
Sent: Wednesday, February 27, 2008 10:30 AM
To: Johnny Willemsen; ace-bugs at cs.wustl.edu
Subject: Re: [ace-bugs] [ACE_Message_Queue] notify PIPE block
causesSelect_Reactor deadlock.

 

Hi Johnny,

 

Thanks for your quick response!

 

I have had a closer look at all related bug messages and now I see that my 
problem is already quite old (sorry for that).

 

Correct me if I'm wrong but as far as I can see now there are two problems 
with ace/Message_Queue_T.cpp:

 

1) Sun May  5 19:14:34 2002  Douglas C. Schmidt
<schm... at macarena.cs.wustl.edu>

 

        * ace/Message_Queue_T.cpp: Modified all the enqueue*() methods so
that
          their calls to notify() occur *outside* of the monitor lock.
          This change prevents deadlock from occurring when a reactor's
          notification pipe is full.  Thanks to Sasha Agranov
          <sagra... at COMGATES.co.il> for reporting this.

 

2) Sat Mar 22 11:58:12 2003  Douglas C. Schmidt
<schm... at tango.doc.wustl.edu>

 

        * ace/Message_Queue_T.cpp: Moved the notify() hook calls within
          the protection of the guard lock critical section to prevent
          race conditions on cleanup.  Thanks to Ron Muck <r... at sdiusa.com>
          for this suggestion. 

 

Currently, the latter (2) is solved but because of that (1), which is my 
problem, has returned.

 

Is it not possible to solve both problems with adding an extra notify lock? 

 


                              <MessageQueue1, lock1>              handle_*
     [Producer-1]-enqueue-->[|||||||||||||||||||||||||]--->[EH-1] <-----+
                   |                                                     \
                    \ <notifyLock>          <PIPE?>                       |
                     +-notify----->[|||||||||||||||||||||]--->[Reactor]---+
                    /                                        (consumer)   |
                   |                                                     /
                   |          <MessageQueue-n, lock-n>                  /
     [Producer-n]-enqueue-->[|||||||||||||||||||||||||]--->[EH-n] <----+ 
                                                                  handle_*

 


I will point out my idea about this in the following fix code example.

 

This fix might be a problem because I think it needs some interface change
in 
ACE_Notification_Strategy too (to get a notification lock somehow) but you
have
more insight in the concequences of that. 

 

Maybe you first could have a look at this proposal?

 

Thanks in advance,
Rudy Pot

 

//==========================================================================
====
FIX CODE EXAMPLE  (changes applicable for: enqueue_prio, enqueue_head, 
                                       enqueue_deadline, enqueue_tail )

 


template <ACE_SYNCH_DECL> int
ACE_Message_Queue<ACE_SYNCH_USE>::enqueue_prio (ACE_Message_Block *new_item,
                                                ACE_Time_Value *timeout)
{
  ACE_TRACE ("ACE_Message_Queue<ACE_SYNCH_USE>::enqueue_prio");
  int queue_count = 0;
  {
    ACE_GUARD_RETURN (ACE_SYNCH_MUTEX_T, ace_mon, this->lock_, -1);

 

    if (this->state_ == ACE_Message_Queue_Base::DEACTIVATED)
      {
        errno = ESHUTDOWN;
        return -1;
      }

 

    if (this->wait_not_full_cond (ace_mon, timeout) == -1)
      return -1;

 

+   {
+     ACE_GUARD_RETURN (ACE_SYNCH_MUTEX_T
+                       ,ace_notify_mon
+                       // Would be handy if ACE_Notification_Strategy
+                       // has a lock. To be gathered from notify target
+                       // (reactor?)
+                       // Problem: ACE_Reactor_Notification_Strategy has
stored  
+                       // Reactor reference but ACE_Notification_Strategy
not.
+                       ,this->notification_strategy->lock()
+                       , -1);

 

+     // Somehow check if the (reactor?) notification channel is full.
+     // Probably needs this->notification_strategy too.
+     if (this->wait_no_notify_full_cond (ace_notify_mon, timeout) == -1)
+       return -1; // We cannot enqueue without causing a deadlock, so exit.

 

+     // Because we hold the notify lock, no new notifications will occur on
our
+     // target and it is safe to enqueue & notify hereafter...

 

      queue_count = this->enqueue_i (new_item);

 

      if (queue_count == -1)
        return -1;

 

      this->notify (); 

 

+   }; // e.o. notify guarded area.

 

  }
  return queue_count;
}

 

END FIX CODE EXAMPLE 
//==========================================================================
====

 

 

 

 

  _____  

From: Johnny Willemsen [mailto:jwillemsen at remedy.nl] 
Sent: woensdag 27 februari 2008 13:08
To: Rudy Pot; ace-bugs at cs.wustl.edu
Subject: RE: [ace-bugs] [ACE_Message_Queue] notify PIPE block causes
Select_Reactor deadlock.

Hi,

 

Thanks for using the PRF form. I found the change below in svn 

 

Johnny

 

Sat Mar 22 11:58:12 2003  Douglas C. Schmidt  <schmidt at tango.doc.wustl.edu>

 

        * ace/Message_Queue_T.cpp: Moved the notify() hook calls within

          the protection of the guard lock critical section to prevent

          race conditions on cleanup.  Thanks to Ron Muck <rlm at sdiusa.com> 

          for this suggestion.

 

 

From: ace-bugs-bounces at cse.wustl.edu [mailto:ace-bugs-bounces at cse.wustl.edu]
On Behalf Of Rudy Pot
Sent: Wednesday, February 27, 2008 11:29 AM
To: ace-bugs at cs.wustl.edu
Subject: [ace-bugs] [ACE_Message_Queue] notify PIPE block causes
Select_Reactor deadlock.

 


    ACE VERSION: 5.5.6

 

    HOST MACHINE and OPERATING SYSTEM: i386, Linux 2.6.20-1.2933.fc6 

 

    TARGET MACHINE and OPERATING SYSTEM, if different from HOST: same.


    COMPILER NAME AND VERSION (AND PATCHLEVEL): gcc 4.1.1

 

    CONTENTS OF $ACE_ROOT/ace/config.h:  config-linux.h

 

    CONTENTS OF $ACE_ROOT/include/makeinclude/platform_macros.GNU (unless
    this isn't used in this case, e.g., with Microsoft Visual C++):
     platform-linux.GNU
    
    AREA/CLASS/EXAMPLE AFFECTED: 
    Message_Queue_T.cpp / ACE_Message_Queue<ACE_SYNCH_USE> / enqueue_prio
    ( But I think also:
       ::enqueue_head, 
       ::enqueue_deadline,
       ::enqueue_tail )

 

    See DESCRIPTION.        

 

    DOES THE PROBLEM AFFECT:
        COMPILATION? NO
        LINKING?     NO
        EXECUTION?   YES
        OTHER (please specify)? NO

 

    SYNOPSIS:
    Application sometimes hangs when main Reactor gets more busy. 
    (e.g. due to significant increasing external events)
    Coredump shows:
    1) Main Reactor thread is blocking to get the ACE_Message_Queue lock.
    2) Multiple threads block on ACE_Select_Reactor_Notify-> which ends up
in
       ->ACE::send() -> write().
    Result is deadlock. We have to kill the application for shutdown.

 

    I've read Sidebar17 of C++ Network Programming Volume 2 but our
situation 
    is different from what is stated there (no notify called from handle_*
of
    event_handler but producer-consumer deadlock).

 


    DESCRIPTION:

 

    The application in short.
    -------------------------
    The part of our program where this deadlock appears is dealing with 
    processing messages from an embedded (CAN) network. 
    There is a thread per CAN message center (hardware communication
channel)
    which puts the received message into ACE_Message_Queue's, depending on
who
    wants to observe the messages. These are the producer threads.

 

    The message queues belongs to observers who all have registered to get
    notified by one Reactor which runs in one main Reactor thread
(consumer).
    (they also have registered themselves by the message center threads as
being
     interested in the messages).

 

    Problem cause
    --------------
    What can happen now is that the main Reactor, which is used for many
other 
    things in our application, temporarily got other work todo, and
therefore
    the message center threads may fill up the ACE_Select_Reactor
notification
    PIPE. This causes the message center threads (producers) to block on the

    (FULL) PIPE write().

 

    The main Reactor thread (consumer), when ready with the other work,
wants to 
    proceed with handling the pending notifications, and so emptying the
PIPE, 
    but cannot do this because the current notification code also holds the 
    message QUEUE lock! 
   
    See code description below:

 


    In Message_Queue_T.cpp
    ======================
    ACE_Message_Queue<ACE_SYNCH_USE>::enqueue_prio,
    ( But also the code reveals:
       ::enqueue_head, 
       ::enqueue_deadline,
       ::enqueue_tail )

 

-------------------------------
DEADLOCK CODE: (Above rev. 46096 until HEAD code)

 

template <ACE_SYNCH_DECL> int
ACE_Message_Queue<ACE_SYNCH_USE>::enqueue_prio (ACE_Message_Block *new_item,
                                                ACE_Time_Value *timeout)
{
  ACE_TRACE ("ACE_Message_Queue<ACE_SYNCH_USE>::enqueue_prio");
  int queue_count = 0;
  {
    ACE_GUARD_RETURN (ACE_SYNCH_MUTEX_T, ace_mon, this->lock_, -1);

 

    if (this->state_ == ACE_Message_Queue_Base::DEACTIVATED)
      {
        errno = ESHUTDOWN;
        return -1;
      }

 

    if (this->wait_not_full_cond (ace_mon, timeout) == -1)
      return -1;

 

    queue_count = this->enqueue_i (new_item);

 

    if (queue_count == -1)
      return -1;

 

    this->notify ();  <<<< ERROR DEADLOCK (When blocking on notify in scope
of 
                                                               buffer
lock...)
  }
  return queue_count;
}

 


    In above code snippet: 
    this->notify (); Causes DEADLOCK when blocking on full notification
pipe.
    This happens because <notify()> is now called within the scope of the 
    ACE_GUARD_RETURN.

 

    In older versions of ACE, e.g. 5.3.1, the <notify()> was outside the
scope
    of the GUARD and we never had this deadlock.

 

    In SUBVERSION ACE, I can see this has been changed after revision
     r.46096 of Message_Queue_T.cpp (ChangeLogTag:Sat Mar 22 11:58:12 2003)
    But I don't know why.

 

    See Message_Queue_T.cpp 
      : enqueue_prio, enqueue_head, enqueue_deadline, enqueue_tail

 

-------------------------------
NON DEADLOCK CODE (rev. 46096)
    ...          
    ...          
    if (queue_count == -1)
      return -1;

 

  }  // e.o. scope ACE_GUARD_RETURN for queue lock.
  this->notify ();  // NO deadlock here, notify will unblock as soon PIPE is

                    // emptied.
  return queue_count;
}

 


-------------------------------
Concerning Coredump parts:

 

[ consumer ]

Thread 1 (process 16785):
#0  0x009bc5d9 in __lll_mutex_lock_wait () from /lib/libpthread.so.0
#1  0x009b8636 in _L_mutex_lock_85 () from /lib/libpthread.so.0
#2  0x009b817d in pthread_mutex_lock () from /lib/libpthread.so.0
#3  0x00a9f922 in ACE_OS::mutex_lock () from /usr/lib/libACE.so.5.5.6
#4  0x00fd83c8 in ACE_Message_Queue<ACE_MT_SYNCH>::is_empty ()
   from /opt/lib/libgcp_datadump.so
#5  0x0142d881 in can::CCanSvcDriverObserver::handle_output ()
   from /opt/lib/libcanResourceManager.so.2
#6  0x00ac03f6 in ACE_Select_Reactor_Notify::dispatch_notify ()
   from /usr/lib/libACE.so.5.5.6
#7  0x00ac057a in ACE_Select_Reactor_Notify::handle_input ()
   from /usr/lib/libACE.so.5.5.6
#8  0x00ac14da in ACE_Select_Reactor_Notify::dispatch_notifications ()
   from /usr/lib/libACE.so.5.5.6
#9  0x00a5ef8e in ACE_Asynch_Pseudo_Task::ACE_Asynch_Pseudo_Task$base ()
   from /usr/lib/libACE.so.5.5.6
#10 0x00a5f5fd in ACE_Asynch_Pseudo_Task::ACE_Asynch_Pseudo_Task$base ()
   from /usr/lib/libACE.so.5.5.6
#11 0x00a65b03 in ACE_OS::gettimeofday () from /usr/lib/libACE.so.5.5.6
#12 0x00abd0d3 in ACE_Reactor::run_reactor_event_loop ()
   from /usr/lib/libACE.so.5.5.6
#13 0x0804ae36 in main ()

 

 

[ producer ]

Thread 28 (process 17023):
#0  0x009bc8f1 in write () from /lib/libpthread.so.0
#1  0x00a55a3a in ACE::send () from /usr/lib/libACE.so.5.5.6
#2  0x00ac082e in ACE_Select_Reactor_Notify::notify ()
   from /usr/lib/libACE.so.5.5.6
#3  0x00a5edcc in ACE_Asynch_Pseudo_Task::ACE_Asynch_Pseudo_Task$base ()
   from /usr/lib/libACE.so.5.5.6
#4  0x00abdce6 in ACE_Reactor::notify () from /usr/lib/libACE.so.5.5.6
#5  0x00abeace in ACE_Reactor_Notification_Strategy::notify ()
   from /usr/lib/libACE.so.5.5.6
#6  0x0804b42a in ACE_Message_Queue<ACE_MT_SYNCH>::notify ()
#7  0x00fd7fa2 in ACE_Message_Queue<ACE_MT_SYNCH>::enqueue_prio ()
   from /opt/lib/libgcp_datadump.so
#8  0x0142d6f3 in can::CCanSvcDriverObserver::update ()
   from /opt/lib/libcanResourceManager.so.2
#9  0x0143078d in can::CCanSvcDriver_Base::svc_Read ()
   from /opt/lib/libcanResourceManager.so.2
#10 0x0143087f in can::CCanSvcDriverRemoteRequestImpl::svc ()
   from /opt/lib/libcanResourceManager.so.2
#11 0x00ad3026 in ACE_Task_Base::svc_run () from /usr/lib/libACE.so.5.5.6
#12 0x00ad39e8 in ACE_Thread_Adapter::invoke_i ()
   from /usr/lib/libACE.so.5.5.6
#13 0x00ad3bb6 in ACE_Thread_Adapter::invoke () from
/usr/lib/libACE.so.5.5.6
#14 0x00a67511 in ace_thread_adapter () from /usr/lib/libACE.so.5.5.6
#15 0x009b626a in start_thread () from /lib/libpthread.so.0
#16 0x92fff470 in ?? ()
#17 0x92fff470 in ?? ()
#18 0x92fff470 in ?? ()
#19 0x92fff470 in ?? ()
#20 0x00000000 in ?? ()

 

[ producer ]

Thread 34 (process 17012):
#0  0x009bc8f1 in write () from /lib/libpthread.so.0
#1  0x00a55a3a in ACE::send () from /usr/lib/libACE.so.5.5.6
#2  0x00ac082e in ACE_Select_Reactor_Notify::notify ()
   from /usr/lib/libACE.so.5.5.6
#3  0x00a5edcc in ACE_Asynch_Pseudo_Task::ACE_Asynch_Pseudo_Task$base ()
   from /usr/lib/libACE.so.5.5.6
#4  0x00abdce6 in ACE_Reactor::notify () from /usr/lib/libACE.so.5.5.6
#5  0x00abeace in ACE_Reactor_Notification_Strategy::notify ()
   from /usr/lib/libACE.so.5.5.6
#6  0x0804b42a in ACE_Message_Queue<ACE_MT_SYNCH>::notify ()
#7  0x00fd7fa2 in ACE_Message_Queue<ACE_MT_SYNCH>::enqueue_prio ()
   from /opt/lib/libgcp_datadump.so
#8  0x0142d6f3 in can::CCanSvcDriverObserver::update ()
   from /opt/lib/libcanResourceManager.so.2
#9  0x0143062b in can::CCanSvcDriver_Base::svc_Read ()
   from /opt/lib/libcanResourceManager.so.2
#10 0x014308cd in can::CCanSvcDriverReadImpl::svc ()
   from /opt/lib/libcanResourceManager.so.2
#11 0x00ad3026 in ACE_Task_Base::svc_run () from /usr/lib/libACE.so.5.5.6
#12 0x00ad39e8 in ACE_Thread_Adapter::invoke_i ()
   from /usr/lib/libACE.so.5.5.6
#13 0x00ad3bb6 in ACE_Thread_Adapter::invoke () from
/usr/lib/libACE.so.5.5.6
#14 0x00a67511 in ace_thread_adapter () from /usr/lib/libACE.so.5.5.6
#15 0x009b626a in start_thread () from /lib/libpthread.so.0
#16 0x96bff470 in ?? ()
#17 0x96bff470 in ?? ()
#18 0x96bff470 in ?? ()
#19 0x96bff470 in ?? ()
#20 0x00000000 in ?? ()

 

[ producer ]

Thread 26 (process 17025):
#0  0x009bc8f1 in write () from /lib/libpthread.so.0
#1  0x00a55a3a in ACE::send () from /usr/lib/libACE.so.5.5.6
#2  0x00ac082e in ACE_Select_Reactor_Notify::notify ()
   from /usr/lib/libACE.so.5.5.6
#3  0x00a5edcc in ACE_Asynch_Pseudo_Task::ACE_Asynch_Pseudo_Task$base ()
   from /usr/lib/libACE.so.5.5.6
#4  0x00abdce6 in ACE_Reactor::notify () from /usr/lib/libACE.so.5.5.6
#5  0x00abeace in ACE_Reactor_Notification_Strategy::notify ()
   from /usr/lib/libACE.so.5.5.6
#6  0x0804b42a in ACE_Message_Queue<ACE_MT_SYNCH>::notify ()
#7  0x00fd7fa2 in ACE_Message_Queue<ACE_MT_SYNCH>::enqueue_prio ()
   from /opt/lib/libgcp_datadump.so
#8  0x0142d6f3 in can::CCanSvcDriverObserver::update ()
   from /opt/lib/libcanResourceManager.so.2
#9  0x0143078d in can::CCanSvcDriver_Base::svc_Read ()
   from /opt/lib/libcanResourceManager.so.2
#10 0x0143087f in can::CCanSvcDriverRemoteRequestImpl::svc ()
   from /opt/lib/libcanResourceManager.so.2
#11 0x00ad3026 in ACE_Task_Base::svc_run () from /usr/lib/libACE.so.5.5.6
#12 0x00ad39e8 in ACE_Thread_Adapter::invoke_i ()
   from /usr/lib/libACE.so.5.5.6
#13 0x00ad3bb6 in ACE_Thread_Adapter::invoke () from
/usr/lib/libACE.so.5.5.6
#14 0x00a67511 in ace_thread_adapter () from /usr/lib/libACE.so.5.5.6
#15 0x009b626a in start_thread () from /lib/libpthread.so.0
#16 0x91bff470 in ?? ()
#17 0x91bff470 in ?? ()
#18 0x91bff470 in ?? ()
#19 0x91bff470 in ?? ()
#20 0x00000000 in ?? ()

 


    REPEAT BY:
    See description

 

    SAMPLE FIX/WORKAROUND:
    Change ACE code so that <this->notify();> is outside of GUARD scope?
    (like it was before in rev. 46096).

 

 

[* end of PRF *]

 

 

Aside from this deadlock problem:

One thing what also comes up now is that I have to look at the notification
pipe

buffer length too when I want to increase the  ACE_Message_Queue size?

(enqueue will block if notification pipe is full). 

Or, I have to use #define ACE_HAS_REACTOR_NOTIFICATION_QUEUE and

recompile ACE?

 

Best regards,

 

 

Rudy Pot

Embedded Computer Systems

 

AWETA G&P b.v

Postbox 17

NL-2630 AA  Nootdorp

tel +31 (0)15 3109961 

fax +31 (0)15 3107321 

mail  <mailto:rpot at aweta.nl> rpot at aweta.nl

web www.aweta.com <blocked::http://www.aweta.com> 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://list.isis.vanderbilt.edu/pipermail/ace-users/attachments/20080311/583875e6/attachment-0001.html 


More information about the Ace-users mailing list