TURBOMOLE Forum
Welcome, Guest. Please login or register.
September 08, 2010, 05:47:33 AM

Login with username, password and session length
Search:     Advanced search
1374 Posts in 495 Topics by 411 Members
Latest Member: shanisi
* Home Help Search Login Register
+  TURBOMOLE Forum
|-+  TURBOMOLE Modules
| |-+  Aoforce and Numforce
| | |-+  core-dump with aforce, TURBOMOLE v.6.0
« previous next »
Pages: [1] Print
Author Topic: core-dump with aforce, TURBOMOLE v.6.0  (Read 998 times)
sassy
Newbie
*
Posts: 2


« on: December 02, 2009, 01:26:29 PM »

Dear all,

I am having a problem getting aforce to run for a larg molecule (85 atoms, one Fe) using BP86/def2-TZVP.
The input file was generated with the latest version of the GUI and then copied over to the cluster. That worked well for
benzene example. The production run got that far (aforce.out):
 CONSTRUCTING second deriv. of 2e energy    -> Hessian
   treating Coulomb (and exchange) contribution
     integrals will be neglected if total contribution <
   2.5746652935118433E-010
      ...terminated. cpu:   87595.82       wall:   87595.82
   treating exchange-correlation contribution

and then generated a core dump due to a segfault:
aoforce[15165]: segfault at 000000004266d000 rip 0000000000be4905 rsp 00007fffffe7b4b0 error 6

As I am having 8 GB on that quadcore Opteron node, I decided to use:
$maxcor    1600
as aforce only runs on a single core and other people are using that node as well.

Would increasing the memory solve the problem or is there another issue here?

All the best

Jörg
Logged
antti_karttunen
Full Member
***
Posts: 75


« Reply #1 on: December 02, 2009, 02:30:08 PM »

Hello Jörg,

Segmentation faults are usually caused by too small stacksize limits. More info here: http://www.turbo-forum.com/index.php?topic=23.0

Regards,
Antti
Logged
uwe
Global Moderator
Sr. Member
*****
Posts: 214


« Reply #2 on: December 04, 2009, 01:13:58 PM »

Hi,

TmoleX can start Turbomole jobs on remote systems itself with the 'run network' button in the start-job panel. It generates a small script that is being executed on the target system. The script automatically includes the line

ulimit -s unlimited

This should help to get rid of a too small stack size limit which causes almost all segfaults.

Uwe
Logged
sassy
Newbie
*
Posts: 2


« Reply #3 on: December 22, 2009, 04:02:58 PM »

Hi all,

thanks for your feedback and sorry for my late reply.

I was actually using the start-job file from the GUI which contains the lines:

#!/bin/sh

rm -f Job_Ended
touch Job_Started
export TURBODIR=/usr/local/TURBOMOLE_6.0/
export PATH=$TURBODIR/scripts:$PATH
ulimit -s unlimited
export PARA_ARCH=MPI
export PARNODES=4
export PATH=$TURBODIR/bin/`sysname`:$PATH
aoforce >aoforce.out
#ridft >ridft.out
rm -f Job_Started
touch Job_Ended

So the ulimit -s unlimited was already set. I found that the easiest way to start the job.
I was starting the job manually (i.e. qrsh to a free node and then start it from the shell), so
the stacksize should have been unlimited.

Any other ideas?

Marry Christmas and all the best for 2010

Jörg
Logged
uwe
Global Moderator
Sr. Member
*****
Posts: 214


« Reply #4 on: December 22, 2009, 05:06:44 PM »

Hi,

Turbomole 6.0 - there was a problem with the AMD math library on some Opteron systems with the 6.0 version of Turbomole.

Please try either Turbomole 6.1 or contact the Turbomole support to get a fixed binary of 6.0 to check if your crash is related to this known problem.

Regards,

Uwe
Logged
Pages: [1] Print 
« previous next »
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.9 | SMF © 2006-2009, Simple Machines LLC Valid XHTML 1.0! Valid CSS!