[adf-list]问题:'MPI应用程序排名13在MPI_Finalize()之前退出状态154'

Hans Van Schoot. vanschoot在scm.com
5月12日08:13:25 Cest 2014

Dear Karina,

Reuti is right, this could very well be due to a wall clock time (the
amount of time you allow your calculation to take on the machine), so
please check the time your job takes versus the specified wall time. If
it is not, please send the full input, output and logfile of the job to
support at scm.com, so we can take a closer look.

Best regards,
Hans van Schoot
- SCM developer


On 05/10/2014 10:27 AM, Reuti wrote:
> Hi,
>
> Am 09.05.2014 um 23:06 schrieb Karina Muñoz:
>
>> I am using a cluster (from a supercomputer centre), which works as a queuing system. Unfortunately, I am just an ADF user, and I am not a specialist in computing programming. Could you explain me which it could be the problem (E.g. Torque ???), in order to transmit the information to the experts. 
> I don't know the details of your cluster and its limits either. Ask the admins what limits (like wall clock time) they impose.
>
> -- Reuti
>
>
>> Thank you very much
>> Best regards,
>>
>> Karina
>>
>>
>> 2014-05-09 15:21 GMT-04:00 Reuti <reuti at staff.uni-marburg.de>:
>> Hi,
>>
>> Am 09.05.2014 um 17:39 schrieb Karina Muñoz:
>>
>>> I am having a recurrent problem with some (large) opt calculations (large outputs). The output crashes down when the program is printing large information, showing the next warning:
>>> (in this case, after the sentence:'This molecular quadrupole moment is calculated with analytic integration') -->
>> Are you running these interactively or inside a queuing system? SIGTERM could be a warning from E.g. Torque.
>>
>> -- Reuti
>>
>>
>>> MPI Application rank 13 exited before MPI_Finalize() with status 154
>>> forrtl: error (78): process killed (SIGTERM)
>>> Image              PC                Routine            Line        Source
>>> libc.so.6          0000003DEC0CEBB7  Unknown               Unknown  Unknown
>>> libpcmpi.so        00002AEEC4CD94C6  Unknown               Unknown  Unknown
>>> libpcmpi.so        00002AEEC4CE976A  Unknown               Unknown  Unknown
>>> libpcmpi.so        00002AEEC4CC7FBF  Unknown               Unknown  Unknown
>>> libpcmpi.so        00002AEEC4D55FB0  Unknown               Unknown  Unknown
>>> libpcmpi.so        00002AEEC4D55C1F  Unknown               Unknown  Unknown
>>> libmpi.so          00002AEEC4B4C3E9  Unknown               Unknown  Unknown
>>> adf.exe            000000000148EC96  Unknown               Unknown  Unknown
>>> adf.exe            0000000001067613  Unknown               Unknown  Unknown
>>> adf.exe            0000000001042F9D  Unknown               Unknown  Unknown
>>> adf.exe            0000000000C77322  Unknown               Unknown  Unknown
>>> adf.exe            00000000008A7747  Unknown               Unknown  Unknown
>>> adf.exe            000000000088FB90  Unknown               Unknown  Unknown
>>> adf.exe            0000000000598DA1  Unknown               Unknown  Unknown
>>> adf.exe            0000000000442CA8  Unknown               Unknown  Unknown
>>> adf.exe            00000000004127F8  Unknown               Unknown  Unknown
>>> adf.exe            000000000041254C  Unknown               Unknown  Unknown
>>> ....
>>> ....
>>> ....(continue...)
>>>
>>> (Sometimes crashes down when the program is printing another information, not necessarily the Quadrupole Moment, as in this example)
>>>
>>> Thank you for your help
>>>
>>> Karina
>>>
>>> _______________________________________________
>>> ADFlist mailing list
>>> ADFlist at scm.com
>>> http://lists.tofoba.com/mailman/listinfo/adflist
>> _______________________________________________
>> ADFlist mailing list
>> ADFlist at scm.com
>> http://lists.tofoba.com/mailman/listinfo/adflist
>>
>>
>>
>>
>> _______________________________________________
>> ADFlist mailing list
>> ADFlist at scm.com
>> http://lists.tofoba.com/mailman/listinfo/adflist
> _______________________________________________
> ADFlist mailing list
> ADFlist at scm.com
> http://lists.tofoba.com/mailman/listinfo/adflist



有关Adflist邮件列表的更多信息