GSI Forum
GSI Helmholtzzentrum für Schwerionenforschung

Home » PANDA » PandaRoot » General » PandaRoot jobs on GSI farm
PandaRoot jobs on GSI farm [message #6862] Mon, 09 June 2008 14:37 Go to next message
Bertram Kopf is currently offline  Bertram Kopf
Messages: 110
Registered: March 2006
continuous participant
From: *ep1.ruhr-uni-bochum.de
Dear experts for the PandaRoot event production,

there are still 106 PandaRoot jobs running on the GSI machines. Most of them (82) have been started on 28th of May or even earlier.
The remaining 25 jobs have been started the 25th of June. Moreover they only used a few seconds of CPU time so far. It seems that this are endless jobs with the consequence that the CPU resources for the other groups are very limited right now.
Therefore I would kindly like to ask you to check whether everything is going well and in case of endless jobs to kill them.

Many thanks in advance,
Bertram.
Re: PandaRoot jobs on GSI farm [message #6884 is a reply to message #6862] Tue, 10 June 2008 10:56 Go to previous messageGo to next message
Kilian Schwarz is currently offline  Kilian Schwarz
Messages: 91
Registered: June 2004
Location: GSI, Darmstadt
continuous participant
From: *gsi.de
Hi Bertram,

indeed, those jobs did nothing useful.
They have been killed.

Cheers and thanks,

Kilian
Re: PandaRoot jobs on GSI farm [message #6886 is a reply to message #6862] Tue, 10 June 2008 11:06 Go to previous messageGo to next message
Kilian Schwarz is currently offline  Kilian Schwarz
Messages: 91
Registered: June 2004
Location: GSI, Darmstadt
continuous participant
From: *gsi.de
Hi Bertram,

those "jobs" were by AliEn automatically submitted JobAgents waiting for real jobs to be submitted by users. Grid Jobs will be picked up by those JobAgents as soon as a Grid user submits some jobs.
Accidentally the number of Grid jobs were (from DC times) still 100 for GSI, which is why continueously 100 JobAgents were running.

I will reduce the number of Grid jobs since currently no Grid production is going on.

Regards,

Kilian
Re: PandaRoot jobs on GSI farm [message #6887 is a reply to message #6862] Tue, 10 June 2008 12:26 Go to previous messageGo to next message
Bertram Kopf is currently offline  Bertram Kopf
Messages: 110
Registered: March 2006
continuous participant
From: *ep1.ruhr-uni-bochum.de
Hi Kilian,
thank you very much for killing these jobs. This helps us to produce and analyze more data for the Physics Book studies.

Cheers,
Bertram.
Re: PandaRoot jobs on GSI farm [message #6906 is a reply to message #6862] Thu, 12 June 2008 12:17 Go to previous messageGo to next message
Bertram Kopf is currently offline  Bertram Kopf
Messages: 110
Registered: March 2006
continuous participant
From: *ep1.ruhr-uni-bochum.de
Hi Johan, Kilian and all others,

after killing 100 panda root endless jobs few days ago (many thanks to Kilian) there are now again appr. 150 jobs running since more than 30 hours on the GSI batch system. It seems to me that these jobs are again placeholder JobAgents waiting for real jobs to be submitted by users. The consequence is that only less than 10 jobs in average could run for the Physics Book mass production and analyses during that time. Therefore I would kindly like to ask you to kill most of the jobs in order to provide also CPU resources for the remaining groups.

BTW: As you certainly know the Physics Book mass production should have the highest priority within the PANDA activities right now. Therefore it would be great to organize a fair sharing of the CPU resources within PANDA.

Thanks in advance,
Bertram.
Re: PandaRoot jobs on GSI farm [message #6907 is a reply to message #6906] Thu, 12 June 2008 12:38 Go to previous messageGo to next message
Kilian Schwarz is currently offline  Kilian Schwarz
Messages: 91
Registered: June 2004
Location: GSI, Darmstadt
continuous participant
From: *gsi.de
sorry for that. The jobs have been killed.

Cheers,

Kilian
Re: PandaRoot jobs on GSI farm [message #6909 is a reply to message #6907] Thu, 12 June 2008 13:48 Go to previous messageGo to next message
Bertram Kopf is currently offline  Bertram Kopf
Messages: 110
Registered: March 2006
continuous participant
From: *ep1.ruhr-uni-bochum.de
Hi Kilian,

great! Thank you for your prompt reaction!

Cheers,
Bertram.
Re: PandaRoot jobs on GSI farm [message #6932 is a reply to message #6906] Mon, 16 June 2008 10:45 Go to previous messageGo to next message
Bertram Kopf is currently offline  Bertram Kopf
Messages: 110
Registered: March 2006
continuous participant
From: *ep1.ruhr-uni-bochum.de
Hi Johan, Kilian and all others,
sorry, but I have to realize that there are again 124 PandaRoot placeholder jobs running on the GSI batch farm since 13th of June.

Best regards,
Bertram.
Re: PandaRoot jobs on GSI farm [message #6934 is a reply to message #6932] Mon, 16 June 2008 11:22 Go to previous messageGo to next message
Johan Messchendorp is currently offline  Johan Messchendorp
Messages: 693
Registered: April 2007
Location: University of Groningen
first-grade participant

From: *KVI.nl
Hi,

Weird? Kilian, is there something automatically started or so?

Johan.
Re: PandaRoot jobs on GSI farm [message #6944 is a reply to message #6934] Mon, 16 June 2008 15:17 Go to previous message
Kilian Schwarz is currently offline  Kilian Schwarz
Messages: 91
Registered: June 2004
Location: GSI, Darmstadt
continuous participant
From: *gsi.de
Hi Johan,

yes, AliEn automatically restarts the Job agents so that they are always on the nominal value, which is supposed to be 20 running and 20 queueing.
But AliEn submits in bunches of 100, so this is one of the problems, I suppose, why the nominal value can be exceeded if such small numbers are configured.
The Grid is designed for large numbers of jobs Smile
The CE seems to count correctly, though. At the moment it finds 20 queueing jobs and submits nothing more.

I will try to keep things under control.

Cheers and sorry for the inconvenience.

Cheers,

Kilian
Previous Topic: Event Display
Next Topic: Simple Reco for new variant of RPC ToF
Goto Forum:
  


Current Time: Thu Mar 28 15:05:26 CET 2024

Total time taken to generate the page: 0.00657 seconds