125 pts.
 AS/400 Batch job Issue
Hi One of our job, for which code was untouched from an year , has now started taking excptionally long time to complete. It used to take hardly 20 mins to complete. Now from last two days it is taking almost 7-8 hrs. The only observation i made was that it is not getting cpu time .5- .8 . I tried changing the priorities. but no use. Could you please help me and guide me what can be done with the job. Also the other jobs in the system are running fine.

Software/Hardware used:
ASKED: September 16, 2011  3:27 PM
UPDATED: March 31, 2012  4:18 PM

Answer Wiki:
As always the first question is "What changed"? Have any of the files changed so that your I/O is different? Large files? Access Paths? Have you changed anything regarding pool sizes? Is something new running at that time that is using all the cycles? If you run the job at a different TOD, will it run faster?
Last Wiki Answer Submitted:  September 16, 2011  4:18 pm  by  CharlieBrowne   33,710 pts.
All Answer Wiki Contributors:  CharlieBrowne   33,710 pts.
To see all answers submitted to the Answer Wiki: View Answer History.


Discuss This Question:
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _


 

this particular job… nothing changed from like an year…
And the other jobs in Qbatch are running normally…

Its only this job that’s not getting CPU time..

 125 pts.

 

This job is active for all that time???? not just waiting in the JobQ?

 44,630 pts.

 

…nothing changed from like an year…

It’s almost certain that something changed. That is, batch jobs that run for 20 minutes usually are processing significant amounts of data. If nothing changes in or about the data, there’s usually no point in running the job again — you’ll just get the same results if you run over unchanged data.

So, elements such as number of records or changes in authorities or just about anything may have effects.

One trouble is that if there is no benchmark to compare against, it can be difficult locating a bottleneck. Does the job do any kind of status logging as it runs? If it logged periodic messages (e.g., every 10000 records or every new job step), it might be possible to isolate where to look.

Tom

 110,135 pts.

 

Hi Ankita,

Did you had look at the priorities of the Jobs that are parallally running Batch.
How does this Jon work? I mean, will it updates Database Transactions or Generates Report?
I suggest, Compare the Job logs of the Current Job and the previous Job which used to take 20 mins.
If you have already compared let us know the results, that maybe helpful for tracking issue.

Pradeep.

 3,520 pts.

 

Thanks for your suggesions…

I checked the following but still no success –

1. Changes the Run priority to max in the subsystem
2. Changes the time slice
3. checked the job log… couldnt fine anything..
4. Tried running the job in QINTER. As it is a nightly job.. there were no other jobs in the subsystem and gave it a run priority of 18.
5. There are no SQL or OPNQRYF used.
6. No change in the logical files used.
7. It still runas at its usual time.. sharing the memory with same jobs.. it used to from quite a while now.

And just to let you know it is a SYNON RPG program called by a driver CL program.

One of the file gets cleared first based on some RST parameters. and then the job builds the file from another file.

Please help.. if you have any other clue on how to get this done…

 125 pts.

 

are you backups running during this time? Could there be a hardware issue like a raid rebuild on a hard drive?

Spend some more time reviewing your system events and more time on the joblog of the long running job. Ensure your message logging setting on the job is:
Message logging:
Level . . . . . . . . . . . . 4
Severity . . . . . . . . . . . 00
Text . . . . . . . . . . . . . *SECLVL
Log CL program commands . . . . *YES

 2,680 pts.

 

the prod box was slow. Even other teams were facing similar problems as us. Finally it was figured out that the box was slow because of unexpectedly high volume of journaling being done accidently. So journaling of few files which wasnt required was switched off.

 125 pts.