Monitoring for Run Away Jobs

25 pts.
Tags:
Critical storage
run away jobs
I'm an admin on a v5r4 system. We have a report utility that eats up storage for no apparent reason. Yesterday I checked the wrksyssts and we were at 34% which is normal for us. At about 12pm I got a critical storage alert message that we were at 90% then mins later 95%. I found the job and forced it to end and we went back to about 40%. Here's the thing. This job does not produce data or write to a splf. It didn't have a big joblog. I can find no reason that it would chew up the system like that.  I would like to monitor jobs for the space/storage/system resources (?) they take so I can catch run away jobs before we reach a critical point on our system. I've already changed our lower limit alert to 75% to warn us sooner but I'd like to monitor at the job level instead. Any ideas? User limits maybe?

Software/Hardware used:
iseries 550
ASKED: December 14, 2011  9:18 PM
UPDATED: February 28, 2012  3:03 PM

Answer Wiki

Thanks. We'll let you know when a new response is added.
Send me notifications when members answer or reply to this question.

REGISTER or login:

Forgot Password?
By submitting you agree to receive email from TechTarget and its partners. If you reside outside of the United States, you consent to having your personal data transferred to and processed in the United States. Privacy

Discuss This Question: 4  Replies

 
There was an error processing your information. Please try again later.
Thanks. We'll let you know when a new response is added.
Send me notifications when members answer or reply to this question.

REGISTER or login:

Forgot Password?
By submitting you agree to receive email from TechTarget and its partners. If you reside outside of the United States, you consent to having your personal data transferred to and processed in the United States. Privacy
  • TomLiotta
    You can set up system monitors with Management Central that fire when disk storage crosses thresholds that you choose. I suppose you could have one call programming that listed active jobs and checked storage in each of them. If any seemed excessive, it could start sending notices. And if a dangerous threshold was exceeded, it could hold the job. (Avoid holding system jobs.) There are commercial products for this. In this case, it seems best to fix whatever your job is doing wrong. Without knowing something about the job, there's not much to suggest. Does it retain its debug data? Tom
    125,585 pointsBadges:
    report
  • Vjeter00
    Is Management Central a 3rd party? My company will not invest in another software. I'm looking nto the cause and solution to this one job but I was hoping to develope somthing that would prevent us from getting another nasty surprise. I like your idea of the home grown monitor job. Do you know what command would check the size of storage a job is using?
    25 pointsBadges:
    report
  • Vjeter00
    Tom, I looked up MC. I see that it's part of iSeries Navigator. I tried finding it there this afternoon but was unsuccessful. Do I have to load/add something? I look a little more tomorrow at work. All your help & suggestions are really appreciated.
    25 pointsBadges:
    report
  • TomLiotta
    Do I have to load/add something? If you don't see it when you open iSeries Navigator, then you will need to run the iSeries Access Selective Setup option and select it to be installed. After it's installed in your Navigator, it might also need to be configured and started in your AS/400. Tom
    125,585 pointsBadges:
    report

Forgot Password

No problem! Submit your e-mail address below. We'll send you an e-mail containing your password.

Your password has been sent to:

To follow this tag...

There was an error processing your information. Please try again later.

REGISTER or login:

Forgot Password?
By submitting you agree to receive email from TechTarget and its partners. If you reside outside of the United States, you consent to having your personal data transferred to and processed in the United States. Privacy

Thanks! We'll email you when relevant content is added and updated.

Following