Compare cert salaries and plan your next career move
YuckTheFankees wrote: » I want to understand how system admins on TechExams look at a problem and resolve it from start to finish (what's your first step, cmd's you may use, tools, strategy, most common issues, important files).
ptilsen wrote: » I would answer that I would first gather as much information as possible. In the case of a chronic rebooting issue, this means the frequency and timing of the reboots, any possible or identified correlating events, when the problem originated or is suspected to have originated, and any error messages, logs, or other possible indicators as to what could be causing the issue. After collecting information, if the evidence does not point to a conclusive cause, I would then try to determine likely causes and seek to eliminate them as possibilities. That is a general response to a partially specific question, and it doesn't involve specific tools or checking for specific technical issues.
CodeBlox wrote: » For servers rebooting (Or any windows workstation for that matter) you could check the files in C:\windows\Minidump When a BSOD occurs, the **** gets stored there if enabled (which it usually is). You will need a third party tool to get any useful data from the *.dmp files. Often times, the results tell you exactly whats causing the BSOD. On workstations, the BSOD can come up and go pretty quickly giving it the illusion that the workstation just rebooted for no reason. We have a domain controller which occasionally BSODS at work.
pram wrote: » Not directly related to rebooting issues, but strace can be a lifesaver. I've used it to debug quite a few scripts. If you've never used it, it essentially attaches to a process and lets you view the system calls. This can be helpful in determining why a program is having issues, as you can see errors being generated that typically don't end up in any log. For example, one of my clients had a java program that ran through jboss that was causing random humongous load spikes. Rather than wait on the programmer to debug it I decided to take a look at what it was doing. Ultimately by using 'strace -e trace=network' on it I discovered it was having name resolution issues, and the threads were getting stuck in an infinite loop. It can be fairly daunting to use at first because the output isn't very intuitive, but its truly invaluable for troubleshooting.
Compare salaries for top cybersecurity certifications. Free download for TechExams community.