I've got a RedHat Linux box that is a VM Guest (at this time, I don't know what the host is, I'm working on getting that info), and we are having alot of issues with running out of RAM at this time, and I think a memory leak. We have Oracle running on this box along with performing backups too. Sometimes a simple backup of 40 GB that should only take 1 to 2 hours can take 10 to 12 hours to complete. The other day we dropped a bunch of tables, indexes and views in the database and then rebuilt them for some maintenance and it took more time when what it should have.
My co-workers and I have setup a cron job to run every 3 hours that frees pagecache, dentries and inodes in cache memory
# sync; echo 3 > /proc/sys/vm/drop_caches
Got the info for this from this website:
http://tecadmin.net/flush-memory-cache-on-linux-server/
This works, however it is a temporary band-aid, and we want a solution.
Along with monitoring free -m and top, I haven't had a chance to dig into any of the logs under /var/log to figure out where the culprit may lie.
We won't be able to install 3rd party software such as
Valgrind to try and troubleshoot, because a request like that has to go thru multiple layers of management, which will take some time.
Is is a good idea to look thru various logs to figure out where the issue or any other recommendations on how to approach to fix the issue?
thanks