So with my company we tend to I think "over do it" when it comes to a failed devices on the network. Example: Major core switch fails, we generally shut down physical interfaces AND routing protocols to that device.
docrice wrote: » Some places have a strict procedure. Sure, shutting an interface down can effectively cut off the device's utility, but if there are many cooks in the kitchen it's possible that someone may come around and "fix it" by turning it back on not knowing about the issue that caused the problem. Another example - if you remove a server from a network, shutting the port down seems enough ... until someone later sees that port as available, plugs an arbitrary device in, turns the port back on, and all of a sudden you have a new device on a VLAN that it's not supposed to be on. This can be bad news.
networker050184 wrote: » I usually don't remove routing protocols but I do raise the metric. That way you can turn it all the way up and gracefully shift traffic back on.
daveyb wrote: » While that does sound a bit extreme, its always wise to bring a device into service gracefully. You generally don't want to plug a device in, turn up its interfaces, and boom its forwarding traffic. What if there is a faulty patch that has become damaged during the swap out? You may get a whole load of TCP retransmits and severely limit traffic flow. Policy got fluffed when restoring config? Could have routing loops/other undesirable behaviour. There are many issues that could occur. Generally a good way of going about things is something like the following: Bring up interfaces. Run a few pings across, make sure no errors on any ports. Bring up IGP with high metrics on all links. Ensure all adjacencies are established and that all routes are being distributed/learnt. Bring up BGP. Ensure all neighbours establish, routes are being distributed. Drop IGP metrics back to what is normal. Restore any FHRP that are usually master on this box. YMMV, but that is roughly the kind of plan I would follow. It always pays to be safe.
networker050184 wrote: » Those people with more experience have probably seen devices comeback online and wreak havoc which is why they're so cautious. Either that or someone that has told them war stories.