I do wish Cisco would hire some decent mechanical engineers. Yes, I know they left and went to Juniper, but you've had a few years now to recruit some more.
Had a ECC error on a 6509-NEBS card, these are sticky errors and the card needs a power restart to get rid of them (and thus determine if they are random or not). Any other manufacturer I could drop the power to the card from the command line, but Cisco are "friendlier" to enterprise customers than that and I need to physically eject the card.
When I reinsert it the box reboots. WTF! Turns out this is a known flaw in the chassis design -- push the top end of the card in first. Nowhere is this mentioned in the documentation. Nor was it pointed out by the Technical Assistance Centre staffer who asked for the card to be reinserted -- rather that person was upset that I'd kept the task open whilst arranging a 3AM slot for hazardous work, saying that it was risk free and could be done at any time. [This is typical behaviour for the TAC, they get a task into Customer-Pend as quickly as possible under almost any pretext; then they try to close it within a few days, no matter what the actual state on the ground.] Hmmm.
Rang the TAC manager to bitch. They simply aren't interested. Cisco staff seem to think that a spontaneous reboot on a physical event isn't unreasonable. Very much like the early days of Windows when people came to ignore operating system crashes as just part of the whole computing experience rather than as an engineering failure.