Soft Skills Engineering
Episode 436: Paralyzed by checkboxes and I'm on a "must keep happy" list
- Autor: Vários
- Narrador: Vários
- Editor: Podcast
- Duración: 0:33:34
- Mas informaciones
Informações:
Sinopsis
In this episode, Dave and Jamison answer these questions: Marcus Zackerberg asks, I work at a megacorp whose recent focus has been on reliability. The company already has mature SLO coverage outage response standards, but my org has taken it to the extreme this year. For example… There is now a dashboard of “service health” that is reviewed by engineering leadership. In it, services are marked “unhealthy” permanently upon a failing check (think HTTP /health). To return to a “healthy” state, one must manually explain the failure with an entry in a spreadsheet, which must be reviewed and signed off. Increasingly I feel this has the opposite effect, discouraging nuanced work to improve reliability and instead becoming “checkbox driven development”, as well as impacting our ability to ship on our existing roadmap items. Additionally, our tech lead is fairly junior and frequently fails to communicate the org’s expectations to the team, leading to us being under the gun of the reliability da