Skip to article frontmatterSkip to article content

OOM kill of enforce-xfs-quota on neurohackademy

FieldValue
Impact TimeOct 16 at 15:00 to Oct 16 at 16:14
Duration1h 14m

Overview

The enforce-xfs-quota container experienced OOM following Angus quota changes

What Happened

The hard quota for the neurohackademy prod hub was modified, Oct 16 at 15:00 to Oct 16 at 16:14 leading to an update of all projects. The generator script encountered an OOM error when processing a large directory (_shared).

Where We Got Lucky

The engineer noticed this when comparing configurations between clusters.

Action Items

Timeline

Oct 16, 2025

TimeEvent
3:42 PMAn engineer notices the enforce-xfs-quota container restarting
3:47 PMThe engineer creates a debug container and runs the generator
4:00 PMThe engineer successfully ran the script, and investigated why it was crashing for the main container