{% if mode == "primary" %}
📦
Box plots show the distribution of performance scores across multiple test runs, with boxes representing the interquartile range (middle 50% of values).
{% elif mode == "comparison" %}
🔄
This comparison shows how different models' performance distributions vary under the same perturbation conditions.
{% endif %}