The reftest broke when font size lead to labels > 16px.
This test is a bit brittle because it doesn't properly rely on CSS properties but needs to use widget style properties to turn of extra sizing from widgets. It might break in the future when porting widgets to draw properly.