2.8m Gmail.txt 🎁 Direct

) to ensure the generated code matches the visual intent [11].

: Uses 11k pairs with a balance of textual and visual rewards ( 2.8M GMAIL.txt

: The model is tested on subsets ranging from 200k to 2.8 million samples. ) to ensure the generated code matches the