I built a finetuned codellama-7B model with David Fu and Alexandra Duan that codes in three.js as our final project in NLP at Columbia. Training data was composed of official three.js samples online, and our evaluation benchmark used a VLM (Qwen3-VL) to judge the visual accuracy of the generated scene.
Watch our model in action here: