In just four days, they programmed the robots using GPT-4o, creating a visual language ... sophisticated multimodal agent ...