a tiny vision language model
Generate text based on input prompts
let's talk about the meaning of life