This is a submission for the Google AI Studio Multimodal Challenge
What I Built
I created Look & Learn, a daily challenge app for language learners. Each day, one new image is generated, and for every language and skill level, a Γ©quiz is created. You can always go back and play quizzes from earlier days, and if a quiz for that day doesnβt exist yet, it will be generated when you first try it. You can also see which days you've already played, so you can get a streak going!
The app will then ask you some questions about the image, in the language youβre trying to learn. For beginners, all the questions are multiple choice. For intermediate and advanced levels, youβll have to type out your answers.
Demo
Screenshots:
Screenshot from an intermediate level Dutch quiz:
How I Used Google AI Studio
I wanted to see how far I could take Google AI Studio while touching the code by hand as little as possible. While I'm mostly skeptical of vibe coding, I thought this challenge was an interesting opportunity to give it a try. So, I mostly wrote prompts, and would give the model feedback in natural language.
Multimodal Features
At the start of each daily challenge, the application always uses the image for that day, generating it if it doesnβt exist yet. That image is then the basis for all the quizzes created that day.
Gemini-2.5-flash is used to generate questions about the image, with prompts that include guidelines for the questions as well as the userβs fluency level. For multiple choice questions, the correct answer is clear and immediate feedback is given. For text entry questions, the userβs response is sent back to Gemini-2.5-flash to evaluate correctness and provide grammar and vocabulary feedback.
Finally, the image is also passed to Gemini-2.5-flash to generate an alt text. This description contains all the information needed to answer the quiz questions, but itβs provided in the learnerβs native language so that they still have to make the effort of translating and connecting the details. Iβve also made sure that any text in other languages is wrapped with the correct lang
attribute, so screen readers pronounce them properly.
Would you like me to also add a short line in the βWhat I Builtβ intro (something like βThis way, the community is sharing the same daily challengeβ) to highlight the communal aspect, or do you prefer to keep it focused just on the learnerβs experience?
Top comments (4)
Great Project, fun way to test language proficiency!
Thx!
Looks great
Thx!
Some comments may only be visible to logged-in visitors. Sign in to view all comments. Some comments have been hidden by the post's author - find out more