Frederik 👨‍💻➡️🌐 Creemers

Posted on Sep 7 • Edited on Sep 13

Look & Learn: a Google AI Multimodal Challenge Entry

#devchallenge #googleaichallenge #ai #gemini

Google AI Challenge Submission

This is a submission for the Google AI Studio Multimodal Challenge

What I Built

I created Look & Learn, a daily challenge app for language learners. Each day, one new image is generated, and for every language and skill level, a équiz is created. You can always go back and play quizzes from earlier days, and if a quiz for that day doesn’t exist yet, it will be generated when you first try it. You can also see which days you've already played, so you can get a streak going!

The app will then ask you some questions about the image, in the language you’re trying to learn. For beginners, all the questions are multiple choice. For intermediate and advanced levels, you’ll have to type out your answers.

Demo

Try Look & Learn here

Screenshots:

Screenshot from an intermediate level Dutch quiz:

How I Used Google AI Studio

I wanted to see how far I could take Google AI Studio while touching the code by hand as little as possible. While I'm mostly skeptical of vibe coding, I thought this challenge was an interesting opportunity to give it a try. So, I mostly wrote prompts, and would give the model feedback in natural language.

Multimodal Features

At the start of each daily challenge, the application always uses the image for that day, generating it if it doesn’t exist yet. That image is then the basis for all the quizzes created that day.

Gemini-2.5-flash is used to generate questions about the image, with prompts that include guidelines for the questions as well as the user’s fluency level. For multiple choice questions, the correct answer is clear and immediate feedback is given. For text entry questions, the user’s response is sent back to Gemini-2.5-flash to evaluate correctness and provide grammar and vocabulary feedback.

Finally, the image is also passed to Gemini-2.5-flash to generate an alt text. This description contains all the information needed to answer the quiz questions, but it’s provided in the learner’s native language so that they still have to make the effort of translating and connecting the details. I’ve also made sure that any text in other languages is wrapped with the correct lang attribute, so screen readers pronounce them properly.

Would you like me to also add a short line in the “What I Built” intro (something like “This way, the community is sharing the same daily challenge”) to highlight the communal aspect, or do you prefer to keep it focused just on the learner’s experience?

Top comments (4)

Pravesh Sudha • Sep 8

Great Project, fun way to test language proficiency!

Frederik 👨‍💻➡️🌐 Creemers • Sep 8

Thx!

Glenn Trojan • Sep 8

Looks great

Frederik 👨‍💻➡️🌐 Creemers • Sep 8

Thx!

Some comments may only be visible to logged-in visitors. Sign in to view all comments. Some comments have been hidden by the post's author - find out more