DEV Community

Cover image for Two DEV Users. Two Countries. One Weird Little Avatar Project.
Aryan Choudhary
Aryan Choudhary

Posted on • Edited on

Two DEV Users. Two Countries. One Weird Little Avatar Project.

Technical hurdles and VRM animation bugs

We Gave Our DEV Avatar Project a V2… and It Escalated Quickly

A few months ago, @webdeveloperhyper and I made a small side project where two VRM avatars talked to each other in a 3D scene.

At the time, it was just meant to be something fun. Two developers from different countries making weird little animated characters interact over the internet because… honestly, why not?

If you missed the first post, here it is:

How a DEV Friend and I Brought Two Avatars to Life

Back then, the project was much simpler. A conversation system, some VRM animations, a few gestures, and a surprising amount of time spent trying to stop avatars from snapping into cursed poses mid-animation.

We thought we were mostly done with it.

We were very wrong.

Somewhere along the way, the project became… alive

The funny thing about side projects is that they rarely stay contained.

You fix one thing, get one new idea, add one “small feature,” and suddenly a weekend experiment starts behaving like an actual system.

That’s pretty much what happened here.

While I was busy juggling office life, Japanese, engineering work, interviews, marketing side quests, and trying to survive adulthood without my brain overheating… WDH was quietly evolving the project in the background like some kind of sleep-deprived VRM wizard.

And every few days I’d wake up to messages like:

“Added music and music button. Then pushed."

or

“fix mobile speech bubble layout and responsive avatar positioning”

or sometimes just:

“fixed and pushed.”

And somehow the project kept getting cooler.

The avatars stopped feeling scripted

One of the biggest changes in v2 was that the characters stopped feeling like objects that simply played animations on command.

Now they actually react.

They blink naturally. Follow movement. Shift attention while listening. React while the other character is speaking. Small details, but weirdly important ones.

It turns out humans are very sensitive to tiny signs of life.

A static model feels dead immediately.

But add:

  1. eye tracking
  2. idle movement
  3. small delays
  4. slight reactions
  5. facial expressions …and suddenly your brain starts treating it like a character instead of geometry.

That transition fascinated me way more than I expected.

At one point we spent an unreasonable amount of time adjusting tiny gesture timings that most users probably won’t consciously notice.

But that’s kind of the magic of interactive systems.

The invisible details are usually doing the heavy lifting.

The architecture quietly became more interesting too

The first version was mostly about “getting it to work.”

The second version became more about orchestration.

Now the scene handles:

  1. dialogue sequencing
  2. animation coordination
  3. background switching
  4. music control
  5. expression timing
  6. responsive layouts
  7. costume switching
  8. branching interactions

And because of that, hardcoding behavior stopped making sense very quickly.

So instead of writing giant chains of conditions everywhere, the conversation itself became structured data.

Each dialogue entry knows:

  • who is speaking
  • what animation plays
  • what expression appears
  • how timing should behave

Which made the system dramatically easier to extend without everything collapsing into spaghetti.

Ironically, the more playful the project became, the more important clean systems thinking became too.

The funniest bugs were always animation-related

There is probably a law somewhere stating that if you work with avatars long enough, something horrifying will eventually happen. And if not it should be.

At various points:

  • arms rotated into dimensions unknown to mankind
  • blinking broke and created sleep paralysis avatars
  • gesture transitions snapped violently
  • one animation froze mid-pose and looked like the character had emotionally given up on life
  • Classic side project experience honestly.

But weirdly enough, those moments ended up becoming some of the most memorable parts of the project.

Because every fix made the avatars feel a little more believable.

While all this was happening, @webdeveloperhyper was building something much bigger

And this is honestly the part I respect the most.

While we were experimenting with our collaboration project, WDH was also building something called AI Avatar — a VS Code and Chrome extension where VRM avatars react to AI activity, animate while you work, track your cursor, trigger expressions, speak through speech bubbles, and basically exist beside you like a tiny chaotic coding companion.

The interesting thing is that it never felt like one of those overly corporate “AI productivity” tools trying to optimize your soul.

It felt playful.

Like something made by someone who genuinely likes interactive characters and wants technology to feel a little more alive and personal.

And I think that mindset naturally spilled back into our collab project too.

A lot of the small interaction details, responsiveness ideas, animation handling improvements, and experimentation mentality came from constantly exploring these systems further.

Which honestly reminded me why collaborations matter so much.

You don’t just share workload.

You share ways of thinking.

The project accidentally became a tiny engine

That’s probably the strangest realization from all this.

At some point we stopped thinking:

“we’re making two avatars talk”

And started realizing:

“wait… this is basically becoming a reusable interaction system.”

Now there’s:

  • branching dialogue
  • modular animation logic
  • reusable scene orchestration
  • dynamic reactions
  • configurable backgrounds
  • audio systems
  • responsive UI behavior And suddenly your silly little side project starts looking suspiciously scalable.

That’s always the dangerous phase.

But honestly, the best part was still the process

The actual code matters, obviously.
But what I’ll probably remember most is:

  • random weekend debugging
  • async conversations across timezones
  • sending screenshots back and forth
  • testing weird ideas just because they sounded funny
  • slowly watching something gain personality
  • That feeling is hard to replicate in structured environments.

No roadmap pressure. No stakeholder meetings. No “business value alignment.”

Just curiosity carrying the project forward.

And honestly?
That’s probably why side projects stay fun.

Where this goes next

Right now, v2 still feels like the beginning.

There are already ideas floating around for:

  • interactive storytelling
  • user-triggered reactions
  • branching scenes
  • mini-games
  • dynamic conversations
  • more expressive characters
  • Or maybe something completely different. At this point, I’ve stopped trying to predict where these projects go.

The fun part is discovering it while building.

For the people interested in trying it themselves

WDH made a public version available here:

Vercel Demo (with VRMA + MP3)

GitHub Repo (without VRMA + MP3)

And honestly, if you build something weird with it, let us know.

Those are usually the best ideas anyway.

Top comments (17)

Collapse
 
javz profile image
Julien Avezou

Awesome updates! It's great to see this project evolve! Keep it up!

Have you considered gamifying the avatar by adding vitals such as sleep (forces you to take intentional breaks while the avatar rests up), or hunger (commits could satisfy this hunger for eg.)?

Collapse
 
itsugo profile image
Aryan Choudhary

Omg these are such GREAT ideas actually! @webdeveloperhyper we could really work on these!!!
Thank you Julien, we would definitely look into how to make these work, if doable we will surely add it up in v3!

Collapse
 
webdeveloperhyper profile image
Web Developer Hyper

Thank you for your comment! 😀 We will consider adding more functions when we update our project to v3.

Collapse
 
klaudiagrz profile image
Klaudia Grzondziel

Thank you for sharing your journey! Those animation-related bugs were hilarious to read about 😂 BTW, I never thought it's so much work to build all the gestures, eye movements, blinking, and so on!

Collapse
 
itsugo profile image
Aryan Choudhary

Thank you for checking it out (⁠ᵔ⁠ᴥ⁠ᵔ⁠) Glad you liked it!!!
And as far as knowing about the work that goes into making VR models work... neither did we know that before this project😭

Collapse
 
klaudiagrz profile image
Klaudia Grzondziel

Means you learnt a lot from this project, good job! 👏🏻

Collapse
 
harun_mahmud_88 profile image
Harun Mahmud

This is such a fun read! The part about "one animation froze mid-pose and looked like the character had emotionally given up on life" had me laughing — those are exactly the kind of bugs that are painful in the moment but make the best stories later.
What really resonated with me is how the project accidentally became a reusable engine. That happens with every side project I've built too — you start with something simple and suddenly you're architecting a whole system just to keep things clean.
The cross-timezone async collaboration angle is underrated as well. No meetings, just "fixed and pushed" — honestly sounds ideal.

Collapse
 
itsugo profile image
Aryan Choudhary

Heyy Harun!!! Glad you found our story funny 😭
And yeah now that we have brought this to a good level we can grow this into multiple directions as we like!

The "no meetings" is definitely a blessing in disguise lol.

Collapse
 
webdeveloperhyper profile image
Web Developer Hyper

Thank you for your cooperation! 😀 It was fun making a collaboration project with you. I want readers to feel the difference between India and Japan through the changeable backgrounds and music. Japanese offices do not have Tatami for floors, but it’s okay. 🤣 Japanese music uses the traditional instrument called the Koto. Looking forward to doing something fun and educational together again!

Collapse
 
kui_luo profile image
Kui Luo

Really practical engineering mindset here. Token efficiency in AI-assisted coding is underrated. Building your own streamlined tool instead of bloating every interaction with thousands of unnecessary tokens shows real maturity. The 900-line investment will compound over time — this is the kind of thinking that separates senior engineers from junior ones relying blindly on AI suggestions.

Collapse
 
prachub profile image
PracHub

Turning a fun project into a sophisticated system sounds like a typical dev spiral. I'm curious how you handled data structuring for the conversations. Storing and managing dynamic dialogue sounds simple, but scaling it can be a headache. If you're looking at modular logic or branching dialogue, the challenge probably goes beyond basic JSON structure. I've been using prachub.com for system design mocks. It's more useful than browsing random forums when I'm juggling a bunch of concepts. That side project evolution is pretty wild—easy to get lost in feature creep, right?

Collapse
 
itsugo profile image
Aryan Choudhary

Hey Andy! Yes for now the JSON structure is handling it well and I agree scaling it would be challenging... We will definitely check out your suggestion, excited to see how this grows!

Collapse
 
mevinbuilds profile image
Mevin Joseph Seby

Great article! The technical depth here is exactly what the community needs. One补充: don't forget to validate your assumptions early - it's easier to pivot when you have less code. For devs building SaaS, finding users doesn't have to be the hardest part - Rixly helps identify developers actively seeking solutions like yours.

Collapse
 
solty_ang profile image
Angela Soltys

My congratulations! Together are always more productive

Collapse
 
itsugo profile image
Aryan Choudhary

Thank you Angela!

Collapse
 
omkar_ghodke_13f91bad26c6 profile image
OMKAR GHODKE

Damn!!!! stuff.. Really enjoyed reading your journey, and appreciate you work 👏🏻

Collapse
 
itsugo profile image
Aryan Choudhary

Thanks Omkar, Glad you liked it!