DEV Community

Cover image for 😋 AGI (bark 🐶) Smart waitress 🎙️
adriens
adriens

Posted on

1

😋 AGI (bark 🐶) Smart waitress 🎙️

❔ About

With this post you'll see how I started my first full artwork creating a bridge between:

  • 📜 Data
  • 🎙️ Sound design
  • 🤖 Generative Text To Speech
  • 🖌️ Video artwork
  • 📢 Digital contents streamline
  • 📈 Social networks and content embedding

💡 Inception

What triggered this creation is the following tweet:

... I immediately started to think:

"... and if I could create a fully digital, multimodal Customer Experience that would be ready to be shared on social platforms ?"

☝️ Also, people are talking a lot about about AGI like Midjourney, DALL-E... but very much less about Generative AI for TTS (Text to Speech).

♾️ " Voice prompts," aka. "History prompts"

As all others AGI, suno-ai/bark makes no exception : it relies on "PROMPTs".

Image description

Luckily, the bark's community is very active and share their voices prompt (and tags) discoveries :

Image description

🔁 Creative workflow

Here is the current workflow I could experiment:

  1. Create & release a SDK to get the data
  2. Imagine a customer experience at restaurant
  3. Develop & tune the data driven script and build soundtrack
  4. Create an avatar and scene for the waitress
  5. Put together soundtrack & avatar into video

🧰 Tools

Here are the open source tools I used for now:

🍿 Demos

Below are the demos:

🤓 How it's built (author's words)

🎙️ Soundtrack

Output soundtrack with bark:

🎞️ Movie

Then put the sound into an avatar with SadTalker:

🤔 Ideas for "later"

Automate:

  1. Video creation
  2. Video upload on dedicated cloud services for further optimal collaboration, digital marketing,...
  3. Avatar creation so video is totally code driven... and makes content more original (and funny) on each release thanks to one time generative prompt (prompt design required)

↩️ Conclusion

The more I think about designing - and achieving - such experiences, the more I find evident the core of this kind of project is:

  • 🎯 Get a clear idea and be strongly focused on what you want to achieve (ie. you don't get lost in your creative journey)
  • 🔗 Design a clean linear workflow that focus on tasks (not tools) so you can adapt it easily as AI projects are evolving at an amazing pace (I mean every week there are new tools)

🔖 Resources

🔭 Tools to prototype

Top comments (9)

Collapse
 
adriens profile image
adriens
Collapse
 
adriens profile image
adriens

Collapse
 
adriens profile image
adriens
Collapse
 
adriens profile image
adriens

Collapse
 
adriens profile image
adriens

Collapse
 
adriens profile image
adriens
Collapse
 
adriens profile image
adriens
Collapse
 
adriens profile image
adriens
Collapse
 
adriens profile image
adriens

A Workflow Copilot. Tailored to You.

Pieces.app image

Our desktop app, with its intelligent copilot, streamlines coding by generating snippets, extracting code from screenshots, and accelerating problem-solving.

Read the docs

👋 Kindness is contagious

Engage with a sea of insights in this enlightening article, highly esteemed within the encouraging DEV Community. Programmers of every skill level are invited to participate and enrich our shared knowledge.

A simple "thank you" can uplift someone's spirits. Express your appreciation in the comments section!

On DEV, sharing knowledge smooths our journey and strengthens our community bonds. Found this useful? A brief thank you to the author can mean a lot.

Okay