DEV Community

Cover image for ๐Ÿง‘โ€โš•๏ธCareSetu AI: Instant Appointments, Intelligent Health Advice, and 24/7 Support Through Voice
Saket Jha
Saket Jha

Posted on • Edited on

๐Ÿง‘โ€โš•๏ธCareSetu AI: Instant Appointments, Intelligent Health Advice, and 24/7 Support Through Voice

AssemblyAI Voice Agents Challenge: Business Automation

This is a submission for the AssemblyAI Voice Agents Challenge for Business Automation Voice Agent and Domain Expert Voice Agent

๐Ÿ“ What I Built
As a software engineer at the healthcare startup CareSetu and a 3rd-year B.Tech student in Mathematics and Computing at institute of national importance, I've seen firsthand how technology can solve critical real-world problems. This voice-based web app allows users to schedule medical appointments, get answers to health questions like 'What precautions should I take for diabetes?', and manage their healthcare needs seamlessly. Itโ€™s designed to feel like you're having a conversation with a trusted health assistant, making healthcare more accessible for everyone.
From a business perspective, this directly impacts CareSetu an other business by automating front-desk tasks, reducing operational costs, and ensuring a steady flow of scheduled appointments, which is vital for the financial health of our partner clinics.
As you can see your result(appointment scheduled) which is 100% correct that means AssemblyAI STT conversion is most reliable for your other task also.

Tech Stack Used:

Backend Tech Stack
โœ…Core Framework & Runtime:
โœ… Python 3.11.9 - Main backend language
โœ… LiveKit Agents Framework- Real-time voice/video communication platform
โœ…AsyncIO - Asynchronous programming for handling concurrent operations
AI & Machine Learning:
โœ…Gemini flash - LLM integration for conversational AI
โœ…Cartesia/TTS - text-to-speech services
โœ…AssemblyAI - STT service with business optimizations
โœ…ElevenLabs - Premium text-to-speech service but as fallback
โœ…Google Cloud Speech - Additional TTS provider as fallback
โœ…Transformers/HuggingFace - ML model handling

PDFMiner/PDFPlumber/PyPDF2 - PDF document processing
โœ…NumPy/SciPy - Scientific computing
โœ…Scikit-learn - Machine learning utilities

Web Framework & APIs:
โœ…LiveKit Agents Framework - Real-time communication platform
โœ… Python HTTP Server - Simple token server for frontend integration
โœ… AIOHTTP - HTTP client library (for outbound requests

Integrations:
โœ…Google Calendar API - Appointment scheduling
โœ…Google Cloud APIs - Various Google services

Frontend Tech Stack
Core Framework:
โœ…React 19.1.0 - Modern React with latest features
โœ…Vite 7.0.4 - Fast build tool and dev server
โœ…TypeScript - Type-safe JavaScript development
โœ…UI & Styling:
โœ…Tailwind CSS 4.1.11- Utility-first CSS framework
โœ…PostCSS- CSS processing
โœ…Real-time Communication:
โœ…LiveKit Client - WebRTC client for voice/video
โœ…@livekit/components-react - Pre-built React components for LiveKit
Testing:
โœ…Vitest - Fast unit testing framework
โœ…Testing Library - React component testing utilities
โœ…JSdom- DOM simulation for testing
Development Tools:
โœ…ESLint - Code linting
โœ… Terser - JavaScript minification

๐Ÿ” STEP-BY-STEP DETAILED BREAKDOWN

  1. User Voice โ†’ Microphone โ†’ Web Audio API โ†’ LiveKit Stream

  2. Audio Stream โ†’ AssemblyAI โ†’ Text Transcript

3.Text Query โ†’ Query Processing โ†’ Knowledge Search โ†’ Context Building

4.Enhanced Context โ†’ Google Gemini โ†’ AI Response

5.Appointment Intent โ†’ Google Calendar API โ†’ Booking Result

6.AI Response โ†’ Cartesia/ElevenLabs/Google โ†’ Audio Stream

7.Audio Stream โ†’ Web Audio API โ†’ Speaker Output

8.Complete Interaction โ†’ Analysis โ†’ Knowledge Update

9.User continues โ†’ Loop to Step 1 | Timeout/Disconnect โ†’ End session

10.STT Error โ†’ Show error โ†’ Retry โ†’ Text input fallback
LLM Error โ†’ Show error โ†’ RAG-only response โ†’ Retry

TTS Error โ†’ Try next service โ†’ Text response fallback
Calendar Error โ†’ Show error โ†’ Manual booking โ†’ Retry

Note: This model currently supports the Appointment Intent and Query Intent (such as providing information based on FAQs, the Privacy Policy of CareSetu, health insurance details, various departments of CareSetu, and general modern scientific tips along with homemade remedies related to healthcare).

๐Ÿ’ปDemo

Explanation Video

๐Ÿ‘‰About Myself, Working Project and Repository explanation

Note:- As you can see at timestamp 7:07 model tell me my name this mean it remembered my name during conversation.

๐Ÿ‘‰Pure Backend Explanation

๐Ÿ‘‰Pure Frontend Explanation

The application is live at:
๐Ÿ‘‰Live Link

1) click on connect to agent
2) then click on start conversastion

๐Ÿ‘‰Backend is hosted on an AWS EC2 instance with Nginx as a reverse proxy.
๐Ÿ‘‰Frontend is hosted on Vercel.

๐Ÿ“ GitHub Repository

โœ…Frontend Code

โœ…Backend Code

Proof of Code Snippet and its result

Source:- caresetuAgent_3.0(Backend)

Code Snippet for Building a Voice Agent with AssemblyAI and LiveKit

Code Snippet for RAG Integration

*Calendar Integration *

๐Ÿง˜๐Ÿปโ€โ™‚๏ธConclusion

AssemblyAI played a crucial role in helping me successfully complete this challenge. From the start of the CareSetu agent project, the AssemblyAI team provided responsive support and guidance, answering my questions about technical requirements, deployment options, and permissible ways to share my project publicly. Whether it was clarifying best practices for publishing my work, assisting with integration details, or offering encouragement during each milestone, their team was always available whenever I needed help, as evidenced by the direct conversations with team members like Lee Vaughn , Dan Ince , Amanda DiNoto and Ryan Seams. Their willingness to address any issues and interest in seeing my progress not only boosted my confidence but also ensured technical obstacles never became roadblocks. This support allowed me to focus fully on building an impactful, reliable voice agent for healthcare automation and customer supportโ€”demonstrating AssemblyAIโ€™s genuine commitment to the success of developers using their platform.

Comment your thoughts, and follow me!

โœ…Twitter Post Of This Blog 5k+ Impression

โœ…Published This Story in Artificial Intelliegence having 27k+ followers + email subscriber

๐Ÿ”—Connect with Me
Medium:- Profile Link
Twitter/X: Profile Link
LinkedIn: Profile Link

Top comments (28)

Collapse
 
alok_maurya_dc6a114d6187c profile image
Alok Maurya

Bro this is genuinely impressive! Loved how youโ€™ve combined instant healthcare access with AI-powered supportโ€”feels like something the system really needs. Proud of you for building something this impactful ๐Ÿ‘ Keep going!"

Collapse
 
saket_jha_a89aca5daba5e8c profile image
Saket Jha • Edited
Collapse
 
praneshsharma profile image
PRANESH

Saket, this is fantastic!
A huge step forward for healthcare accessibility.

Collapse
 
saket_jha_a89aca5daba5e8c profile image
Saket Jha • Edited

Thanks @praneshsharma. Ready to collaborate with you in upcoming @AssemblyAI or any AI/ML project

Collapse
 
praneshsharma profile image
PRANESH

Yes , let's catch up soon and build something cool together !!

Collapse
 
sudhanshu_kumar_998960661 profile image
Sudhanshu Kumar

An impressive blend of RAG and LLM .

Collapse
 
saket_jha_a89aca5daba5e8c profile image
Saket Jha

Thanks @sudhanshu .

Collapse
 
shubham_kumar_64dfdd60244 profile image
shubham Kumar

This is an impressive integration of AI into healthcare! The voice-enabled features and instant appointment system could be a game-changer for accessibility.

Collapse
 
saket_jha_a89aca5daba5e8c profile image
Saket Jha
Collapse
 
ayush_kumar_6b9069a2dfaa1 profile image
AYUSH KUMAR

Impressed by your effort bro .keep growing @saket_jha_a89aca5daba5e8c .

Collapse
 
saket_jha_a89aca5daba5e8c profile image
Saket Jha
Collapse
 
shaunak_lende_81d210ea9d9 profile image
Shaunak Lende

The use of AI in the most crucial area that is health is amazing. Also the feature of scheduling appointment is very well executed.

Collapse
 
saket_jha_a89aca5daba5e8c profile image
Saket Jha

Yes, it's working. It will reduce manpower in the appointment scheduling department and will be available 24/7.

Collapse
 
sarthak_kasaudhan_1084151 profile image
SARTHAK KASAUDHAN • Edited

Kudos, Saket! Integrating voice commands into a health platform is a forward- thinking idea. You're not just building an app- you're already making it happen.
Realy it will be a Game- changer. Well done!!

Collapse
 
rakesh_mishra_d7250d028e4 profile image
rakesh Mishra

Absolutely impressed by this unique and innovative project! ๐ŸŽฏ
Huge thanks to @saket Kumar Jha for introducing me to @AssemblyAI โ€” such a powerful tool!
Iโ€™m excited to share it with my friends and canโ€™t wait to use it in my upcoming BTP project. ๐Ÿš€๐Ÿ”

Collapse
 
saket_jha_a89aca5daba5e8c profile image
Saket Jha

Thanks @rakesh_mishra_d7250d028e4. You should definitely use in your project.

Collapse
 
anijeet_mani_c0029251ff5e profile image
Anijeet Mani

Great work with the use of AI in healthcare field. This has potential for the game changer in health care.

Collapse
 
saket_jha_a89aca5daba5e8c profile image
Saket Jha
Collapse
 
eishaan_khatri profile image
Eishaan Khatri

Great work, Saket! As someone working with NLP, I really appreciate the thoughtful integration of RAG, voice AI, and real-time systems. Clean execution with real-world impact, truly impressive!

Collapse
 
saket_jha_a89aca5daba5e8c profile image
Saket Jha

Thanks @eishaan_khatri.

Some comments may only be visible to logged-in visitors. Sign in to view all comments.