Building a Modern Business Card Scanner with AI: A Deep Dive into AWS and DeepSeek Integration
Hey there, fellow developers! ๐ Today, I'm excited to share my journey of building a modern business card scanner application that leverages AI to transform physical business cards into organized digital contacts. This project combines AWS services with the DeepSeek API to create a powerful networking tool.
๐ฏ The Problem
In today's digital age, we still exchange physical business cards, but managing them efficiently is a challenge:
- Cards get lost or damaged
- Manual data entry is time-consuming and error-prone
- It's hard to analyze your professional network effectively
- Finding specific contacts or seeing connections between companies is difficult
There are existing digital business card scanning solution in the market, but they are often:
- expensive
- not user friendly
- only supports internal address book, no integration with device contacts
- data stored in the providers' database, you don't own the data
๐ก The Solution
I built an open-source web application that solves these problems by:
- Using AI to extract information from business card images
- Organizing contacts with rich metadata
- Allows importing the contacts to your phone contacts in batch!
- self-deployable solution, meaning you own and control your data!
- Providing network analysis and visualization
- Offering an AI-powered chat interface for querying your contact database
๐๏ธ Technical Architecture
The application is built on a modern serverless stack:
Key Components:
-
Frontend:
- Frontend hosted on S3 and enhanced with CloudFront CDN
- Interactive network visualization using D3.js
- Real-time chat interface
- Responsive design with Tailwind CSS
-
Backend:
- AWS Lambda for serverless compute
- Amazon Textract for OCR
- DeepSeek API for intelligent data extraction
- DynamoDB for contact storage
- S3 for image storage
-
Authentication:
- AWS Cognito for user management
- Secure API access with IAM roles
๐ Key Features
- Intelligent Card Scanning
- Network Analysis
- AI-Powered Chat Interface
๐ Getting Started
Want to try it yourself?
- Click this demo url
- Sign in using the test credentials:
- Username: testUser
- Password: 12345678!
- Navigate to the "Scan Cards" tab
- Upload one or more business card images
- View processed contacts in the "My Contacts" tab
- Explore network insights in the "Network Analysis" tab
Want to deploy yourself? Check out the repo: https://github.com/john-ng-hk/Biz-card-scanner
๐จ User Experience
The application provides a seamless experience:
- Upload business card images (single or batch)
- AI automatically extracts and categorizes information
- View your contacts in a modern, filterable interface
- Explore network connections through interactive visualizations
- Chat with AI to analyze your professional network
- Easily export contacts as vcard files for your phone contacts
๐ฐ Cost Estimation
AWS: Total 12 months cost = 15.00 USD AWS Pricing Calculator
DeepSeek API: minimum spend = 10RMB = 1.38 USD (1USD = 7.24 RMB)
Total Annual Cost = AWS+DeepSeek key = 15+1.38=16.38= around 16.5 USD
main assumptions:
Number of API Requests 1000 API requests each day x 20 working days = 20000 API requests per month
Number of scan card requests: 1 card each day x 30 days = 30 requests
๐ฎ Future Enhancements
I'm planning to add:
- Maybe with https so that the can directly scan with device mobile (DONE! You can now directly scan with camera)
- Better UI/UX and improve latency with AI chatbot.
- The current serverless architecture is a "Lambdalith", which is great for quick development for PoC but does not scale well with more features coming in. So may need decouple this part in the future.
๐ค Key Learnings
Building this project taught me several valuable lessons:
- Serverless architectures need really thoughtful timeout configurations (I had to increaset the integration timeout to 60000ms via service quota request this time...)
- User experience is crucial for AI-powered application, I didnt expect most my dev time is on the frontend part.
๐ Open Source
This project is open source! Feel free to:
- Star the repository
- Submit issues or feature requests
- Contribute improvements
- Fork and adapt for your needs
๐ Resources
๐ Connect
I'd love to hear your thoughts and suggestions! Connect with me:
- GitHub: [https://github.com/john-ng-hk]
- LinkedIn: [https://www.linkedin.com/in/john-nch-hk/]
Let me know in the comments if you'd like to see more detailed posts about specific aspects of this project!
Top comments (0)