Mustafa ERBAY

Posted on May 18 • Originally published at mustafaerbay.com.tr

Eventual vs. Strong Consistency: The Indie Hacker's Tough Choice

#life #consistency #distributedsystems #indiehacker

Whether you're building enterprise software or working on your own side product, decisions about database and system architecture are always on the table. For me, one of these decisions, especially as an indie hacker working with limited resources, has been choosing a "consistency" model. This means, should I expect data to be consistent everywhere, all the time, or am I willing to accept that it will eventually become consistent, even with a slight delay? This isn't just a technical choice; it's a matter of philosophy that directly impacts the future of a project and the user experience.

I’ve oscillated between these two approaches while working on a manufacturing ERP or developing my own financial calculators. Each has its own set of pros and cons. Here, I want to detail which decision I made in which scenario, what those decisions cost me, and why I chose the path I did.

What is Consistency and Why is it Critical for an Indie Hacker?

Data consistency is one of the cornerstones of distributed systems architecture. Simply put, it defines how "the same" data should be across multiple access points or when data is distributed across replicas. There are two main types: Strong Consistency and Eventual Consistency. For an indie hacker, this choice isn't just a theoretical concept; it's a reality that directly affects my project's cost, performance, and development speed.

For instance, when I was developing my own anonymous data platform for Turkey, there were areas where users needed to see real-time, up-to-date data. However, for some statistics, it was acceptable for them to update with a few seconds' delay. Making the right decision here meant ensuring user satisfaction while avoiding the setup of unnecessarily complex and expensive systems. A wrong choice could lead to unnecessary complexity, high latency, or worse, incorrect data.

ℹ️ A Reminder about the CAP Theorem

In distributed systems design, the CAP theorem tells us that we can only achieve two out of three properties (Consistency, Availability, Partition Tolerance) simultaneously. Usually, P (Partition Tolerance) is always desired because network failures are inevitable. In this case, the choice is between C (Consistency) and A (Availability). My journey as an indie hacker has always involved a trade-off with these choices.

Strong Consistency: Safe Harbor or Slow Sailing?

Strong Consistency guarantees that any write operation to a system will be immediately visible to all subsequent read operations. This means that the moment data is updated, everyone accessing that data sees its most current version. This model is indispensable in places where critical data like financial transactions, inventory management, or user authentication is involved. When I was writing the "purchase/manufacture/ship/invoice" flow for a manufacturing ERP, I couldn't compromise on Strong Consistency. The moment a product's stock was updated, it was unacceptable for another order to be processed with incorrect stock information.

The biggest advantage of this consistency model is that it simplifies application logic because you don't have to deal with data inconsistencies. However, this ease comes at a price: performance and scalability. To ensure Strong Consistency, you often need to use distributed transactions, two-phase commit protocols, or consensus algorithms (like Paxos, Raft). This increases latency and system complexity. In my ERP project, we used PostgreSQL, and the READ COMMITTED isolation level was generally sufficient. However, there were critical moments where I had to escalate to SERIALIZABLE isolation.

-- Example of Strong Consistency: A bank transfer
BEGIN;
UPDATE accounts SET balance = balance - 100 WHERE account_id = 'sender_id';
UPDATE accounts SET balance = balance + 100 WHERE account_id = 'receiver_id';
COMMIT;

In this example, once the COMMIT is successful, both account balances are updated immediately and consistently. If any part of the transfer fails, everything reverts to its previous state with a ROLLBACK. This forms the basis of Strong Consistency and is vital for financial systems. I worked with similar precision in my own side product, financial calculators, because even the slightest inconsistency was unacceptable when users' money was involved. [related: Database Transaction Management]

Eventual Consistency: The Allure of Speed and Scalability

Eventual Consistency offers the opposite approach: when a write operation occurs, it might take some time for that data to propagate to all its replicas. However, the system guarantees that after a while (eventually), all replicas will become identical. Eventual Consistency is widely used in places like social media feeds, e-commerce product recommendations, and instant messaging app "read" receipts. For example, even if you don't immediately see a friend's new post, it appearing in your feed within a few seconds usually doesn't cause a problem.

For indie hackers, Eventual Consistency is very appealing, especially when working with limited budget and hardware resources. It offers lower latency, high scalability, and a simpler distributed system architecture. For instance, in the backend of my Android spam app, I implemented blacklist updates with Eventual Consistency. When a user added a new spam number, it wasn't critical for this information to propagate to all users instantly; a delay of a few minutes was acceptable. This allowed me to serve more users with fewer server resources.

⚠️ Challenges of Eventual Consistency

Eventual Consistency places more responsibility on application developers. Managing data conflicts, handling scenarios where stale data is displayed in the user interface, and debugging inconsistent states are more complex. This can be a significant cost in terms of time and effort, especially for indie hackers.

When implementing Eventual Consistency, message queues (like Redis Streams or a Message Queue service) and idempotent operations are commonly used.

# Example of Eventual Consistency: User profile update
# Assume a user is updating their profile
user_profile_data = {"user_id": "123", "name": "Mustafa", "email": "mustafa@example.com"}

# Write to the database
db.save_user_profile(user_profile_data)

# Send a message to a queue to propagate the change to other services
message_queue.publish("user_profile_updated", user_profile_data)

In this scenario, when the write operation to the database completes, db.save_user_profile is successful. However, updating other services (like a search index or cache) with this change happens asynchronously via message_queue.publish and might take some time. This is the fundamental working principle of Eventual Consistency.

Trade-offs from an Indie Hacker's Perspective and My Choices

My choice between Eventual and Strong Consistency as an indie hacker has always been shaped by the triangle of my project's requirements, my budget, and my technical skills. Both models have serious trade-offs, and it's crucial to understand them thoroughly.

Cost: Strong Consistency is generally more expensive. Managing distributed transactions requires more server resources (CPU, RAM) and often involves setting up complex database clusters. For the side products I use on my own VPS, this cost was a significant factor. Eventual Consistency can reduce costs by enabling higher scalability with simpler architectures.
Complexity: Strong Consistency complicates system architecture and operations, especially in multi-region deployments. Factors like replication strategies, consensus algorithms, and network latency come into play. Eventual Consistency, on the other hand, might require conflict resolution logic on the application side, which is its own form of complexity. The operational challenges I faced managing Strong Consistency in my ERP project could be a topic for a separate post under [related: System Management and Observability].
Performance: Strong Consistency typically has higher latency because it must wait for all replicas to be updated before acknowledging an operation. Eventual Consistency can complete write operations much faster and can improve performance by reading from cache or nearby replicas. This difference can be vital for the responsiveness of a web application's user interface.
Development Speed: As an indie hacker, one of the most critical factors for me is development speed. Strong Consistency might offer a faster initial development process by requiring less dealing with "data inconsistency scenarios." However, this advantage can turn into a disadvantage when the need for scaling arises. Eventual Consistency, requiring the setup of inconsistency management mechanisms from the start, might progress more slowly initially but can be more flexible in the long run.

💡 An Example from My Experience

In my own anonymous data platform, I used Strong Consistency (PostgreSQL transactions) for the parts where users entered data. However, for the parts where this data was processed into statistical reports and displayed, I opted for Eventual Consistency. The reports could update with a few seconds' delay, which incredibly improved the system's overall performance and scalability. If I had tried to do everything with Strong Consistency, I would likely have needed significantly more server resources, multiplying the project's cost.

Practical Application Scenarios and My Choices

At different stages of my life and projects, I've made clear choices between Eventual and Strong Consistency. These choices were usually directly related to the nature of the project and its fault tolerance level.

Financial Calculators (Strong Consistency)

In my own financial calculators, it was imperative that the data entered by users and the calculation results were 100% accurate and consistently available in real-time. The slightest inconsistency could have negatively impacted users' financial decisions. Therefore, I opted for Strong Consistency using PostgreSQL transactions and the READ COMMITTED isolation level. The atomicity of operations, meaning they either completed entirely or not at all, was paramount. Performance loss or scalability challenges could not outweigh data accuracy here.

For example, when displaying a user's historical calculation records, I used a simple transaction like the following to ensure they always saw the latest version:

-- Updating a user's calculation history
BEGIN;
INSERT INTO user_calculations (user_id, calculation_type, input_data, result, timestamp)
VALUES ('user_123', 'mortgage_calc', '{"amount": 300000}', '{"monthly_payment": 2500}', NOW());
-- If there's another related operation, it would be added here
COMMIT;

This simple BEGIN/COMMIT block guaranteed the integrity and real-time consistency of the data.

Manufacturing ERP (Hybrid Approach)

Working on an ERP for a manufacturing company presented a more complex situation. Strong Consistency was indispensable for core workflows (orders, production, inventory, invoicing). When an order's stock was decremented, this information needed to be immediately updated across the entire system. Here, we even used the SERIALIZABLE isolation level in scenarios where complex production planning algorithms affected multiple records simultaneously.

However, we allowed Eventual Consistency in places like operator screens or real-time production tracking dashboards. When an operator recorded the completion of a machine output, this information could be delayed by 5-10 seconds before reflecting on the global dashboard. This delay was not critical for operational decisions and significantly reduced the system's overall load. This hybrid approach ensured the accuracy of business-critical processes while offering flexibility in performance and scalability. For the dashboards, we set up a separate Redis cache layer and asynchronously pushed database changes to this cache.

# Completing a production order and updating the dashboard
# Strong Consistency: Update production order status
db_session.execute(
    "UPDATE production_orders SET status = 'COMPLETED' WHERE order_id = :order_id",
    {"order_id": "PO001"}
)
db_session.commit()

# Eventual Consistency: Update cache for the dashboard (asynchronous)
redis_client.publish("production_update_channel", {"order_id": "PO001", "status": "COMPLETED"})

Here, db_session.commit() ensures Strong Consistency, while redis_client.publish triggers Eventual Consistency. My post on [related: Redis OOM Eviction Policy Choices] delves into the details of such cache usage.

My Own Side Product Backend (Eventual Consistency Dominates)

For the backend of my Android spam app or my own blog site, I mostly leaned towards Eventual Consistency. When a user added a spam number, it was acceptable for this information to take a few minutes to propagate to other users. This meant a simpler architecture, lower server costs, and faster development. Particularly in a process where I ingested and processed logs from journald, the transformation of logs into statistics and their display also relied on Eventual Consistency.

Here, I would save the data using simple INSERT and UPDATE operations in PostgreSQL, and a worker service running in the background would process this data and write it to a cache or another Eventual Consistent store. Setting up retry mechanisms for error scenarios was critical in these types of systems.

# Log processing worker (simplified)
import time
import redis
import psycopg2

redis_conn = redis.Redis(host='localhost', port=6379)
pg_conn = psycopg2.connect("dbname=mydatabase user=myuser password=mypass")

def process_log_entry(log_data):
    # Save log data to PostgreSQL (Strong)
    cur = pg_conn.cursor()
    cur.execute("INSERT INTO raw_logs (data, processed) VALUES (%s, FALSE)", (log_data,))
    pg_conn.commit()
    cur.close()

    # Process log data and save summary statistics to Redis (Eventual)
    # This operation can be asynchronous and more tolerant to errors
    parsed_data = parse_log(log_data)
    redis_conn.incr(f"log_counts:{parsed_data['type']}")
    redis_conn.set(f"last_processed_log:{parsed_data['id']}", time.time())

# Main loop
while True:
    # Get log from queue
    log_entry = get_log_from_queue()
    if log_entry:
        process_log_entry(log_entry)
    time.sleep(1)

In this example, while saving the raw log to PostgreSQL is done with Strong Consistency, updating statistics in Redis relies on Eventual Consistency. This separation allowed me to leverage the advantages of both worlds.

Challenges Faced Implementing Consistency Models

While implementing these two different consistency models across various projects, I, as an indie hacker, encountered numerous challenges. These were not only technical but also related to user experience and debugging processes.

Data Conflicts

One of the biggest challenges with Eventual Consistency arises when concurrent writes to the same data occur from multiple sources. For instance, when two users try to edit the same article simultaneously, it becomes necessary to determine which version is correct. Simple strategies like Last-Writer-Wins are sometimes sufficient, but other times, more complex merge algorithms are needed. Debugging such situations, especially in distributed systems, can be quite time-consuming. I've had to trace these conflicts using specific timestamps, like a WAL rotation alarm occurring at 03:14, while examining journald logs.

User Experience Challenges

When transitioning from Strong Consistency to Eventual Consistency, I had to manage the possibility of users seeing stale data. For example, a user not seeing a change they made immediately might be perceived as an error. To solve this, I used techniques like loading spinners, "Your data is updating..." messages, or optimistic UI updates. This means showing the user as if the action happened instantly, while waiting for Eventual Consistency to complete in the background. This approach was particularly effective in improving user satisfaction on my mobile app. [related: Mobile App Performance Optimization]

Debugging and Observability

When working with Eventual Consistency in distributed systems, finding the root cause of a problem (root cause analysis) becomes much more challenging. Data inconsistencies can manifest at different times across different services. Therefore, observability (metrics, logs, traces) is incredibly important. Collecting metrics with Prometheus, creating dashboards with Grafana, and using Distributed Tracing tools have been lifesavers in such scenarios. Especially when I hit soft limits like cgroup memory.high and got OOM-killed, these tools were crucial in understanding why the system reached that state.

# Inspecting logs of a related service with journalctl
journalctl -u my_async_worker.service --since "2 hours ago" | grep "consistency_error"

This command allows me to scan the logs of a specific service for the last 2 hours, finding keywords like "consistency_error," helping me understand when and where an inconsistency began.

Tips for Making the Right Decision and My Approach

When deciding between Eventual and Strong Consistency, as an indie hacker, I ask myself a few fundamental questions. These questions have effectively guided me toward the right path:

Is Data Loss or Incorrect Data Acceptable? If data loss or showing incorrect data momentarily would impact the criticality of the task (finance, inventory, security), then Strong Consistency is mandatory. If a delay of a few seconds or minutes is tolerable, Eventual Consistency might be an option.
How Important is Scalability? What is the future growth potential of my project? If I aim to reach millions of users or high transaction volumes, Eventual Consistency generally offers a better starting point. Strong Consistency requires more complex and costly solutions for scaling.
What are My Budget and Resources? As an indie hacker, I have a limited budget and time. The complexity and cost associated with Strong Consistency can be overwhelming for small projects. Eventual Consistency allows me to achieve more with fewer server resources. When I recently wrote sleep 360 and a worker got OOM-killed, I optimized resource usage by switching to polling-wait, which was a result of the flexibility offered by Eventual Consistency.
What are the User Experience Expectations? Do users expect to see changes they make immediately? Or do they have tolerance for slight delays? This directly impacts interface design and backend architecture.

ℹ️ My Practical Rule

Generally, I prefer Strong Consistency for the "core" and money-related parts of the business. For other, less critical, or purely "informational" parts, I lean towards Eventual Consistency. This hybrid approach allows me to balance security with performance.

When making these decisions, I always lay out the potential trade-offs. For instance, the logic of "We would have done X, but because of Y, we chose Z" is a golden rule for me. Every choice has a cost, and foreseeing this cost from the outset prevents major problems down the line.

Conclusion

The choice between Eventual and Strong Consistency is a recurring technical and strategic decision in an indie hacker's journey. In my 20 years of field experience, I've repeatedly seen these decisions directly impact a project's success or failure. While prioritizing data accuracy in my own financial calculators, I aimed for scalability and cost-effectiveness in my anonymous data platform. In both cases, choosing the consistency model that best suited the project's needs allowed me to achieve the best results.

Remember, even with an "it'll be fine" attitude, understanding these fundamental principles and making the right decision in the right place will save you headaches in the long run. In my next project, perhaps I'll delve deeper into a more complex Eventual Consistency model, CRDTs (Conflict-free Replicated Data Types), and share my experiences on this topic with you.

DEV Community