<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Isha Mudgal</title>
    <description>The latest articles on DEV Community by Isha Mudgal (@isha_mudgal_8f5c04b6e8b54).</description>
    <link>https://dev.to/isha_mudgal_8f5c04b6e8b54</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3266714%2F85f99273-d966-4478-9e9d-a098de4b01f4.png</url>
      <title>DEV Community: Isha Mudgal</title>
      <link>https://dev.to/isha_mudgal_8f5c04b6e8b54</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/isha_mudgal_8f5c04b6e8b54"/>
    <language>en</language>
    <item>
      <title>System Design Pillars</title>
      <dc:creator>Isha Mudgal</dc:creator>
      <pubDate>Wed, 18 Jun 2025 13:55:43 +0000</pubDate>
      <link>https://dev.to/isha_mudgal_8f5c04b6e8b54/system-design-pillars-25o5</link>
      <guid>https://dev.to/isha_mudgal_8f5c04b6e8b54/system-design-pillars-25o5</guid>
      <description>&lt;p&gt;&lt;strong&gt;1. ⚡ Availability&lt;/strong&gt;&lt;br&gt;
Definition:&lt;br&gt;
The ability of a system to remain accessible and operational at all times.&lt;/p&gt;

&lt;p&gt;Key Points:&lt;/p&gt;

&lt;p&gt;Measured as a percentage uptime (e.g., 99.99% availability = ~52 minutes/year downtime).&lt;/p&gt;

&lt;p&gt;Requires redundancy: multiple instances, failovers.&lt;/p&gt;

&lt;p&gt;Common techniques: Load balancers, health checks, replicas.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;2. 🛡️ Reliability&lt;/strong&gt;&lt;br&gt;
Definition:&lt;br&gt;
The system's ability to function correctly and consistently over time.&lt;/p&gt;

&lt;p&gt;Key Points:&lt;/p&gt;

&lt;p&gt;Reliability ≠ Availability. A system can be available but return incorrect results.&lt;/p&gt;

&lt;p&gt;Achieved through: fault detection, retries, data replication, and monitoring.&lt;/p&gt;

&lt;p&gt;Measured with metrics like MTBF (Mean Time Between Failures).&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;3. 📈 Scalability&lt;/strong&gt;&lt;br&gt;
Definition:&lt;br&gt;
A system’s ability to handle increased load without performance loss.&lt;/p&gt;

&lt;p&gt;Key Points:&lt;/p&gt;

&lt;p&gt;Vertical Scaling: Add more power to a single machine.&lt;/p&gt;

&lt;p&gt;Horizontal Scaling: Add more machines (preferred for web-scale).&lt;/p&gt;

&lt;p&gt;Involves sharding, caching, stateless services, and distributed queues.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;4. 🔧 Maintainability&lt;/strong&gt;&lt;br&gt;
Definition:&lt;br&gt;
How easily a system can be understood, updated, and fixed.&lt;/p&gt;

&lt;p&gt;Key Points:&lt;/p&gt;

&lt;p&gt;High maintainability = faster iterations and fewer bugs.&lt;/p&gt;

&lt;p&gt;Achieved with: clean code, modular architecture, automated tests, observability.&lt;/p&gt;

&lt;p&gt;Reduces system downtime and tech debt over time.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;5. 🧯 Fault Tolerance&lt;/strong&gt;&lt;br&gt;
Definition:&lt;br&gt;
The system’s ability to keep running even when some components fail.&lt;/p&gt;

&lt;p&gt;Key Points:&lt;/p&gt;

&lt;p&gt;Examples: retry logic, failover systems, circuit breakers.&lt;/p&gt;

&lt;p&gt;Closely tied with availability and reliability.&lt;/p&gt;

&lt;p&gt;Design principle: “Design for failure” — assume things will go wrong.&lt;/p&gt;

</description>
      <category>systemdesign</category>
    </item>
    <item>
      <title>Abstraction</title>
      <dc:creator>Isha Mudgal</dc:creator>
      <pubDate>Tue, 17 Jun 2025 15:59:44 +0000</pubDate>
      <link>https://dev.to/isha_mudgal_8f5c04b6e8b54/abstraction-2fi1</link>
      <guid>https://dev.to/isha_mudgal_8f5c04b6e8b54/abstraction-2fi1</guid>
      <description>&lt;p&gt;&lt;strong&gt;1. Why Are Abstractions Important?&lt;/strong&gt;&lt;br&gt;
Abstractions simplify complex systems by hiding lower-level implementation details. They:&lt;/p&gt;

&lt;p&gt;Allow developers to focus on solving higher-level problems.&lt;/p&gt;

&lt;p&gt;Enable modularity: changes in one layer don’t affect others.&lt;/p&gt;

&lt;p&gt;Promote reuse, scalability, and testability.&lt;/p&gt;

&lt;p&gt;Are foundational in both OS (e.g., file systems) and distributed systems (e.g., databases, RPC).&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;2. Network Abstractions:&lt;/strong&gt; Remote Procedure Calls (RPC)&lt;br&gt;
RPC enables communication between services across a network, hiding the complexity of:&lt;/p&gt;

&lt;p&gt;Data serialization/deserialization&lt;/p&gt;

&lt;p&gt;Network communication protocols&lt;/p&gt;

&lt;p&gt;Retry logic and error handling&lt;/p&gt;

&lt;p&gt;Advantages:&lt;/p&gt;

&lt;p&gt;Makes remote calls look like local method calls.&lt;/p&gt;

&lt;p&gt;Reduces boilerplate for service-to-service interactions.&lt;/p&gt;

&lt;p&gt;Popular frameworks: gRPC, Thrift, Apache Avro.&lt;/p&gt;

&lt;p&gt;Caveats:&lt;/p&gt;

&lt;p&gt;Network issues can introduce latency/failures.&lt;/p&gt;

&lt;p&gt;RPC is not the same as function calls: developers must account for distributed system challenges (timeouts, retries, etc.).&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;3. Spectrum of Consistency Models&lt;/strong&gt;&lt;br&gt;
Consistency in distributed systems balances availability, latency, and correctness.&lt;/p&gt;

&lt;p&gt;Models include:&lt;/p&gt;

&lt;p&gt;Strong Consistency – Reads always reflect latest writes. Simplifies reasoning but affects availability.&lt;/p&gt;

&lt;p&gt;Eventual Consistency – All replicas eventually converge. Faster but may show stale data.&lt;/p&gt;

&lt;p&gt;Causal Consistency – Preserves cause-effect relationships.&lt;/p&gt;

&lt;p&gt;Read-Your-Writes / Monotonic Reads – Guarantees that a user's session observes its own updates.&lt;/p&gt;

&lt;p&gt;Tradeoffs: CAP theorem dictates that in network partitions, you must choose between consistency and availability.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;4. The Spectrum of Failure Models&lt;/strong&gt;&lt;br&gt;
Failures are inevitable in distributed systems. Types include:&lt;/p&gt;

&lt;p&gt;Crash Failures – Server stops working (e.g., power loss).&lt;/p&gt;

&lt;p&gt;Omission Failures – Messages dropped or not sent.&lt;/p&gt;

&lt;p&gt;Timing Failures – Response not within expected time.&lt;/p&gt;

&lt;p&gt;Byzantine Failures – Arbitrary/malicious behavior.&lt;/p&gt;

&lt;p&gt;Key takeaways:&lt;/p&gt;

&lt;p&gt;Design for failure: Use timeouts, retries, replication, and monitoring.&lt;/p&gt;

&lt;p&gt;Systems must tolerate partial failures without breaking completely.&lt;/p&gt;

&lt;p&gt;Consensus protocols (e.g., Paxos, Raft) handle agreement in unreliable environments.&lt;/p&gt;

</description>
      <category>systemdesign</category>
      <category>grokking</category>
    </item>
  </channel>
</rss>
