<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Reetesh kumar</title>
    <description>The latest articles on DEV Community by Reetesh kumar (@reetesh_kumar).</description>
    <link>https://dev.to/reetesh_kumar</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F2675177%2F25f371af-70bc-4b6c-b4ed-80f4ec420e22.jpg</url>
      <title>DEV Community: Reetesh kumar</title>
      <link>https://dev.to/reetesh_kumar</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/reetesh_kumar"/>
    <language>en</language>
    <item>
      <title>[Boost]</title>
      <dc:creator>Reetesh kumar</dc:creator>
      <pubDate>Sat, 18 Apr 2026 19:34:21 +0000</pubDate>
      <link>https://dev.to/reetesh_kumar/-4bhg</link>
      <guid>https://dev.to/reetesh_kumar/-4bhg</guid>
      <description>&lt;div class="ltag__link--embedded"&gt;
  &lt;div class="crayons-story "&gt;
  &lt;a href="https://dev.to/reetesh_kumar/the-40-architecture-processing-1-billion-api-requests-with-9999-uptime-1p45" class="crayons-story__hidden-navigation-link"&gt;The $40 Architecture: Processing 1 Billion API Requests with 99.99% Uptime&lt;/a&gt;


  &lt;div class="crayons-story__body crayons-story__body-full_post"&gt;
    &lt;div class="crayons-story__top"&gt;
      &lt;div class="crayons-story__meta"&gt;
        &lt;div class="crayons-story__author-pic"&gt;

          &lt;a href="/reetesh_kumar" class="crayons-avatar  crayons-avatar--l  "&gt;
            &lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F2675177%2F25f371af-70bc-4b6c-b4ed-80f4ec420e22.jpg" alt="reetesh_kumar profile" class="crayons-avatar__image" width="800" height="1032"&gt;
          &lt;/a&gt;
        &lt;/div&gt;
        &lt;div&gt;
          &lt;div&gt;
            &lt;a href="/reetesh_kumar" class="crayons-story__secondary fw-medium m:hidden"&gt;
              Reetesh kumar
            &lt;/a&gt;
            &lt;div class="profile-preview-card relative mb-4 s:mb-0 fw-medium hidden m:inline-block"&gt;
              
                Reetesh kumar
                
              
              &lt;div id="story-author-preview-content-3520450" class="profile-preview-card__content crayons-dropdown branded-7 p-4 pt-0"&gt;
                &lt;div class="gap-4 grid"&gt;
                  &lt;div class="-mt-4"&gt;
                    &lt;a href="/reetesh_kumar" class="flex"&gt;
                      &lt;span class="crayons-avatar crayons-avatar--xl mr-2 shrink-0"&gt;
                        &lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F2675177%2F25f371af-70bc-4b6c-b4ed-80f4ec420e22.jpg" class="crayons-avatar__image" alt="" width="800" height="1032"&gt;
                      &lt;/span&gt;
                      &lt;span class="crayons-link crayons-subtitle-2 mt-5"&gt;Reetesh kumar&lt;/span&gt;
                    &lt;/a&gt;
                  &lt;/div&gt;
                  &lt;div class="print-hidden"&gt;
                    
                      Follow
                    
                  &lt;/div&gt;
                  &lt;div class="author-preview-metadata-container"&gt;&lt;/div&gt;
                &lt;/div&gt;
              &lt;/div&gt;
            &lt;/div&gt;

          &lt;/div&gt;
          &lt;a href="https://dev.to/reetesh_kumar/the-40-architecture-processing-1-billion-api-requests-with-9999-uptime-1p45" class="crayons-story__tertiary fs-xs"&gt;&lt;time&gt;Apr 18&lt;/time&gt;&lt;span class="time-ago-indicator-initial-placeholder"&gt;&lt;/span&gt;&lt;/a&gt;
        &lt;/div&gt;
      &lt;/div&gt;

    &lt;/div&gt;

    &lt;div class="crayons-story__indention"&gt;
      &lt;h2 class="crayons-story__title crayons-story__title-full_post"&gt;
        &lt;a href="https://dev.to/reetesh_kumar/the-40-architecture-processing-1-billion-api-requests-with-9999-uptime-1p45" id="article-link-3520450"&gt;
          The $40 Architecture: Processing 1 Billion API Requests with 99.99% Uptime
        &lt;/a&gt;
      &lt;/h2&gt;
        &lt;div class="crayons-story__tags"&gt;
            &lt;a class="crayons-tag  crayons-tag--monochrome " href="/t/architecture"&gt;&lt;span class="crayons-tag__prefix"&gt;#&lt;/span&gt;architecture&lt;/a&gt;
            &lt;a class="crayons-tag  crayons-tag--monochrome " href="/t/devops"&gt;&lt;span class="crayons-tag__prefix"&gt;#&lt;/span&gt;devops&lt;/a&gt;
            &lt;a class="crayons-tag  crayons-tag--monochrome " href="/t/performance"&gt;&lt;span class="crayons-tag__prefix"&gt;#&lt;/span&gt;performance&lt;/a&gt;
            &lt;a class="crayons-tag  crayons-tag--monochrome " href="/t/cloud"&gt;&lt;span class="crayons-tag__prefix"&gt;#&lt;/span&gt;cloud&lt;/a&gt;
        &lt;/div&gt;
      &lt;div class="crayons-story__bottom"&gt;
        &lt;div class="crayons-story__details"&gt;
          &lt;a href="https://dev.to/reetesh_kumar/the-40-architecture-processing-1-billion-api-requests-with-9999-uptime-1p45" class="crayons-btn crayons-btn--s crayons-btn--ghost crayons-btn--icon-left"&gt;
            &lt;div class="multiple_reactions_aggregate"&gt;
              &lt;span class="multiple_reactions_icons_container"&gt;
                  &lt;span class="crayons_icon_container"&gt;
                    &lt;img src="https://assets.dev.to/assets/sparkle-heart-5f9bee3767e18deb1bb725290cb151c25234768a0e9a2bd39370c382d02920cf.svg" width="24" height="24"&gt;
                  &lt;/span&gt;
              &lt;/span&gt;
              &lt;span class="aggregate_reactions_counter"&gt;1&lt;span class="hidden s:inline"&gt; reaction&lt;/span&gt;&lt;/span&gt;
            &lt;/div&gt;
          &lt;/a&gt;
            &lt;a href="https://dev.to/reetesh_kumar/the-40-architecture-processing-1-billion-api-requests-with-9999-uptime-1p45#comments" class="crayons-btn crayons-btn--s crayons-btn--ghost crayons-btn--icon-left flex items-center"&gt;
              Comments


              &lt;span class="hidden s:inline"&gt;Add Comment&lt;/span&gt;
            &lt;/a&gt;
        &lt;/div&gt;
        &lt;div class="crayons-story__save"&gt;
          &lt;small class="crayons-story__tertiary fs-xs mr-2"&gt;
            3 min read
          &lt;/small&gt;
            
              &lt;span class="bm-initial"&gt;
                

              &lt;/span&gt;
              &lt;span class="bm-success"&gt;
                

              &lt;/span&gt;
            
        &lt;/div&gt;
      &lt;/div&gt;
    &lt;/div&gt;
  &lt;/div&gt;
&lt;/div&gt;

&lt;/div&gt;


</description>
    </item>
    <item>
      <title>The $40 Architecture: Processing 1 Billion API Requests with 99.99% Uptime</title>
      <dc:creator>Reetesh kumar</dc:creator>
      <pubDate>Sat, 18 Apr 2026 19:16:41 +0000</pubDate>
      <link>https://dev.to/reetesh_kumar/the-40-architecture-processing-1-billion-api-requests-with-9999-uptime-1p45</link>
      <guid>https://dev.to/reetesh_kumar/the-40-architecture-processing-1-billion-api-requests-with-9999-uptime-1p45</guid>
      <description>&lt;p&gt;In the world of cloud computing, there is a "Managed Service Tax." Standard API gateways often charge $1.00 per million requests. At a billion requests, that is a &lt;strong&gt;$1,000 bill&lt;/strong&gt;. However, by optimizing the underlying architecture, that same volume can be handled for &lt;strong&gt;$0.00004 per request&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;Here is the deep dive into the strategy that balances microscopic costs with "four nines" reliability.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F0xj74ecpeqb7opws6qcv.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F0xj74ecpeqb7opws6qcv.png" alt=" " width="800" height="436"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  1. The Dual-Layer Load Balancing Strategy
&lt;/h2&gt;

&lt;p&gt;Reliability at scale requires a clear separation between public-facing traffic and internal service communication.&lt;/p&gt;

&lt;h3&gt;
  
  
  External Load Balancer (The Entry Point)
&lt;/h3&gt;

&lt;p&gt;The external layer acts as the "Public Guard." The goal here is &lt;strong&gt;L4 (TCP) Load Balancing&lt;/strong&gt;. &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Why it works:&lt;/strong&gt; Unlike L7 (HTTP) balancers that inspect every packet, L4 operates at the transport layer. It is significantly faster and cheaper because it simply forwards traffic to the Gateway without the overhead of deep packet inspection.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Key Role:&lt;/strong&gt; SSL/TLS termination and DDoS mitigation happen here, shielding the internal network from the raw internet.&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  Internal Load Balancer (The Service Mesh)
&lt;/h3&gt;

&lt;p&gt;Once traffic is inside the network, an Internal LB manages "East-West" traffic between microservices.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Service Discovery:&lt;/strong&gt; It allows services to find each other dynamically. If a "User Service" instance dies, the Internal LB automatically reroutes traffic to a healthy node.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Security:&lt;/strong&gt; Because this balancer has no public IP, it creates an air-gap that makes the internal architecture much harder to exploit.&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  2. The Core: Crafting a Custom API Gateway
&lt;/h2&gt;

&lt;p&gt;The "DIY" Gateway is the secret to high-density performance. While managed tools are great for startups, they often include "feature bloat" that consumes unnecessary CPU and RAM.&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;The Architectural Choice:&lt;/strong&gt; To maximize control and tailor operations precisely, building a custom API gateway is the superior path. This DIY approach is fantastic for those who want to optimize every detail, although it requires more upfront effort. If you prefer ready-made solutions, tools like Kong or Tyk can also serve well without the extra development overhead.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fjd8v0twfhqwl1g6jon79.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fjd8v0twfhqwl1g6jon79.png" alt=" " width="800" height="436"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h3&gt;
  
  
  Why a DIY Gateway Wins at Scale:
&lt;/h3&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;Resource Efficiency:&lt;/strong&gt; A custom gateway written in a high-performance language like &lt;strong&gt;Go&lt;/strong&gt; or &lt;strong&gt;Rust&lt;/strong&gt; can handle thousands of concurrent requests using less than 128MB of RAM.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Minimalist Middleware:&lt;/strong&gt; You only run the code you need (e.g., JWT validation and Rate Limiting), which keeps the "request-to-response" time under 5ms.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Smart Routing:&lt;/strong&gt; Custom gateways can implement "circuit breaker" patterns that are specifically tuned to the application's unique failure modes.&lt;/li&gt;
&lt;/ol&gt;




&lt;h2&gt;
  
  
  3. The Math of $0.00004 per Request
&lt;/h2&gt;

&lt;p&gt;To achieve these economics, the architecture must leverage &lt;strong&gt;Resource Density&lt;/strong&gt; rather than "Pay-as-you-go" pricing.&lt;/p&gt;

&lt;p&gt;$$Total Cost = \frac{Instance Hourly Rate \times Total Hours}{Total Requests}$$&lt;/p&gt;

&lt;h3&gt;
  
  
  The Cost-Optimization Playbook:
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;ARM-Based Compute:&lt;/strong&gt; Moving from x86 to ARM (like AWS Graviton) typically offers a &lt;strong&gt;40% price-performance boost&lt;/strong&gt;. For a simple Gateway task, ARM is significantly more efficient.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Spot Instance Strategy:&lt;/strong&gt; By designing the Gateway to be &lt;strong&gt;stateless&lt;/strong&gt;, the architecture can run on Spot instances. These are up to 90% cheaper than On-Demand instances. With a 99.99% uptime goal, the architecture uses a small "On-Demand" base and scales up using Spot.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Zero-Copy Logging:&lt;/strong&gt; To save on I/O costs, logs should be buffered in memory and shipped in batches to cold storage, rather than writing to expensive high-speed disks for every single request.&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  4. Achieving 99.99% Uptime
&lt;/h2&gt;

&lt;p&gt;Cost-cutting is useless if the system fails. High availability is built into this architecture through three specific pillars:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;Multi-AZ Redundancy:&lt;/strong&gt; The architecture is never pinned to a single data center. The External Load Balancer distributes traffic across at least three Availability Zones.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Passive Health Checks:&lt;/strong&gt; The Internal Load Balancer monitors the "heartbeat" of every service. If a container hangs, it is evicted from the rotation in milliseconds, ensuring the user never sees a 502 error.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Auto-Scaling Groups:&lt;/strong&gt; The system is configured to scale based on &lt;strong&gt;CPU latency&lt;/strong&gt; rather than just "Request Count," ensuring the Gateway stays ahead of traffic spikes before they cause a bottleneck.&lt;/li&gt;
&lt;/ol&gt;




&lt;h2&gt;
  
  
  Conclusion
&lt;/h2&gt;

&lt;p&gt;This architecture proves that scale doesn't have to be expensive. By combining &lt;strong&gt;Layered Load Balancing&lt;/strong&gt;, a &lt;strong&gt;DIY API Gateway&lt;/strong&gt;, and &lt;strong&gt;ARM-based Spot compute&lt;/strong&gt;, any engineering team can process massive volumes of data for a fraction of the traditional cost.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The choice is simple:&lt;/strong&gt; You can pay for a managed service to handle the complexity, or you can build the architecture that turns that complexity into a competitive advantage.&lt;/p&gt;

</description>
      <category>architecture</category>
      <category>devops</category>
      <category>performance</category>
      <category>cloud</category>
    </item>
    <item>
      <title>🌟 Deploying a Live Project Without a Dockerfile Using Buildpacks 🌟</title>
      <dc:creator>Reetesh kumar</dc:creator>
      <pubDate>Fri, 10 Jan 2025 19:07:41 +0000</pubDate>
      <link>https://dev.to/reetesh_kumar/deploying-a-live-project-without-a-dockerfile-using-buildpacks-3f7c</link>
      <guid>https://dev.to/reetesh_kumar/deploying-a-live-project-without-a-dockerfile-using-buildpacks-3f7c</guid>
      <description>&lt;p&gt;Hello connection 👋&lt;/p&gt;

&lt;p&gt;Recently, &lt;a href="https://www.linkedin.com/in/reetesh-kumar-850807255/" rel="noopener noreferrer"&gt;I&lt;/a&gt; had the opportunity to deploy a project live without even creating a Dockerfile, thanks to the awesome Buildpacks. It’s a super efficient and simple way to package your applications for deployment. No more manual Dockerfile writing, just build, deploy, and go!&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Flk4rnn8bvl7ww8kuix7m.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Flk4rnn8bvl7ww8kuix7m.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;🌟Step-by-Step Guide to Deploying with Buildpacks&lt;br&gt;
1️⃣ Install the Buildpack CLI&lt;br&gt;
Start by installing the pack CLI tool for working with Buildpacks:&lt;/p&gt;

&lt;p&gt;curl -sSL “&lt;a href="https://lnkd.in/gnk2--ej" rel="noopener noreferrer"&gt;https://lnkd.in/gnk2--ej&lt;/a&gt; download/pack-$(uname -s)-$(uname -m)” -o /usr/local/bin/pack&lt;br&gt;
chmod +x /usr/local/bin/pack&lt;/p&gt;

&lt;p&gt;2️⃣ Prepare Your Project&lt;br&gt;
Make sure your project has the necessary files like:&lt;br&gt;
package.json (for Node.js apps)&lt;br&gt;
requirements.txt (for Python apps)&lt;br&gt;
Or other language-specific files.&lt;/p&gt;

&lt;p&gt;3️⃣ Build Your App Image&lt;br&gt;
pack build my-app-image — builder paketobuildpacks/builder:base&lt;br&gt;
my-app-image: The name you want for your app’s image.&lt;br&gt;
paketobuildpacks/builder:base: This builder works with many languages.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fe0sdelly6obzskdvqhxl.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fe0sdelly6obzskdvqhxl.png" alt=" " width="800" height="269"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;4️⃣ Test the Image Locally&lt;br&gt;
Run the image locally to check everything works:&lt;br&gt;
docker run -d -p 8080:8080 my-app-image&lt;br&gt;
Now, open &lt;a href="http://localhost:8080" rel="noopener noreferrer"&gt;http://localhost:8080&lt;/a&gt; in your browser. If it’s up and running, you’re good to go!&lt;/p&gt;

&lt;p&gt;5️⃣ Push the Image to a Registry&lt;br&gt;
Once you’re satisfied, push your image to DockerHub or any container registry:&lt;br&gt;
docker tag my-app-image /my-app&lt;br&gt;
docker push /my-app&lt;/p&gt;

&lt;p&gt;6️⃣ Deploy to the Cloud&lt;br&gt;
Finally, deploy the image to your preferred cloud provider — AWS, GCP, Azure, or Kubernetes.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F421dnmbzw6g2q03e1b8o.jpeg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F421dnmbzw6g2q03e1b8o.jpeg" alt=" " width="800" height="400"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;🌟What Makes Buildpacks So Powerful?&lt;br&gt;
Buildpacks make things so much easier:&lt;br&gt;
🔹 Automatic Dependency Detection: It figures out all your app’s dependencies and installs them automatically.&lt;br&gt;
🔹 No Dockerfile Needed: Focus on coding, not Dockerfiles.&lt;br&gt;
🔹 Optimized for Production: It builds images that are ready to go live!&lt;br&gt;
🔹 Multi-language Support: Whether you’re using Node.js, Python, or others, it works across the board.&lt;/p&gt;

&lt;p&gt;Buildpacks are a game-changer for developers looking for a streamlined, hassle-free deployment process. You don’t have to get caught up in Dockerfile details — just pack and deploy!&lt;br&gt;
Special thanks to &lt;a href="https://www.linkedin.com/in/shubhamlondhe1996/" rel="noopener noreferrer"&gt;Shubham Londhe&lt;/a&gt; for introducing me to this amazing tool. 🙏&lt;br&gt;
If you haven’t tried Buildpacks yet, give it a shot. It’ll make your deployment process way smoother! 🌱&lt;/p&gt;

</description>
      <category>devops</category>
      <category>aws</category>
      <category>learning</category>
      <category>docker</category>
    </item>
  </channel>
</rss>
