<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Chetan Menge</title>
    <description>The latest articles on DEV Community by Chetan Menge (@chetan_menge).</description>
    <link>https://dev.to/chetan_menge</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F1466612%2Ffe9fc46b-1b78-400f-a8b1-bf673bd3700c.png</url>
      <title>DEV Community: Chetan Menge</title>
      <link>https://dev.to/chetan_menge</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/chetan_menge"/>
    <language>en</language>
    <item>
      <title>Mastering DevOps Branching: Your Ultimate Guide to Git Flow, Trunk, Tag-Based, and Hybrid Strategies</title>
      <dc:creator>Chetan Menge</dc:creator>
      <pubDate>Tue, 13 Aug 2024 10:10:41 +0000</pubDate>
      <link>https://dev.to/chetan_menge/devops-branching-strategies-for-developers-a-guide-to-git-flow-trunk-tag-based-and-hybrid-approaches-2p0p</link>
      <guid>https://dev.to/chetan_menge/devops-branching-strategies-for-developers-a-guide-to-git-flow-trunk-tag-based-and-hybrid-approaches-2p0p</guid>
      <description>&lt;p&gt;This guide explores popular Git branching strategies for DevOps environments, focusing on Git Flow, Trunk-Based Development (TDD), Tag-Based, and Hybrid approaches. We'll compare their strengths and weaknesses, highlight challenges they address, and discuss their suitability for web applications and ETL processes / applications.&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;Branching Strategies:&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;&lt;strong&gt;Git Flow:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Workflow&lt;/strong&gt;: Separate branches for development (develop), feature development (feature/&lt;em&gt;), hotfixes (hotfix/&lt;/em&gt;), and releases (release/*). Features are merged into develop, releases are branched from develop, and hotfixes are applied to both develop and the release branch.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Pros&lt;/strong&gt;: Clear separation of concerns, promotes stable releases, good for large teams with complex projects.&lt;/li&gt;
&lt;li&gt; &lt;strong&gt;Cons&lt;/strong&gt;: Overhead of managing multiple branches, potential merge conflicts.&lt;/li&gt;
&lt;li&gt; &lt;strong&gt;Challenges Addressed&lt;/strong&gt;: Feature isolation, release stability, collaboration in large teams.&lt;/li&gt;
&lt;li&gt; &lt;strong&gt;Suitability&lt;/strong&gt;: Complex web applications, large-scale ETL projects.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Reference Diagram&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F9gzkai6z8agsq0ma5fb0.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F9gzkai6z8agsq0ma5fb0.jpg" alt="Image description" width="769" height="391"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Trunk-Based Development (TDD):&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Workflow&lt;/strong&gt;: All development happens directly on the main branch (master). Frequent integrations and deployments ensure a stable codebase.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Pros&lt;/strong&gt;: Encourages collaboration and rapid feedback, simplifies branch management.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Cons&lt;/strong&gt;: Requires disciplined development practices and strong CI/CD. Not ideal for large or complex projects.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Challenges Addressed&lt;/strong&gt;: Streamlined development workflow, continuous integration testing.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Suitability&lt;/strong&gt;: Small to medium-sized web applications, ETL processes requiring frequent updates.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Ffusiyezn7xvhvkbxujx8.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Ffusiyezn7xvhvkbxujx8.jpg" alt="Image description" width="659" height="230"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Tag-Based:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt; &lt;strong&gt;Workflow&lt;/strong&gt;: Short-lived feature branches are merged directly into the main branch (master). Releases are tagged on the main branch.&lt;/li&gt;
&lt;li&gt; &lt;strong&gt;Pros&lt;/strong&gt;: Simple, fast releases, good for small teams or projects with frequent deployments.&lt;/li&gt;
&lt;li&gt; &lt;strong&gt;Cons&lt;/strong&gt;: Main branch can become unstable, difficult to rollback changes.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Challenges Addressed&lt;/strong&gt;: Rapid development cycles, continuous integration and delivery (CI/CD).&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Suitability&lt;/strong&gt;: Simple web applications, small ETL pipelines.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Hybrid&lt;/strong&gt;:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt; &lt;strong&gt;Workflow&lt;/strong&gt;: Combines elements of Git Flow and Tag-Based. May involve short-lived feature branches merged into a develop branch before integration with the main branch, or feature branches directly merged to the main branch with frequent tagging for releases.&lt;/li&gt;
&lt;li&gt; &lt;strong&gt;Pros&lt;/strong&gt;: Flexible, adaptable to project needs, balances stability with development speed.&lt;/li&gt;
&lt;li&gt; &lt;strong&gt;Cons&lt;/strong&gt;: Requires careful planning and discipline to avoid complexity.&lt;/li&gt;
&lt;li&gt; &lt;strong&gt;Challenges Addressed&lt;/strong&gt;: Adaptability to project size and complexity, maintaining release stability.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Suitability&lt;/strong&gt;: Both web applications and ETL projects, depending on specific needs.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Faa2ud6ho6h3ktmr9mxvm.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Faa2ud6ho6h3ktmr9mxvm.jpg" alt="Image description" width="800" height="467"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;Choosing the Right Strategy:&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;The best strategy hinges on your project's specific needs. Consider these factors:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Project Size and Complexity: Larger projects with multiple developers might benefit from the structure of Git Flow.&lt;/li&gt;
&lt;li&gt;Release Frequency: Frequent releases might favor a Tag-Based or Hybrid approach.&lt;/li&gt;
&lt;li&gt;Team Experience: A Hybrid approach might require more discipline but offers greater flexibility.&lt;/li&gt;
&lt;li&gt;Deployment Frequency: TDD or a Hybrid with frequent main branch merges works well for rapid deployments.&lt;/li&gt;
&lt;/ul&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;Additional Notes:&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;&lt;strong&gt;CI/CD Integration&lt;/strong&gt;: All these strategies can benefit from a strong CI/CD pipeline to automate testing, integration, and deployment.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Web Applications vs. ETL Apps&lt;/strong&gt;: Both strategies can be applied to both web applications and ETL processes. ETL might benefit from more frequent deployments and rollbacks, which could influence the choice.&lt;/p&gt;

&lt;p&gt;By understanding these strategies and their implications, developers can choose the approach that best promotes efficient development, stable releases, and collaboration within their DevOps environment.&lt;/p&gt;

</description>
      <category>cicd</category>
      <category>branching</category>
      <category>sdlc</category>
      <category>versioncontrol</category>
    </item>
    <item>
      <title>GCP Cloud Armor - How to Leverage and add extra layer of security</title>
      <dc:creator>Chetan Menge</dc:creator>
      <pubDate>Fri, 21 Jun 2024 11:59:31 +0000</pubDate>
      <link>https://dev.to/chetan_menge/gcp-cloud-armor-how-to-leverage-and-add-extra-layer-of-security-4ol7</link>
      <guid>https://dev.to/chetan_menge/gcp-cloud-armor-how-to-leverage-and-add-extra-layer-of-security-4ol7</guid>
      <description>&lt;p&gt;In today's digital world, securing your internet-facing applications is paramount.  Distributing Denial-of-Service (DDoS) attacks, web application vulnerabilities, and malicious bots can significantly disrupt your services and damage your reputation. Google Cloud Armor offers a robust solution to fortify your application's defences. This blog post, aimed at developers, explores how Cloud Armor bolsters your application security on Google Cloud Platform (GCP).&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;What is Cloud Armor?&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Cloud Armor is a globally-distributed Web Application Firewall (WAF) and DDoS mitigation service offered by GCP. It acts as a security shield, positioned in front of your internet-facing applications, filtering malicious traffic before it reaches your backend servers. Cloud Armor offers a multi-layered defence against various threats, including:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;DDoS Attacks:&lt;/strong&gt; Cloud Armor safeguards your applications from volumetric (L3/L4) and Layer 7 DDoS attacks, ensuring service availability during traffic surges.&lt;br&gt;
&lt;strong&gt;Web Application Attacks:&lt;/strong&gt; Pre-configured WAF rules based on OWASP Top 10 risks help mitigate common web vulnerabilities like SQL injection and cross-site scripting (XSS).&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Benefits of Cloud Armor&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Enhanced Security&lt;/strong&gt;: Cloud Armor provides a comprehensive security solution, safeguarding your applications from a broad spectrum of threats.&lt;br&gt;
&lt;strong&gt;Improved Performance&lt;/strong&gt;: By filtering malicious traffic at the edge, Cloud Armor reduces the load on your backend servers, enhancing application performance.&lt;br&gt;
&lt;strong&gt;Simplified Management&lt;/strong&gt;: Cloud Armor offers a user-friendly interface for managing security policies and monitoring traffic patterns.&lt;br&gt;
&lt;strong&gt;Global Scale&lt;/strong&gt;: Cloud Armor's globally distributed network ensures consistent protection across all your GCP regions.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Implementation with a Reference Diagram&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Flytqbiklmyiiglmcobd3.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Flytqbiklmyiiglmcobd3.jpg" alt="Image description" width="800" height="455"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Users access your application through the internet.&lt;/li&gt;
&lt;li&gt;Traffic is routed through Cloud Load Balancing, which can be integrated with Cloud Armor.&lt;/li&gt;
&lt;li&gt;Cloud Armor's WAF engine inspects incoming traffic, filtering out malicious requests based on pre-configured rules or custom policies.&lt;/li&gt;
&lt;li&gt;Legitimate traffic is forwarded to your application servers.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Sample Policy&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fgj6m49j3jyqw1ulm2ai3.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fgj6m49j3jyqw1ulm2ai3.png" alt="Image description" width="741" height="670"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fy0n9qqb17k3utmrnszgx.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fy0n9qqb17k3utmrnszgx.png" alt="Image description" width="728" height="587"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Pros and Cons of using Cloud Armor&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Pros:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Robust security against DDoS attacks and web application vulnerabilities.&lt;/li&gt;
&lt;li&gt;Improved application performance and availability.&lt;/li&gt;
&lt;li&gt;Simplified security management with a user-friendly interface.&lt;/li&gt;
&lt;li&gt;Scalable protection that adapts to your application's traffic patterns.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Cons:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Additional cost associated with Cloud Armor usage.&lt;/li&gt;
&lt;li&gt;May require configuration adjustments for existing applications.&lt;/li&gt;
&lt;li&gt;Might introduce slight latency due to additional processing at the edge.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Cost Considerations&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Cloud Armor charges are based on incoming and outgoing request counts. You can leverage GCP's free tier for limited usage. Pay-as-you-go pricing applies for exceeding the free tier limits.  Refer to GCP's pricing documentation for detailed cost information &lt;a href="https://cloud.google.com/armor/pricing"&gt;https://cloud.google.com/armor/pricing&lt;/a&gt;&lt;/p&gt;

</description>
      <category>gcp</category>
      <category>cloudarmor</category>
      <category>websecurity</category>
      <category>cloud</category>
    </item>
    <item>
      <title>101- Databricks Model Serving- Saving Cost</title>
      <dc:creator>Chetan Menge</dc:creator>
      <pubDate>Wed, 08 May 2024 10:12:21 +0000</pubDate>
      <link>https://dev.to/chetan_menge/101-llm-dbrx-instruct-model-serving-saving-cost-nnj</link>
      <guid>https://dev.to/chetan_menge/101-llm-dbrx-instruct-model-serving-saving-cost-nnj</guid>
      <description>&lt;p&gt;Started exploring and trying Databricks instruct LLM. Was going over the Databricks Marketplace and installed and served the model by following steps listed in the Sample provided Notebook.&lt;/p&gt;

&lt;p&gt;Was able to serve model successfully and interacted with it very well by proving few prompts. Its was after couple of days, got realised when saw budget alert notification that, planned budget got exceeded way beyond.&lt;/p&gt;

&lt;h2&gt;
  
  
  Lesson Learned,
&lt;/h2&gt;

&lt;ol&gt;
&lt;li&gt;LLM Download from Market place is free &lt;/li&gt;
&lt;li&gt;Serving LLM - Similar to cloud hosted resources cost saving, there is way to &lt;strong&gt;scale down&lt;/strong&gt; served LLM endpoint when not in use&lt;/li&gt;
&lt;li&gt;Model which is accessed from Marketplace can be serve using "Databricks Model Serving" approach which server Model as REST endpoint using serverless compute.&lt;/li&gt;
&lt;/ol&gt;




&lt;p&gt;Please find below details with screenshot for reference, for downloading and serving DBRX Model.&lt;/p&gt;

&lt;h2&gt;
  
  
  Model Download
&lt;/h2&gt;

&lt;p&gt;On Databricks Workspace portal, we can go to Marketplace and search for LLM. E.g search for DBRX models.&lt;/p&gt;

&lt;p&gt;Model and its details will be shown as below,&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fu3ohwntsdv9xbktt7tqw.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fu3ohwntsdv9xbktt7tqw.png" alt="Image description" width="800" height="369"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;You can select / Click on  Get instant access, to download model into your environment.&lt;/p&gt;

&lt;h2&gt;
  
  
  Validation of Model in Unity Catalog
&lt;/h2&gt;

&lt;p&gt;Once downloaded, model will be available in unity catalog as shown below&lt;br&gt;
&lt;a href="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fjbx9bpdxxao6wwif3w2q.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fjbx9bpdxxao6wwif3w2q.png" alt="Image description" width="747" height="356"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;If its listed in unity catalog that means model got downloaded and available for use.&lt;/p&gt;
&lt;h2&gt;
  
  
  Serving Model thru Endpoint
&lt;/h2&gt;

&lt;p&gt;You can go to unity catalog and select specific model e.g. dbrx_instruct. You can create the endpoint and server model by clicking the “Serve this model” button above in the model UI.&lt;/p&gt;

&lt;p&gt;Below page will be prompted to select the configuration before serving the model &lt;/p&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fpuya3d77s50514q90ikf.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fpuya3d77s50514q90ikf.png" alt="Image description" width="800" height="278"&gt;&lt;/a&gt;&lt;/p&gt;
&lt;h2&gt;
  
  
  Saving Cost of Serving Model Endpoint
&lt;/h2&gt;

&lt;p&gt;While serving the model , make sure to expand the Advance Configuration section, which has option of &lt;strong&gt;"Scale to Zero"&lt;/strong&gt; Please refer below screenshot for the details.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fojepzrkwafdwqo0hmwrc.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fojepzrkwafdwqo0hmwrc.png" alt="Image description" width="800" height="304"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;If the "scale to zero" is not selected, the minimum charge will depend on the minimum provisioned concurrency specified by the chosen concurrency range.&lt;/p&gt;

&lt;p&gt;If ‘scale to zero’ is selected, scale to zero happens automatically after 30 minutes of no requests, at which time the endpoint enters the fully scaled-to-zero (idle) state. You are not charged during this time period. When a new request is made, the endpoint exits this idle state and begins scaling up at which point you begin getting charged.&lt;/p&gt;
&lt;h2&gt;
  
  
  Reference :-
&lt;/h2&gt;


&lt;div class="crayons-card c-embed text-styles text-styles--secondary"&gt;
      &lt;div class="c-embed__cover"&gt;
        &lt;a href="https://www.databricks.com/product/pricing/model-serving" class="c-link s:max-w-50 align-middle" rel="noopener noreferrer"&gt;
          &lt;img alt="" src="https://res.cloudinary.com/practicaldev/image/fetch/s--HJXcaWP9--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://www.databricks.com/wp-content/uploads/2022/11/Databricks-AWS-Pricing-OG.png" height="420" class="m-0" width="800"&gt;
        &lt;/a&gt;
      &lt;/div&gt;
    &lt;div class="c-embed__body"&gt;
      &lt;h2 class="fs-xl lh-tight"&gt;
        &lt;a href="https://www.databricks.com/product/pricing/model-serving" rel="noopener noreferrer" class="c-link"&gt;
          Model Serving Pricing | Databricks
        &lt;/a&gt;
      &lt;/h2&gt;
        &lt;p class="truncate-at-3"&gt;
          Databricks Model Serving simplifies the deployment of machine learning models as APIs, enabling real-time predictions within seconds or milliseconds.
        &lt;/p&gt;
      &lt;div class="color-secondary fs-s flex items-center"&gt;
          &lt;img alt="favicon" class="c-embed__favicon m-0 mr-2 radius-0" src="https://res.cloudinary.com/practicaldev/image/fetch/s--7-n24L7A--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://www.databricks.com/en-website-assets/favicon-32x32.png%3Fv%3Dc9b9916c3b27dc51866c46b79a6e9b88" width="32" height="32"&gt;
        databricks.com
      &lt;/div&gt;
    &lt;/div&gt;
&lt;/div&gt;


</description>
      <category>llm</category>
      <category>databricks</category>
      <category>beginners</category>
      <category>cost</category>
    </item>
  </channel>
</rss>
