DEV Community

Karthik Sakthivel
Karthik Sakthivel

Posted on

2

Amazon Bedrock announces support for cost allocation tags on inference profiles

What's new at AWS πŸ“’

πŸ”° Amazon Bedrock now supports customers to allocate and track on-demand foundation model usage.

πŸ”° With this, customers can categorize their GenAI inference costs by department, team, or application using AWS cost allocation tags.

⚠️ What is Amazon Bedrock:
βœ”οΈ It is a fully managed service that offers a choice of high-performing foundation models from leading AI companies via a single API.
βœ”οΈ It also provides a broad set of capabilities such as security, privacy, and responsible AI capabilities built in.

πŸ”° These capabilities help customer to build tailored applications for multiple use cases across different industries.

πŸ”° Importantly it is helping organizations by ensuring customer trust and data governance.

πŸ”° You can leverage this feature by creating an application inference profile and tagging it.

⚠️ What is Inference profiles:
βœ”οΈ These profiles are a resource in Amazon Bedrock that define a model and one or more Regions
βœ”οΈ Inference profile can route model invocation requests.

⚠️ Types of inference profiles:
1️⃣ Cross region inference profiles
2️⃣ Application inference profiles

⚠️ When to use inference profiles:
βœ”οΈ Track usage metrics
βœ”οΈ Use tags to monitor costs
βœ”οΈ Cross-region inference

πŸ“Œ Explore more about cross-region inference profiles:
https://aws.amazon.com/blogs/machine-learning/getting-started-with-cross-region-inference-in-amazon-bedrock/

Image of Docusign

Bring your solution into Docusign. Reach over 1.6M customers.

Docusign is now extensible. Overcome challenges with disconnected products and inaccessible data by bringing your solutions into Docusign and publishing to 1.6M customers in the App Center.

Learn more

Top comments (0)

A Workflow Copilot. Tailored to You.

Pieces.app image

Our desktop app, with its intelligent copilot, streamlines coding by generating snippets, extracting code from screenshots, and accelerating problem-solving.

Read the docs