Matthew Evan Eichler

Posted on Jul 1

AWS Bedrock Managed Knowledge Bases: Should We Use Them?

#aws #bedrock #knowledgebase

AWS released Managed Knowledge Bases for Amazon Bedrock on 17 June 2026. The feature significantly reduces the operational complexity of building Retrieval-Augmented Generation (RAG) solutions by allowing Bedrock to manage the vector storage, indexing, embeddings, and retrieval infrastructure on your behalf.

For teams looking to deliver an Agent Core proof of concept or their first production RAG workload quickly, this can be a compelling option. However, there are some important trade-offs to understand before committing to the managed approach.

Traditionally, a Bedrock Knowledge Base required a customer-managed vector store such as:

OpenSearch Serverless
OpenSearch Managed Clusters
Aurora PostgreSQL with pgvector
Pinecone
DocumentDB
Other supported vector databases With a Managed Knowledge Base, Bedrock handles the underlying vector infrastructure and embedding model selection for you.

Creating one from the AWS CLI is straightforward:

aws bedrock-agent create-knowledge-base \
      --name "my-managed-kb" \
      --role-arn "arn:aws:iam::${AWS_ACCOUNT_ID}:role/service-role/AmazonBedrockExecutionRoleForKnowledgeBase_ihv1p" \
      --knowledge-base-configuration '{
        "type": "MANAGED",
        "managedKnowledgeBaseConfiguration": {
          "embeddingModelType": "MANAGED"
        }
      }'

Advantages

Lower operational overhead

There is no need to provision, secure, monitor, patch, or scale a separate vector database.

Lower costs

S3 storage is cheaper than database storage. Pay only for each ingestion and retrieval operation. Indexing and searching compute are free. No 24/7 server costs.

Faster time-to-value

Managed Knowledge Bases make it possible to stand up a RAG solution in minutes rather than days.

Automatic embedding management

Bedrock manages embedding selection and indexing, reducing the number of architectural decisions required from development teams.

Cost-effective for smaller workloads

The managed model can be attractive for:

Proofs of Concept
Departmental knowledge bases
Agent Core pilots
Workloads with highly variable usage patterns

Good fit for multiple small knowledge bases

Organizations experimenting with several independent knowledge domains can often benefit from the simplified operational model.

Disadvantages

Less control

The primary trade-off is reduced visibility and control over the underlying vector store and retrieval implementation.

Potential performance limitations

Organizations with extremely high query-per-second (QPS) requirements may still prefer dedicated OpenSearch-based solutions where indexing, scaling, and query performance can be tuned explicitly.

Fewer advanced search capabilities

If your use case requires sophisticated filtering, custom ranking strategies, or hybrid lexical/vector search, a self-managed OpenSearch solution may remain the better choice.

Infrastructure-as-Code support is not yet mature

At the time of writing, Managed Knowledge Bases are available through the Bedrock APIs and Console, but CloudFormation and CDK support have not yet fully caught up.

CloudFormation and CDK Considerations

One of the biggest surprises for infrastructure engineers is the current gap between the Bedrock APIs and CloudFormation support.

While a Managed Knowledge Base can be created through the Bedrock API and AWS CLI, there is currently no equivalent CloudFormation resource definition that maps cleanly to the managed configuration. This means CDK users cannot yet create Managed Knowledge Bases using standard CDK constructs.

For teams that require full Infrastructure-as-Code automation, a common workaround is a Lambda-backed Custom Resource that invokes the Bedrock API during deployment.

This situation is not unusual for newly released AWS services. Historically, CloudFormation and CDK support often follow shortly after the underlying service APIs become available.

Recommendation

If your goal is to deliver a RAG-enabled Agent Core solution quickly and with minimal operational overhead, a Managed Knowledge Base is currently the most attractive starting point.

If, however, your organization requires:

Advanced search capabilities
Full infrastructure control
Sophisticated indexing strategies
Complete CloudFormation/CDK automation

... then a traditional OpenSearch-based Knowledge Base remains the more flexible option.

TL;DR

Use Managed Knowledge Bases when speed, simplicity, and lower operational overhead are more important than infrastructure control.

Postscript: Cloud Formation and CDK Engineering Insights

As with most new AWS Products, the Cloud Formation types and CDK support comes at some later date; forget including this in your codebase unless you want to spend the time writing a Lambda backed Custom Resource.

You might find it interesting that the original OpenSearch Serverless Vector Collection also requires a Custom Resource in CDK. The reason is the cluster is created using a Cloud Formation type but there are no default indexes defined. An "on-the-fly"Lambda is created which initialises the cluster with an index that the Knowledge Base can use.

If you want to take a deep dive into this approach try Dirk Michel's pipelineburst example from his Bedrock examples repository in his CDK stacks section (written about 2 years ago).

Top comments (1)

Luis Cruz • Jul 1

Great breakdown—this is exactly the kind of topic where “managed convenience” can quietly turn into “opaque complexity.”

Bedrock Managed Knowledge Bases look attractive because they remove a lot of undifferentiated heavy lifting (chunking, embedding, vector store ops, ingestion pipelines). But in practice, you’re also accepting a strong opinionated layer over your RAG architecture that can be hard to tune when things go wrong.

The real trade-off I keep seeing is:

Pros: fast setup, fewer infra decisions, good default retrieval pipeline, easier for teams without dedicated ML/RAG infra
Cons: limited control over chunking strategy, retrieval tuning, ranking behavior, and observability at the vector layer

For early-stage or internal tools, it’s often a strong default. But for production systems where retrieval quality is a core differentiator (support bots, search, agent memory systems), I still prefer a more explicit RAG pipeline where embeddings, vector DB, and reranking are independently controllable.

So I’d frame it less as “should we use it?” and more as:
👉 “How much retrieval control are we willing to trade for operational simplicity?”

That answer changes a lot depending on scale and use case.