DEV Community: Bonkur Harshith Reddy

How AWS Academy Students Can Activate AWS Skill Builder and Claim Free AWS Exam Vouchers

Bonkur Harshith Reddy — Wed, 06 May 2026 03:04:23 +0000

Many students spend ₹10k–₹15k (75-150 USD + Tax) on AWS certifications and third-party training platforms without realizing AWS Academy can provide AWS Skill Builder access and free AWS certification vouchers through their college.

Who is this for

AWS Academy students
College students with Canvas access
Students enrolled in AWS Academy Cloud Foundations / Architecting etc.

Step-by-step for AWS Academy students

Step 1: Login to Canvas and select an AWS Academy course enrolled into.

Step 2: Select the Modules menu item

Step 3: Scroll down to the last module named “AWS T&C Resources”. Review the FAQs and select the link called “Activate Your AWS Skill Builder Offer”.

Then, follow the onscreen instructions.

Instructions for AWS Academy Approved Educators to activate their AWS Skill Builder.

AWS Academy Approved Educator - Activate AWS Skill Builder

Step 1: Login to Canvas and select the “Educator Getting Started with AWS Academy” course.

Step 2: Select the Modules menu item

Step 3: Scroll down to the last module named “AWS T&C Resources”. Review the FAQs and select the link called “Activate Your AWS Skill Builder Offer”.

Step 4 onwards: please follow the onscreen instructions.

Educators and Students will receive following Skill Builder invitation message.

In Skill Builder Learning portal, Educators and Students can access all paid SBS contents and Jam, Cloud Quest etc.

By clicking “view all training activity”, there are four per-enrolled exam prep courses

After completing one of four practice exams, the system will issue one free exam voucher accordingly.

Educator and student can claim for free exam voucher at all after completing practice exams per SBST account.

Educator and Student will receive exam voucher details from Pearson VUE in 7-10 working days.

The exam voucher is valid for 6 months-1 year.

Common Mistakes

Using personal email instead of academy email
Not completing official practice exam
Waiting less than 3 business days
Missing the AWS T&C Resources module

Things Students Should Know

Voucher availability depends on institution partnership
Not all courses provide vouchers
Practice exams are usually required
Skill Builder subscription duration may be limited

If this helped you, share it with your college AWS community or classmates before they buy vouchers unnecessarily.

A Deep Technical Chronicle of the AWS Data and AI Meetup in Hyderabad: Unified Studio, Bedrock, and Modern Migration

Bonkur Harshith Reddy — Thu, 20 Nov 2025 15:16:01 +0000

Introduction

The AWS Data and AI Meetup in Hyderabad offered an entire day of hands-on learning across analytics, machine learning, generative AI, and large-scale data migration. Through a combination of conceptual sessions and practical workshops, the event demonstrated how AWS services integrate to build modern, scalable data and AI systems.

This article documents the full experience in depth, covering both the architectural discussions and the step-by-step implementations we followed throughout the workshops.

Organized By

This event was organized by Hafiz Mohammad Khan, an AWS Community Hero who actively leads and supports AWS events and developer communities across Hyderabad.

The AWS Community Heroes program recognizes technologists who consistently contribute knowledge, organize events, and support developers across the global AWS ecosystem. Hafiz coordinated the sessions, workshops, and overall flow of the meetup, ensuring a smooth and engaging technical experience.

Why I Attended This Meetup

My background is primarily in Google Cloud Platform, where I have worked with BigQuery, data processing workflows, and the broader GCP AI ecosystem. Over time, I grew increasingly curious about how AWS approaches the same large-scale data engineering, ML, and generative AI challenges.

I wanted to see firsthand how AWS enables:

Unified Analytics
Combining structured, unstructured, and streaming data into a single platform so SQL, ML, and BI workloads operate from one unified layer.
ML Lifecycle Management
Managing data preparation, training, tuning, deployment, and monitoring through a standardized and automated process.
Dataset Governance
Managing access, lineage, quality, security policies, and compliance across complex datasets.
Lakehouse Architectures
Combining the flexibility of data lakes with the reliability and performance of data warehouses using open formats like Iceberg.
GenAI Integration
Building applications powered by embeddings, foundation models, and orchestration features through services like Amazon Bedrock.
Large-Scale Migration
Moving enterprise databases and analytical workloads into AWS using tools like DMS Serverless and SCT.

This event offered the perfect opportunity to explore the AWS ecosystem from end to end.

Event Flow

The day followed this sequence:

Session 1 → Workshop 1 → Lunch → Workshop 2 → High Tea → Session 2

This structure created a balanced mix of learning and networking while giving time to interact with speakers, AWS specialists, and fellow participants.

Speakers and Their Expertise

Neha Prasad
Analytics Specialist at AWS
Anirudh Chawla
Analytics Specialist at AWS
Shivapriya
Solutions Architect at AWS
Vishal Alhat
Developer Advocate at AWS
Harsha Mathan
Principal Data Engineer at Verisk

Session 1: The Modern Data and AI Problem Landscape

Speaker: Neha Prasad

The opening session focused on the challenges enterprises face while scaling data and AI initiatives.

High Effort Machine Learning Systems

Enterprises often rely on disconnected tools for exploration, feature engineering, training, and deployment. This fragmentation slows iteration and increases operational complexity.

Persona Fragmentation

Data engineers, analysts, data scientists, and ML engineers use different tools with varying governance standards, making collaboration and reproducibility difficult.

Data Growth vs. Data Utilization

Although organizations collect massive amounts of data, only a small portion gets used effectively because ingestion, governance, analytics, and ML pipelines lack tight integration.

Governance Challenges

Access control, lineage tracking, quality checks, and cataloging tools often operate in silos, lowering confidence in large-scale pipelines.

Why SageMaker Unified Studio

Unified Studio solves these problems by centralizing analytics, data preparation, ML workflows, governance, and lineage into a single tightly integrated environment.

Understanding SageMaker Unified Studio

A Single Workspace

Unified Studio allows users to perform:

SQL Analytics
Run SQL queries directly inside SageMaker to explore structured datasets.
Notebook-Based Experimentation
Use Jupyter-style notebooks for prototyping and model development.
Data Preparation
Clean, transform, and preprocess raw data for ML or analytics.
Pipeline Creation
Build automated workflows for ingestion, training, evaluation, and deployment.
Training
Run scalable distributed training jobs.
Deployment
Publish models as endpoints or batch jobs for real applications.
Lineage Tracking
Track dataset evolution, transformations, and model dependencies.

Kernel Per Cell Model

Users can run SQL, Python, Bash, or PySpark within the same notebook, enabling hybrid workflows without switching tools.

Integrated Governance

Unified Studio connects directly to the AWS Data Catalog, enabling:

Dataset Versioning
Automatically track dataset changes to enable rollback, comparison, and reproducibility.
Metadata Management
Store schema information, owners, classifications, and descriptions.
Schema Rules
Enforce structural and validation requirements across data pipelines.
Access Controls
Manage who can view or modify datasets for secure and compliant usage.

Iceberg Support

Apache Iceberg integration enables:

ACID Compliance
Ensures consistent concurrent reads and writes at any scale.
Schema Evolution
Modify tables without breaking downstream jobs.
Time Travel
Query historical versions for debugging or audits.
Partition Evolution
Change partition strategies without reprocessing data.

These capabilities are essential for large-scale analytic pipelines.

What I Learned From This Session

Before this session, I only had a surface-level idea of how AWS unified analytics and ML workflows actually worked. Seeing Unified Studio in action made it clear how AWS connects data preparation, analytics, training, deployment, and governance inside one seamless environment.

I realized how powerful features like dataset versioning, schema evolution, time travel, lineage tracking, and multi-kernel execution are in reducing friction across teams and tools. These capabilities solve many of the coordination and reproducibility challenges I’ve faced in real projects.

This session showed me how mature and integrated the AWS data platform has become. It made me want to explore Iceberg tables, Unified Studio pipelines, and governed ML workflows in much more depth.

Workshop 1: End-to-End Analytics to ML Pipeline Using Unified Studio

Speaker: Anirudh Chawla

This workshop demonstrated how to build a complete analytics-to-ML workflow using a sales dataset.

Creating Analytics and ML Projects

We created two environments:

Analytics Project
Used for dataset exploration.
ML Project
Used for feature engineering and model training.

Unified Studio automatically provisioned infrastructure and configurations.

Dataset Exploration

Inside the Analytics Project, we:

Uploaded the sales dataset
Imported the raw CSV so it could be profiled, queried, and analyzed.
Used SQL for exploratory queries
Ran SQL statements to inspect row counts, filter data, aggregate metrics, and validate data quality.
Viewed auto-generated visualizations
Quickly explored trends and anomalies with built-in charts.
Examined column-level statistics
Reviewed min, max, mean, distinct counts, and missing values to assess readiness.

Publishing the Dataset

Once the exploration phase was complete inside the Analytics Project, we published the cleaned and analyzed dataset to the AWS Data Catalog. This step essentially “promoted” the dataset from a local working copy into a governed, shareable asset. Publishing added metadata, schema details, and access controls, making the dataset discoverable to other projects inside Unified Studio. This also ensured that downstream teams or ML pipelines always referenced a validated, consistent version of the data rather than ad-hoc files.

Switching to the ML Project

After publishing, we switched from the Analytics Project into the ML Project to begin the machine learning workflow. Instead of manually uploading files again, we simply imported the published dataset from the Data Catalog. This guaranteed that the ML pipeline consumed the same curated data we explored earlier, with all transformations and schema definitions preserved. Once imported, the dataset became available inside Data Wrangler and the training workflows, allowing us to begin feature engineering, validation, and model development without repeating any exploration steps.

Data Wrangler Transformation

Using Data Wrangler, we:

Cleaned missing values
Filled or removed incomplete entries.
Engineered features
Created derived variables to enrich model performance.
Applied validation rules
Ensured the dataset met quality and formatting requirements.
Prepared the dataset for training
Output the processed data into a training-ready format.

Pipeline Construction

We built a complete ML pipeline consisting of:

Preprocessing
Automated data cleaning, transformations, and feature engineering.
Training
Triggered a job to train an ML model using the prepared data.
Evaluation
Assessed model accuracy using validation metrics.
Conditional model registration
Registered the model only if it met required quality thresholds.

Model Deployment and Lineage

The model was deployed as an endpoint. Unified Studio displayed full lineage from ingestion to deployment, supporting reproducibility and auditability.

What I Learned From This Workshop

Workshop 1 finally showed me how an end-to-end ML workflow actually comes together inside SageMaker Unified Studio. I’ve used separate tools for data exploration, feature engineering, pipeline orchestration, and deployment before, but I had never seen all of them integrated so tightly in one environment.

I learned how Unified Studio simplifies every step: exploring datasets with SQL, transforming them with Data Wrangler, and automating the entire process using ML Pipelines. Seeing preprocessing, training, evaluation, and conditional model registration run seamlessly in a single pipeline made it clear how mature the AWS MLOps ecosystem has become.

The hands-on demo also highlighted features I previously underestimated, like dataset publishing, lineage tracking, project-level separation, and automatic environment provisioning. These capabilities remove a lot of friction that usually slows down real-world ML workflows.

After this workshop, I now understand how to build production-ready ML pipelines the AWS way, and I’m excited to experiment more with Data Wrangler flows, conditional pipeline steps, and automated model deployment from end to end.

Workshop 2: Generative AI Image Editing Using Bedrock

Speakers: Vishal Alhat and Shivapriya

This workshop focused on building a generative AI application using a fully serverless architecture.

Architecture Components

The application used:

AWS Amplify
Hosted and served the frontend with CI/CD capabilities.
Amazon Cognito
Handled authentication and user session management.
API Gateway
Routed frontend requests to backend Lambda functions.
AWS Lambda
Executed backend logic, triggered Bedrock requests, and returned results.
Amazon Bedrock
Performed generative AI image manipulation using foundation model APIs.
Amazon DynamoDB
Stored metadata such as prompts, job IDs, timestamps, and output references.

Application Flow

Users authenticated through Cognito and submitted prompts or images through the Amplify frontend. API Gateway routed requests to Lambda, which invoked Bedrock models for image generation or editing. DynamoDB stored metadata for tracking and retrieval.

Hands-On Takeaway

This workshop showcased how generative AI applications can be built without provisioning GPUs or managing ML infrastructure. Bedrock simplifies foundation model usage, while serverless components handle scalability.

My Takeaways from Workshop 2

Workshop 2 showed me how quickly a complete GenAI application can be built when every component is serverless. Seeing Amplify, Cognito, API Gateway, Lambda, Bedrock, and DynamoDB working together helped me understand how each service fits into the overall flow. I realized how much complexity disappears when authentication, API routing, backend logic, model invocation, and database storage are all managed for you by AWS.

The hands-on demo made it clear that Bedrock is not just an AI model hosting service. It becomes much more powerful when paired with Lambda for orchestration and DynamoDB for storing metadata and user context. I also learned how frontend and backend pieces communicate through API Gateway and how Amplify simplifies deployment.

Overall, this workshop gave me confidence that building a production-ready GenAI feature does not require managing GPUs or heavy ML infrastructure. The serverless architecture made the entire workflow feel simple, scalable, and practical for real applications.

Session 2: Database Migration Deep Dive (DMS, SCT, Snowflake)

Speaker: Harsha Mathan

This session walked through an enterprise migration from a legacy SQL Server system to Snowflake.

Migration Challenges

Large migrations often encounter:

Unpredictable CDC volume
Change Data Capture streams may spike unexpectedly, causing lag or replication issues.
Schema incompatibilities
Source and destination do not always align, requiring transformations.
High operational overhead
Migration jobs require careful monitoring, troubleshooting, and coordination.
Infrastructure saturation during spikes
Sudden load surges can overwhelm legacy systems and slow migration.

End-to-End Migration Architecture

The full migration pipeline included:

SQL Server (source)
The transactional system that supplied both full and incremental data.
Step Functions (orchestration)
Managed workflow sequencing, retries, and state tracking.
AWS DMS (replication)
Performed full load and continuous CDC replication.
Amazon S3 (Parquet staging)
Stored incoming replicated data in Parquet format.
AWS Glue (schema adjustments)
Cleaned and transformed schema mismatches between the source and Snowflake.
Snowflake (destination)
The cloud data warehouse used for analytics consumption.

Full Load and CDC Separation

Separating historical full loads from ongoing CDC streams created a much more stable migration flow. Full load jobs typically involve large volumes of static historical data, while CDC streams handle real-time incremental updates. Running them together often leads to contention, latency, and unnecessary retries. By isolating these two phases, the team ensured that the heavy historical batch did not interfere with the continuous replication pipeline. This also simplified troubleshooting, improved throughput, and enabled the migration to progress predictably without overwhelming the source system.

Parquet and Glue Integration

Storing replicated data in Parquet format offered significant performance and cost benefits. Parquet’s columnar structure compressed better, reduced storage footprint, and accelerated analytical queries compared to raw formats like CSV or JSON. AWS Glue then stepped in to handle schema alignment, type corrections, and transformation of fields that did not map cleanly from SQL Server to Snowflake. This combination of Parquet and Glue provided a clean, optimized staging layer that ensured data was structured correctly and efficiently before being loaded into Snowflake for analytics.

DMS Serverless

Using DMS Serverless removed much of the operational burden typically associated with managing migration infrastructure. Instead of manually allocating resources or worrying about capacity planning during CDC spikes, DMS Serverless automatically scaled replication capacity in response to workload changes. This eliminated throughput bottlenecks and reduced the chances of lag building up during peak periods. It also simplified administrative overhead, as there were no servers to patch, monitor, or resize. Overall, it made the migration pipeline more resilient and hands-off, especially for long-running enterprise workloads.

Generative AI in SCT

AWS SCT uses generative AI to automatically convert SQL Server stored procedures and functions into Snowflake-compatible syntax, reducing manual rewriting.

My Key Takeaways

By the end of the meetup, I gained a deeper understanding of how modern data and AI systems are built on AWS:

I learned how SageMaker Unified Studio brings data exploration, feature engineering, ML pipelines, and deployment into a single governed workspace, removing the friction of switching between multiple tools.
I understood how features like dataset versioning, lineage tracking, schema evolution, and access controls play a critical role in building trustworthy and compliant analytics pipelines.
The Apache Iceberg discussion helped me see how open table formats enable scalable lakehouse architectures with ACID guarantees and reproducibility.
The GenAI workshop showed me how serverless components such as Amplify, Cognito, API Gateway, Lambda, Bedrock, and DynamoDB work together to form a simple, scalable, production-ready application architecture.
The migration deep dive clarified how enterprise systems move from legacy databases to modern warehouses using DMS Serverless, Step Functions, Glue transformations, and Parquet staging.
Overall, the event helped me connect analytics, ML, GenAI, and migration patterns into one cohesive view of how AWS approaches end-to-end data engineering and AI workflows.

What’s Next: AI for Bharat Program

During the meetup, the speakers also highlighted the AI for Bharat initiative, a nationwide program designed to help developers across India build real-world generative AI applications using AWS. The program combines structured workshops, hands-on labs, and a national-level hackathon focused on analytics, LLMs, Bedrock, agents, and scalable cloud architectures.

You can explore the program here:
🔗 https://vision.hack2skill.com/event/ai-for-bharat

After attending this meetup and getting hands-on experience with Unified Studio, Bedrock, serverless application design, and migration workflows, the AI for Bharat program feels like the perfect next step. It offers an opportunity to apply these skills in a competitive setting, build production-ready AI solutions, earn certificates, and collaborate with developers across India.

If you want to build with GenAI and cloud-native architectures on AWS, this is one of the best programs to join.

Conclusion

The AWS Data and AI Meetup in Hyderabad provided a comprehensive look into modern cloud-native data engineering, machine learning, and generative AI practices. The combination of conceptual sessions, detailed architecture discussions, and immersive hands-on workshops made the event extremely valuable.

For anyone exploring AWS for large-scale data and AI systems, this meetup offered a complete and practical blueprint for what modern cloud solutions look like in production.

Anatomy of a Cloud Collapse: A Technical Deep-Dive on the AWS Outage of October 2025

Bonkur Harshith Reddy — Fri, 14 Nov 2025 12:06:06 +0000

TL;DR: The 15-Hour Outage

On October 20, 2025, AWS’s US-EAST-1 (Northern Virginia) region experienced a 15-hour outage triggered by a rare race condition in DynamoDB’s DNS automation system. This caused DynamoDB (a NoSQL database used across AWS control planes) to become unreachable.

Because DynamoDB powers internal services like EC2, IAM, STS, Lambda, and Redshift, over 140 AWS services were eventually affected.

Independent measurements showed that 20 to 30 percent of all internet-facing services experienced disruptions — nearly one-third of the internet.

AWS Infrastructure Context

AWS organizes compute into:

Regions (geographical clusters)
Availability Zones (AZs) (isolated data centers within a region)
Control planes (authentication, orchestration, routing)
Data planes (actual compute, storage, execution)

This outage was a regional control-plane failure, which is worse than a simple service crash because many systems depended on DynamoDB for metadata and operations.

After reading this article, you will understand:

How the DynamoDB DNS race condition happened
Why a 2.5-hour bug turned into a 15-hour outage
How metastable failure overwhelmed EC2
How the failure cascaded across the internet
How to architect systems to avoid such collapses

Part 1: The Root Cause (The “How” and “Why”)

DynamoDB DNS Automation Internals

DynamoDB uses a two-part subsystem to maintain consistent DNS entries:

DNS Planner

Generates routing configuration sets called plans that describe:

Backend server lists
Health and routing weights
Failover settings
DNS TTL values

DNS Enactors

Distributed workers that read these plans and apply them to Route 53.

They operate independently across Availability Zones for fault tolerance.

What Went Wrong

On October 20:

One Enactor stalled while processing Plan-100.
Other Enactors applied Plan-101 and Plan-102 successfully.
A cleanup job deleted old plans, including Plan-100.
Hours later, the slow Enactor resumed and applied Plan-100.
Because the plan no longer existed, it submitted an empty DNS update.

The endpoint:

dynamodb.us-east-1.amazonaws.com

now pointed to no IP addresses.

DynamoDB continued running internally, but DNS made it unreachable.
This was the spark that triggered the larger cascade.

DNS Race Condition Diagram

Explanation: Shows how a delayed Enactor reapplied outdated state after deletion, erasing DynamoDB’s DNS entry.

Part 2: The Cascade (How a 2.5-Hour Bug Became a 15-Hour Outage)

AWS fixed DNS in ~2.5 hours, but the region did not recover because it entered a metastable failure state.

A metastable system is “alive but stuck” because:

backlog > processing capacity
retry storms amplify load
recovery cannot progress

Step-by-Step Breakdown

1. EC2’s Droplet Workflow Manager Failed

DWFM stores host leases and lifecycle metadata in DynamoDB.

When DynamoDB became unreachable:

Lease renewals failed
Autoscaling operations stalled
Millions of internal control-plane writes backed up

2. Synchronized Retry Storm

Once DNS was restored:

EC2 hosts
AWS internal services
Customer workloads

all retried at the same time.

This thundering herd instantly saturated DynamoDB and EC2.

3. Congestive Collapse

Symptoms:

100 percent CPU
Zero progress
Endless retries
Growing queues
No way to drain backlog sequentially

4. Manual Recovery

AWS engineers had to:

Implement global throttling
Purge corrupted internal queues
Restart EC2 control-plane nodes
Gradually rebuild DynamoDB state
Slowly warm caches

The majority of the 15-hour outage was recovery, not the root cause.

Metastable Failure Loop Diagram

Explanation: Shows how retries overloaded the control plane, preventing state from stabilizing even after DynamoDB’s DNS was fixed.

Part 3: The Blast Radius (Who Was Affected)

Internal AWS Failures

DynamoDB: DNS unreachable
EC2: Lifecycle and autoscaling halted
IAM / STS: Auth failures cascaded to all clients
Lambda: Triggers, scaling, and invocations failed
Redshift: Control-plane operations stalled
NLB: Health checks degraded
AWS Support Console: Partially offline

External Impact (2,000+ Companies)

More than 8 million user-facing errors occurred.

Category	Examples	Impact
Social / Messaging	Snapchat, Signal, Discord	Login failures, message delays
Gaming / Media	Roblox, Fortnite, Disney+	Playback and matchmaking failures
Productivity	Canva, Duolingo, Atlassian	API failures, degraded workflows
Finance	Venmo, Coinbase, Banks	Payments stuck, verification delays
IoT	Alexa, Ring	Device control and telemetry failures

US-EAST-1’s failure rippled across global internet infrastructure.

Cascade Dependency Tree Diagram

Explanation: Visualizes how DynamoDB sits at the foundation of multiple AWS control planes. Once its DNS failed, the outage propagated upward through EC2, IAM, Lambda, and into customer workloads.

Part 4: How to Architect for Resilience Next Time

These lessons apply to any large distributed system.

1. Reduce Regional Blast Radius

Use:

Multi-region architectures
DynamoDB Global Tables
Route 53 failover
AWS Global Accelerator

Critical workloads must not rely solely on US-EAST-1.

2. Prevent Thundering Herds

Implement disciplined retry strategies:

Exponential backoff
Full jitter
Retry budgets
Max retry caps

Retries should help recovery, not destroy it.

3. Use Circuit Breakers

Circuit breakers:

Detect repeated failures
Stop calling the dependency
Fail fast
Reopen slowly

This prevents your service from participating in a cascading overload.

4. Test Disaster Recovery with Chaos Engineering

Simulate:

Regional DynamoDB outages
IAM / STS failures
EC2 API throttling
Partial DNS failures
Cross-region failover

A DR plan is only real once tested.

Closing Thoughts

The October 2025 AWS outage was a reminder that:

A small bug can ripple across global infrastructure
DNS misconfigurations can disable entire services
Control-plane failures are more destructive than data-plane failures
Regional dependence is a systemic risk

Cloud resilience is not automatic.
It must be intentionally engineered.

Your architecture must assume US-EAST-1 can fail.
Because one day, it will.

References and Further Reading

A Practical Guide to Passing the MongoDB Certified DBA Exam

Bonkur Harshith Reddy — Wed, 24 Sep 2025 15:29:05 +0000

I recently passed the MongoDB Certified DBA exam, and I want to share the straightforward, no-fluff study plan you can follow to do the same. This guide focuses on the official resources and strategies that actually work so no dumps, no guesswork, just a clear path to getting certified.

Quick Facts about the Exam

Format: 66 MCQs
Time Limit: 90 mins
Cost: ~$150 USD (This can change, so check MongoDB University for current pricing).

How to Get a Discount (or Even a Free Exam!)

The $150 exam fee can be a barrier, but you should never have to pay the full price. Here’s how to claim the two most common vouchers.

The 50% Discount (For Everyone)

This is the standard discount available to anyone who prepares using the official materials. The process is simple:

Enroll in the "MongoDB Database Admin Path" on MongoDB University.
Complete the entire learning path. This means watching all the lectures and, most importantly, finishing all the labs and quizzes.
Upon completion, MongoDB will automatically provide you with a 50% discount code in mail to use when you register for the exam.

The 100% Free Voucher (For Students)

If you are a student, you can get the exam for free through the GitHub Student Developer Pack. This requires a few extra steps but is absolutely worth it.

Get the GitHub Student Developer Pack: First, you must be verified as a student by GitHub. If you haven't already, sign up for the GitHub Student Developer Pack. This process may require you to submit proof of enrollment.
Find the MongoDB Offer: Once you have the pack, log in and look through the list of partner offers for MongoDB.
Activate the Offer: Click the link to activate the MongoDB offer. This will typically redirect you to MongoDB's website to create or link an account, granting you benefits like Atlas credits and, most importantly, the 100% exam voucher.
Complete the Learning Path: Just like with the 50% discount, you will still need to complete the "MongoDB Database Admin Path" on MongoDB University to be eligible to use your voucher.

What if you only get 50% off?
Sometimes, students who register through the GitHub Student Developer Pack might still only see the standard 50% discount. If this happens to you, don't worry. MongoDB has a support form to resolve this.
Fill out this official support form, and the MongoDB education team will manually verify your status and apply the 100% voucher to your account.

What’s the Passing Score?
This is a very common question, and the official answer is that MongoDB does not publish the exact passing score. According to the official exam guide, the required percentage is determined through statistical analysis for each version of the exam and is not publicly shared.
The important thing to know is that you only need to achieve an overall passing score. You do not need to pass each individual topic or domain.

The Best Free Resources (Use These, In This Order)

MongoDB University “MongoDB Database Admin Path”: This is your single most important resource. Complete the entire path, including all the hands-on labs.
Official Associate DBA Exam Study Guide: Use this as your master checklist. If it's on the guide, you need to know it.
Official Practice Questions: These are pure gold. The style is very similar to the real exam. Do them multiple times, and don't move on from a wrong answer until you understand why you got it wrong.
MongoDB Documentation: The ultimate source of truth. When a course lesson feels light on details, go to the docs.
MongoDB Developer Community Forums: Perfect for asking specific technical questions. You'll often get answers from MongoDB staff.
Reddit (r/mongodb): Excellent for candid advice and real-world exam experiences from recent test-takers.
Helpful YouTube Channels: For visual learners, channels like the Official MongoDB channel, freeCodeCamp, Edureka, Bro Code, etc offer excellent tutorials that can help reinforce complex topics.

Key Topics to Master
The exam covers seven main domains. While you don't need to pass each one individually, focusing on the heavily-weighted topics like Indexing and CRUD is key to achieving a high overall score.
Here's a visual breakdown of the key topics:

CRUD: (26%) Query patterns, update operators, and aggregation fundamentals.
Indexes: (18%) Single-field, compound, and multikey indexes.
Security: (15%) Role-Based Access Control (RBAC), authentication, and authorization.
Replication: (14%) Replica set architecture, failover mechanics, and read preferences.
Server Administration: (10%) Configuration, backups, and monitoring.
Monitoring: (9%) Reading alerts, monitoring storage, and currentOp.
Philosophy & Features: (7%) Core concepts of the document model and sharding.``
Backup and Recovery: (1%)

Suggested 4-Week Study Roadmap
This is an aggressive but achievable timeline if you can dedicate a few hours each day. Adjust it based on your personal experience.

Week 1: The Fundamentals.

Complete the first half of the MongoDB University Admin Path (M001, CRUD, and Indexing courses).
Build and query a simple database locally. Get comfortable in the shell.

Week 2: Administration & High Availability.

Focus on the Server Administration and Replication courses.
Hands-On Goal: Set up a basic 3-node replica set on your local machine and practice initiating a failover. This was a game-changer for my own understanding.

Week 3: Advanced Topics.

Work through the Sharding and Security courses.
Review the Official Study Guide and dive into the documentation for any weak areas.

Week 4: Practice and Review.

Take the Official Practice Questions daily. Aim for a consistent score of 90% or higher.
Practice under time pressure. Give yourself 90 seconds per question to simulate the real exam environment.

Final Pre-Exam Checklist

Completed the entire MongoDB Database Admin learning path.
Read through every topic on the Official Study Guide.
Scored 90%+ on the official practice questions multiple times.
Performed hands-on labs for setting up a replica set, performing a backup/restore, and configuring a user with specific roles.
Verified the exam price and used your discount voucher on the MongoDB University site.

Passing the MongoDB Certified DBA exam is completely achievable with focused study and hands-on practice. Trust the official resources, adopt a practice-first mindset, and you'll be well on your way.
Good luck and happy indexing! If you have any questions, feel free to drop a comment below. I'd be happy to help.