Tilak Upadhyay

Posted on Oct 6

Unpacking Application Security: A Comprehensive Threat Modeling Guide

#security #threatmodeling #appsec #prodsec

In the fast-paced world of software development, building features quickly often takes precedence. However, neglecting security early in the development lifecycle can lead to significant technical debt, costly breaches, and reputational damage down the line. This is where Threat Modeling shines. It's not just a buzzword; it's a proactive, structured approach to identifying potential security weaknesses in your application before they become exploitable vulnerabilities.

It’s crucial to understand that threat modeling isn't a rigid, one-size-fits-all process. Different individuals, teams, or organizations may approach threat modeling with varying methodologies, tools, and perspectives based on their unique knowledge, risk appetite, and operational context. The walkthrough presented here represents one effective way to conduct a thorough threat model, but it's important to remember that other valid and insightful approaches exist. The core objective remains the same: to systematically think like an attacker to build a stronger defense.

Let's embark on a practical, end-to-end journey through a threat modeling exercise for a realistic application.

The Scenario: "PlexiDocs" - A Secure Document Collaboration Platform

To make this practical, imagine we are the architects behind PlexiDocs, a new cloud-based web application. PlexiDocs is designed for small to medium-sized businesses to securely store, share, and collaborate on highly sensitive documents such as legal contracts, financial reports, and HR records. The emphasis is on strong security and ease of collaboration.

Core Features of PlexiDocs:

User Roles & Access Control: The platform distinguishes between two primary user types: Admins (who can manage users, billing, and all documents) and Members (who can manage documents they own or are shared with them). Both can upload, view, edit, and delete documents within their permissions.
Document Management: Users can upload various document types (PDFs, DOCX, XLSX). For each document uploaded, the application automatically generates a small thumbnail preview to enhance user experience. Documents can be organized into a hierarchical folder structure.
Secure Collaboration: Users have the ability to share a specific document with external parties (individuals who are not PlexiDocs users). This is done by generating a unique, hard-to-guess "secret" link. Anyone possessing this link can view the document, but cannot edit or download it directly without further authentication.
User Authentication & Recovery: Users log in with a standard username and password. A "Forgot Password" feature is also implemented, which sends a password reset link to the user's registered email address.

Technical Architecture of PlexiDocs:

Frontend: A modern Single-Page Application (SPA) built using a popular JavaScript framework (e.g., React, Angular, Vue.js). This runs entirely in the user's web browser and communicates with the backend via APIs.
Backend API: A RESTful API layer (e.g., built with Node.js, Python/Django, Java/Spring Boot) that serves as the central brain of the application. It handles all business logic, user authentication requests, authorization checks, and orchestrates interactions with databases and file storage.
Database: A relational SQL database (e.g., PostgreSQL, MySQL) serves two primary functions:
- Stores user information, including hashed passwords, user roles, and profile details.
- Stores document metadata, such as file names, sizes, creation dates, owner IDs, folder paths, and access control lists (ACLs) for shared documents.
File Storage: An object storage service (e.g., Amazon S3, Google Cloud Storage) is used to store the actual, raw document files uploaded by users. The backend API generates temporary, pre-signed URLs, allowing the frontend to directly upload/download files to/from the object storage, which improves performance and reduces backend load.
Thumbnail Service: This is a separate, dedicated microservice. When a new document is uploaded and processed by the Backend API, a notification is sent to this service. It then securely retrieves the document from the file storage, processes it to generate a thumbnail image (e.g., a JPEG or PNG preview), and saves this thumbnail back into the file storage, updating the document metadata.

With a clear understanding of PlexiDocs, its features, and its underlying technology, we can now commence our security analysis. Let's put on our security hats and walk through the classic four-step threat modeling process.

Step 1: Decompose the Application (What Are We Building?)

The foundational step in threat modeling is to thoroughly understand the system we are trying to secure. We cannot identify vulnerabilities in something we don't fully comprehend. Application decomposition involves breaking down the application into its constituent parts and mapping out how data flows between them. The most effective tool for this is a Data Flow Diagram (DFD).

A DFD provides a visual representation using simple, standardized symbols:

External Entities (Rectangles): Users or external systems that interact with our application but are outside its direct control.
Processes (Circles): Components that transform or handle data within our application.
Data Stores (Parallel Lines): Places where data resides persistently.
Data Flows (Arrows): The paths that data takes between entities, processes, and stores.
Trust Boundaries (Dotted Lines): Crucially, these delineate areas with different levels of trust. Crossing a trust boundary is often where attackers look for weaknesses.

Here’s the DFD for PlexiDocs, outlining these components and their interactions:

Understanding Key Data Flows:

User Authentication:
- The User (E1) initiates a login, sending DF1: User Credentials to the Authentication Service (P1).
- P1 verifies these credentials against D1: User Database.
- Upon successful login, P1 issues DF2: Session Token back to the User's browser, enabling subsequent authenticated requests.
Document Upload:
- A logged-in User (E1) sends DF3: Document Upload Request to the Document Management Service (P2).
- P2 interacts with D2: Document Metadata Database to record initial metadata.
- P2 then generates DF4: Signed URL for Upload/Download and sends it back to the User.
- The User's browser uses this signed URL to directly upload the file to D3: File Storage.
Thumbnail Generation:
- After a successful document upload, P2: Document Management Service sends DF8: New File Notification to P4: Thumbnail Generation Service.
- P4 then performs DF9: File Download for Processing from D3: File Storage, generates the thumbnail, and executes DF10: Thumbnail Upload back to D3.
- Optionally, P4 might update D2: Document Metadata Database with details about the new thumbnail.

Trust Boundaries are Critical:
Observe the dotted lines. The Internet <-> Application Boundary is the most significant, separating our controlled environment from the untrusted public internet. We also have an Internal Service Boundary encapsulating the Thumbnail Generation Service, indicating that this service is potentially isolated and might operate with different trust assumptions or permissions. Any interaction that crosses these boundaries demands heightened security scrutiny.

Step 2: Identify Threats (What Can Go Wrong?)

With our DFD clearly illustrating the system's architecture and data flows, we can now systematically brainstorm potential security vulnerabilities. This is where we shift our mindset to think like an adversary. A highly effective and widely adopted framework for this is STRIDE, a mnemonic for six primary categories of threats, developed by Microsoft. We apply STRIDE to each element of our DFD (External Entities, Processes, Data Stores, Data Flows).

Spoofing: Pretending to be someone or something you're not. (e.g., impersonating a user or a trusted service)
Tampering: Modifying data or code. (e.g., altering a document, changing a user's role)
Repudiation: Denying that an action took place. (e.g., a user denying they deleted a file, lacking proof)
Information Disclosure: Exposing information to unauthorized individuals. (e.g., sensitive data leaks, viewing private documents)
Denial of Service (DoS): Making a system or resource unavailable to legitimate users. (e.g., flooding a server, resource exhaustion)
Elevation of Privilege: Gaining capabilities or access beyond what one is authorized for. (e.g., a 'Member' acting as an 'Admin')

Let's apply STRIDE to several key components of PlexiDocs to build a list of potential threats:

Category	Component / Data Flow	Threat Description
Information Disclosure	Data Flow: User Credentials	(Threat #1) Credentials are sent over an unencrypted channel (HTTP), allowing a Man-in-the-Middle attacker to intercept and steal the password.
Elevation of Privilege	Process: Document Mgmt Service	(Threat #2) An authenticated `Member` user modifies the document ID in an API call (e.g., `/api/docs/123` to `/api/docs/124`), bypassing authorization and accessing another user's document (Insecure Direct Object Reference - IDOR).
Repudiation	Process: Authentication Service	(Threat #3) An `Admin` user performs a sensitive action (e.g., deleting another user account) and later denies having done so. The system lacks robust, immutable audit logging for such critical actions to provide non-repudiable proof.
Denial of Service	Process: Thumbnail Service	(Threat #4) An attacker uploads a computationally intensive file (e.g., a "zip bomb" or a massive image file) designed to exhaust the CPU and memory of the `Thumbnail Service`, preventing it from processing legitimate documents.
Information Disclosure	Process: Link Sharing Service	(Threat #5) The "secret" links generated for external sharing use predictable or sequential identifiers, allowing an attacker to enumerate URLs and discover links to other private documents.
Spoofing	Process: Thumbnail Service	(Threat #6) The `Thumbnail Service` can be tricked into downloading a file from an arbitrary URL supplied by an attacker instead of from the internal file storage (Server-Side Request Forgery - SSRF), allowing the attacker to scan the internal network.
Tampering	Data Store: User Database	(Threat #7) An attacker with SQL injection or direct database access modifies a user's role in the database from `Member` to `Admin`, gaining full administrative control.

Step 3: Rate the Threats (What Do We Fix First?)

After identifying a multitude of potential threats, we face a critical challenge: we can't fix everything at once. Resources (time, budget, personnel) are always limited. Therefore, we must prioritize the threats based on their potential impact and likelihood of occurrence. A widely used model for this is DREAD, which provides a quantitative way to assess and compare risks.

From our list generated in Step 2, let's select two distinct and highly illustrative threats to analyze and prioritize:

Threat #1: Information Disclosure due to no HTTPS.
Threat #3: Repudiation due to no audit logs.

DREAD evaluates each threat across five dimensions, typically on a scale of 1 (low) to 10 (high):

Damage Potential: How bad would it be if this threat were successfully exploited? (e.g., minor data loss vs. full system compromise)
Reproducibility: How easy is it for an attacker to reliably reproduce the conditions necessary for the attack? (e.g., requires specific timing vs. happens every time)
Exploitability: How much skill or effort is required to perform the attack? (e.g., expert hacker vs. script kiddie with automated tools)
Affected Users: How many users would be impacted by a successful exploit? (e.g., a single user vs. the entire user base)
Discoverability: How easy is it for an attacker to find this vulnerability or weakness? (e.g., hidden deep in code vs. immediately visible)

The total DREAD score is typically the average of these five ratings, with a higher score indicating a more critical threat requiring immediate attention.

Threat Analysis 1: Information Disclosure due to no HTTPS (Threat #1)

This threat describes a scenario where an attacker intercepts user credentials because the login or application pages are served over unencrypted HTTP, allowing for easy snooping of sensitive data over a public network.

Damage Potential (D): 8
- Justification: A successful MITM attack on an HTTP login page leads directly to credential compromise, which can result in full account takeover. This gives the attacker access to all of the user's documents and potentially other sensitive functions. This is a severe, high-impact outcome.
Reproducibility (R): 9
- Justification: If a site serves content over HTTP (especially a login form), an attacker on the same local network (e.g., public Wi-Fi) can reliably intercept traffic. Tools for setting up such attacks are common and effective, making this highly reproducible.
Exploitability (E): 7
- Justification: While not a one-click exploit, there are numerous readily available tools (like Wireshark, MITMproxy, SSLStrip) and public guides that significantly lower the bar for executing this type of attack. It doesn't require advanced zero-day expertise.
Affected Users (A): 3
- Justification: This attack typically targets users on a specific compromised network segment or in a particular physical location. While not impacting all users globally, it can impact any user connected to that network, potentially affecting a significant portion of a small business's workforce using a shared office network.
Discoverability (D): 10
- Justification: The absence of HTTPS is trivially discoverable. Any user or attacker simply needs to look at the URL bar in their browser (which displays "Not Secure" or lacks the padlock icon). Automated scanners also flag this immediately. It requires no special effort to find.

Threat #1 - Average Risk Score: (8+9+7+3+10) / 5 = 7.4 (High Priority)

Threat Analysis 2: Repudiation due to no audit logs (Threat #3)

This threat describes a situation where an Admin user performs a critical action (like deleting sensitive documents or user accounts) and later falsely claims they did not perform the action, and the system lacks the necessary immutable audit trails to prove otherwise.

Damage Potential (D): 8
- Justification: The business impact can be severe. Losing the ability to prove who performed a critical action can lead to legal disputes, regulatory non-compliance, financial penalties, internal mistrust, and inability to perform forensic investigations.
Reproducibility (R): 9
- Justification: If the audit logging mechanism for specific actions is non-existent or easily tampered with, an attacker (or rogue insider) can perform the action and deny it 100% of the time. The lack of a log is a consistent flaw.
Exploitability (E): 2
- Justification: This threat requires virtually no technical skill to "exploit." The "attacker" is a legitimate user simply using the application's existing functionality and then lying about it, knowing there's no proof.
Affected Users (A): 5
- Justification: While it might involve a single Admin user's action, the impact often cascades. If a crucial document is deleted without accountability, it can disrupt an entire team, a project, or even the whole business, affecting many indirect users.
Discoverability (D): 2
- Justification: This is a subtle, hidden architectural weakness. It's not something an attacker can "find" by probing the application directly. It's usually discovered only after an incident occurs and an investigation attempts (and fails) to find evidence in logs.

Threat #3 - Average Risk Score: (8+9+2+5+2) / 5 = 5.2 (Medium Priority)

Prioritization Conclusion:
Based on our DREAD analysis, Threat #1 (Information Disclosure due to no HTTPS) emerges as the highest priority with a score of 7.4. While Threat #3 (Repudiation) is also significant, the immediate and widespread exploitability and discoverability of unencrypted communications make Threat #1 a more urgent and easily exploitable risk that must be addressed first.

Step 4: Determine Countermeasures (How Do We Fix It?)

The final step is to design and implement countermeasures (or mitigations) to address the identified and prioritized threats. A robust security strategy emphasizes defense-in-depth, employing multiple layers of controls across different categories: prevention, detection, and response.

Let's focus on our highest-priority threat: Threat #1: Information Disclosure due to no HTTPS.

Strategy	Countermeasure
Prevention	1. Enforce HTTPS Everywhere: This is the most critical and fundamental step. Configure all web servers (frontend and API) to serve only HTTPS traffic. Implement server-side redirects (HTTP Strict Transport Security - HSTS) to automatically forward any HTTP request to its HTTPS equivalent. HSTS specifically instructs browsers to always connect via HTTPS for your domain, even on subsequent visits, preventing downgrade attacks.
Reduce Impact (Resilience)	2. Implement Multi-Factor Authentication (MFA): Even if an attacker somehow manages to steal a username and password (e.g., via phishing, not just MITM), MFA adds a crucial second layer of verification (e.g., a code from an authenticator app, a biometric scan). This significantly raises the bar for an attacker to gain access, drastically reducing the impact of compromised credentials. 3. Utilize Secure and HttpOnly Flags for Cookies: Ensure all session tokens and other sensitive cookies are set with the `Secure` flag (cookies are only sent over HTTPS) and the `HttpOnly` flag (client-side JavaScript cannot access the cookie, mitigating XSS risks).
Detection	4. Implement Robust Logging & Anomaly Detection: Integrate all authentication logs (successful logins, failed attempts, password resets) into a centralized Security Information and Event Management (SIEM) system. Configure alerts for suspicious patterns, such as: - Multiple failed login attempts from a single IP address (brute-force/credential stuffing). - Impossible travel (logins from geographically distant locations in a short timeframe). - Logins from unknown or suspicious IP ranges.
Response	5. Develop a Comprehensive Incident Response (IR) Plan: A detailed IR plan is essential. This plan should clearly outline steps to take upon detection of a credential compromise or MITM attack. This includes: - Immediate invalidation of the compromised session token. - Forcing a password reset for the affected user. - Notifying the user of the potential compromise. - Escalation paths and communication protocols for internal teams and, if necessary, external authorities.

Concluding Thoughts

Threat modeling is a continuous, living process, not a one-time exercise. By integrating it into your development lifecycle, you transform security from a reactive, often panicked, afterthought into a proactive, engineering discipline. This structured approach forces us to challenge our assumptions, anticipate attacker behavior, and build security directly into the design, leading to more resilient, trustworthy, and ultimately successful applications. It saves time, reduces costs, and most importantly, protects your users and your business.

What Other Techniques Can Be Used for Threat Modeling?

While the STRIDE and DREAD frameworks are highly effective and beginner-friendly, the landscape of threat modeling methodologies is rich and diverse. Each offers a slightly different lens through which to view your system's security:

PASTA (Process for Attack Simulation and Threat Analysis): This is a risk-centric, seven-stage methodology that guides you from understanding business objectives and technical scope to simulating attacks and managing identified risks. It's more comprehensive and business-driven than STRIDE.
VAST (Visual, Agile, and Simple Threat modeling): Designed to integrate seamlessly into Agile and DevOps workflows, VAST emphasizes visual diagrams (like DFDs and process flow diagrams) and aims to be understandable by both technical and non-technical stakeholders across the organization.
Trike: A methodology that focuses heavily on defining security requirements, developing threat models, and using them as a security auditing tool. It has a strong emphasis on establishing clear data trustworthiness and authorization rules.
CVSS (Common Vulnerability Scoring System): While not a threat modeling methodology itself, CVSS is the industry standard for rating the severity of discovered vulnerabilities. It provides a standardized, objective framework (metrics for exploitability, impact, environmental factors) that can be used as an alternative or complementary approach to DREAD for prioritizing risks once identified.
Attack Trees: A formal, hierarchical diagramming technique that breaks down a high-level attack goal into more specific sub-goals, which are then broken down further. They help visualize all possible ways an attack could succeed.
Kill Chains (Cyber Kill Chain): Originally developed by Lockheed Martin, this framework describes the stages of a successful cyberattack (reconnaissance, weaponization, delivery, exploitation, installation, command & control, actions on objectives). It's more of an operational security framework but can inform threat modeling by helping visualize attacker progression.

Exploring these various techniques will broaden your threat modeling toolkit and allow you to select the best approach for the specific context of your project.

DEV Community