DEV Community: Khushi Singla

🎤 Crack Interviews with AI: Building a Voice-Powered Mock Interview Simulator

Khushi Singla — Tue, 12 May 2026 06:44:54 +0000

What I Built

AI Voice Interview Simulator is an intelligent, voice-enabled mock interview platform that helps job seekers practice interviews with real-time feedback, emotion analysis, and performance tracking.

Most interview preparation platforms today are text-based and fail to simulate real conversational pressure or provide actionable feedback on communication skills.

This project solves that by creating a voice-based AI interview experience that:

conducts real-time mock interviews
asks adaptive questions based on responses
evaluates interview performance
analyzes emotional signals
tracks progress over time

The goal was to build a more realistic and interactive interview preparation platform rather than a static chatbot experience.

Features

🎤 Voice Interview Simulation

AI asks questions using text-to-speech
Users can answer using their microphone
Automatic speech transcription using Whisper

🧠 Dynamic AI Questioning

The interview flow adapts based on previous answers.

The platform supports:

General interviews
Technical interviews
HR interviews
Sales interviews

Difficulty levels:

Entry Level
Mid Level
Senior Level

The system also supports resume-based personalized interview generation.

📊 Real-Time Performance Scoring

After every response, the platform evaluates:

Clarity
Confidence
Relevance

Users receive instant AI-generated feedback after each answer.

🎭 Emotion Analysis

The system analyzes emotional signals from candidate responses and detects:

confidence
nervousness
enthusiasm
hesitation

This helps users understand both technical and communication performance.

📈 Interview History Dashboard

The platform stores previous interview sessions and allows users to:

review past interviews
track score improvements
analyze trends over time
revisit previous feedback

🔄 Resume Upload Support

Users can upload resumes in PDF format.

The AI then generates personalized interview questions based on:

skills
projects
experience
technologies mentioned in the resume

Demo

🌐 Live Applications

🎥 Demo Video

tys-tdxs-dvd (2026-04-28 20_17 GMT+5_30).mp4 - Google Drive

drive.google.com

Tech Stack

Component	Technology
Frontend	Streamlit
Workflow Automation	n8n
LLM	Groq (llama-3.1-8b-instant)
Voice Output	Lemonfox TTS
Voice Input	Lemonfox Whisper STT
Emotion Analysis	Groq LLM
Database	Supabase (PostgreSQL)
Backend Deployment	Railway
Frontend Deployment	Streamlit Cloud

Architecture

Workflow 1 — AI Interview

Receives user answer and conversation history
Sends context to AI workflow
Generates next interview question dynamically

Workflow 2 — Answer Scorer

Receives question and candidate answer
Scores clarity, confidence, and relevance
Returns structured feedback

How It Works

User Starts Interview
            ↓
AI Generates Question
            ↓
User Answers via Voice/Text
            ↓
Speech-to-Text Processing
            ↓
AI Evaluation + Emotion Analysis
            ↓
Feedback + Next Question
            ↓
Session Stored in Database

Challenges Faced

Maintaining Conversational Flow

One challenge was preserving enough interview context so that follow-up questions felt natural and relevant.

Real-Time Voice Processing

Handling speech-to-text conversion while maintaining smooth interview flow required careful workflow orchestration.

Consistent Performance Evaluation

Interview scoring can be subjective, so tuning prompts for balanced evaluation across different answer styles required multiple iterations.

Future Improvements

Planned enhancements include:

real-time voice emotion detection
multilingual interview support
company-specific interview modes
coding round simulations
analytics dashboards
progress visualizations

Conclusion

Building AI Voice Interview Simulator demonstrated how conversational AI and voice workflows can improve interview preparation experiences.

The most exciting part of the project was creating adaptive interview interactions that respond dynamically to user answers while also providing structured performance feedback and emotion analysis.

The combination of voice interaction, AI evaluation, and adaptive questioning helped create a more realistic interview simulation platform.

Building a Safe AI Database Assistant with Azure OpenAI, LangChain & Function Calling

Khushi Singla — Fri, 19 Dec 2025 06:50:06 +0000

From raw CSVs to a production-ready AI assistant that queries data safely — without hallucinating SQL.

In this post, I’ll walk through how I built an AI-powered data analyst using:

Azure OpenAI
LangChain
LangGraph
Function Calling
SQLite

The assistant can:

Analyze CSV data using pandas
Query a SQL database safely
Choose predefined backend functions automatically
Explain results clearly
Avoid hallucinations and unsafe SQL

🧩 Problem Statement

When working with AI models and databases, common problems include:

❌ Hallucinated SQL queries
❌ Unsafe eval or raw SQL execution
❌ No control over what the model can access
❌ No explanation of how results were computed

Goal:
Build an AI assistant that:

Answers analytical questions about COVID data
Uses only allowed tools
Never guesses
Explains every answer

📊 Dataset

We use the COVID all-states history dataset, which includes:

state
date
hospitalizedIncrease
positiveIncrease
…and more

The dataset is first used as:

A pandas DataFrame
A SQLite database

🧱 Architecture Overview

User Question
     ↓
Azure OpenAI (Assistant / LangChain)
     ↓
Tool Selection (Function / SQL / DataFrame)
     ↓
Safe Backend Execution
     ↓
Result
     ↓
Final Explanation

Key idea:

The model decides WHAT to do.
Your backend decides HOW it is done.

🔹 Part 1: Talking to Azure OpenAI via LangChain

We start by connecting to Azure OpenAI using AzureChatOpenAI:

llm = AzureChatOpenAI(
    azure_endpoint="https://<your-endpoint>.cognitiveservices.azure.com/",
    api_key="YOUR_API_KEY",
    api_version="2024-12-01-preview",
    model="gpt-4o-mini"
)

A simple sanity check:

response = llm.invoke([
    HumanMessage(content="Hello, Azure OpenAI via LangChain!")
])
print(response.content)

🔹 Part 2: DataFrame Agent (CSV Analysis)

We load the CSV into pandas and expose controlled computation via a tool.

DataFrame Tool

@tool
def run_df(query: str) -> str:
    """Run Python code on the global dataframe `df` and return the result."""
    return str(eval(query))

⚠️ Note: In production, replace eval with a restricted execution layer.

Enforcing Tool Usage

llm_with_tools = llm.bind_tools([run_df])

The prompt forces the model to:

Use the tool
Perform actual pandas calculations
Explain results

🔹 Part 3: Moving from CSV → SQL (SQLite)

We convert the CSV into SQLite:

engine = create_engine("sqlite:///./db/test.db")

df.to_sql(
    name="all_states_history",
    con=engine,
    if_exists="replace",
    index=False
)

Now the same dataset can be queried via SQL.

🔹 Part 4: SQL Agent with LangGraph

Using LangGraph’s ReAct agent:

agent_executor_SQL = create_react_agent(
    model=llm,
    tools=toolkit.get_tools()
)

The system prompt enforces:

Only valid tables
Only specific columns
No hallucinated values
Markdown-only output

🔹 Part 5: Function Calling (No Raw SQL)

Instead of letting the model generate SQL, we define pre-approved backend functions.

Example Functions

def get_hospitalized_increase_for_state_on_date(state_abbr, specific_date):
    ...

def get_positive_cases_for_state_on_date(state_abbr, specific_date):
    ...

Function Registry (Critical!)

FUNCTION_MAP = {
    "get_hospitalized_increase_for_state_on_date": get_hospitalized_increase_for_state_on_date,
    "get_positive_cases_for_state_on_date": get_positive_cases_for_state_on_date,
}

This ensures:

✅ Only allowed functions run
❌ No arbitrary code execution

🔹 Part 6: Azure OpenAI Function Calling (No Assistant API)

Using Chat Completions + functions:

response = client.chat.completions.create(
    model="gpt-4o-mini",
    messages=messages,
    functions=functions,
    function_call="auto"
)

If the model calls a function:

Extract arguments
Route via FUNCTION_MAP
Execute backend logic
Send result back
Get final grounded answer

🔹 Part 7: Assistant API (Persistent Context)

Now we level up.

Creating the Assistant

assistant = client.beta.assistants.create(
    name="Covid Data Assistant",
    model="gpt-4o-mini",
    tools=[{"type": "function", "function": fn} for fn in functions]
)

Assistant Loop (Key Concept)

while True:
    run_status = client.beta.threads.runs.retrieve(...)

    if run_status.status == "requires_action":
        # extract function name
        # dispatch via FUNCTION_MAP
        # submit tool output

    elif run_status.status == "completed":
        break

The assistant remembers conversation context,
but never caches database results.

🧠 Key Takeaways

✅ What This Design Solves

Prevents SQL hallucinations
Enforces backend safety
Keeps AI answers grounded in data
Scales cleanly as tools grow

🧩 Mental Model

Layer	Responsibility
LLM	Reasoning & intent
Assistant	Tool selection
Backend	Data access
Function Map	Security

🎯 When to Use What?

Use Case	Best Choice
One-shot queries	Chat + function calling
Multi-turn analysis	Assistant API
CSV exploration	DataFrame tools
Production DB	Predefined SQL functions

🚀 Final Thoughts

This approach mirrors how real production AI systems are built:

AI decides what
Backend controls how
Data remains authoritative
Explanations remain transparent

Connect With Me

Let’s learn and build cool data science and AI projects together!

💼 LinkedIn: https://www.linkedin.com/in/singla-khushi/
🔗 GitHub: https://github.com/KhushiSingla-tech
📩 Comments below are always welcome!

Predicting Football Player Market Value with a Simple ML Pipeline (Pandas + Scikit-Learn)

Khushi Singla — Mon, 10 Nov 2025 09:15:40 +0000

In this project, I explore how to predict football player market value using a clean and simple ML pipeline based on Python, Pandas, Seaborn, and Scikit-Learn.

📌 Full code and notebook available on GitHub:
👉 https://github.com/KhushiSingla-tech/Football-player-price-pridiction

Dataset & Setup

We start by loading a local data.csv.

import numpy as np 
import matplotlib.pyplot as plt 
import pandas as pd
import seaborn as sns

dataset = pd.read_csv('data.csv')

First look:

dataset.head()
dataset.columns
dataset.describe()
dataset.shape
dataset.dtypes
dataset['nationality'].value_counts()

Tip: keep an eye on data types and missing values. position_cat should already be numeric in this workflow—if it weren’t, we’d need to encode it first.

Quick EDA

Below are the core visuals I generated. I’m including placeholders so you can drop screenshots from your notebook. Keep the titles the same to stay consistent.

1. Name vs Age (top 50)

plt.figure(figsize=(10,6))
graph = sns.barplot(x='name', y='age', data=dataset[:50], palette="rocket")
graph.set(xlabel="Name", ylabel="Age", title="Name VS Age")
graph.set_xticklabels(graph.get_xticklabels(), rotation=90)
sns.set_context('talk'); sns.despine(); plt.show()

2. Members per Club

plt.figure(figsize=(10,6))
graph = sns.countplot(x='club', data=dataset, palette="vlag")
graph.set(xlabel="Club", ylabel="Member", title="Members per club")
graph.set_xticklabels(graph.get_xticklabels(), rotation=90)
sns.set_context('talk'); sns.despine(); plt.show()

3. Name vs Market Value (top 50)

plt.figure(figsize=(16,6))
graph = sns.barplot(x='name', y='market_value', data=dataset[:50], palette="colorblind")
graph.set(xlabel="Name", ylabel="Market Value", title="Name VS Market Value")
graph.set_xticklabels(graph.get_xticklabels(), rotation=90)
sns.set_context('notebook'); sns.despine(); plt.show()

4. Name vs Position Category (top 50)

plt.figure(figsize=(16,6))
graph = sns.pointplot(x='name', y='position_cat', data=dataset[:50], palette="deep")
graph.set(xlabel="Name", ylabel="Position category", title="Name VS Position Category")
graph.set_xticklabels(graph.get_xticklabels(), rotation=90)
sns.set_context('talk'); sns.despine(); plt.show()

5. Name vs Region (top 50)

plt.figure(figsize=(16,6))
graph = sns.pointplot(x='name', y='region', data=dataset[:50], palette="rocket")
graph.set(xlabel="Name", ylabel="Region", title="Name VS Region")
graph.set_xticklabels(graph.get_xticklabels(), rotation=90)
sns.set_context('poster'); sns.despine(); plt.show()

6. Players by Nationality

plt.figure(figsize=(20,6))
graph = sns.countplot(x='nationality', data=dataset, palette="muted")
graph.set(xlabel="Nationality", ylabel="Players", title="No. of players amoung different nationality")
graph.set_xticklabels(graph.get_xticklabels(), rotation=90)
sns.set_context('paper'); sns.despine(); plt.show()

7. Players by Region

graph = sns.countplot(x='region', data=dataset, palette="vlag")
graph.set(xlabel="Region", ylabel="Players", title="No. of players amoung various regions")
sns.set_context('paper'); sns.despine(); plt.show()

8. Name vs FPL Points (top 50)

plt.figure(figsize=(16,6))
graph = sns.barplot(x='name', y='fpl_points', data=dataset[:50], palette="pastel")
graph.set(xlabel="Name", ylabel="FPL Points", title="Name VS FPL points")
graph.set_xticklabels(graph.get_xticklabels(), rotation=90)
sns.set_context('poster'); sns.despine(); plt.show()

9. Name vs FPL Value (top 50)

plt.figure(figsize=(16,6))
graph = sns.pointplot(x='name', y='fpl_value', data=dataset[:50], palette="dark")
graph.set(xlabel="Name", ylabel="FPL Value", title="Name VS FPL value")
graph.set_xticklabels(graph.get_xticklabels(), rotation=90)
sns.set_context('notebook'); sns.despine(); plt.show()

10. New Foreign (Count)

graph = sns.countplot(x='new_foreign', data=dataset, palette="dark")
graph.set(xlabel="New Foreign", ylabel="Amount", title="How many are new signing from a different league")
sns.set_context('notebook'); sns.despine(); plt.show()

11. New Foreign (By Name)

plt.figure(figsize=(20,6))
graph = sns.pointplot(x='name', y='new_foreign', data=dataset[:100], palette="dark")
graph.set(xlabel="Name", ylabel="New Foreign", title="Whether a new signing from a different league")
graph.set_xticklabels(graph.get_xticklabels(), rotation=90)
sns.set_context('notebook'); sns.despine(); plt.show()

12. New Signing (Count)

graph = sns.countplot(x='new_signing', data=dataset, palette="rocket")
graph.set(xlabel="New Signing", ylabel="Amount", title="How many are new signing ")
sns.set_context('notebook'); sns.despine(); plt.show()

13. New Signing (By Name)

plt.figure(figsize=(20,6))
graph = sns.pointplot(x='name', y='new_signing', data=dataset[:100], palette="bright")
graph.set(xlabel="Name", ylabel="New Signing", title="Whether a new signing")
graph.set_xticklabels(graph.get_xticklabels(), rotation=90)
sns.set_context('notebook'); sns.despine(); plt.show()

Feature Selection

For modeling, I use the following five predictors:

dataset = pd.read_csv('data.csv') 
X = dataset[['age', 'fpl_value', 'fpl_points', 'page_views', 'position_cat']]
Y = dataset['market_value']

Why these?

age – price typically varies with age/prime years.
fpl_value, fpl_points – performance and fantasy value often correlate with perceived market value.
page_views – a soft proxy for popularity/visibility.
position_cat – price dynamics differ by position.

Train/Test Split + Scaling

from sklearn.model_selection import train_test_split 
X_train, X_test, Y_train, Y_test = train_test_split(
    X, Y, test_size=0.2, random_state=0
)

from sklearn.preprocessing import StandardScaler 
sc_X = StandardScaler() 
X_train = sc_X.fit_transform(X_train) 
X_test = sc_X.transform(X_test)

Why scaling? Linear models are sensitive to feature scales; standardization helps stable coefficients and convergence.

Model: Linear Regression

from sklearn.linear_model import LinearRegression 
regressor = LinearRegression() 
regressor.fit(X_train, Y_train)

Make predictions on the test set:

Y_pred = regressor.predict(X_test)
df = pd.DataFrame({'Actual': Y_test, 'Predicted': Y_pred})
df.head()

To see predictions across the full dataset:

X1 = sc_X.transform(X) 
Y_pred1 = regressor.predict(X1)

Output: -

array([ 6.28665327e+01,  4.63344124e+01,  1.71999185e+01,  2.70542543e+01,
        1.67576333e+01,  2.31341678e+01,  3.06614290e+01,  1.26441319e+01,
        1.88859345e+01,  1.61778959e+01,  1.68413424e+01,  1.87550870e+01,
        1.53217919e+01,  2.00342164e+01,  5.03340141e+00,  9.45484552e+00,
        8.81515716e+00,  1.65135952e+01,  2.15921316e+01,  1.14508041e+01,
        4.94468838e+00,  2.20113374e+01,  5.06376105e+00,  9.02558975e+00,
        5.66032107e+00,  5.82966111e+00,  1.40691543e+01,  3.47309764e+01,
        2.54661370e+01,  3.17429798e+01,  9.84330289e+00,  6.72138221e+00,
        1.06760045e+01,  1.16738703e+01,  1.04311591e+01,  1.11660553e+01,
        4.62853856e+00,  1.29618040e+01,  8.85989508e+00,  2.93968695e+00,
        1.09260466e+01,  1.40564779e+01,  5.83211310e+00,  1.66622152e+00,
        6.92188866e+00,  5.12775349e+00,  4.77049269e+00,  8.60744252e-01,
        1.76621052e+00,  5.65940283e+00,  3.01078473e+00,  6.11320445e+00,
        4.77661984e-01,  6.86850732e+00,  4.09268232e+00,  4.45776943e+00,
       -1.21186324e+00, -4.32716636e+00,  2.27473673e+00, -1.88866900e+00,
        1.57521202e+00,  1.59605849e+00,  9.93010176e+00,  1.14337555e+00,
        4.70114334e-01, -5.69705594e-01,  5.49109710e+00,  2.42127613e+00,
        1.60229262e+00,  1.68421329e+00,  6.44917242e+00,  5.78379753e+00,
        9.27755719e-01,  1.58683879e+00,  1.39307739e+01,  1.21129834e+01,
        1.60684486e+01,  6.89538099e+00,  5.06714267e+00,  6.16871528e+00,
        7.97262607e+00,  1.09000523e+01,  6.59496956e+00,  8.34852473e+00,
        7.41193892e-01,  2.90238204e+00,  4.10618827e+00,  1.04340149e+01,
        5.29097527e+00,  1.77164410e+00,  9.12243523e-01, -1.71626246e+00,
        5.39393523e+01,  5.02586215e+01,  2.33287914e+01,  3.38398821e+01,
        2.27397252e+01,  2.76023255e+01,  2.06741282e+01,  2.40857909e+01,
        2.55614322e+01,  1.97780734e+01,  2.45654940e+01,  1.00361670e+01,
        2.12049333e+01,  8.44089635e+00,  2.72273014e+01,  1.32175831e+01,
        1.00032029e+01,  2.07556329e-02,  1.34878506e+01,  8.99831431e+00,
        2.46911471e+01,  2.84420802e+01,  1.37693399e+01,  1.39753539e+01,
        4.14940785e+00,  7.89296807e+00,  4.59603873e+00,  9.33580921e+00,
        9.17322247e+00,  5.91166865e+00,  7.20501161e+00,  1.62985464e+00,
        9.77106295e+00,  5.24199276e+00,  6.77506385e+00, -1.10826023e-01,
        6.70876820e+00,  1.51229913e+00,  3.64913411e+00,  5.86341313e+00,
       -1.71934842e+00,  2.64918075e+01,  1.53767278e+01,  2.08364791e+01,
        1.28360458e+01,  1.42769184e+01,  1.83438607e+01,  8.39739780e+00,
        1.54972711e+01,  9.24879295e+00,  8.84735296e+00,  4.27543923e+01,
        1.00334222e+01,  6.67323145e+00,  4.24451728e+00,  1.64599406e+01,
        8.93319695e+00,  1.36017222e+01,  9.23623029e+00,  4.75455136e+00,
        5.70086565e+00,  5.26865228e+00,  6.05140101e+00,  1.27525174e+01,
        3.33916737e+00,  5.90391575e+00,  2.80368956e+00,  1.71560146e+01,
        1.77938304e+01,  4.82038725e+00,  4.59853329e+00,  3.02243318e+00,
        2.64504042e+00,  6.44877606e+00,  3.59123084e+00, -8.69915834e-01,
        2.93424055e+00, -2.94119546e+00,  3.14294737e+00,  3.46717211e+00,
        8.45537926e+00,  2.31326804e+00,  5.60399756e-01,  3.76573016e+00,
        1.30656073e-01,  1.83081491e+00, -2.04944876e+00,  2.08635844e-01,
       -6.24129449e-01,  8.17555691e+00,  9.17157126e+00,  7.59654639e+00,
        5.81263224e+00,  2.15563645e+00, -4.78803512e-01,  4.23712894e+00,
        8.80441122e+00,  3.50444501e+01,  3.07869954e+01,  1.59727357e+01,
        9.11841002e+00,  1.07657645e+01,  9.27215255e+00,  6.23564852e+00,
        1.96676802e+01,  1.10408239e+01,  8.46461158e+00,  4.94055844e+00,
        6.76284129e+00,  1.13816079e+01,  1.06885705e+01,  6.05253148e+00,
        2.99910714e+00,  1.44952259e+01,  3.61337549e+00,  2.60868278e+00,
        7.41538690e+00,  2.92184993e+00,  4.45153531e+00,  3.78618026e+00,
        8.64074595e+00,  3.50352917e+01,  4.00189086e+01,  4.30005449e+01,
        2.43905161e+01,  2.12379487e+01,  2.52051293e+01,  1.54203647e+01,
        1.33076223e+01,  1.40963410e+01,  1.33599626e+01,  1.71506799e+01,
        2.39610467e+01,  1.30875717e+01,  2.62851963e+01,  4.77089334e+00,
        5.82049375e+00,  1.00155278e+01,  1.43421271e+01,  8.23561008e+00,
        5.30518579e+00,  7.40927061e+00,  5.53990619e+00,  7.30884677e+00,
        5.55237174e+00,  2.70476182e+01,  7.80253031e+00,  6.98958048e+00,
        4.33994048e+01,  5.54702805e+01,  3.20615354e+01,  2.21769627e+01,
        2.53293890e+01,  3.42494372e+01,  1.31454153e+01,  1.21667107e+01,
        1.97419795e+01,  6.30015138e+00,  8.82081099e+00,  5.06400874e+01,
        1.57180792e+01,  1.44266939e+01,  2.25336943e+01,  1.35383098e+01,
        5.05587301e-01,  3.04413219e+00,  1.27785472e+01,  2.29855522e+01,
        5.81554453e+01,  2.31388392e+01,  2.16422756e+01,  5.10021411e+01,
        2.18816842e+01,  2.28615104e+01,  1.47747468e+01,  1.52930900e+01,
        3.17042563e+01,  1.46778860e+01,  3.35870220e+01,  3.10865733e+01,
        1.64592195e+01,  1.69076901e+01,  1.11555766e+01,  1.29238188e+01,
        9.61917212e+00,  1.25204674e+01,  5.30423360e+00,  5.54871109e+00,
        1.00654973e+01,  4.98946114e+00,  7.42955630e+00,  6.31904380e+00,
        1.30403887e+01,  1.00392522e+00,  6.06577036e+00,  6.72085484e+00,
        3.73134019e+00,  4.18149614e+00,  4.22383012e+00,  1.91381534e+00,
       -1.28088146e+00,  3.41045955e+00,  1.79497319e+00,  1.05829507e+01,
        9.43534298e+00,  3.28532558e+00,  1.35250312e+00,  4.38494855e+00,
        2.07244010e+00,  1.75906926e+00,  1.43509874e+01,  6.87198964e+00,
        6.13033140e+00, -2.33888264e+00,  1.35990916e+01,  1.80634481e+01,
        1.46927448e+01,  1.44239848e+01,  1.12612981e+01,  1.29791664e+01,
        6.24648586e+00,  9.03304108e+00,  5.25226415e+00,  1.51571160e+01,
        1.41645531e+01,  1.01954205e+01,  9.91526470e+00,  1.17886001e+01,
        3.27217678e+00,  5.94602737e+00,  2.11646419e+01,  4.44317689e+00,
        6.10421380e+00,  2.47182931e+00,  9.65090120e-01,  5.44983836e+00,
        5.29387882e+00,  1.22565818e+01,  1.78140601e+01,  7.59876326e+00,
        9.32675979e+00,  5.76379440e+00,  9.09217109e+00,  1.39168840e+01,
        9.92486243e+00,  1.19482597e+00,  5.87981968e+00,  3.78964322e+00,
        4.60747657e+00,  6.09054273e+00,  7.18585630e+00,  5.48772878e+00,
        1.01458041e+01,  2.99328230e+00,  1.21505520e+01,  3.70836344e+00,
        1.08663018e+01,  3.86679739e+00,  7.27572286e+00,  2.95082669e+01,
        2.28551986e+01,  8.11729167e+00,  1.09926147e+01,  1.23986210e+01,
        6.04083954e+00,  5.54532532e+00,  5.13323449e+00,  8.13637450e+00,
        5.03507346e+00,  6.15626580e+00,  5.97910234e+00,  5.35986738e+00,
       -2.24795766e-01,  6.53422485e+00, -1.30726559e+00,  4.22683927e+00,
        3.46130466e+00,  2.62146976e+00,  5.16180438e+00, -6.12786456e-01,
        1.87310775e+00,  2.18184157e+00,  6.84458483e-01,  1.15822758e+01,
        5.43069136e+01,  6.54671081e+01,  3.79330854e+01,  3.21481995e+01,
        1.72037115e+01,  1.59411311e+01,  1.74161390e+01,  1.11834417e+01,
        1.28956970e+01,  1.40528607e+01,  1.94235419e+01,  9.06373403e+00,
        2.21515945e+01,  1.22072454e+01,  1.16944338e+01,  9.81890962e+00,
        1.34160836e+01,  6.47409346e+00,  8.09325423e+00,  5.34652950e+00,
        1.15204048e+01,  1.72578040e+01,  6.88453524e+00,  7.38962753e+00,
        3.82242989e+00,  4.63910853e+00,  3.52678463e+00,  4.17896473e+00,
        7.89789526e+00,  8.12813073e+00,  1.27843162e+00,  9.97198566e+00,
        8.03247501e+00,  6.45394674e+00,  5.12676355e+00,  3.96953706e+00,
        9.13592851e+00,  1.70473040e-01,  6.70820319e+00,  4.63104526e+00,
        4.59259892e+00,  3.44236036e+00,  2.69420533e+00,  6.32927942e+00,
        7.78319819e+00,  1.53732744e+01,  1.28845013e+01,  7.87680461e+00,
        1.26908110e+01,  9.99255503e+00,  8.46245403e+00,  8.29923735e+00,
        8.85993621e+00,  8.23448929e+00,  8.17877211e+00,  1.09487410e+01,
        2.08215886e+00,  4.43309931e+00,  5.53899424e+00,  3.45700961e+00,
        4.43442418e+00, -3.65736914e-01,  3.47071664e+00,  1.49935272e+01,
        1.97778766e+01,  2.34133710e+01,  9.59576755e+00,  9.65451702e+00,
        1.81178362e+01,  7.31740896e+00,  8.88841637e+00,  7.79515304e+00,
        3.52915171e+00,  1.30045353e+01,  6.92258405e+00,  8.99422402e+00,
        4.63065558e+00,  7.37182795e+00,  4.50268052e+00,  7.47405175e+00,
        5.66246456e+00,  6.41619652e+00,  6.22417826e+00,  3.21996267e+00,
        5.11293760e+00])

Evaluation: 10-Fold Cross-Validation

from sklearn.model_selection import cross_val_score 
accuracy = cross_val_score(estimator=regressor, X=X_train, y=Y_train, cv=10)
print(accuracy.mean())
print(accuracy.std())

Metric: By default, LinearRegression with cross_val_score uses the estimator’s .score() which is R².
Report:
- CV Mean R²: {{CV_MEAN_R2}}
- CV Std: {{CV_STD_R2}}

If you prefer error metrics, add: from sklearn.metrics import mean_absolute_error, mean_squared_error, r2_score and compute MAE/RMSE/R² on the held-out test set.

What I Learned

Even a small, clean feature set can drive a reasonable baseline.
fpl_value and fpl_points usually show strong signal; if you have access to richer performance data (minutes, xG, assists/90, age-curve features), add them.
page_views captures attention, which influences pricing; try other popularity proxies.
Consider regularized models (Ridge/Lasso) or tree ensembles (RandomForest, XGBoost) and compare CV scores.
Plot residuals vs. predicted to check for systematic under/over-valuation (especially on very high-value players).

Follow-Up Questions

I’d love to hear your thoughts!

Which additional features would you include to improve prediction accuracy?
Do you think football player value is more influenced by performance or popularity metrics?
Would you like to see a version of this project using RandomForest/XGBoost?
Should I deploy this model as an interactive web app where you can enter player stats and get predictions?

Feel free to comment below — I’d love to discuss and expand this project further!

Connect With Me

Let’s learn and build cool data projects together!

💼 LinkedIn: https://www.linkedin.com/in/singla-khushi/
🔗 GitHub: https://github.com/KhushiSingla-tech
📩 Comments below are always welcome!

Seamless API Test Automation: Integrating Karate Framework with Jenkins

Khushi Singla — Thu, 29 May 2025 08:29:02 +0000

Ensuring your APIs remain reliable with every new release can be time-consuming—unless you automate it. Integrating Karate, a powerful API testing framework, with Jenkins, your go-to CI/CD server, can drastically reduce manual testing efforts and catch issues early in the development cycle.

In this blog post, you’ll learn how to set up and run Karate tests in Jenkins, create beautiful reports, and leverage advanced configuration to fit your workflow.

What is Karate?

Karate is an open-source API test automation framework that combines API test-automation, mocks, performance-testing, and even UI automation into a single unified framework. Built on top of Cucumber, it uses Gherkin syntax for writing test scenarios in plain language, making it ideal for both developers and testers.

Why Integrate Karate with Jenkins?

Here's why this duo is a win-win:

Automated Testing — Run API tests automatically after each commit.
Early Bug Detection — Catch regressions before they hit production.
Efficient Workflow — Save time by reducing manual testing.
Real-Time Feedback — Get detailed, instantly available test reports.
Improved Code Quality — Continuous testing ensures confidence in code changes.
Scalability — Supports parallel testing for large test suites.
Toolchain Friendly — Plays nicely with other Jenkins plugins (like JUnit, HTML Publisher).
Centralized Management — Control everything from Jenkins' UI.

Karate and Jenkins Integration & Execution Flow

Integration Steps (Setup Phase)

Execution Steps (Per Build)

Prerequisites

Make sure the following are in place before you begin:

Jenkins installed and running
Plugins: Pipeline, Git, and JUnit
A Karate test project (Maven-based)
Your code hosted in a version control system (e.g., GitHub)

Jenkins + Karate Integration: Step-by-Step

Step 1. Create a New Jenkins Pipeline

Open Jenkins and log in.
Click on “New Item” from the dashboard.
Enter a name for your pipeline job.
Select “Pipeline” and click OK.

Step 2. Configure the Pipeline

Option 1: Using Inline Script
Paste the following pipeline script under the Pipeline section:

pipeline {
    agent {
        node {
            label 'spot_security'
        }
    }
    tools {
        maven 'sonar-maven'
        jdk 'OpenJDK-11'
    }
    stages {
        stage('Karate Test') {
            steps {
                script {
                    try {
                        sh "mvn test -Dtest=<TestRunner> -s mvnsettings.xml"
                    } catch (Exception e) {
                        currentBuild.result = 'UNSTABLE'
                    }
                }
            }
        }
    }
}

Option 2: Using Jenkinsfile from SCM
Select "Pipeline script from SCM"

Choose Git as SCM
Set Repository URL: https://github.com/your-org/your-repo.git
Set Script Path: Jenkinsfile

Step 3. Add Karate to your pom.xml

<dependencies>
    <dependency>
        <groupId>com.intuit.karate</groupId>
        <artifactId>karate-junit5</artifactId>
        <version>1.4.0</version> <!-- Or latest -->
        <scope>test</scope>
    </dependency>
</dependencies>

Advanced Configurations

1. Passing Environment Variables

Want to inject dynamic values (e.g., base URLs) into your tests?
Update the pipeline like so:

stage('Karate Test') {
    steps {
        script {
            try {
                sh "mvn test -Dtest=<TestRunner> -Dkarate.env=${params.TEST_ENV} -s mvnsettings.xml"
            } catch (Exception e) {
                currentBuild.result = 'UNSTABLE'
            }
        }
    }
}

Ensure your karate-config.js handles karate.env appropriately.

2. Publish Karate Test Report in Jenkins

To generate a nice HTML report for each test run, use the publishHTML plugin.

stage('Publish Karate Report') {
    steps {
        script {
            try {
                publishHTML(target: [
                    allowMissing: false,
                    alwaysLinkToLastBuild: true,
                    keepAll: true,
                    reportDir: 'target/cucumber-html-reports',
                    reportFiles: 'overview-features.html',
                    reportName: 'Karate Test Report'
                ])
            } catch (Exception e) {
                currentBuild.result = 'UNSTABLE'
            }
        }
    }
}

⚠️ Make sure karate.output.path is not overridden in your project if you rely on the default report location.

Running the Pipeline

Trigger manually from Jenkins UI
Or set up automatic triggers:
- Poll SCM (e.g., H/5 * * * *)
- Webhook-based triggering from GitHub/GitLab/etc.

Once triggered:

Watch logs in Console Output
Navigate to Published Reports for visual test summaries

Conclusion

Integrating Karate with Jenkins allows teams to automate API tests as part of their CI/CD pipeline. This ensures higher code quality, early bug detection, and streamlined collaboration between QA and development. With minimal setup, you’ll save hours of manual regression testing and boost release confidence.

A Developer’s Guide to API Testing with Karate Framework

Khushi Singla — Thu, 29 May 2025 06:48:49 +0000

In the world of modern software development, API testing is essential for validating backend services—ensuring they respond correctly, behave as expected, and remain reliable over time. Among the many tools available for API automation, Karate Framework stands out for its elegant syntax, powerful capabilities, and developer-friendly approach.

This guide walks you through what Karate is, why it's a compelling choice for API testing, how to write your first test, and how to integrate Karate into your CI/CD pipeline for continuous quality assurance.

What is Karate Framework?

Karate is an open-source test automation tool designed specifically for API testing. Built on top of the Cucumber JVM, it enables developers and testers to write expressive, readable tests using simple Gherkin syntax inside .feature files—no Java code required to get started.

Key Features of Karate

✅ Supports testing of REST, SOAP, GraphQL, and HTTP APIs
✅ Built-in assertions for JSON and XML payloads
✅ Data-driven testing using Scenario Outline
✅ Supports reusable test logic via call and external files
✅ Includes tools for API mocking and performance testing
✅ Parallel test execution out of the box
✅ Native support for reporting and logging
✅ Seamless integration with Maven, Gradle, and CI/CD pipelines

Why Choose Karate for API Testing?

Simplicity: Write clean and concise tests without boilerplate code.
Unified Testing Tool: Combine functional, mocking, and performance testing in one framework.
Data-Driven: Define test scenarios once and run them with multiple datasets.
Reusability: Easily modularize tests using call for maintainable test suites.
Active Community: Strong documentation and support from the open-source community.

Setting Up Karate Framework

1. Create a Maven Project

If you're using Maven, start by adding the Karate dependency to your pom.xml:

<dependency>
    <groupId>com.intuit.karate</groupId>
    <artifactId>karate-junit5</artifactId>
    <version>1.3.1</version>
    <scope>test</scope>
</dependency>

Alternatively, you can use Gradle—Karate supports both build tools.

2. Create the Directory Structure

src
└── test
    └── java
        └── api
            └── test.feature

Writing Your First Karate Test

Let’s start with a simple GET request to the GitHub API.

File: src/test/java/api/test.feature

Feature: Test GitHub API

  Scenario: Get user details
    Given url 'https://api.github.com/users/octocat'
    When method get
    Then status 200
    And match response.login == 'octocat'
    And match response.id == 583231

What’s Happening Here?

Given url sets the target API endpoint.
When method get performs a GET request.
Then status checks for HTTP 200 OK.
match asserts the content of the response.

Running Karate Tests

Create a Java test runner to execute your .feature files using JUnit 5:

import com.intuit.karate.junit5.Karate;

class ApiTestRunner {
    @Karate.Test
    Karate testAll() {
        return Karate.run("api/test").relativeTo(getClass());
    }
}

Run this test just like any JUnit test—either from your IDE or within a CI environment.

Advanced Examples

1. Data-Driven Testing

You can use Scenario Outline and an Examples table to test multiple variations of a scenario:

Feature: Test multiple GitHub users

  Scenario Outline: Validate GitHub users exist
    Given url 'https://api.github.com/users/<username>'
    When method get
    Then status 200
    And match response.login == '<username>'

  Examples:
    | username |
    | octocat  |
    | defunkt  |
    | mojombo  |

2. Sending POST Request with Headers and Params

Here’s how to test a POST endpoint with headers, query parameters, and a request body:

Feature: Create resource on mock API

  Scenario: Create a post with query params and headers
    Given url 'https://jsonplaceholder.typicode.com/posts'
    And param draft = true
    And header Authorization = 'Bearer dummyToken123'
    And header Content-Type = 'application/json'
    And request
      """
      {
        "title": "foo",
        "body": "bar",
        "userId": 1
      }
      """
    When method post
    Then status 201
    And match response.title == 'foo'
    And match response.body == 'bar'
    And match response.userId == 1

Karate in CI/CD Pipelines

Karate tests are easy to integrate into your build pipelines using Maven or Gradle. For example, a basic Maven command to run all tests:

mvn test

You can plug this into your CI workflows (GitHub Actions, Jenkins, GitLab CI, etc.) to ensure automated API validation as part of every build.

👉 Check out my next blog: Integrating Karate with Jenkins for CI/CD for a step-by-step guide on setting up Karate in a Jenkins pipeline.

Conclusion

Karate makes API testing both accessible and powerful. Whether you're validating REST endpoints, mocking responses, or setting up performance tests, Karate provides a unified solution that fits seamlessly into modern development workflows.

If you're just getting started with API automation, Karate’s readable syntax and robust features make it one of the best tools to learn and use.

DEV Community: Khushi Singla

🎤 Crack Interviews with AI: Building a Voice-Powered Mock Interview Simulator

What I Built

Features

🎤 Voice Interview Simulation

🧠 Dynamic AI Questioning

📊 Real-Time Performance Scoring

🎭 Emotion Analysis

📈 Interview History Dashboard

🔄 Resume Upload Support

Demo

🌐 Live Applications

Prototype Version

Full Application

📂 GitHub Repository

🎥 Demo Video

tys-tdxs-dvd (2026-04-28 20_17 GMT+5_30).mp4 - Google Drive

Tech Stack

Architecture

Workflow 1 — AI Interview

Workflow 2 — Answer Scorer

How It Works

Challenges Faced

Maintaining Conversational Flow

Real-Time Voice Processing

Consistent Performance Evaluation

Future Improvements

Conclusion

Building a Safe AI Database Assistant with Azure OpenAI, LangChain & Function Calling

🧩 Problem Statement

📊 Dataset

🧱 Architecture Overview

🔹 Part 1: Talking to Azure OpenAI via LangChain

🔹 Part 2: DataFrame Agent (CSV Analysis)

DataFrame Tool

Enforcing Tool Usage

🔹 Part 3: Moving from CSV → SQL (SQLite)

🔹 Part 4: SQL Agent with LangGraph

🔹 Part 5: Function Calling (No Raw SQL)

Example Functions

Function Registry (Critical!)

🔹 Part 6: Azure OpenAI Function Calling (No Assistant API)

🔹 Part 7: Assistant API (Persistent Context)

Creating the Assistant

Assistant Loop (Key Concept)

🧠 Key Takeaways

✅ What This Design Solves

🧩 Mental Model

🎯 When to Use What?

🚀 Final Thoughts

Connect With Me

Predicting Football Player Market Value with a Simple ML Pipeline (Pandas + Scikit-Learn)

Dataset & Setup

Quick EDA

Feature Selection

Train/Test Split + Scaling

Model: Linear Regression

Evaluation: 10-Fold Cross-Validation

What I Learned

Follow-Up Questions

Connect With Me

Seamless API Test Automation: Integrating Karate Framework with Jenkins

What is Karate?

Why Integrate Karate with Jenkins?

Karate and Jenkins Integration & Execution Flow

Integration Steps (Setup Phase)

Execution Steps (Per Build)

Prerequisites

Jenkins + Karate Integration: Step-by-Step

Step 1. Create a New Jenkins Pipeline

Step 2. Configure the Pipeline

Step 3. Add Karate to your pom.xml

Advanced Configurations

1. Passing Environment Variables

2. Publish Karate Test Report in Jenkins

Running the Pipeline

Conclusion

Further Reading

A Developer’s Guide to API Testing with Karate Framework

What is Karate Framework?