<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: CHANTSZCHEUK</title>
    <description>The latest articles on DEV Community by CHANTSZCHEUK (@chantszcheuk).</description>
    <link>https://dev.to/chantszcheuk</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F1869540%2F0e4d2726-5d97-488d-a7a3-2b0bd80e3bf6.png</url>
      <title>DEV Community: CHANTSZCHEUK</title>
      <link>https://dev.to/chantszcheuk</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/chantszcheuk"/>
    <language>en</language>
    <item>
      <title>My Journey Learning Artificial Intelligence - Day 15</title>
      <dc:creator>CHANTSZCHEUK</dc:creator>
      <pubDate>Tue, 03 Sep 2024 12:04:29 +0000</pubDate>
      <link>https://dev.to/chantszcheuk/my-journey-learning-artificial-intelligence-day-15-1jnh</link>
      <guid>https://dev.to/chantszcheuk/my-journey-learning-artificial-intelligence-day-15-1jnh</guid>
      <description>&lt;p&gt;Today, we focused on implementing the AI-driven resume parsing feature. We used spaCy for named entity recognition (NER) to extract key information from resumes.&lt;/p&gt;

&lt;p&gt;First, we installed spaCy and downloaded the English language model:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;pip install spacy
python -m spacy download en_core_web_sm
Then, we created a new service for resume parsing:
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;





&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;# app/services/resume_parser.py
import spacy

nlp = spacy.load("en_core_web_sm")

def parse_resume(text):
    doc = nlp(text)

    parsed_data = {
        "name": [],
        "email": [],
        "phone": [],
        "skills": [],
        "education": [],
        "experience": []
    }

    for ent in doc.ents:
        if ent.label_ == "PERSON":
            parsed_data["name"].append(ent.text)
        elif ent.label_ == "EMAIL":
            parsed_data["email"].append(ent.text)
        # Add more entity extractions here

    # Custom skill extraction (simplified)
    skill_keywords = ["Python", "JavaScript", "Machine Learning", "Data Analysis"]
    parsed_data["skills"] = [keyword for keyword in skill_keywords if keyword.lower() in text.lower()]

    return parsed_data
We also created a new route to handle resume uploads:
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;





&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;# app/routes/resume.py
from flask import Blueprint, request, jsonify
from app.services.resume_parser import parse_resume

resume_bp = Blueprint('resume', __name__)

@resume_bp.route('/upload', methods=['POST'])
def upload_resume():
    if 'file' not in request.files:
        return jsonify({"error": "No file part"}), 400

    file = request.files['file']
    if file.filename == '':
        return jsonify({"error": "No selected file"}), 400

    if file:
        # Read the file content
        resume_text = file.read().decode('utf-8')

        # Parse the resume
        parsed_data = parse_resume(resume_text)

        return jsonify(parsed_data), 200
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;This is a basic implementation and will need further refinement, but it's a good starting point. Tomorrow, we'll work on improving the parsing accuracy and handling different resume formats.&lt;/p&gt;

</description>
    </item>
    <item>
      <title>My Journey Learning Artificial Intelligence - Day 14</title>
      <dc:creator>CHANTSZCHEUK</dc:creator>
      <pubDate>Tue, 03 Sep 2024 12:03:07 +0000</pubDate>
      <link>https://dev.to/chantszcheuk/my-journey-learning-artificial-intelligence-day-14-4i7k</link>
      <guid>https://dev.to/chantszcheuk/my-journey-learning-artificial-intelligence-day-14-4i7k</guid>
      <description>&lt;p&gt;Today marked the beginning of our ambitious AI-powered job-seeking platform project. We started by outlining the core features:&lt;/p&gt;

&lt;p&gt;AI-driven resume parsing and analysis&lt;br&gt;
Job recommendation engine&lt;br&gt;
Automated skill matching&lt;br&gt;
Chatbot for user assistance&lt;br&gt;
We set up our development environment, choosing Python for backend development with Flask as our web framework. For the AI components, we decided to use TensorFlow and spaCy for natural language processing tasks.&lt;/p&gt;

&lt;p&gt;Here's our initial project structure:&lt;br&gt;
job_seeker_ai/&lt;br&gt;
├── app/&lt;br&gt;
│   ├── &lt;strong&gt;init&lt;/strong&gt;.py&lt;br&gt;
│   ├── models/&lt;br&gt;
│   ├── routes/&lt;br&gt;
│   ├── services/&lt;br&gt;
│   └── utils/&lt;br&gt;
├── config.py&lt;br&gt;
├── requirements.txt&lt;br&gt;
└── run.py&lt;br&gt;
We ended the day by setting up a basic Flask application and creating our first route:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;from flask import Flask

app = Flask(__name__)

@app.route('/')
def home():
    return "Welcome to AI JobSeeker"

if __name__ == '__main__':
    app.run(debug=True)
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Tomorrow, we'll start implementing the resume parsing featu&lt;/p&gt;

</description>
    </item>
    <item>
      <title>My Journey Learning Artificial Intelligence - Day 11</title>
      <dc:creator>CHANTSZCHEUK</dc:creator>
      <pubDate>Tue, 03 Sep 2024 12:01:23 +0000</pubDate>
      <link>https://dev.to/chantszcheuk/my-journey-learning-artificial-intelligence-day-11-382o</link>
      <guid>https://dev.to/chantszcheuk/my-journey-learning-artificial-intelligence-day-11-382o</guid>
      <description>&lt;p&gt;Today marked a significant milestone in my AI learning journey as I completed Microsoft's comprehensive course on Generative AI. This intensive program provided a deep dive into the cutting-edge world of generative models and their applications.&lt;/p&gt;

&lt;p&gt;The course started with an introduction to the fundamental concepts of Generative AI, including the architecture of generative models, particularly focusing on transformer-based models like GPT (Generative Pre-trained Transformer). I learned about the self-attention mechanism, which is crucial for these models to understand context in sequences of data.&lt;/p&gt;

&lt;p&gt;One of the most interesting parts was learning about prompt engineering. I now understand how to craft effective prompts to guide AI models in generating desired outputs, whether it's for text completion, summarization, or even code generation. This skill feels like a superpower, allowing me to communicate more effectively with AI systems.&lt;/p&gt;

&lt;p&gt;The course also covered ethical considerations in AI, emphasizing the importance of responsible AI development. We discussed potential biases in training data and how they can affect model outputs, as well as strategies to mitigate these issues.&lt;/p&gt;

&lt;p&gt;Practical sessions included hands-on experience with Azure OpenAI Service. I learned how to set up the service, make API calls, and integrate generative AI capabilities into applications. &lt;br&gt;
&lt;a href="https://microsoft.github.io/generative-ai-for-beginners/#/" rel="noopener noreferrer"&gt;&lt;/a&gt;&lt;/p&gt;

</description>
    </item>
    <item>
      <title>My Journey Learning Artificial Intelligence - Day 10</title>
      <dc:creator>CHANTSZCHEUK</dc:creator>
      <pubDate>Tue, 03 Sep 2024 11:58:50 +0000</pubDate>
      <link>https://dev.to/chantszcheuk/my-journey-learning-artificial-intelligence-day-13-1865</link>
      <guid>https://dev.to/chantszcheuk/my-journey-learning-artificial-intelligence-day-13-1865</guid>
      <description>&lt;p&gt;Today I dived into recommendation systems. I learned about collaborative filtering, content-based filtering, and hybrid approaches. Using the Surprise library, I implemented a simple movie recommendation system based on user ratings.&lt;/p&gt;

&lt;p&gt;I was intrigued by how these systems can personalize user experiences across various platforms, from e-commerce to streaming services.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;from surprise import Dataset, Reader, SVD
from surprise.model_selection import train_test_split
from surprise import accuracy

# Load the movielens-100k dataset
data = Dataset.load_builtin('ml-100k')

# Split the data into training and testing sets
trainset, testset = train_test_split(data, test_size=.25)

# Use SVD algorithm
algo = SVD()

# Train the algorithm on the trainset
algo.fit(trainset)

# Predict ratings for the testset
predictions = algo.test(testset)

# Compute and print Root Mean Squared Error
rmse = accuracy.rmse(predictions)
print(f'RMSE: {rmse}')

# Make recommendations for a specific user
user_id = 196  # example user id
item_id = 302  # example item id
pred = algo.predict(user_id, item_id)
print(f'Predicted rating for user {user_id} on item {item_id}: {pred.est}')
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



</description>
    </item>
    <item>
      <title>My Journey Learning Artificial Intelligence - Day 8</title>
      <dc:creator>CHANTSZCHEUK</dc:creator>
      <pubDate>Tue, 03 Sep 2024 11:57:16 +0000</pubDate>
      <link>https://dev.to/chantszcheuk/my-journey-learning-artificial-intelligence-day-8-5bpb</link>
      <guid>https://dev.to/chantszcheuk/my-journey-learning-artificial-intelligence-day-8-5bpb</guid>
      <description>&lt;p&gt;Explored the ethical implications of AI today. I learned about bias in AI systems, the importance of fairness in machine learning models, and the potential societal impacts of AI. I also studied some case studies of AI ethics failures and how they could have been prevented.&lt;/p&gt;

&lt;p&gt;This day made me realize the crucial role of responsible AI development and the need for diverse perspectives in the field.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://microsoft.github.io/generative-ai-for-beginners/#/03-using-generative-ai-responsibly/README?wt.mc_id=academic-105485-koreyst" rel="noopener noreferrer"&gt;&lt;/a&gt;&lt;/p&gt;

</description>
    </item>
    <item>
      <title>My Journey Learning Artificial Intelligence - Day 6</title>
      <dc:creator>CHANTSZCHEUK</dc:creator>
      <pubDate>Tue, 03 Sep 2024 11:55:04 +0000</pubDate>
      <link>https://dev.to/chantszcheuk/my-journey-learning-artificial-intelligence-day-11-4obg</link>
      <guid>https://dev.to/chantszcheuk/my-journey-learning-artificial-intelligence-day-11-4obg</guid>
      <description>&lt;p&gt;Today was all about working with APIs. I learned how to integrate AI services into applications using APIs. I used the OpenAI API to generate text completions and the Google Cloud Vision API for image analysis.&lt;/p&gt;

&lt;p&gt;This hands-on experience showed me how powerful pre-built AI services can be, allowing developers to add AI capabilities to their applications without building models from scratch.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;import openai

# Set up your API key
openai.api_key = 'your-api-key-here'

def generate_text(prompt):
    response = openai.Completion.create(
      engine="text-davinci-002",
      prompt=prompt,
      max_tokens=100
    )
    return response.choices[0].text.strip()

# Example usage
prompt = "Write a short story about a robot learning to paint:"
generated_text = generate_text(prompt)
print(generated_text)
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



</description>
    </item>
    <item>
      <title>My Journey Learning Artificial Intelligence - Day 5</title>
      <dc:creator>CHANTSZCHEUK</dc:creator>
      <pubDate>Tue, 03 Sep 2024 11:52:06 +0000</pubDate>
      <link>https://dev.to/chantszcheuk/my-journey-learning-artificial-intelligence-day-5-1oel</link>
      <guid>https://dev.to/chantszcheuk/my-journey-learning-artificial-intelligence-day-5-1oel</guid>
      <description>&lt;p&gt;My Journey Learning Artificial Intelligence - Day 5&lt;/p&gt;

&lt;p&gt;Today was all about natural language processing (NLP). I learned about tokenization, stemming, and lemmatization as fundamental text preprocessing steps. &lt;br&gt;
&lt;a href="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fdlbn1ch8xmcsijkzxy7f.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fdlbn1ch8xmcsijkzxy7f.png" alt="Image description" width="800" height="424"&gt;&lt;/a&gt;&lt;br&gt;
I also explored word embeddings, and how they capture semantic relationships between words in a vector space.&lt;/p&gt;

</description>
    </item>
    <item>
      <title>My Journey Learning Artificial Intelligence - Day 4</title>
      <dc:creator>CHANTSZCHEUK</dc:creator>
      <pubDate>Tue, 03 Sep 2024 11:49:46 +0000</pubDate>
      <link>https://dev.to/chantszcheuk/my-journey-learning-artificial-intelligence-day-4-253d</link>
      <guid>https://dev.to/chantszcheuk/my-journey-learning-artificial-intelligence-day-4-253d</guid>
      <description>&lt;p&gt;Today I explored supervised learning, a fundamental concept in machine learning. Supervised learning involves training models on labeled data to make predictions or classifications. I learned about two main types: regression for predicting continuous values, and classification for categorizing data into discrete classes.&lt;/p&gt;

</description>
    </item>
    <item>
      <title>My Journey Learning Artificial Intelligence - Day 3</title>
      <dc:creator>CHANTSZCHEUK</dc:creator>
      <pubDate>Tue, 03 Sep 2024 11:45:56 +0000</pubDate>
      <link>https://dev.to/chantszcheuk/my-journey-learning-artificial-intelligence-day-3-1iao</link>
      <guid>https://dev.to/chantszcheuk/my-journey-learning-artificial-intelligence-day-3-1iao</guid>
      <description>&lt;p&gt;Today, I delved into the fascinating world of neural networks, the backbone of many AI systems. Neural networks are inspired by the human brain's structure and function, consisting of interconnected nodes or "neurons" that process and transmit information.&lt;/p&gt;

&lt;p&gt;I learned about the basic components of a neural network: input layer, hidden layers, and output layer. &lt;/p&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fvxwxykqt4fny23azx5we.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fvxwxykqt4fny23azx5we.png" alt="Image description" width="800" height="388"&gt;&lt;/a&gt;&lt;br&gt;
Each connection between neurons has a weight, which is adjusted during the learning process. The key to a neural network's ability to learn lies in its activation functions and the process of backpropagation.&lt;/p&gt;

&lt;p&gt;Activation functions introduce non-linearity into the network, allowing it to learn complex patterns. Common activation functions include ReLU (Rectified Linear Unit), Sigmoid, and Tanh. Backpropagation is the algorithm used to calculate gradients and update weights, enabling the network to minimize errors and improve its predictions.&lt;/p&gt;

&lt;p&gt;I also explored different types of neural networks, such as Convolutional Neural Networks (CNNs) for image processing and Recurrent Neural Networks (RNNs) for sequential data like text or time series.&lt;/p&gt;

&lt;p&gt;The concept of deep learning, which involves neural networks with many hidden layers, particularly intrigued me. It's amazing how these deep networks can automatically learn hierarchical representations of data, leading to breakthroughs in various fields like computer vision and natural language processing.&lt;/p&gt;

</description>
    </item>
    <item>
      <title>My Journey Learning Artificial Intelligence -Day2</title>
      <dc:creator>CHANTSZCHEUK</dc:creator>
      <pubDate>Fri, 02 Aug 2024 11:22:50 +0000</pubDate>
      <link>https://dev.to/chantszcheuk/my-journey-learning-artificial-intelligence-day2-1j41</link>
      <guid>https://dev.to/chantszcheuk/my-journey-learning-artificial-intelligence-day2-1j41</guid>
      <description>&lt;h3&gt;
  
  
  First time try Azure OpenAI to make API call on python
&lt;/h3&gt;

&lt;p&gt;create new  deployment in Azure OpenAI Studio and set up deploy model to make API calls.&lt;br&gt;
&lt;a href="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fgjikrdsg7no2ta0f43vo.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fgjikrdsg7no2ta0f43vo.png" alt="Image description" width="800" height="415"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;It need to install the necessary libraries and run the following command in cmd.&lt;br&gt;
&lt;strong&gt;pip install -r requirements.txt&lt;/strong&gt;&lt;br&gt;
&lt;a href="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fq05lnnesrmw8yvro5me9.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fq05lnnesrmw8yvro5me9.png" alt="Image description" width="489" height="394"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Create &lt;strong&gt;.env file&lt;/strong&gt; for API Keys.&lt;br&gt;
&lt;a href="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fmxi8i3stns9c3wh4uqd3.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fmxi8i3stns9c3wh4uqd3.png" alt="Image description" width="780" height="300"&gt;&lt;/a&gt;&lt;br&gt;
_A .env file is used to store environment variables, which can be accessed by your application to keep sensitive information like API keys secure and out of your source code. _&lt;/p&gt;

&lt;p&gt;Try to run the &lt;strong&gt;ValidateSetup.py&lt;/strong&gt;&lt;br&gt;
&lt;a href="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fu034t3pokm60ts83yvh9.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fu034t3pokm60ts83yvh9.png" alt="Image description" width="800" height="487"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;There have a error about rate limit, the reason about this problem is because of Azure Student Account Quotas has limit 1000 tokens.&lt;br&gt;
&lt;a href="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Ff20ang5r59gd1wwnesux.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Ff20ang5r59gd1wwnesux.png" alt="Image description" width="749" height="486"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Luckily ,I find a free API Key on GitHub and active successfully&lt;br&gt;
&lt;a href="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Frc29svahs6f95j1e0e9h.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Frc29svahs6f95j1e0e9h.png" alt="Image description" width="800" height="792"&gt;&lt;/a&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>llm</category>
      <category>learning</category>
    </item>
    <item>
      <title>My Journey Learning Artificial Intelligence -Day1</title>
      <dc:creator>CHANTSZCHEUK</dc:creator>
      <pubDate>Thu, 01 Aug 2024 17:18:07 +0000</pubDate>
      <link>https://dev.to/chantszcheuk/my-journey-learning-artificial-intelligence-day1-4i5l</link>
      <guid>https://dev.to/chantszcheuk/my-journey-learning-artificial-intelligence-day1-4i5l</guid>
      <description>&lt;p&gt;The term "Artificial Intelligence" (AI) has become increasingly popular in recent years. Whether in the tech industry, business sector, or everyday life, its presence is ubiquitous. As a student currently studying AI, I would like to share some experiences and the key point from this journey.&lt;/p&gt;

&lt;h2&gt;
  
  
  Foundation Models versus LLMs
&lt;/h2&gt;

&lt;p&gt;The term Foundation Model was coined by Stanford researchers and defined as an AI model that follows some criteria, such as:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;They are trained using unsupervised learning or self-supervised learning,&lt;/strong&gt; meaning they are trained on unlabeled multi-modal data, and they do not require human annotation or labeling of data for their training process.&lt;br&gt;
&lt;strong&gt;They are very large models&lt;/strong&gt;, based on very deep neural networks trained on billions of parameters.&lt;br&gt;
&lt;strong&gt;They are normally intended to serve as a ‘foundation’ for other models&lt;/strong&gt;, meaning they can be used as a starting point for other models to be built on top of, which can be done by fine-tuning.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Ftu2oeh09v5pmxxu2vwl7.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Ftu2oeh09v5pmxxu2vwl7.png" alt="Image description" width="680" height="381"&gt;&lt;/a&gt;&lt;/p&gt;
&lt;h2&gt;
  
  
  Large Language Models (LLMs)
&lt;/h2&gt;

&lt;p&gt;Based on Transformer architecture, trained on vast amounts of unlabeled data.&lt;br&gt;
Capable of generating grammatically correct and creative text responses.&lt;br&gt;
Work with numerical tokens for better performance.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F4j4u1qbsx3f75cjel2nd.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F4j4u1qbsx3f75cjel2nd.png" alt="Image description" width="800" height="328"&gt;&lt;/a&gt;&lt;/p&gt;
&lt;h2&gt;
  
  
  How Large Language Models Work
&lt;/h2&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;Tokenization&lt;/strong&gt;: Process text inputs into numerical tokens.
&lt;img src="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fhyzaez4xsx3dm2y0dhe7.png" alt="Image description" width="800" height="246"&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Prediction&lt;/strong&gt;: Model predicts output tokens based on input tokens.&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Selection&lt;/strong&gt;: Outputs tokens based on probability distribution, introducing randomness for creativity.&lt;br&gt;
&lt;a href="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F5qqebdy3oaqbx6r67cfl.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F5qqebdy3oaqbx6r67cfl.png" alt="Image description" width="652" height="337"&gt;&lt;/a&gt;&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Tuning&lt;/strong&gt;: Model parameter like temperature controls the randomness level.&lt;/p&gt;&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;&lt;strong&gt;Thanks for this youtube video by 3Blue1Brown who help me a lot.&lt;/strong&gt;&lt;br&gt;
&lt;iframe width="710" height="399" src="https://www.youtube.com/embed/wjZofJX0v4M"&gt;
&lt;/iframe&gt;
&lt;/p&gt;

</description>
      <category>ai</category>
      <category>learning</category>
    </item>
  </channel>
</rss>
