<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: A_Lucas</title>
    <description>The latest articles on DEV Community by A_Lucas (@a_lucas).</description>
    <link>https://dev.to/a_lucas</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F1505387%2Fe2830f85-77f1-4a80-9c22-8ba03c5622c6.jpg</url>
      <title>DEV Community: A_Lucas</title>
      <link>https://dev.to/a_lucas</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/a_lucas"/>
    <language>en</language>
    <item>
      <title>Embedding and Reranking with Alibaba Cloud AI Service</title>
      <dc:creator>A_Lucas</dc:creator>
      <pubDate>Fri, 28 Feb 2025 03:26:46 +0000</pubDate>
      <link>https://dev.to/a_lucas/embedding-and-reranking-with-alibaba-cloud-ai-service-5541</link>
      <guid>https://dev.to/a_lucas/embedding-and-reranking-with-alibaba-cloud-ai-service-5541</guid>
      <description>&lt;p&gt;Written by:  Elastic &lt;a href="https://www.elastic.co/search-labs/author/tomas-murua" rel="noopener noreferrer"&gt;Tomás Murúa&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;In this article, we'll cover how to integrate &lt;a href="https://www.alibabacloud.com/en/solutions/generative-ai" rel="noopener noreferrer"&gt;Alibaba Cloud AI&lt;/a&gt; features with Elasticsearch to improve relevance in semantic searches.&lt;/p&gt;

&lt;p&gt;Alibaba Cloud AI Search is a solution that integrates advanced AI features with Elasticsearch tools, by leveraging the &lt;a href="https://www.alibabacloud.com/en/solutions/generative-ai/qwen?_p_lc=1" rel="noopener noreferrer"&gt;Qwen LLM/DeepSeek-R1&lt;/a&gt; family to contribute with advanced models for inference and classification. In this article, we'll use descriptions of novels and plays written by the same author to test the Alibaba reranking and sparse embedding endpoints.&lt;/p&gt;

&lt;h2&gt;
  
  
  Steps
&lt;/h2&gt;

&lt;p&gt;1）Configure Alibaba Cloud AI&lt;/p&gt;

&lt;p&gt;2）Create Elasticsearch mappings&lt;/p&gt;

&lt;p&gt;3）Index data into Elasticsearch&lt;/p&gt;

&lt;p&gt;4）Query data&lt;/p&gt;

&lt;p&gt;5）Bonus: Answering questions with completion&lt;/p&gt;

&lt;h2&gt;
  
  
  Configure Alibaba Cloud AI
&lt;/h2&gt;

&lt;h3&gt;
  
  
  Alibaba Cloud AI reranking and embeddings
&lt;/h3&gt;

&lt;p&gt;Open inference Alibaba Cloud offers different services. In this example, we'll use the descriptions of popular books and plays by Agatha Christie to test &lt;strong&gt;Alibaba Cloud&lt;/strong&gt; embeddings and reranking endpoints in semantic search.&lt;/p&gt;

&lt;p&gt;The Alibaba Cloud AI reranking endpoint is a semantic reranking functionality. This type of reranking uses a machine learning model to reorder search results based on their semantic similarity to a query. This allows you to use out-of-the-box semantic search capabilities on existing full-text search indices.&lt;/p&gt;

&lt;p&gt;The sparse embedding endpoint is a type of embedding where most values are zero, making relevant information more prominent.&lt;/p&gt;

&lt;h3&gt;
  
  
  Get Alibaba Cloud API Key
&lt;/h3&gt;

&lt;p&gt;We need a &lt;a href="https://www.alibabacloud.com/help/en/open-search/search-platform/user-guide/api-keys-management" rel="noopener noreferrer"&gt;valid API Key&lt;/a&gt; to integrate Alibaba with Elasticsearch. To get it, follow these steps:&lt;/p&gt;

&lt;p&gt;1）Access the Alibaba Cloud portal from the &lt;a href="https://opensearch.console.aliyun.com/cn-shanghai/rag/server-market" rel="noopener noreferrer"&gt;Service Plaza&lt;/a&gt; section.&lt;/p&gt;

&lt;p&gt;2）Go to the left menu &lt;strong&gt;API Keys&lt;/strong&gt; as shown below.&lt;/p&gt;

&lt;p&gt;3）Generate a new API Key.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F8yezdl85op4c13m8zix9.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F8yezdl85op4c13m8zix9.png" width="800" height="452"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h3&gt;
  
  
  Configure Alibaba Endpoints
&lt;/h3&gt;

&lt;p&gt;We´ll first configure the sparse embedding endpoint to transform the text descriptions into semantic vectors:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Embeddings endpoint:&lt;/strong&gt;&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;PUT _inference/sparse_embedding/alibabacloud_ai_search_sparse
{
    "service": "alibabacloud-ai-search",
    "service_settings": {
        "api_key": "&amp;lt;api_key&amp;gt;",
        "service_id": "ops-text-sparse-embedding-001",
        "host": "default-j01.platform-cn-shanghai.opensearch.aliyuncs.com",
        "workspace": "default"
    }
}
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;We´ll then configure the rerank endpoint to reorganize results.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Rerank Endpoint:&lt;/strong&gt;&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;PUT _inference/rerank/alibabacloud_ai_search_rerank
{
    "service": "alibabacloud-ai-search",
    "service_settings": {
        "api_key": "&amp;lt;api_key&amp;gt;",
        "service_id": "ops-bge-reranker-larger",
        "host": "default-j01.platform-cn-shanghai.opensearch.aliyuncs.com",
        "workspace": "default"
    }
}
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Now that the endpoints are configured, we can prepare the Elasticsearch index.&lt;/p&gt;

&lt;h2&gt;
  
  
  Create Elasticsearch mappings
&lt;/h2&gt;

&lt;p&gt;Let's configure the mappings. For this, we need to organize both the texts with the descriptions as well as the model-generated vectors.&lt;/p&gt;

&lt;p&gt;We'll use the following properties:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;semantic_description&lt;/strong&gt;: to store the embeddings generated by the model and run semantic searches.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;description&lt;/strong&gt;: we'll use a "text" type to store the novels and plays’ descriptions and use them for full-text search.&lt;/p&gt;

&lt;p&gt;We'll include the copy_to parameter so that both the text and the semantic field are available for &lt;strong&gt;hybrid search&lt;/strong&gt;:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;PUT arts
{
  "mappings": {
    "properties": {
      "semantic_description": {
        "type": "semantic_text",
        "inference_id": "alibabacloud_ai_search_sparse"
      },
      "description": {
        "type": "text",
        "copy_to": "semantic_description"
      }
    }
  }
}
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;With the mappings ready, we can now index the data.&lt;/p&gt;

&lt;h2&gt;
  
  
  Index data into Elasticsearch
&lt;/h2&gt;

&lt;p&gt;Here's the dataset with the descriptions that we'll use for this example. We'll index it using the Elasticsearch Bulk API.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;POST arts/_bulk
{ "index": {} }
{ "description": " Black Coffee is a play by the British crime-fiction author Agatha Christie. In the play, a scientist discovers that someone in his household has stolen the formula for an explosive." }
{ "index": {} }
{ "description": "The Mousetrap is a murder mystery play by Agatha Christie. The play opened in London's West End in 1952 and ran continuously until 16 March 2020." }
{ "index": {} }
{ "description": "The Body in the Murder is a Miss Marple mystery novel published by Agatha Christie in 1942. The case involves the murder of two teenage girls who are similar in appearance." }
{ "index": {} }
{ "description": " Agatha Christie's last published novel before she passed, Curtain: Poirot's Last Case is also her indelible detective's last appearance. Poirot and Hastings return to the very same house from The Mysterious Affairs at Styles over 30 years later." }
{ "index": {} }
{ "description": " Death on the Nile is Agatha Christie's most daring travel mystery novel. The tranquillity of a cruise along the Nile is shattered by the discovery that Linnet Ridgeway has been shot through the head." }
{ "index": {} }
{ "description": " The Murder of Roger Ackroyd was Agatha Christie’s first book to be published by William Collins in the spring of 1926. William Collins became part of HarperCollins and are still Christie’s publishers today." }
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Note that the first two documents, “Black Coffee” and “The Mousetraps” are plays while the others are novels.&lt;/p&gt;

&lt;h2&gt;
  
  
  Query data
&lt;/h2&gt;

&lt;p&gt;To see the different results we can get, we'll run different types of queries, starting with semantic query, then applying reranking, and finally using both. We'll use the same question "Which novel was written by Agatha Christie?" expecting to get the three documents that explicitly say novel, plus the one that says book. The two plays should be the last results.&lt;/p&gt;

&lt;h3&gt;
  
  
  Semantic search
&lt;/h3&gt;

&lt;p&gt;We'll begin querying the semantic_text field to ask: "Which novel was written by Agatha Christie?" Let's see what happens:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;GET /arts/_search
{
  "_source": {
    "includes": [
      "description"
    ]
  },
  "query": {
    "semantic": {
      "field": "semantic_description",
      "query": "Which novel was written by Agatha Christie?"
    }
  }
}
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;strong&gt;Response:&lt;/strong&gt;&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;{
  "took": 1246,
  "timed_out": false,
  "_shards": {
    "total": 1,
    "successful": 1,
    "skipped": 0,
    "failed": 0
  },
  "hits": {
    "total": {
      "value": 6,
      "relation": "eq"
    },
    "max_score": 0.1759066,
    "hits": [
      {
        "_index": "arts",
        "_id": "rdJ4-ZMB36zj9EVTnMgJ",
        "_score": 0.1759066,
        "_source": {
          "description": " Death on the Nile is Agatha Christie's most daring travel mystery novel. The tranquillity of a cruise along the Nile is shattered by the discovery that Linnet Ridgeway has been shot through the head."
        }
      },
      {
        "_index": "arts",
        "_id": "rNJ4-ZMB36zj9EVTnMgJ",
        "_score": 0.17499167,
        "_source": {
          "description": " Agatha Christie's last published novel before she passed, Curtain: Poirot's Last Case is also her indelible detective's last appearance. Poirot and Hastings return to the very same house from The Mysterious Affairs at Styles over 30 years later."
        }
      },
      {
        "_index": "arts",
        "_id": "q9J4-ZMB36zj9EVTnMgJ",
        "_score": 0.16319725,
        "_source": {
          "description": "The Body in the Murder is a Miss Marple mystery novel published by Agatha Christie in 1942. The case involves the murder of two teenage girls who are similar in appearance."
        }
      },
      {
        "_index": "arts",
        "_id": "qtJ4-ZMB36zj9EVTnMgJ",
        "_score": 0.15506727,
        "_source": {
          "description": "The Mousetrap is a murder mystery play by Agatha Christie. The play opened in London's West End in 1952 and ran continuously until 16 March 2020."
        }
      },
      {
        "_index": "arts",
        "_id": "qdJ4-ZMB36zj9EVTnMgJ",
        "_score": 0.14572844,
        "_source": {
          "description": " Black Coffee is a play by the British crime-fiction author Agatha Christie. In the play, a scientist discovers that someone in his household has stolen the formula for an explosive."
        }
      },
      {
        "_index": "arts",
        "_id": "rtJ4-ZMB36zj9EVTnMgJ",
        "_score": 0.13951442,
        "_source": {
          "description": " The Murder of Roger Ackroyd was Agatha Christie’s first book to be published by William Collins in the spring of 1926. William Collins became part of HarperCollins and are still Christie’s publishers today."
        }
      }
    ]
  }
}
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;In this case, the response prioritized most of the novels, but the document that says “book” appears last. We can still further refine the results with reranking.&lt;/p&gt;

&lt;h3&gt;
  
  
  Refining results with Reranking
&lt;/h3&gt;

&lt;p&gt;In this case, we'll use a &lt;code&gt;_inference/rerank&lt;/code&gt; request to assess the documents we got in the first query and improve their rank in the results.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;POST _inference/rerank/alibabacloud_ai_search_rerank
{
  "query": "Which novel was written by Agatha Christie?",
  "input": [
    "Black Coffee is a play by the British crime-fiction author Agatha Christie. In the play, a scientist discovers that someone in his household has stolen the formula for an explosive.",
    "The Mousetrap is a murder mystery play by Agatha Christie. The play opened in London's West End in 1952 and ran continuously until 16 March 2020.",
    " The Body in the Murder is a Miss Marple mystery novel published by Agatha Christie in 1942. The case involves the murder of two teenage girls who are similar in appearance.",
    " Agatha Christie's last published novel before she passed, Curtain: Poirot's Last Case is also her indelible detective's last appearance. Poirot and Hastings return to the very same house from The Mysterious Affairs at Styles over 30 years later.",
    " Death on the Nile is Agatha Christie's most daring travel mystery novel. The tranquillity of a cruise along the Nile is shattered by the discovery that Linnet Ridgeway has been shot through the head.",
    " The Murder of Roger Ackroyd was Agatha Christie’s first book to be published by William Collins in the spring of 1926. William Collins became part of HarperCollins and are still Christie’s publishers today."
  ]
}
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;strong&gt;Response:&lt;/strong&gt;&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;{
  "rerank": [
    {
      "index": 3,
      "relevance_score": 0.91086304
    },
    {
      "index": 4,
      "relevance_score": 0.8409133
    },
    {
      "index": 2,
      "relevance_score": 0.76838577
    },
    {
      "index": 5,
      "relevance_score": 0.2295352
    },
    {
      "index": 0,
      "relevance_score": 0.13846178
    },
    {
      "index": 1,
      "relevance_score": 0.06620602
    }
  ]
}
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;The response here shows that both plays are now at the bottom of the results.&lt;/p&gt;

&lt;h3&gt;
  
  
  Semantic search and reranking endpoint combined
&lt;/h3&gt;

&lt;p&gt;Using a retriever, we'll combine the semantic query and reranking in just one step:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;POST /arts/_search
{
  "_source": {
    "includes": ["description"]
  },
  "retriever": {
    "text_similarity_reranker": {
      "retriever": {
        "standard": {
          "query": {
            "semantic": {
              "field": "semantic_description",
              "query": "Which novel was written by Agatha Christie?"
            }
          }
        }
      },
      "field": "description",
      "rank_window_size": 10,
      "inference_id": "alibabacloud_ai_search_rerank",
      "inference_text": "Which novel was written by Agatha Christie?"
    }
  }
}
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;strong&gt;Response:&lt;/strong&gt;&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;  "took": 1568,
  "timed_out": false,
  "_shards": {
    "total": 1,
    "successful": 1,
    "skipped": 0,
    "failed": 0
  },
  "hits": {
    "total": {
      "value": 6,
      "relation": "eq"
    },
    "max_score": 0.91086304,
    "hits": [
      {
        "_index": "arts",
        "_id": "rNJ4-ZMB36zj9EVTnMgJ",
        "_score": 0.91086304,
        "_source": {
          "description": " Agatha Christie's last published novel before she passed, Curtain: Poirot's Last Case is also her indelible detective's last appearance. Poirot and Hastings return to the very same house from The Mysterious Affairs at Styles over 30 years later."
        }
      },
      {
        "_index": "arts",
        "_id": "rdJ4-ZMB36zj9EVTnMgJ",
        "_score": 0.8409133,
        "_source": {
          "description": " Death on the Nile is Agatha Christie's most daring travel mystery novel. The tranquillity of a cruise along the Nile is shattered by the discovery that Linnet Ridgeway has been shot through the head."
        }
      },
      {
        "_index": "arts",
        "_id": "q9J4-ZMB36zj9EVTnMgJ",
        "_score": 0.76838577,
        "_source": {
          "description": "The Body in the Murder is a Miss Marple mystery novel published by Agatha Christie in 1942. The case involves the murder of two teenage girls who are similar in appearance."
        }
      },
      {
        "_index": "arts",
        "_id": "rtJ4-ZMB36zj9EVTnMgJ",
        "_score": 0.2295352,
        "_source": {
          "description": " The Murder of Roger Ackroyd was Agatha Christie’s first book to be published by William Collins in the spring of 1926. William Collins became part of HarperCollins and are still Christie’s publishers today."
        }
      },
      {
        "_index": "arts",
        "_id": "qdJ4-ZMB36zj9EVTnMgJ",
        "_score": 0.13846178,
        "_source": {
          "description": " Black Coffee is a play by the British crime-fiction author Agatha Christie. In the play, a scientist discovers that someone in his household has stolen the formula for an explosive."
        }
      },
      {
        "_index": "arts",
        "_id": "qtJ4-ZMB36zj9EVTnMgJ",
        "_score": 0.06620602,
        "_source": {
          "description": "The Mousetrap is a murder mystery play by Agatha Christie. The play opened in London's West End in 1952 and ran continuously until 16 March 2020."
        }
      }
    ]
  }
}
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;The results here differ from the semantic query. We can see that the document with no exact match for "novel" but that says “book” (&lt;em&gt;The Murder of Roger Ackroyd)&lt;/em&gt; appears higher than in the first semantic search. Both plays are still the last results, just like with reranking.&lt;/p&gt;

&lt;h3&gt;
  
  
  Bonus: Answering questions with completion
&lt;/h3&gt;

&lt;p&gt;With embeddings and reranking we can satisfy a search query, but still, the user will see all the search results and not the actual answer.&lt;/p&gt;

&lt;p&gt;With the examples provided, we are one step away from a RAG implementation, where we can provide the top results + the question to an LLM to get the right answer.&lt;/p&gt;

&lt;p&gt;Fortunately, Alibaba Cloud AI Service also provides an endpoint service we can use to achieve this purpose.&lt;/p&gt;

&lt;h4&gt;
  
  
  Let’s create the endpoint
&lt;/h4&gt;

&lt;p&gt;&lt;strong&gt;Completion Endpoint:&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;1）Create Completion Endpoints with Alibaba Cloud Qwen LLM&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;PUT _inference/completion/alibabacloud_ai_search_completion
{
    "service": "alibabacloud-ai-search",
    "service_settings": {
        "host" : "default-j01.platform-cn-shanghai.opensearch.aliyuncs.com",
        "api_key": "&amp;lt;api_key&amp;gt;",
        "service_id": "ops-qwen-turbo",
        "workspace" : "default"
    }
}
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;2）We can also create it with DeepSeek-R1&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;PUT _inference/completion/alibabacloud_ai_search_completion_deepseek_r1
{
    "service": "alibabacloud-ai-search",
    "service_settings": {
        "host" : "default-j01.platform-cn-shanghai.opensearch.aliyuncs.com",
        "api_key": "{{API_KEY}}",
        "service_id": "deepseek-r1",
        "workspace" : "default"
    }
}
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;And now, send the results and question from the previous query:&lt;/p&gt;

&lt;h4&gt;
  
  
  Query with Alibaba Cloud Qwen LLM
&lt;/h4&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;POST _inference/completion/alibabacloud_ai_search_completion
{
  "input": """
    Answer the following question using the context provided:
    QUESTION: Which novel was written by Agatha Christie?
    CONTEXT:
    DOCUMENT1
    Black Coffee is a play by the British crime-fiction author Agatha Christie. In the play, a scientist discovers that someone in his household has stolen the formula for an explosive.
    DOCUMENT2
    The Mousetrap is a murder mystery play by Agatha Christie. The play opened in London's West End in 1952 and ran continuously until 16 March 2020.
    DOCUMENT3
    The Body in the Murder is a Miss Marple mystery novel published by Agatha Christie in 1942. The case involves the murder of two teenage girls who are similar in appearance.
    DOCUMENT4
    Agatha Christie's last published novel before she passed, Curtain: Poirot's Last Case is also her indelible detective's last appearance. Poirot and Hastings return to the very same house from The Mysterious Affairs at Styles over 30 years later.
    DOCUMENT5
    Death on the Nile is Agatha Christie's most daring travel mystery novel. The tranquillity of a cruise along the Nile is shattered by the discovery that Linnet Ridgeway has been shot through the head."
    DOCUMENT6
    The Murder of Roger Ackroyd was Agatha Christie’s first book to be published by William Collins in the spring of 1926. William Collins became part of HarperCollins and are still Christie’s publishers today.
    ANSWER:
    """
}
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Response:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;{
  "completion": [
{
      "result": "Agatha Christie wrote several novels, including \"The Body in the Murder,\" \"Curtain: Poirot's Last Case,\" \"Death on the Nile,\" and \"The Murder of Roger Ackroyd.\""
    }
  ]
}
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h4&gt;
  
  
  Query with Alibaba Cloud DeepSeek-R1
&lt;/h4&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;POST _inference/completion/alibabacloud_ai_search_completion_deepseek_r1?timeout=180s
{
  "input": "&amp;lt;|system|&amp;gt;
    You are an AI assistant.&amp;lt;/s&amp;gt;
    &amp;lt;|user|&amp;gt;
    CONTEXT:
    Black Coffee is a play by the British crime-fiction author Agatha Christie. In the play, a scientist discovers that someone in his household has stolen the formula for an explosive;

    The Mousetrap is a murder mystery play by Agatha Christie. The play opened in London's West End in 1952 and ran continuously until 16 March 2020;

    The Body in the Murder is a Miss Marple mystery novel published by Agatha Christie in 1942. The case involves the murder of two teenage girls who are similar in appearance;
    Agatha Christie's last published novel before she passed, Curtain: Poirot's Last Case is also her indelible detective's last appearance. Poirot and Hastings return to the very same house from The Mysterious Affairs at Styles over 30 years later;

    Death on the Nile is Agatha Christie's most daring travel mystery novel. The tranquillity of a cruise along the Nile is shattered by the discovery that Linnet Ridgeway has been shot through the head;
    The Murder of Roger Ackroyd was Agatha Christie’s first book to be published by William Collins in the spring of 1926. William Collins became part of HarperCollins and are still Christie’s publishers today;

    QUESTION: 
    Which novela were written by Agatha Christie?&amp;lt;/s&amp;gt;
    &amp;lt;|assistant|&amp;gt;"
}
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Result:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;{
  "completion": [
    {
      "result": """&amp;lt;think&amp;gt;
Okay, let's see. The user is asking which novels were written by Agatha Christie based on the given context. First, I need to go through each item in the context and determine if it's a novel. The user mentioned "novela," which I think is Spanish for "novel," so they're asking about novels, not plays or other works.
Looking at the context entries one by one:
1. **Black Coffee** is described as a play by Christie. So that's a play, not a novel. Exclude.
2. **The Mousetrap** is a murder mystery play, opened in London's West End. Definitely a play, not a novel. Exclude.
3. **The Body in the Murder** is listed as a Miss Marple mystery novel published in 1942. Wait, the title here might be a bit off. Agatha Christie wrote a novel called "The Body in the Library," which is a Miss Marple story from 1942. Maybe the user made a typo. Assuming it's "The Body in the Library," then yes, that's a novel. But the title given is "The Body in the Murder," which I don't recall. Need to check if that's a real title or a mistake. However, since the context says it's a Miss Marple novel published in 1942, I'll proceed with that, even if the title is slightly wrong. So include as a novel.
4. **Curtain: Poirot's Last Case** is mentioned as her last published novel before she passed. So that's a novel. Include.
5. **Death on the Nile** is described as a travel mystery novel. That's a novel. Include.
6. **The Murder of Roger Ackroyd** was her first book published by William Collins. That's a novel. Include.
So the novels listed here are: The Body in the Murder (assuming typo), Curtain, Death on the Nile, and The Murder of Roger Ackroyd. However, "The Body in the Murder" might actually be "The Body in the Library," which is the correct title. But since the user provided that exact title, I should list it as given, even if there's an error. Alternatively, note the possible typo.
Also, check if there are other works mentioned. The other entries are plays. So the answer should list the four novels mentioned in the context, being careful with the title accuracy.
&amp;lt;/think&amp;gt;
The novels written by Agatha Christie mentioned in the context are:  
1. **The Body in the Murder** (likely a typo for *The Body in the Library*, a Miss Marple novel published in 1942).  
2. **Curtain: Poirot's Last Case** (her final published novel featuring Hercule Poirot).  
3. **Death on the Nile** (a travel mystery novel set on a Nile cruise).  
4. **The Murder of Roger Ackroyd** (her breakthrough novel published in 1926).  
*Note*:  
- *Black Coffee* and *The Mousetrap* are plays, not novels.  
- If "The Body in the Murder" is intended to refer to *The Body in the Library*, the latter is the correct title of Christie's 1942 Miss Marple novel."""
    }
  ]
}
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h2&gt;
  
  
  Conclusion
&lt;/h2&gt;

&lt;p&gt;Integrating Alibaba Cloud AI Search with Elasticsearch allows us to easily access completion, embedding, and reranking models to incorporate them into our search pipeline.&lt;/p&gt;

&lt;p&gt;We can use the reranking and embedding endpoints, either separately or together, with the help of a retriever.&lt;/p&gt;

&lt;p&gt;We can also introduce the completion endpoint to finish up a RAG end-to-end implementation.&lt;/p&gt;

&lt;p&gt;Ready to start your journey with elasticsearch on &lt;a href="https://www.alibabacloud.com/en/product/elasticsearch" rel="noopener noreferrer"&gt;Alibaba Cloud&lt;/a&gt;? Explore our tailored Cloud solutions and services to take the first step towards transforming your data into a visual masterpiece.&lt;/p&gt;

&lt;p&gt;Please &lt;a href="https://account.alibabacloud.com/login/login.htm" rel="noopener noreferrer"&gt;Click here&lt;/a&gt;, Embark on Your 30-Day Free Trial &lt;/p&gt;

&lt;p&gt;Original text：&lt;a href="https://www.elastic.co/search-labs/blog/alibaba-cloud-ai-embeddings-reranking" rel="noopener noreferrer"&gt;Embeddings and reranking with Alibaba Cloud AI Service - Elasticsearch Labs&lt;/a&gt;&lt;/p&gt;

</description>
      <category>deepseek</category>
      <category>qwenllm</category>
      <category>alibabacloud</category>
      <category>elasticsearch</category>
    </item>
    <item>
      <title>Building Real-Time Anomaly Detection Systems with Alibaba Cloud Elasticsearch ML Modules</title>
      <dc:creator>A_Lucas</dc:creator>
      <pubDate>Thu, 27 Feb 2025 06:36:09 +0000</pubDate>
      <link>https://dev.to/a_lucas/building-real-time-anomaly-detection-systems-with-alibaba-cloud-elasticsearch-ml-modules-5f9d</link>
      <guid>https://dev.to/a_lucas/building-real-time-anomaly-detection-systems-with-alibaba-cloud-elasticsearch-ml-modules-5f9d</guid>
      <description>&lt;h2&gt;
  
  
  Overview of Alibaba Cloud Elasticsearch ML Modules
&lt;/h2&gt;

&lt;h3&gt;
  
  
  Key Features for Real-Time Anomaly Detection
&lt;/h3&gt;

&lt;p&gt;Alibaba Cloud Elasticsearch ML Modules offer powerful tools for detecting anomalies in real time. These modules use both &lt;a href="https://www.alibabacloud.com/blog/leveraging-alibaba-cloud-elasticsearch-for-intelligent-data-detection-and-prediction_601165" rel="noopener noreferrer"&gt;unsupervised and supervised machine learning techniques&lt;/a&gt; to analyze your data effectively. Unsupervised learning identifies unusual patterns without prior training, making it ideal for detecting anomalies in logging or financial transactions. Supervised learning, on the other hand, trains models on labeled data to classify new information and predict outcomes using regression and classification algorithms.&lt;/p&gt;

&lt;p&gt;These features allow you to process large volumes of logging data efficiently. You can uncover hidden patterns and address potential issues before they disrupt your system. By leveraging these capabilities, you enhance your ability to maintain robust IT operations.&lt;/p&gt;

&lt;h3&gt;
  
  
  Observability Capabilities in Elasticsearch
&lt;/h3&gt;

&lt;p&gt;Observability plays a crucial role in anomaly detection. Alibaba Cloud Elasticsearch integrates advanced observability tools to help you monitor and analyze your system. The &lt;a href="https://www.alibabacloud.com/blog/from-enterprise-rag-to-ai-assistant%E2%80%94alibaba-cloud-elasticsearch-technology-practice-in-ai-search_601952" rel="noopener noreferrer"&gt;AI Assistant uses RAG technology&lt;/a&gt; and large language models to automate data analysis and visualization. It transforms observability data into actionable insights, enabling you to identify root causes of errors quickly.&lt;/p&gt;

&lt;p&gt;This assistant also monitors real-time anomalies and potential failures. It analyzes logging data to detect security threats and proposes defense strategies. These observability capabilities ensure your cloud-native system remains secure and efficient.&lt;/p&gt;

&lt;h3&gt;
  
  
  Why Choose Alibaba Cloud for aiops and Anomaly Detection
&lt;/h3&gt;

&lt;p&gt;Alibaba Cloud stands out as a leader in aiops and anomaly detection. Its Elasticsearch ML Modules combine cloud-native technologies with advanced machine learning to deliver unparalleled performance. The platform supports seamless integration with your IT infrastructure, enabling you to process logging data in real time.&lt;/p&gt;

&lt;p&gt;With Alibaba Cloud, you gain access to cutting-edge tools for observability and aiops. These tools empower you to automate monitoring, reduce downtime, and improve system reliability. By choosing Alibaba Cloud, you ensure your IT operations remain resilient and future-ready.&lt;/p&gt;

&lt;h2&gt;
  
  
  Key Components of a Real-Time Anomaly Detection System
&lt;/h2&gt;

&lt;h3&gt;
  
  
  Data Ingestion and Preprocessing
&lt;/h3&gt;

&lt;p&gt;Data ingestion and preprocessing form the foundation of any real-time anomaly detection system. You need to collect data from diverse sources such as website logs, network traffic, and online databases. This step ensures that all relevant information is available for analysis. The collected data often contains inconsistencies or missing values. Cleaning and formatting the data make it usable for anomaly detection. For instance, estimation techniques can address missing data points effectively.&lt;/p&gt;

&lt;p&gt;The ingestion process involves storing the data in Elasticsearch, where it becomes accessible for further analysis. Preprocessing also includes converting raw data into formats suitable for machine learning algorithms. By ensuring high-quality input, you improve the system's performance and accuracy in detecting anomalies.&lt;/p&gt;

&lt;h3&gt;
  
  
  Machine Learning Models for Anomaly Detection
&lt;/h3&gt;

&lt;p&gt;Machine learning models are the core of anomaly detection and pattern recognition. These models analyze data to identify unusual patterns. Algorithms like Isolation Forest, Local Outlier Factor, and One-Class SVM are particularly effective. Isolation Forest isolates anomalies by dividing data points using decision trees. Local Outlier Factor evaluates the density of data points to detect anomalies based on their neighborhood. One-Class SVM creates boundaries around normal data points, marking those outside as anomalies.&lt;/p&gt;

&lt;p&gt;Selecting the right algorithm depends on your system's requirements. For example, unsupervised algorithms work well when you lack labeled data. These models enable you to uncover hidden anomalies and gain real-time insights into your system's behavior.&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Algorithm&lt;/th&gt;
&lt;th&gt;Description&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Isolation Forest&lt;/td&gt;
&lt;td&gt;Divides data points using decision trees to isolate anomalies.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Local Outlier Factor&lt;/td&gt;
&lt;td&gt;Detects anomalies by analyzing the density of data points in their neighborhood.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;One-Class SVM&lt;/td&gt;
&lt;td&gt;Identifies anomalies by creating boundaries around normal data points.&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;h3&gt;
  
  
  Real-Time Processing and Alerting
&lt;/h3&gt;

&lt;p&gt;Real-time processing ensures that anomalies are detected as they occur. Advanced analytics engines continuously analyze data streams, enabling immediate identification of unusual patterns. This capability is crucial for mitigating risks such as cyberattacks or system failures. For example, detecting anomalies in network traffic in real time allows you to respond quickly to potential security threats.&lt;/p&gt;

&lt;p&gt;Alerting systems play a vital role in this process. They generate context-aware alerts that help you understand the root cause of anomalies. These alerts prioritize critical issues, enabling faster decision-making. By integrating real-time processing and intelligent alerting, you enhance your system's performance and ensure robust monitoring.&lt;/p&gt;

&lt;h3&gt;
  
  
  Observability and Monitoring Tools
&lt;/h3&gt;

&lt;p&gt;Observability and monitoring tools are essential for maintaining the performance of your real-time anomaly detection system. These tools help you track system behavior, identify issues, and ensure smooth operations. By using observability tools, you gain a comprehensive view of your system's health and detect anomalies before they escalate into major problems.&lt;/p&gt;

&lt;p&gt;Monitoring tools continuously collect and analyze data from various sources, such as logs, metrics, and traces. This data provides valuable insights into your system's performance. For example, you can monitor CPU usage, memory consumption, and network traffic to identify unusual patterns. These insights allow you to take immediate action to prevent downtime or security breaches.&lt;/p&gt;

&lt;p&gt;Observability tools go beyond basic monitoring by offering advanced analysis capabilities. They enable you to understand the root cause of anomalies through detailed data visualization and correlation. For instance, you can use dashboards to view real-time metrics and identify trends that indicate potential issues. This level of observability ensures that your system remains reliable and efficient.&lt;/p&gt;

&lt;p&gt;Alerting systems play a critical role in monitoring. They notify you when performance metrics deviate from expected values. These alerts prioritize critical issues, helping you focus on resolving the most urgent problems. By integrating observability and monitoring tools, you enhance your system's performance and maintain its stability.&lt;/p&gt;

&lt;p&gt;Incorporating these tools into your anomaly detection system ensures proactive management. You can address issues before they impact your operations. This approach improves overall system performance and reduces the risk of unexpected failures.&lt;/p&gt;

&lt;h2&gt;
  
  
  Step-by-Step Guide to Building the System
&lt;/h2&gt;

&lt;h3&gt;
  
  
  Setting Up Alibaba Cloud Elasticsearch
&lt;/h3&gt;

&lt;p&gt;To begin, you need to set up an Alibaba Cloud Elasticsearch cluster. Follow these steps to ensure a smooth setup process:&lt;/p&gt;

&lt;p&gt;1）&lt;a href="https://www.alibabacloud.com/blog/leveraging-alibaba-cloud-elasticsearch-for-intelligent-data-detection-and-prediction_601165" rel="noopener noreferrer"&gt;Create an Elasticsearch cluster, preferably version 8.5&lt;/a&gt;, for optimal performance.&lt;/p&gt;

&lt;p&gt;2）Access the Kibana console of your cluster and add sample data for analysis.&lt;/p&gt;

&lt;p&gt;3）Set up an unsupervised machine learning task to analyze behaviors, such as web server access patterns.&lt;/p&gt;

&lt;p&gt;4）Create a supervised learning task to predict outcomes, like flight delays, using historical data.&lt;/p&gt;

&lt;p&gt;5）Evaluate the model's metrics to ensure reliability and accuracy.&lt;/p&gt;

&lt;p&gt;This setup forms the backbone of your anomaly detection system. It enables you to leverage aiops capabilities for real-time insights and efficient automation of monitoring tasks.&lt;/p&gt;

&lt;h3&gt;
  
  
  Configuring Data Ingestion Pipelines
&lt;/h3&gt;

&lt;p&gt;Data ingestion pipelines are essential for feeding your system with high-quality data. To configure these pipelines:&lt;/p&gt;

&lt;p&gt;1）Log in to the GlassFlow WebApp&lt;/p&gt;

&lt;p&gt;2）Create a new pipeline and configure a data source.&lt;/p&gt;

&lt;p&gt;3）Define the transformer to perform necessary data transformations.&lt;/p&gt;

&lt;p&gt;4）Configure a data sink to store the processed data.&lt;/p&gt;

&lt;p&gt;5）Confirm the pipeline and copy the credentials for integration.&lt;/p&gt;

&lt;p&gt;Ensure the data stream matches the expected format and arrives at the predefined rate. This step guarantees that your system processes accurate and consistent data, enhancing the effectiveness of aiops-driven automation.&lt;/p&gt;

&lt;h3&gt;
  
  
  Training and Deploying ML Models
&lt;/h3&gt;

&lt;p&gt;Training and deploying machine learning models is the core of anomaly detection. Use unsupervised learning models, such as Isolation Forest, to identify unusual patterns in your data. For predictive tasks, supervised learning models like regression or classification algorithms work best.&lt;/p&gt;

&lt;p&gt;Once trained, deploy the models within your Elasticsearch cluster. Use the Kibana console to monitor their performance and refine them as needed. This process ensures your system remains adaptive and capable of handling real-time anomaly detection with minimal manual intervention. Automation of these tasks further enhances the efficiency of your aiops strategy.&lt;/p&gt;

&lt;h3&gt;
  
  
  Implementing Real-Time Anomaly Detection
&lt;/h3&gt;

&lt;p&gt;To implement real-time anomaly detection, you need to integrate machine learning models into your system and configure them for continuous analysis. Start by deploying the trained models into your Alibaba Cloud Elasticsearch cluster. Use the Kibana interface to manage and monitor these models effectively.&lt;/p&gt;

&lt;p&gt;Set up real-time data streams to feed the system with live data. Tools like Logstash or Beats can help you collect and forward data from various sources. Ensure the data pipeline is robust and capable of handling high volumes without delays. This step ensures the system processes data in real time, enabling immediate anomaly detection.&lt;/p&gt;

&lt;p&gt;Next, configure detection rules and thresholds. These rules define what constitutes an anomaly in your system. For example, you might set thresholds for CPU usage or network traffic. The machine learning models will analyze incoming data against these rules to identify unusual patterns.&lt;/p&gt;

&lt;p&gt;Finally, integrate alerting mechanisms. Use Elasticsearch's built-in alerting features to notify you of incidents as they occur. Alerts can be sent via email, SMS, or integrated with third-party tools like Slack. This setup ensures you respond to incidents promptly, minimizing potential disruptions.&lt;/p&gt;

&lt;h3&gt;
  
  
  Monitoring and Optimizing the System
&lt;/h3&gt;

&lt;p&gt;Continuous monitoring and performance optimization are essential for maintaining an effective anomaly detection system. Use observability tools in Alibaba Cloud Elasticsearch to track system metrics like latency, throughput, and error rates. These metrics provide insights into the system's health and help you identify areas for improvement.&lt;/p&gt;

&lt;p&gt;Implement predictive and proactive monitoring to anticipate potential issues. For instance, analyze historical data to predict future incidents. This approach allows you to address problems before they impact your operations.&lt;/p&gt;

&lt;p&gt;Optimize the system by refining machine learning models and detection rules. Regularly evaluate model performance using metrics like precision and recall. Adjust thresholds and retrain models as needed to improve accuracy. Additionally, ensure your data pipelines remain efficient by monitoring their performance and addressing bottlenecks.&lt;/p&gt;

&lt;h2&gt;
  
  
  Benefits and Use Cases
&lt;/h2&gt;

&lt;h3&gt;
  
  
  Advantages of Real-Time Anomaly Detection with Alibaba Cloud
&lt;/h3&gt;

&lt;p&gt;Real-time anomaly detection with Alibaba Cloud offers numerous advantages that enhance your IT operations. These benefits ensure your systems remain secure, efficient, and resilient. The following table highlights the key advantages:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Benefit Description&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Real-time Detection: Identifies various attacks and threats as they occur, ensuring timely responses to security risks.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;High-risk Operation Identification: Utilizes intelligent algorithms to pinpoint high-risk operations effectively.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Full Database Audit: Conducts real-time audits of all database activities to maintain security.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Abnormal Access Source Identification: Quickly detects new or unusual access sources to databases.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Custom Performance Dashboards: Supports tailored dashboards for performance monitoring and comparative analysis.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Automatic SQL Query Identification: Detects abnormal SQL queries and manages traffic to maintain business continuity.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Global Load Distribution Review: Automatically reviews SQL queries based on load distribution to enhance database stability.&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;These features empower you to address security and threat detection challenges proactively. By leveraging Alibaba Cloud's capabilities, you can maintain robust observability and ensure uninterrupted IT performance.&lt;/p&gt;

&lt;h3&gt;
  
  
  Real-World Use Cases Across Industries
&lt;/h3&gt;

&lt;p&gt;Alibaba Cloud's real-time anomaly detection finds applications across various industries. It helps you tackle industry-specific challenges effectively. The table below illustrates some real-world use cases:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Industry&lt;/th&gt;
&lt;th&gt;Application Description&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Finance&lt;/td&gt;
&lt;td&gt;Detects fraudulent transactions and unusual trading activities.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Healthcare&lt;/td&gt;
&lt;td&gt;Monitors patient vitals for early signs of medical conditions.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Manufacturing&lt;/td&gt;
&lt;td&gt;Identifies equipment malfunctions to prevent downtime.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Cybersecurity&lt;/td&gt;
&lt;td&gt;Detects network intrusions and suspicious activities.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Retail&lt;/td&gt;
&lt;td&gt;Analyzes sales data to spot irregular purchasing patterns.&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;These examples demonstrate how Alibaba Cloud supports diverse industries in improving observability and addressing IT challenges. Whether you aim to enhance cybersecurity or optimize manufacturing processes, real-time anomaly detection provides actionable insights.&lt;/p&gt;

&lt;h3&gt;
  
  
  Enhancing aiops with Alibaba Cloud Elasticsearch
&lt;/h3&gt;

&lt;p&gt;Alibaba Cloud Elasticsearch significantly enhances aiops capabilities. Its &lt;a href="https://www.alibabacloud.com/blog/from-enterprise-rag-to-ai-assistant%E2%80%94alibaba-cloud-elasticsearch-technology-practice-in-ai-search_601952" rel="noopener noreferrer"&gt;AI Assistant leverages RAG technology and large language models to improve anomaly monitoring, alert handling, and data analysis.&lt;/a&gt; You can visualize complex data through statistical charts, making it easier to interpret even without prior knowledge of Elasticsearch query syntax.&lt;/p&gt;

&lt;p&gt;The AI Assistant plays a vital role in real-time monitoring. It identifies abnormal conditions, analyzes error logs, and proposes defense strategies against potential threats. These features ensure your IT systems remain secure and efficient. By integrating advanced observability tools, Alibaba Cloud Elasticsearch empowers you to automate monitoring and optimize system performance. This approach strengthens your aiops strategy, enabling you to address challenges with precision and agility.&lt;/p&gt;

&lt;p&gt;Building a real-time anomaly detection system with Alibaba Cloud Elasticsearch ML Modules equips you with powerful tools to monitor, analyze, and secure your IT operations. Observability plays a critical role in this process by offering:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Enhanced system stability&lt;/strong&gt;, which optimizes performance and boosts availability.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Faster troubleshooting&lt;/strong&gt;, reducing downtime through quick issue identification.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Scalability&lt;/strong&gt;, helping you adapt to changes and support growth.&lt;/p&gt;

&lt;p&gt;By leveraging these capabilities, you can ensure resilient and efficient systems. Start exploring Alibaba Cloud Elasticsearch ML Modules today to transform your IT operations with cutting-edge aiops and observability tools.&lt;/p&gt;

&lt;h2&gt;
  
  
  FAQ
&lt;/h2&gt;

&lt;h3&gt;
  
  
  What is real-time anomaly detection, and why is it important?
&lt;/h3&gt;

&lt;p&gt;Real-time anomaly detection identifies unusual patterns or behaviors in data as they occur. It helps you address potential issues immediately, preventing disruptions and ensuring system stability. This capability is crucial for maintaining efficient IT operations and enhancing overall system reliability.&lt;/p&gt;

&lt;h3&gt;
  
  
  How does Alibaba Cloud Elasticsearch support anomaly detection?
&lt;/h3&gt;

&lt;p&gt;Alibaba Cloud Elasticsearch uses advanced machine learning models to analyze time series data. It detects outliers and unusual patterns in real time. The platform integrates seamlessly with observability tools, enabling you to monitor, analyze, and respond to anomalies effectively.&lt;/p&gt;

&lt;h3&gt;
  
  
  Can I automate incident management with Alibaba Cloud Elasticsearch?
&lt;/h3&gt;

&lt;p&gt;Yes, you can automate incident management and automation using Alibaba Cloud Elasticsearch. Its AI-powered tools, such as the AI Assistant, streamline anomaly detection and alerting. These features help you resolve incidents faster and improve system performance.&lt;/p&gt;

&lt;h3&gt;
  
  
  What industries benefit most from anomaly detection systems?
&lt;/h3&gt;

&lt;p&gt;Industries like finance, healthcare, manufacturing, and cybersecurity benefit significantly. For example, anomaly detection helps you identify fraudulent transactions, monitor patient vitals, prevent equipment failures, and detect network intrusions. These applications enhance operational efficiency and security.&lt;/p&gt;

&lt;h3&gt;
  
  
  How do I ensure my anomaly detection system remains effective?
&lt;/h3&gt;

&lt;p&gt;Regularly update your machine learning models and detection rules. Monitor system performance using observability tools. Analyze historical data to predict future incidents. These practices help you maintain an adaptive and reliable anomaly detection system&lt;/p&gt;

&lt;p&gt;If you want to learn more, &lt;a href="https://www.alibabacloud.com/en/product/elasticsearch?_p_lc=1&amp;amp;spm=a3c0i.7911826.6791778070.251.4b7e3870dFxOMi" rel="noopener noreferrer"&gt;please click it&lt;/a&gt;and have &lt;a href="https://account.alibabacloud.com/login/login.htm" rel="noopener noreferrer"&gt;a 30-day free trial&lt;/a&gt;.&lt;/p&gt;

</description>
      <category>machinelearning</category>
      <category>alibabacloud</category>
      <category>elasticsearch</category>
      <category>aiops</category>
    </item>
    <item>
      <title>What is RAG and how Alibaba Cloud Elasticsearch enhances AI search with retrieval-augmented generation</title>
      <dc:creator>A_Lucas</dc:creator>
      <pubDate>Tue, 25 Feb 2025 05:27:00 +0000</pubDate>
      <link>https://dev.to/a_lucas/what-is-rag-and-how-alibaba-cloud-elasticsearch-enhances-ai-search-with-retrieval-augmented-1l7c</link>
      <guid>https://dev.to/a_lucas/what-is-rag-and-how-alibaba-cloud-elasticsearch-enhances-ai-search-with-retrieval-augmented-1l7c</guid>
      <description>&lt;p&gt;Retrieval-augmented generation (RAG) combines the strengths of information retrieval and generative AI to create more accurate and context-aware responses. Unlike traditional generative models, which may rely on static or outdated data, &lt;a href="https://www.alibabacloud.com/en/solutions/generative-ai/rag?_p_lc=1from" rel="noopener noreferrer"&gt;RAG integrates real-time information&lt;/a&gt; diverse sources like articles, databases, and books. This approach ensures that the generated content remains relevant and reliable.&lt;/p&gt;

&lt;p&gt;You can see the impact of RAG in applications like AI search, where it enhances accuracy and relevance. By optimizing information retrieval and enabling real-time data integration, RAG addresses challenges in industries such as healthcare, finance, and customer service. Its ability to provide nuanced, contextually rich results makes it a game-changer for modern search technologies.&lt;/p&gt;

&lt;p&gt;Alibaba Cloud Elasticsearch leverages RAG to redefine AI search capabilities, offering enterprises a powerful tool to meet evolving demands.&lt;/p&gt;

&lt;h2&gt;
  
  
  Understanding Retrieval-Augmented Generation (RAG)
&lt;/h2&gt;

&lt;h3&gt;
  
  
  What is RAG?
&lt;/h3&gt;

&lt;p&gt;Retrieval-augmented generation (RAG) is a cutting-edge approach that combines information retrieval with generative AI to produce accurate and contextually relevant responses. Unlike traditional generative AI models, which rely solely on pre-trained data, RAG retrieves up-to-date information from external sources. This retrieval mechanism ensures that the generated content is grounded in factual and current data, enhancing its reliability and relevance. By integrating retrieved information into the generation process, RAG delivers responses that are both precise and context-aware, making it a powerful tool for modern AI applications.&lt;/p&gt;

&lt;h3&gt;
  
  
  How RAG Works
&lt;/h3&gt;

&lt;h4&gt;
  
  
  The Retrieval Process
&lt;/h4&gt;

&lt;p&gt;The retrieval process in RAG involves several key steps to ensure the system gathers the most relevant information for your query. These steps include:&lt;/p&gt;

&lt;p&gt;1）Receiving your prompt or query.&lt;/p&gt;

&lt;p&gt;2）Searching for relevant information from external sources.&lt;/p&gt;

&lt;p&gt;3）Retrieving the most pertinent data to provide additional context.&lt;/p&gt;

&lt;p&gt;4）Augmenting your prompt with this enriched context.&lt;/p&gt;

&lt;p&gt;5）Submitting the enhanced prompt to a large language model (LLM).&lt;/p&gt;

&lt;p&gt;6）Delivering an improved and contextually accurate response to you.&lt;/p&gt;

&lt;p&gt;This structured process ensures that the retrieved information aligns with your query, enabling the system to provide grounded and reliable responses.&lt;/p&gt;

&lt;h4&gt;
  
  
  The Generation Process
&lt;/h4&gt;

&lt;p&gt;Once the retrieval process is complete, the generation phase begins. The system enhances the original prompt with the retrieved data, creating an enriched input for the LLM. This enriched prompt allows the generative AI to produce responses that are informed by the latest information. Techniques like post-retrieval processing with frozen LLMs and fine-tuning LLMs for RAG further optimize the generation process. These methods ensure that the generated content is natural, contextually relevant, and grounded in factual information, minimizing errors and improving user satisfaction.&lt;/p&gt;

&lt;h3&gt;
  
  
  Applications of RAG
&lt;/h3&gt;

&lt;h4&gt;
  
  
  AI-powered search engines
&lt;/h4&gt;

&lt;p&gt;&lt;a href="https://www.alibabacloud.com/blog/igniting-the-ai-revolution---a-journey-with-qwen-rag-and-langchain_600876?spm=a3c0i.28373366.4822918750.2.25da18318Wp8Lj" rel="noopener noreferrer"&gt;RAG&lt;/a&gt; has revolutionized AI-powered search engines by enabling them to deliver highly accurate and context-aware results. By combining retrieval and generation mechanisms, these search engines can process complex queries and provide responses that go beyond simple keyword matching. This capability makes them invaluable for industries requiring precise and reliable information retrieval, such as healthcare and finance.&lt;/p&gt;

&lt;h4&gt;
  
  
  Customer support and chatbots
&lt;/h4&gt;

&lt;p&gt;In customer support, RAG enhances chatbot functionalities by ensuring responses are based on accurate and relevant data. This approach reduces the risk of incorrect answers and minimizes AI hallucination. Chatbots powered by RAG can handle diverse customer queries effectively, leading to improved interaction quality and higher customer satisfaction. The integration of retrieval-augmented generation also boosts operational efficiency, making it a preferred choice for businesses.&lt;/p&gt;

&lt;h4&gt;
  
  
  Content creation and summarization
&lt;/h4&gt;

&lt;p&gt;RAG excels in content creation and summarization tasks by leveraging both structured and unstructured data. It can summarize lengthy documents, generate detailed reports, and provide responses in various formats, such as summaries or in-depth explanations. This flexibility makes RAG an essential tool for businesses needing comprehensive and accurate content generation.&lt;/p&gt;

&lt;h2&gt;
  
  
  Benefits of Retrieval-Augmented Generation in AI Search
&lt;/h2&gt;

&lt;h3&gt;
  
  
  Access to Current and Relevant Information
&lt;/h3&gt;

&lt;p&gt;RAG ensures you always have access to fresh information by dynamically retrieving data from diverse sources. Unlike traditional methods that rely on static databases, RAG continuously updates its knowledge base. This capability is crucial in fields like healthcare and finance, where timely and accurate information is essential.&lt;/p&gt;

&lt;p&gt;1）RAG's architecture reduces the risk of generating outdated or misleading content.&lt;/p&gt;

&lt;p&gt;2）It retrieves the most recent and relevant material, ensuring responses remain factual and reliable.&lt;/p&gt;

&lt;p&gt;By leveraging this dynamic retrieval process, you can trust that the information provided aligns with the latest developments, enhancing the quality of your AI search results.&lt;/p&gt;

&lt;h3&gt;
  
  
  Improved Accuracy and Contextual Understanding
&lt;/h3&gt;

&lt;p&gt;RAG enhances the contextual quality of AI-generated responses by integrating external knowledge into the generation process. This approach ensures that even small pieces of information maintain their relevance and clarity.&lt;/p&gt;

&lt;p&gt;1）Contextual retrieval prevents the loss of critical details, which often leads to incomplete or inaccurate responses.&lt;/p&gt;

&lt;p&gt;2）By enriching the generative process with external data, RAG delivers results that are both precise and contextually grounded.&lt;/p&gt;

&lt;p&gt;This improved contextual understanding allows you to receive high-quality responses tailored to your specific queries, making your search experience more effective and satisfying.&lt;/p&gt;

&lt;h3&gt;
  
  
  Cost-Effectiveness Compared to Training Large Models
&lt;/h3&gt;

&lt;p&gt;RAG offers a cost-efficient alternative to training large AI models. Instead of investing heavily in computational resources for fine-tuning, you can focus on maintaining a robust retrieval infrastructure.&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Approach&lt;/th&gt;
&lt;th&gt;Advantages&lt;/th&gt;
&lt;th&gt;Challenges&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;RAG&lt;/td&gt;
&lt;td&gt;Avoids resource-intensive fine-tuning; Cost mainly for retrieval infrastructure; Scales with evolving data&lt;/td&gt;
&lt;td&gt;Initial setup investment; Querying external databases may incur costs&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Fine-Tuning&lt;/td&gt;
&lt;td&gt;One-time training investment for well-defined tasks&lt;/td&gt;
&lt;td&gt;Requires significant computational resources; High-end GPUs needed; Cumulative costs for new tasks&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;By adopting RAG, you can achieve high-quality results without the financial burden of training and maintaining large-scale models. This makes it an ideal choice for businesses seeking scalable and cost-effective AI search solutions.&lt;/p&gt;

&lt;h3&gt;
  
  
  Scalability for Diverse Applications
&lt;/h3&gt;

&lt;p&gt;Retrieval-augmented generation (RAG) demonstrates exceptional scalability, making it suitable for a wide range of industries and use cases. Its ability to integrate real-time data retrieval with generative AI allows you to adapt it to diverse applications, ensuring efficiency and relevance in your operations.&lt;/p&gt;

&lt;p&gt;Many organizations have already leveraged RAG to address unique challenges. For example:&lt;/p&gt;

&lt;p&gt;1）&lt;strong&gt;Delivery support chatbot&lt;/strong&gt;: DoorDash uses RAG to power chatbots that assist independent contractors, improving response accuracy and resolving issues faster.&lt;/p&gt;

&lt;p&gt;2）&lt;strong&gt;Customer tech support&lt;/strong&gt;: LinkedIn integrates RAG with a knowledge graph to reduce customer service resolution times by 28.6%.&lt;/p&gt;

&lt;p&gt;3）&lt;strong&gt;Internal policies chatbot&lt;/strong&gt;: Bell employs RAG to provide employees with instant access to updated company policies.&lt;/p&gt;

&lt;p&gt;4）&lt;strong&gt;SQL query assistance&lt;/strong&gt;: Pinterest enhances user experience by guiding table selection through RAG-powered SQL query support.&lt;/p&gt;

&lt;p&gt;5）&lt;strong&gt;Industry classification&lt;/strong&gt;: Ramp standardizes customer classification using RAG with NAICS codes.&lt;/p&gt;

&lt;p&gt;6）&lt;strong&gt;Financial research&lt;/strong&gt;: Analysts rely on RAG to retrieve up-to-date market data and research reports efficiently.&lt;/p&gt;

&lt;p&gt;RAG’s scalability also extends to specialized fields like compliance management, technical documentation, and scientific research. Compliance officers can retrieve regulations and guidelines during audits, while developers use RAG to locate technical documentation and code snippets. Researchers benefit from quick access to recent studies and journals tailored to their queries.&lt;/p&gt;

&lt;p&gt;By adopting RAG, you can scale your AI capabilities to meet the demands of various industries. Its adaptability ensures that your business remains competitive, whether you aim to enhance customer support, streamline internal processes, or improve decision-making through data-driven insights. This versatility makes RAG an invaluable tool for modern enterprises.&lt;/p&gt;

&lt;h2&gt;
  
  
  Overview of Alibaba Cloud Elasticsearch
&lt;/h2&gt;

&lt;h3&gt;
  
  
  What is Alibaba Cloud Elasticsearch?
&lt;/h3&gt;

&lt;p&gt;Alibaba Cloud Elasticsearch is a &lt;a href="https://www.alibabacloud.com/en/product/elasticsearch" rel="noopener noreferrer"&gt;fully managed service designed to enhance the capabilities&lt;/a&gt; of the open-source Elasticsearch platform. It offers advanced features that improve performance, cost efficiency, and scalability, making it an ideal choice for modern AI applications. By separating storage from computing, it optimizes kernel performance and accelerates data writing, ensuring stability even during high-concurrency operations.&lt;/p&gt;

&lt;p&gt;Key highlights of Alibaba Cloud Elasticsearch include:&lt;/p&gt;

&lt;p&gt;1）&lt;a href="https://www.alibabacloud.com/blog/what-is-elasticsearch_601014" rel="noopener noreferrer"&gt;Fully managed service with out-of-the-box functionality&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;2）100% compatibility with open-source Elasticsearch features.&lt;/p&gt;

&lt;p&gt;3）Integration with Elastic for advanced capabilities like security and machine learning.&lt;/p&gt;

&lt;p&gt;4）Cost-effective pay-as-you-go pricing model.&lt;/p&gt;

&lt;p&gt;5）Enhanced real-time log analysis and multi-dimensional data querying.&lt;/p&gt;

&lt;p&gt;This robust platform empowers you to handle complex AI search tasks with ease, ensuring high performance and reliability.&lt;/p&gt;

&lt;h3&gt;
  
  
  Key Features Supporting RAG
&lt;/h3&gt;

&lt;h4&gt;
  
  
  Scalability and High Performance
&lt;/h4&gt;

&lt;p&gt;Alibaba Cloud Elasticsearch ensures scalability and high performance through innovative optimizations. By separating storage from computing, it manages resources efficiently, allowing you to scale operations seamlessly. The platform also accelerates high-concurrency data writing, enabling smooth handling of large-scale data.&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Optimization Technique&lt;/th&gt;
&lt;th&gt;Description&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Hardware Acceleration&lt;/td&gt;
&lt;td&gt;Utilizes SIMD instructions for performance boosts through projects like Panama.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Quantization Techniques&lt;/td&gt;
&lt;td&gt;Minimizes resource footprint while maintaining quality.&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;These features make Alibaba Cloud Elasticsearch a powerful tool for supporting retrieval-augmented generation in AI applications.&lt;/p&gt;

&lt;h4&gt;
  
  
  Real-time Data Retrieval
&lt;/h4&gt;

&lt;p&gt;Real-time data retrieval is a cornerstone of Alibaba Cloud Elasticsearch, enabling you to access the latest information for your RAG-based solutions. This capability ensures that your AI models provide accurate and up-to-date responses, overcoming the limitations of static databases.&lt;/p&gt;

&lt;p&gt;Key benefits include:&lt;/p&gt;

&lt;p&gt;1）Dynamic retrieval of the most recent and relevant material.&lt;/p&gt;

&lt;p&gt;2）Factually correct answers based on current data.&lt;/p&gt;

&lt;p&gt;3）Enhanced applications in fields like healthcare, finance, and legal consulting.&lt;/p&gt;

&lt;p&gt;With real-time retrieval, you can trust that your AI search results remain relevant and reliable.&lt;/p&gt;

&lt;h4&gt;
  
  
  Seamless Integration with AI Models
&lt;/h4&gt;

&lt;p&gt;Alibaba Cloud Elasticsearch simplifies the integration of AI models, allowing you to &lt;a href="https://www.alibabacloud.com/blog/from-enterprise-rag-to-ai-assistant%E2%80%94alibaba-cloud-elasticsearch-technology-practice-in-ai-search_601952" rel="noopener noreferrer"&gt;load custom models directly into the cluster for end-to-end processing&lt;/a&gt;. This feature bridges search, ranking, and AI services, enabling advanced functionalities like semantic search and relevance ranking.&lt;/p&gt;

&lt;p&gt;Additional integration capabilities include:&lt;/p&gt;

&lt;p&gt;1）Support for diversified models and hybrid retrieval technologies.&lt;/p&gt;

&lt;p&gt;2）Management of custom transformer models for context-specific searches.&lt;/p&gt;

&lt;p&gt;3）Transition from traditional search to AI-powered semantic search.&lt;/p&gt;

&lt;p&gt;These integration features enhance the adaptability of your RAG-based solutions, ensuring they meet diverse business needs.&lt;/p&gt;

&lt;h3&gt;
  
  
  Why Alibaba Cloud Elasticsearch is Ideal for RAG
&lt;/h3&gt;

&lt;p&gt;Alibaba Cloud Elasticsearch stands out as an ideal platform for implementing RAG-based solutions. Its &lt;a href="https://www.alibabacloud.com/blog/key-technologies-for-optimization-of-enterprise-level-rag_601723" rel="noopener noreferrer"&gt;high-performance semantic search capabilities&lt;/a&gt; allow you to retrieve and generate accurate, context-aware responses without relying on exact keyword matches. The platform also provides full-text answers to complex questions and enables personalized recommendations, making it suitable for a wide range of applications.&lt;/p&gt;

&lt;p&gt;By leveraging Alibaba Cloud Elasticsearch, you can unlock the full potential of retrieval-augmented generation, enhancing the accuracy, scalability, and efficiency of your AI search solutions.&lt;/p&gt;

&lt;h2&gt;
  
  
  How Alibaba Cloud Elasticsearch Enhances RAG-based AI Search
&lt;/h2&gt;

&lt;h3&gt;
  
  
  Optimizing the Retrieval Process
&lt;/h3&gt;

&lt;p&gt;Alibaba Cloud Elasticsearch optimizes the retrieval process in RAG by employing advanced techniques that improve performance and relevance. These optimizations ensure that your AI-powered applications deliver accurate and contextually rich results. The following table highlights key enhancements:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Optimization Type&lt;/th&gt;
&lt;th&gt;Description&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Hardware Acceleration&lt;/td&gt;
&lt;td&gt;Reduces query response time from 100ms to about 20ms, enabling faster vector retrieval.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Memory Optimization&lt;/td&gt;
&lt;td&gt;Cuts memory usage by 95% through vector quantization, improving indexing speed and efficiency.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Semantic Expansion&lt;/td&gt;
&lt;td&gt;Extends vocabulary with related concepts, enhancing semantic understanding.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Hybrid Search Strategies&lt;/td&gt;
&lt;td&gt;Combines text and vector search for improved relevance and user experience.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Ranking Mechanism&lt;/td&gt;
&lt;td&gt;Uses BM25 for initial ranking and refined models for secondary ranking to ensure top results.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Model Integration&lt;/td&gt;
&lt;td&gt;Allows seamless loading of custom models for end-to-end processing within the cluster.&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;These features ensure that your retrieval-augmented generation workflows operate efficiently, providing precise and relevant data for the generation phase.&lt;/p&gt;

&lt;h3&gt;
  
  
  Enabling Real-time Data Updates
&lt;/h3&gt;

&lt;p&gt;Real-time data updates are critical for maintaining the accuracy and relevance of RAG-enabled search applications. Alibaba Cloud Elasticsearch empowers you to build &lt;a href="https://www.alibabacloud.com/blog/elasticsearch-8-how-to-use-hybrid-search-with-rag-technology_601526" rel="noopener noreferrer"&gt;AI-powered solutions that process data dynamically&lt;/a&gt;. Key benefits include:&lt;/p&gt;

&lt;p&gt;1）Automatic updates that enhance responsiveness and ensure data integrity.&lt;/p&gt;

&lt;p&gt;2）&lt;a href="https://www.alibabacloud.com/blog/a-deep-dive-into-elasticsearch-use-cases-on-alibaba-cloud_601222" rel="noopener noreferrer"&gt;Robust access control and security monitoring&lt;/a&gt; to protect sensitive information.&lt;/p&gt;

&lt;p&gt;3）Seamless integration with large language models for improved semantic understanding.&lt;/p&gt;

&lt;p&gt;The platform also supports hybrid retrieval strategies, combining multiple methods to refine search results. This capability ensures that your AI applications remain up-to-date and deliver factually correct responses, even in fast-changing industries like finance and healthcare.&lt;/p&gt;

&lt;h3&gt;
  
  
  Supporting Large-scale AI Applications
&lt;/h3&gt;

&lt;p&gt;Alibaba Cloud Elasticsearch provides the scalability needed to support large-scale AI applications. You can develop &lt;a href="https://www.alibabacloud.com/blog/alibaba-cloud-ai-search-solution-explained-intelligent-search-driven-by-large-language-models-helping-enterprises-in-digital_601971" rel="noopener noreferrer"&gt;AI-driven search solutions that integrate seamlessly&lt;/a&gt; with large language models. These applications benefit from features like automatic updates, robust access control, and security monitoring. Whether you aim to enhance enterprise search capabilities or build intelligent customer service tools, the platform adapts to your needs.&lt;/p&gt;

&lt;p&gt;By leveraging these capabilities, you can scale your retrieval-augmented generation workflows to handle complex queries and vast datasets. This scalability ensures that your AI applications remain efficient and reliable, even as your business grows.&lt;/p&gt;

&lt;h3&gt;
  
  
  Enhancing Search Accuracy and Relevance
&lt;/h3&gt;

&lt;p&gt;Search accuracy and relevance are critical for delivering meaningful results in AI-powered applications. Alibaba Cloud Elasticsearch employs advanced techniques to ensure your retrieval-augmented generation (RAG) workflows consistently produce precise and contextually relevant outputs.&lt;/p&gt;

&lt;p&gt;One of the standout features is its &lt;strong&gt;ranking mechanism&lt;/strong&gt;, which combines traditional and modern approaches to refine search results. Initially, the system uses BM25 to calculate document weights based on term frequency and location. It then applies integrated learning models for secondary ranking, ensuring the most relevant results appear at the top. This dual-layered approach significantly enhances the quality of search outcomes.&lt;/p&gt;

&lt;p&gt;Alibaba Cloud Elasticsearch also leverages a &lt;strong&gt;hybrid retrieval strategy&lt;/strong&gt;. By integrating text-based, sparse, and dense vector indexes, it improves retrieval precision and efficiency. This combination allows you to handle diverse query types, from simple keyword searches to complex semantic queries, with ease.&lt;/p&gt;

&lt;p&gt;Another key capability is &lt;strong&gt;intent understanding&lt;/strong&gt;. The platform analyzes user queries to identify their underlying intent, optimizing result sorting and ensuring the content aligns with user expectations. This feature is particularly valuable in applications like customer support, where accurate responses are essential.&lt;/p&gt;

&lt;p&gt;The platform’s performance improvements further highlight its effectiveness. In knowledge base Q&amp;amp;A scenarios, Alibaba Cloud Elasticsearch achieved a remarkable &lt;a href="https://www.alibabacloud.com/blog/from-enterprise-rag-to-ai-assistant%E2%80%94alibaba-cloud-elasticsearch-technology-practice-in-ai-search_601952" rel="noopener noreferrer"&gt;accuracy increase from 48% to over 95%&lt;/a&gt;. This leap underscores its ability to deliver reliable and relevant results across various use cases.&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Technique&lt;/th&gt;
&lt;th&gt;Description&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Ranking Mechanism&lt;/td&gt;
&lt;td&gt;Utilizes BM25 to determine document weights based on frequency and location, followed by a secondary ranking using integrated learning models.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Hybrid Retrieval Strategy&lt;/td&gt;
&lt;td&gt;Combines text, sparse, and dense vector indexes for improved retrieval precision and efficiency.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Intent Understanding&lt;/td&gt;
&lt;td&gt;Analyzes user queries to optimize result sorting, ensuring content relevance.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Performance Improvement&lt;/td&gt;
&lt;td&gt;Achieved a significant increase in accuracy from 48% to over 95% in knowledge base Q&amp;amp;A scenarios through advanced search techniques.&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;By adopting these advanced techniques, you can ensure your RAG-based systems deliver accurate, relevant, and context-aware results. This capability empowers your business to meet user expectations and maintain a competitive edge in AI-driven search applications.&lt;/p&gt;

&lt;p&gt;Retrieval-augmented generation (RAG) has transformed AI search by integrating external knowledge for precise, contextually relevant responses. Its ability to retrieve real-time data and provide diverse response formats makes it indispensable for industries requiring accuracy and adaptability. By combining retrieval and generative models, RAG enhances user satisfaction and operational efficiency.&lt;/p&gt;

&lt;p&gt;Alibaba Cloud Elasticsearch amplifies these benefits with its advanced features. A &lt;a href="https://www.alibabacloud.com/blog/from-enterprise-rag-to-ai-assistant%E2%80%94alibaba-cloud-elasticsearch-technology-practice-in-ai-search_601952" rel="noopener noreferrer"&gt;5X improvement in vector performance&lt;/a&gt; reduces query response times, while memory optimization minimizes resource usage without compromising quality. The hybrid search strategy and seamless integration with large language models ensure accurate, scalable, and cost-effective solutions. Businesses can leverage these capabilities to improve search outcomes and streamline operations.&lt;/p&gt;

&lt;p&gt;Explore Alibaba Cloud Elasticsearch to unlock the full potential of RAG and elevate your AI search applications.&lt;/p&gt;

&lt;h2&gt;
  
  
  FAQ
&lt;/h2&gt;

&lt;h3&gt;
  
  
  What is the difference between retrieval-augmented generation and semantic search?
&lt;/h3&gt;

&lt;p&gt;Retrieval-augmented generation combines information retrieval with generative AI to create context-aware responses. Semantic search focuses on understanding user intent and meaning in queries. While both enhance AI search algorithms, retrieval-augmented generation integrates real-time data, whereas semantic search relies on natural language processing to improve relevance.&lt;/p&gt;

&lt;h3&gt;
  
  
  How do large language models contribute to retrieval-augmented generation?
&lt;/h3&gt;

&lt;p&gt;Large language models process enriched prompts created by the retrieval mechanism. They generate responses informed by real-time data, ensuring accuracy and contextual relevance. Their ability to understand and generate human-like text makes them essential for retrieval-augmented generation workflows.&lt;/p&gt;

&lt;h3&gt;
  
  
  Why is real-time data important in AI search algorithms?
&lt;/h3&gt;

&lt;p&gt;Real-time data ensures responses remain accurate and up-to-date. This is crucial for industries like healthcare and finance, where outdated information can lead to errors. Retrieval-augmented generation uses real-time data to enhance natural language processing and improve the relevance of search results.&lt;/p&gt;

&lt;h3&gt;
  
  
  How does Alibaba Cloud Elasticsearch support semantic search?
&lt;/h3&gt;

&lt;p&gt;Alibaba Cloud Elasticsearch integrates advanced natural language processing techniques and hybrid retrieval strategies. It combines text-based and vector-based searches to improve semantic understanding. This ensures accurate and contextually relevant results, making it ideal for applications requiring precise query handling.&lt;/p&gt;

&lt;h3&gt;
  
  
  What industries benefit most from retrieval-augmented generation?
&lt;/h3&gt;

&lt;p&gt;Industries like healthcare, finance, and customer service benefit significantly. Retrieval-augmented generation provides accurate, context-aware responses by leveraging large language models and real-time data. This improves decision-making, operational efficiency, and user satisfaction across diverse applications.&lt;/p&gt;

&lt;p&gt;If you want to learn more, &lt;a href="https://www.alibabacloud.com/en/product/elasticsearch?_p_lc=1&amp;amp;spm=a3c0i.7911826.6791778070.251.4b7e3870dFxOMi" rel="noopener noreferrer"&gt;please click it&lt;/a&gt; and have &lt;a href="https://account.alibabacloud.com/login/login.htm" rel="noopener noreferrer"&gt;a 30-day free trial&lt;/a&gt;.&lt;/p&gt;

</description>
      <category>rag</category>
      <category>elasticsearch</category>
      <category>ai</category>
      <category>alibabacloud</category>
    </item>
    <item>
      <title>Mastering Vector Search: How Alibaba Cloud’s Inference API Enhances Elasticsearch</title>
      <dc:creator>A_Lucas</dc:creator>
      <pubDate>Fri, 21 Feb 2025 06:55:31 +0000</pubDate>
      <link>https://dev.to/a_lucas/mastering-vector-search-how-alibaba-clouds-inference-api-enhances-elasticsearch-57ac</link>
      <guid>https://dev.to/a_lucas/mastering-vector-search-how-alibaba-clouds-inference-api-enhances-elasticsearch-57ac</guid>
      <description>&lt;p&gt;Vector search has revolutionized how modern applications handle data retrieval. By leveraging advanced indexing techniques, it significantly improves both speed and accuracy. Industries like e-commerce, healthcare, and education now rely on vector search to deliver personalized recommendations, and create intelligent learning platforms. &lt;/p&gt;

&lt;p&gt;Alibaba Cloud Elasticsearch takes this innovation further by integrating with the Inference API. This integration enables you to utilize dense and sparse embeddings, enhancing the relevance of search results. With features like semantic reranking and support for complex queries, AlibabaCloud AI Search empowers you to build smarter, AI-driven applications.&lt;/p&gt;

&lt;h2&gt;
  
  
  Understanding Vector Search
&lt;/h2&gt;

&lt;h3&gt;
  
  
  What is Vector Search?
&lt;/h3&gt;

&lt;p&gt;Vector search is a powerful method for retrieving information by analyzing the relationships between data points in a multi-dimensional space. Unlike traditional keyword-based searches, vector search relies on mathematical representations called vectors. These vectors capture the semantic meaning of data, enabling more accurate and context-aware results.&lt;/p&gt;

&lt;p&gt;The process involves several key steps:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Vectorization&lt;/strong&gt;: Models like Word2Vec or BERT convert items into numerical vectors.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Indexing&lt;/strong&gt;: The system organizes these vectors into a structure optimized for efficient searching.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Query Vectorization&lt;/strong&gt;: Search queries are transformed into vectors for comparison.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Similarity Search&lt;/strong&gt;: The system compares query vectors to indexed vectors using distance metrics.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Ranking&lt;/strong&gt;: Results are ordered based on their similarity to the query vector.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Retrieval&lt;/strong&gt;: The top-ranked items are returned as search results.&lt;/p&gt;

&lt;p&gt;This approach powers modern applications like semantic search, recommendation systems, and image retrieval, making it a cornerstone of AI search technologies.&lt;/p&gt;

&lt;h3&gt;
  
  
  Sparse vs. Dense Vector Search
&lt;/h3&gt;

&lt;p&gt;Sparse and dense vector search differ in how they represent and process data. The table below highlights their key distinctions:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Feature&lt;/th&gt;
&lt;th&gt;Sparse Vector Search&lt;/th&gt;
&lt;th&gt;Dense Vector Search&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Dimensionality&lt;/td&gt;
&lt;td&gt;High, with many zero values&lt;/td&gt;
&lt;td&gt;Lower, with few or no zero values&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Information Richness&lt;/td&gt;
&lt;td&gt;Less semantically rich&lt;/td&gt;
&lt;td&gt;Semantically rich, capturing content meaning&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Similarity Measurement&lt;/td&gt;
&lt;td&gt;Based on keyword matches and frequency&lt;/td&gt;
&lt;td&gt;Uses algorithms like cosine similarity&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;Dense vector search, supported by platforms like AlibabaCloud AI Search, excels in capturing the deeper semantic meaning of data. This makes it ideal for applications requiring advanced text embedding services and contextual understanding.&lt;/p&gt;

&lt;h3&gt;
  
  
  Applications of Vector Search in Modern Use Cases
&lt;/h3&gt;

&lt;p&gt;Vector search has transformed industries by enabling smarter and more efficient data retrieval. Here are some notable applications:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;E-Commerce&lt;/strong&gt;:&lt;/p&gt;

&lt;p&gt;Personalized product recommendations based on user preferences.&lt;/p&gt;

&lt;p&gt;Visual search capabilities that let users find products using images.&lt;/p&gt;

&lt;p&gt;Enhanced search experiences through semantic understanding of natural language queries.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Healthcare&lt;/strong&gt;:&lt;/p&gt;

&lt;p&gt;Medical image search for diagnostics by comparing new images with historical cases.&lt;/p&gt;

&lt;p&gt;Retrieval of patient information from electronic health records using semantic analysis.&lt;/p&gt;

&lt;p&gt;Clinical decision support systems that recommend tests and treatments.&lt;/p&gt;

&lt;p&gt;By integrating vector search into platforms like Elasticsearch, you can unlock these capabilities and more. AlibabaCloud AI Search further enhances this by providing robust tools for embedding and analysis, making it a leader in AI-driven search solutions.&lt;/p&gt;

&lt;h2&gt;
  
  
  Integrating Alibaba Cloud’s Inference API with Elasticsearch
&lt;/h2&gt;

&lt;h3&gt;
  
  
  Setting Up the Inference API
&lt;/h3&gt;

&lt;p&gt;To begin integrating the Inference API with Elasticsearch, you need to follow a structured setup process. This ensures seamless communication between the two platforms. Here are the steps:&lt;/p&gt;

&lt;p&gt;1）Create an endpoint in Elasticsearch by specifying the service as &lt;code&gt;alibabacloud-ai-search&lt;/code&gt;. Provide essential service settings, including workspace, host, service ID, and API keys.&lt;/p&gt;

&lt;p&gt;2）Use a &lt;code&gt;PUT&lt;/code&gt; request to configure the text embedding endpoint with the required parameters.&lt;/p&gt;

&lt;p&gt;3）Verify the endpoint creation by checking the response from Elasticsearch.&lt;/p&gt;

&lt;p&gt;4）Test the endpoint by issuing a &lt;code&gt;POST&lt;/code&gt; request to perform inference and generate embeddings.&lt;/p&gt;

&lt;p&gt;This setup allows you to leverage the full potential of AlibabaCloud AI Search for advanced vector search capabilities.&lt;/p&gt;

&lt;h3&gt;
  
  
  Creating Inference Endpoints
&lt;/h3&gt;

&lt;p&gt;Creating inference endpoints is a critical step in enabling Elasticsearch to utilize Alibaba’s AI services. Follow these steps to create and test your endpoint:&lt;/p&gt;

&lt;p&gt;1）Define the service as &lt;code&gt;alibabacloud-ai-search&lt;/code&gt; and provide the necessary settings, such as workspace, host, service ID, and API keys.&lt;/p&gt;

&lt;p&gt;2）&lt;a href="https://www.alibabacloud.com/blog/elasticsearch-open-inference-api-adds-support-for-alibaba-cloud-ai-search_601634" rel="noopener noreferrer"&gt;Use the following command to create a text embedding endpoint:&lt;/a&gt;&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;PUT _inference/text_embedding/ali_ai_embeddings
{
    "service": "alibabacloud-ai-search",
    "service_settings": {
        "api_key": "&amp;lt;api_key&amp;gt;",
        "service_id": "ops-text-embedding-001",
        "host": "xxxxx.platform-cn-shanghai.opensearch.aliyuncs.com",
        "workspace": "default"
    }
}
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;3）Confirm the endpoint creation by reviewing the response from Elasticsearch.&lt;/p&gt;

&lt;p&gt;4）Test the endpoint by calling the perform inference API with:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;POST _inference/text_embedding/ali_ai_embeddings
{
    "input": "What is Elastic?"
}
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;These steps ensure that your Elasticsearch instance is ready to handle advanced AI-driven tasks.&lt;/p&gt;

&lt;h3&gt;
  
  
  Configuring Alibaba Cloud Elasticsearch for Vector Search
&lt;/h3&gt;

&lt;p&gt;After creating an inference endpoint, configure Alibaba Cloud Elasticsearch to optimize it for vector search. Start by ensuring that your Elasticsearch instance supports vector-based indexing. Use the aliyun-knn plugin to enable efficient similarity searches. Configure the dimensions and similarity measures to align with your use case.&lt;/p&gt;

&lt;p&gt;Next, integrate the embeddings generated by the Inference API into your Elasticsearch vector database. This allows you to perform semantic searches, reranking, and other advanced operations. Test the configuration by issuing queries and validating the results. AlibabaCloud AI Search simplifies this process, enabling you to focus on building intelligent applications.&lt;/p&gt;

&lt;p&gt;By following these steps, you can unlock the full potential of Alibaba Cloud Elasticsearch for creating an indexing service instance tailored to your needs.&lt;/p&gt;

&lt;h3&gt;
  
  
  Testing and Validating the Integration
&lt;/h3&gt;

&lt;p&gt;Testing and validating the integration between Alibaba Cloud Elasticsearch and the inference API ensures that your setup functions as expected. This step is crucial for identifying potential issues and optimizing performance.&lt;/p&gt;

&lt;p&gt;1）&lt;strong&gt;Run Sample Queries&lt;/strong&gt;:&lt;br&gt;
Start by executing sample queries against your configured inference endpoint. Use a variety of inputs to test the system's ability to generate embeddings and return relevant results. For example, you can input a query like:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;POST _inference/text_embedding/ali_ai_embeddings
{
    "input": "Find similar products to this item."
}
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Observe the response to ensure the embeddings align with your expectations.&lt;/p&gt;

&lt;p&gt;2）&lt;strong&gt;Validate Search Results&lt;/strong&gt;:&lt;br&gt;
Perform searches using the vector database in Elasticsearch. Compare the returned results with the expected outcomes. If the results lack relevance, revisit your embedding model or similarity measures to refine the configuration.&lt;/p&gt;

&lt;p&gt;3）&lt;strong&gt;Monitor Performance Metrics&lt;/strong&gt;:&lt;br&gt;
Use Elasticsearch's built-in monitoring tools to track key performance indicators like query latency and throughput. These metrics help you identify bottlenecks and ensure the system meets your application's requirements.&lt;/p&gt;

&lt;p&gt;4）&lt;strong&gt;Iterate and Optimize&lt;/strong&gt;:&lt;br&gt;
Based on your findings, fine-tune the embeddings, similarity measures, or indexing configurations. This iterative process ensures your vector search implementation remains robust and efficient.&lt;/p&gt;

&lt;p&gt;By thoroughly testing and validating the integration, you can confidently deploy a reliable and high-performing vector search solution powered by Alibaba Cloud Elasticsearch.&lt;/p&gt;

&lt;h2&gt;
  
  
  Practical Use Cases of Enhanced Vector Search
&lt;/h2&gt;

&lt;h3&gt;
  
  
  Image Search and Retrieval
&lt;/h3&gt;

&lt;p&gt;Enhanced vector search has transformed how you can perform image search and retrieval. By representing visual content as vectors, systems can identify similar patterns with remarkable accuracy. This capability is particularly impactful in e-commerce. For instance:&lt;/p&gt;

&lt;p&gt;1）Users can upload images to find visually similar products, improving their shopping experience.&lt;/p&gt;

&lt;p&gt;2）Online furniture retailers can match uploaded images of couches with similar items in their inventory.&lt;/p&gt;

&lt;p&gt;Platforms like AlibabaCloud AI Search leverage this technology to deliver precise results. Additionally, vector search powers advanced systems like Google Images, enabling them to analyze visual patterns effectively. These applications highlight how vector search enhances user engagement and satisfaction in image-based searches.&lt;/p&gt;

&lt;h3&gt;
  
  
  Geospatial Analysis and Location-Based Applications
&lt;/h3&gt;

&lt;p&gt;Geospatial analysis has become a cornerstone for applications that rely on location-based data. By leveraging vector search, you can unlock advanced geospatial capabilities, enabling precise and efficient handling of geospatial queries. Alibaba Cloud Elasticsearch, with its support for geo_point data types, empowers you to build intelligent systems that extract actionable location-based insights.&lt;/p&gt;

&lt;p&gt;When working with geospatial data, you often need to process and analyze geo_point coordinates. These coordinates represent specific locations on the Earth's surface, such as latitude and longitude. For example, you can use geo_point fields to store the location of a delivery address, a retail store, or a user’s current position. With Alibaba Cloud Elasticsearch, you can index and query this data seamlessly, enabling real-time geospatial analysis.&lt;/p&gt;

&lt;p&gt;Vector search enhances geospatial queries by enabling similarity-based searches. Instead of relying solely on traditional distance calculations, you can use embeddings to capture the contextual relationships between locations. This approach is particularly useful for applications like ride-hailing services, where matching drivers to riders requires analyzing both proximity and contextual factors like traffic patterns or preferred routes.&lt;/p&gt;

&lt;p&gt;Location-based insights derived from geospatial analysis can transform industries. In logistics, you can optimize delivery routes by analyzing geo_point data for warehouses and customer addresses. In retail, you can identify high-performing store locations by correlating sales data with customer demographics. By integrating Alibaba Cloud Elasticsearch, you gain the tools to process location-based data efficiently and extract meaningful insights.&lt;/p&gt;

&lt;p&gt;By combining vector search with geospatial capabilities, you can create innovative solutions that leverage the power of location-based data. Whether you’re building a navigation app or a recommendation system, Alibaba Cloud Elasticsearch provides the foundation for success.&lt;/p&gt;

&lt;h2&gt;
  
  
  Advanced Features and Optimization
&lt;/h2&gt;

&lt;h3&gt;
  
  
  Performance Tuning for Vector Search
&lt;/h3&gt;

&lt;p&gt;Optimizing vector search in Elasticsearch ensures faster and more accurate results. You can achieve this by implementing several key strategies:&lt;/p&gt;

&lt;p&gt;1）&lt;strong&gt;Reduce Vector Dimensions&lt;/strong&gt;: Simplify data structures using techniques like PCA or UMAP. This reduces computational complexity and speeds up searches.&lt;/p&gt;

&lt;p&gt;2）&lt;strong&gt;Index Efficiently&lt;/strong&gt;: Use Approximate Nearest Neighbor (ANN) algorithms such as HNSW or FAISS to enhance indexing performance.&lt;/p&gt;

&lt;p&gt;3）&lt;strong&gt;Batch Queries&lt;/strong&gt;: Process multiple queries in a single request to minimize overhead and improve throughput.&lt;/p&gt;

&lt;p&gt;4）&lt;strong&gt;Use Caching&lt;/strong&gt;: Cache frequently accessed queries to reduce computational load and response times.&lt;/p&gt;

&lt;p&gt;5）&lt;strong&gt;Monitor Performance&lt;/strong&gt;: Regularly analyze metrics like query latency and throughput to identify bottlenecks and optimize configurations.&lt;/p&gt;

&lt;p&gt;These techniques ensure that your vector search implementation remains efficient, even as your dataset grows. Alibaba Cloud Elasticsearch provides robust tools to support these optimizations, enabling you to deliver high-performance search solutions.&lt;/p&gt;

&lt;h3&gt;
  
  
  Leveraging the aliyun-knn Plugin
&lt;/h3&gt;

&lt;p&gt;The &lt;a href="https://www.alibabacloud.com/blog/unlock-the-power-of-vector-search-alibaba-cloud-elasticsearch-with-the-aliyun-knn-plugin_601101" rel="noopener noreferrer"&gt;aliyun-knn plugin&lt;/a&gt;enhances vector search capabilities in Alibaba Cloud Elasticsearch by leveraging the Proxima vector library developed by Alibaba DAMO Academy. This plugin offers several advanced features:&lt;/p&gt;

&lt;p&gt;1）It supports distributed searches with multiple replica shards, ensuring scalability and fault tolerance.&lt;/p&gt;

&lt;p&gt;2）It enables real-time incremental synchronization and near-real-time searches, making it ideal for dynamic datasets.&lt;/p&gt;

&lt;p&gt;3）It supports algorithms like HNSW and Linear Search, which are effective for both small and large datasets.&lt;/p&gt;

&lt;p&gt;The plugin powers applications like image search, video fingerprinting, and recommendation systems. For example, Alibaba uses it in platforms like Pailitao and Taobao to deliver precise and efficient search results. By integrating this plugin, you can unlock advanced functionalities for machine learning and AI-driven applications.&lt;/p&gt;

&lt;h3&gt;
  
  
  Best Practices for Scalability and Efficiency
&lt;/h3&gt;

&lt;p&gt;Scalability and efficiency are critical for handling large-scale vector search implementations. Follow these best practices to ensure optimal performance:&lt;/p&gt;

&lt;p&gt;1）Choose a database that aligns with your application's requirementsand supports seamless integration.&lt;/p&gt;

&lt;p&gt;2）Optimize indexing by using techniques like inverted indexes or KD-trees to improve search operations.&lt;/p&gt;

&lt;p&gt;3）Implement distributed architectures to handle large datasets effectively. Horizontal scaling and sharding allow you to manage growing data volumes.&lt;/p&gt;

&lt;p&gt;4）Use quantization techniquesto reduce vector size and storage requirements.&lt;/p&gt;

&lt;p&gt;5）Cache frequently accessed queries to minimize system load and improve response times.&lt;/p&gt;

&lt;p&gt;6）Regularly monitor performance metrics and optimize indexes to maintain efficiency.&lt;/p&gt;

&lt;p&gt;Alibaba Cloud Elasticsearch simplifies these processes with its built-in tools and support for distributed architectures. By following these strategies, you can build scalable and efficient vector search solutions tailored to your needs.&lt;/p&gt;

&lt;p&gt;Integrating Alibaba Cloud’s Inference API with Elasticsearch unlocks measurable benefits for vector search. You can achieve faster query response times, reduced memory utilization, and enhanced semantic understanding. For instance:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Metric&lt;/th&gt;
&lt;th&gt;Before Integration&lt;/th&gt;
&lt;th&gt;After Integration&lt;/th&gt;
&lt;th&gt;Improvement&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Query Response Time&lt;/td&gt;
&lt;td&gt;100ms&lt;/td&gt;
&lt;td&gt;20ms&lt;/td&gt;
&lt;td&gt;&lt;a href="https://www.alibabacloud.com/blog/from-enterprise-rag-to-ai-assistant%E2%80%94alibaba-cloud-elasticsearch-technology-practice-in-ai-search_601952" rel="noopener noreferrer"&gt;80% reduction&lt;/a&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Memory Utilization&lt;/td&gt;
&lt;td&gt;Original demand&lt;/td&gt;
&lt;td&gt;25% of original&lt;/td&gt;
&lt;td&gt;75% reduction&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Query Speed Improvement&lt;/td&gt;
&lt;td&gt;N/A&lt;/td&gt;
&lt;td&gt;2-5x faster&lt;/td&gt;
&lt;td&gt;N/A&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;This integration also provides a fully-managed service, balancing scale and performance while minimizing operational costs. By leveraging Alibaba Cloud Elasticsearch, you can focus on building innovative applications without worrying about infrastructure management.&lt;/p&gt;

&lt;p&gt;Start implementing this integration today to transform your search capabilities and stay ahead in the AI-driven era.&lt;/p&gt;

&lt;h2&gt;
  
  
  FAQ
&lt;/h2&gt;

&lt;h3&gt;
  
  
  What is the purpose of integrating the Inference API with Elasticsearch?
&lt;/h3&gt;

&lt;p&gt;The integration allows you to enhance search capabilities by leveraging AI-driven embeddings and semantic reranking. It improves the relevance and accuracy of search results, enabling advanced use cases like recommendation systems and semantic text search.&lt;/p&gt;

&lt;h3&gt;
  
  
  How does the aliyun-knn plugin improve vector search performance?
&lt;/h3&gt;

&lt;p&gt;The aliyun-knn plugin optimizes similarity searches by using efficient algorithms like HNSW. It supports distributed architectures and real-time synchronization, ensuring scalability and faster query responses for large datasets.&lt;/p&gt;

&lt;h3&gt;
  
  
  Can you use Alibaba Cloud Elasticsearch for hybrid search?
&lt;/h3&gt;

&lt;p&gt;Yes, you can combine dense and sparse embeddings to implement hybrid search. This approach enables you to handle both keyword-based and semantic queries, delivering more comprehensive search results.&lt;/p&gt;

&lt;h3&gt;
  
  
  What are the prerequisites for setting up the Inference API?
&lt;/h3&gt;

&lt;p&gt;You need an Alibaba Cloud account, an Elasticsearch instance, and API credentials. Ensure your Elasticsearch instance supports vector-based indexing and has the aliyun-knn plugin installed.&lt;/p&gt;

&lt;h3&gt;
  
  
  How do you monitor the performance of vector search?
&lt;/h3&gt;

&lt;p&gt;Use Elasticsearch’s built-in monitoring tools to track metrics like query latency and throughput. Regularly analyze these metrics to identify bottlenecks and optimize configurations for better performanc&lt;/p&gt;

</description>
      <category>vectorsearch</category>
      <category>elasticsearch</category>
      <category>api</category>
      <category>alibabacloud</category>
    </item>
    <item>
      <title>Singapore Meetup Invitation：Decoding the Future of AI Search</title>
      <dc:creator>A_Lucas</dc:creator>
      <pubDate>Wed, 19 Feb 2025 08:30:11 +0000</pubDate>
      <link>https://dev.to/a_lucas/singapore-meetup-invitationdecoding-the-future-of-ai-search-3mg5</link>
      <guid>https://dev.to/a_lucas/singapore-meetup-invitationdecoding-the-future-of-ai-search-3mg5</guid>
      <description>&lt;p&gt;Join Us for an Exclusive AI Search Meetup Hosted by Alibaba Cloud!&lt;/p&gt;

&lt;p&gt;We are excited to extend a warm invitation to you for an exclusive AI Search Meetup hosted by Alibaba Cloud. Dive deep into the latest advancements in AI-powered search technologies at this unique event.&lt;/p&gt;

&lt;p&gt;Our gathering will feature expert speakers from both Elastic and Alibaba Cloud, who will share their insights and experiences on harnessing AI to enhance search capabilities and drive innovation in the industry.&lt;/p&gt;

&lt;p&gt;For detailed information about the meetup, please see the image.&lt;br&gt;
&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fimg.alicdn.com%2Fimgextra%2Fi2%2FO1CN01CU36ec1eUnn15SipP_%21%216000000003875-0-tps-3126-9237.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fimg.alicdn.com%2Fimgextra%2Fi2%2FO1CN01CU36ec1eUnn15SipP_%21%216000000003875-0-tps-3126-9237.jpg" alt="img" width="800" height="2363"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;We look forward to welcoming you to an afternoon of knowledge sharing and networking!&lt;/p&gt;

&lt;p&gt;If you want to learn more, Please click &lt;a href="https://resource.alibabacloud.com/event/detail?id=7816" rel="noopener noreferrer"&gt;here.&lt;/a&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>elasticsearch</category>
      <category>alibabacloud</category>
      <category>todayisearched</category>
    </item>
    <item>
      <title>How to manage clusters in Alibaba Cloud Elasticsearch for beginners</title>
      <dc:creator>A_Lucas</dc:creator>
      <pubDate>Mon, 17 Feb 2025 05:26:36 +0000</pubDate>
      <link>https://dev.to/a_lucas/how-to-manage-clusters-in-alibaba-cloud-elasticsearch-for-beginners-3lm3</link>
      <guid>https://dev.to/a_lucas/how-to-manage-clusters-in-alibaba-cloud-elasticsearch-for-beginners-3lm3</guid>
      <description>&lt;p&gt;Managing clusters in Alibaba Cloud Elasticsearch involves overseeing the resources and configurations that power your search and analytics engine. You ensure that you can effectively manage clusters to operate efficiently, scale seamlessly, and remain secure. Effective cluster management offers several benefits: &lt;a href="https://www.alibabacloud.com/blog/alibaba-cloud-elasticsearch-performance-optimization_597092" rel="noopener noreferrer"&gt;faster query speeds through optimized configurations&lt;/a&gt;, improved processing power with high-spec servers, and elastic scaling for additional disk space or node upgrades. Security also improves as clusters stay isolated in VPCs, access is controlled via whitelists, and role-based access control strengthens identity verification. This tutorial simplifies the process, making it accessible even if you're just starting out.&lt;/p&gt;

&lt;h2&gt;
  
  
  &lt;strong&gt;Setting Up Your Alibaba Cloud Elasticsearch Cluster&lt;/strong&gt;
&lt;/h2&gt;

&lt;h3&gt;
  
  
  &lt;strong&gt;Prerequisites for Cluster Setup&lt;/strong&gt;
&lt;/h3&gt;

&lt;h4&gt;
  
  
  &lt;strong&gt;Alibaba Cloud account and permissions&lt;/strong&gt;
&lt;/h4&gt;

&lt;p&gt;Before creating an Elasticsearch cluster, you need an &lt;a href="https://www.alibabacloud.com/help/en/es/user-guide/create-an-alibaba-cloud-elasticsearch-cluster" rel="noopener noreferrer"&gt;Alibaba Cloud account&lt;/a&gt;. You can register for one through the official registration page. Ensure that your account has completed real-name verification. This step is essential for accessing Alibaba Cloud services.&lt;/p&gt;

&lt;p&gt;You also need the necessary permissions to manage resources. If you are part of a team, confirm that your account has the required roles assigned.&lt;/p&gt;

&lt;h4&gt;
  
  
  &lt;strong&gt;Understanding billing options&lt;/strong&gt;
&lt;/h4&gt;

&lt;p&gt;Alibaba Cloud Elasticsearch offers &lt;a href="https://www.alibabacloud.com/help/en/es/product-overview/billing-methods/" rel="noopener noreferrer"&gt;two billing methods&lt;/a&gt;:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;&lt;strong&gt;Billing Method&lt;/strong&gt;&lt;/th&gt;
&lt;th&gt;&lt;strong&gt;Description&lt;/strong&gt;&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Subscription&lt;/td&gt;
&lt;td&gt;Requires an upfront fee and is more cost-effective  for long-term use.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Pay-as-you-go&lt;/td&gt;
&lt;td&gt;Charged hourly, suitable for short-term use or  testing, and can be released at any time.&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;Choose the billing method that aligns with your project needs. For example, if you are testing or experimenting with the ELK stack, the pay-as-you-go option provides flexibility.&lt;/p&gt;

&lt;h3&gt;
  
  
  &lt;strong&gt;Steps to Create an Alibaba Cloud Elasticsearch Cluster&lt;/strong&gt;
&lt;/h3&gt;

&lt;h4&gt;
  
  
  &lt;strong&gt;Choosing the right Elasticsearch version&lt;/strong&gt;
&lt;/h4&gt;

&lt;p&gt;When creating an Elasticsearch instance, select a version compatible with your application. Alibaba Cloud Elasticsearch supports multiple versions, ensuring flexibility for different use cases. Always choose the latest stable version for optimal performance and security.&lt;/p&gt;

&lt;h4&gt;
  
  
  &lt;strong&gt;Selecting instance types and storage&lt;/strong&gt;
&lt;/h4&gt;

&lt;p&gt;Alibaba Cloud Elasticsearch &lt;a href="https://alibaba-cloud.medium.com/alibaba-cloud-elasticsearch-whats-new-and-latest-features-48ee771cb28" rel="noopener noreferrer"&gt;separates storage from computing&lt;/a&gt;, reducing costs and improving performance. Select an instance type based on your workload. For example, a high-spec instance is ideal for data-intensive tasks.&lt;/p&gt;

&lt;p&gt;The following table shows how cluster configurations affect &lt;a href="https://www.alibabacloud.com/blog/alibaba-cloud-elasticsearch-performance-optimization_597092" rel="noopener noreferrer"&gt;response times&lt;/a&gt;:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;&lt;strong&gt;Cluster nodes&lt;/strong&gt;&lt;/th&gt;
&lt;th&gt;&lt;strong&gt;Average RT for 10 concurrent retrievals&lt;/strong&gt;&lt;/th&gt;
&lt;th&gt;&lt;strong&gt;Average RT for 50 concurrent retrievals&lt;/strong&gt;&lt;/th&gt;
&lt;th&gt;&lt;strong&gt;Average RT for 100 concurrent retrievals&lt;/strong&gt;&lt;/th&gt;
&lt;th&gt;&lt;strong&gt;Average RT for 200 concurrent retrievals&lt;/strong&gt;&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;1&lt;/td&gt;
&lt;td&gt;77ms&lt;/td&gt;
&lt;td&gt;459ms&lt;/td&gt;
&lt;td&gt;438ms&lt;/td&gt;
&lt;td&gt;1001ms&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;3&lt;/td&gt;
&lt;td&gt;38ms&lt;/td&gt;
&lt;td&gt;103ms&lt;/td&gt;
&lt;td&gt;162ms&lt;/td&gt;
&lt;td&gt;298ms&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;10&lt;/td&gt;
&lt;td&gt;21ms&lt;/td&gt;
&lt;td&gt;36ms&lt;/td&gt;
&lt;td&gt;48ms&lt;/td&gt;
&lt;td&gt;81ms&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;h3&gt;
  
  
  &lt;strong&gt;Initial Cluster Configurations&lt;/strong&gt;
&lt;/h3&gt;

&lt;h4&gt;
  
  
  &lt;strong&gt;Configuring cluster name and region&lt;/strong&gt;
&lt;/h4&gt;

&lt;p&gt;When building an Elasticsearch cluster, assign a unique name to identify it easily. Choose a region close to your users to minimize latency. Specifying a zone is unnecessary during setup, simplifying the process.&lt;/p&gt;

&lt;h4&gt;
  
  
  &lt;strong&gt;Setting up IP whitelists for access control&lt;/strong&gt;
&lt;/h4&gt;

&lt;p&gt;&lt;a href="https://www.alibabacloud.com/blog/securing-your-elasticsearch-clusters-comprehensive-access-control-methods_601326" rel="noopener noreferrer"&gt;IP whitelists enhance security&lt;/a&gt; by restricting access to specific IP addresses. Add your host's IP address to the whitelist to enable public network access. This step prevents unauthorized access and ensures your data remains secure.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Tip:&lt;/strong&gt; Regularly update your IP whitelist to reflect changes in your network configuration.&lt;/p&gt;

&lt;h2&gt;
  
  
  &lt;strong&gt;Managing Clusters in Alibaba Cloud Elasticsearch&lt;/strong&gt;
&lt;/h2&gt;

&lt;h3&gt;
  
  
  &lt;strong&gt;Adding and Removing Nodes&lt;/strong&gt;
&lt;/h3&gt;

&lt;h4&gt;
  
  
  &lt;strong&gt;When and why to scale your cluster&lt;/strong&gt;
&lt;/h4&gt;

&lt;p&gt;Scaling your cluster ensures it can handle increasing workloads or optimize resource usage. You should add nodes when your cluster experiences high traffic, increased data ingestion, or slow query responses. Removing nodes may be necessary to reduce costs or reallocate resources. Maintaining at least two data nodes is essential for reliability. For multi-zone clusters, balance the number of nodes across zones to enhance stability.&lt;/p&gt;

&lt;h4&gt;
  
  
  &lt;strong&gt;Steps to add or remove nodes&lt;/strong&gt;
&lt;/h4&gt;

&lt;p&gt;Follow these steps to manage nodes effectively:&lt;/p&gt;

&lt;h5&gt;
  
  
  &lt;strong&gt;Adding Nodes&lt;/strong&gt;:
&lt;/h5&gt;

&lt;p&gt;Use the Alibaba Cloud console to add nodes to your cluster.&lt;/p&gt;

&lt;p&gt;Ensure the new nodes have sufficient resources, such as memory and disk space.&lt;/p&gt;

&lt;p&gt;Elasticsearch automatically redistributes shards across the new nodes&lt;/p&gt;

&lt;h5&gt;
  
  
  &lt;strong&gt;Removing Nodes&lt;/strong&gt;:
&lt;/h5&gt;

&lt;p&gt;Verify that the cluster has enough resources to handle the workload after node removal.&lt;/p&gt;

&lt;p&gt;Use the GET _cat/indices?v command to check resource usage.&lt;/p&gt;

&lt;p&gt;Adjust the number of replica shards to avoid allocation errors.&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;&lt;strong&gt;Consideration&lt;/strong&gt;&lt;/th&gt;
&lt;th&gt;&lt;strong&gt;Details&lt;/strong&gt;&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Resource Management&lt;/td&gt;
&lt;td&gt;&lt;a href="https://www.alibabacloud.com/help/en/es/support/faq-about-alibaba-cloud-elasticsearch-clusters" rel="noopener noreferrer"&gt;Ensure sufficient   resources after node removal to prevent shard allocation errors.&lt;/a&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Shard Allocation&lt;/td&gt;
&lt;td&gt;Verify that replica shards are fewer than data  nodes. Adjust replicas if needed.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Cluster Stability&lt;/td&gt;
&lt;td&gt;Maintain at least two data nodes. Balance nodes  across zones for multi-zone clusters.&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;h3&gt;
  
  
  &lt;strong&gt;Scaling Your Cluster&lt;/strong&gt;
&lt;/h3&gt;

&lt;h4&gt;
  
  
  &lt;strong&gt;Vertical scaling (upgrading instance types)&lt;/strong&gt;
&lt;/h4&gt;

&lt;p&gt;&lt;a href="https://topic.alibabacloud.com/a/the-instance-displays-the-font-classtopic-s-color00c1deelasticsearchfont-cluster-ecosystem-shards-and-horizontal-scaling-_1_27_32662567.html" rel="noopener noreferrer"&gt;Vertical scaling involves upgrading the hardware of existing nodes.&lt;/a&gt; This method is ideal for consistent high-resource demand. For example, upgrading to a high-spec instance improves performance for data-intensive tasks. Vertical scaling is efficient but may require downtime during the upgrade process.&lt;/p&gt;

&lt;h4&gt;
  
  
  &lt;strong&gt;Horizontal scaling (adding more nodes)&lt;/strong&gt;
&lt;/h4&gt;

&lt;p&gt;Horizontal scaling adds more nodes to distribute the workload. Elasticsearch's distributed architecture simplifies this process. This method is suitable for applications with varying traffic patterns or high availability needs. Horizontal scaling minimizes downtime and enhances workload distribution.&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;&lt;strong&gt;Scenario&lt;/strong&gt;&lt;/th&gt;
&lt;th&gt;&lt;strong&gt;Preferred Scaling Type&lt;/strong&gt;&lt;/th&gt;
&lt;th&gt;&lt;strong&gt;Explanation&lt;/strong&gt;&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Traffic patterns&lt;/td&gt;
&lt;td&gt;Horizontal Scaling&lt;/td&gt;
&lt;td&gt;Adapts to varying traffic with the ability to add or  remove resources.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Resource efficiency&lt;/td&gt;
&lt;td&gt;Vertical Scaling&lt;/td&gt;
&lt;td&gt;Efficient for consistent high-resource demand by  boosting existing capabilities.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Application architecture&lt;/td&gt;
&lt;td&gt;Horizontal Scaling&lt;/td&gt;
&lt;td&gt;Suitable for applications designed to run on  multiple servers.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Downtime tolerance&lt;/td&gt;
&lt;td&gt;Horizontal Scaling&lt;/td&gt;
&lt;td&gt;Facilitates less downtime, ideal for high  availability needs.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Workload distribution&lt;/td&gt;
&lt;td&gt;Horizontal Scaling&lt;/td&gt;
&lt;td&gt;Excels in distributing workloads across multiple  nodes.&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;h3&gt;
  
  
  &lt;strong&gt;Configuring Security Settings&lt;/strong&gt;
&lt;/h3&gt;

&lt;h4&gt;
  
  
  &lt;strong&gt;Updating IP whitelists&lt;/strong&gt;
&lt;/h4&gt;

&lt;p&gt;IP whitelists restrict access to specific IP addresses, enhancing security. Regularly update your whitelist to reflect changes in your network. Use the Alibaba Cloud console to add or remove IP addresses. This step ensures only authorized users can access your cluster.&lt;/p&gt;

&lt;h4&gt;
  
  
  &lt;strong&gt;Enabling alert features for monitoring&lt;/strong&gt;
&lt;/h4&gt;

&lt;p&gt;Enable alert features to monitor your cluster's health and performance. Alerts notify you of critical events, such as high CPU usage or abnormal cluster statuses. Configure alerts for key metrics like disk usage and cluster health. This proactive approach helps you address issues before they escalate, ensuring your cluster operates efficiently.&lt;/p&gt;

&lt;h2&gt;
  
  
  &lt;strong&gt;Monitoring and Maintaining Your Cluster&lt;/strong&gt;
&lt;/h2&gt;

&lt;h3&gt;
  
  
  &lt;strong&gt;Monitoring Cluster Health&lt;/strong&gt;
&lt;/h3&gt;

&lt;h4&gt;
  
  
  &lt;strong&gt;Using the Alibaba Cloud console&lt;/strong&gt;
&lt;/h4&gt;

&lt;p&gt;Monitoring your cluster's health ensures optimal performance and stability. The Alibaba Cloud console provides tools to simplify this process:&lt;/p&gt;

&lt;p&gt;&lt;a href="https://trendmicro.com/cloudoneconformity/knowledge-base/alibaba-cloud/AlibabaCloud-ACK/cluster-check.html" rel="noopener noreferrer"&gt;&lt;strong&gt;Cluster Check&lt;/strong&gt;&lt;/a&gt;: Automates health checks     to identify and resolve issues proactively. This feature minimizes downtime and enhances stability.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://www.alibabacloud.com/en/product/cloud-monitor?_p_lc=1" rel="noopener noreferrer"&gt;&lt;strong&gt;CloudMonitor&lt;/strong&gt;&lt;/a&gt;: Tracks metrics and     detects service availability. It enables you to monitor resource usage and health status while setting alarms for critical metrics.&lt;/p&gt;

&lt;p&gt;These tools help you stay informed about your cluster's condition and act promptly when issues arise.&lt;/p&gt;

&lt;h4&gt;
  
  
  &lt;strong&gt;Key metrics to track (e.g., CPU, memory, disk usage)&lt;/strong&gt;
&lt;/h4&gt;

&lt;p&gt;Tracking key metrics is essential for maintaining your Elasticsearch cluster. Use the following table to understand the most critical metrics:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;&lt;strong&gt;Metric&lt;/strong&gt;&lt;/th&gt;
&lt;th&gt;&lt;strong&gt;Description&lt;/strong&gt;&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Cluster Status&lt;/td&gt;
&lt;td&gt;Indicates overall health (Red, Yellow, Green).&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Nodes&lt;/td&gt;
&lt;td&gt;Total number of nodes, including successful and  failed nodes.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;JVM Heap Usage&lt;/td&gt;
&lt;td&gt;Percentage of JVM heap memory used.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;CPU Usage&lt;/td&gt;
&lt;td&gt;Percentage of CPU used by Elasticsearch.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Disk Usage&lt;/td&gt;
&lt;td&gt;Percentage of disk space used.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Active Shards&lt;/td&gt;
&lt;td&gt;Number of active shards in the cluster.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Query Cache Hit Ratio&lt;/td&gt;
&lt;td&gt;Ratio of cache hits to total requests.&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;Monitoring these metrics helps you identify bottlenecks and optimize resource usage.&lt;/p&gt;

&lt;h3&gt;
  
  
  &lt;strong&gt;Managing Indices&lt;/strong&gt;
&lt;/h3&gt;

&lt;h4&gt;
  
  
  &lt;strong&gt;Creating, deleting, and optimizing indices&lt;/strong&gt;
&lt;/h4&gt;

&lt;p&gt;Efficient index management improves query performance and reduces resource consumption. Follow these &lt;a href="https://www.alibabacloud.com/blog/alibaba-cloud-elasticsearch-performance-optimization_597092" rel="noopener noreferrer"&gt;best practices&lt;/a&gt;:&lt;/p&gt;

&lt;p&gt;1)Create one type per index and separate indexes for data with different fields. This approach avoids large index queries.&lt;/p&gt;

&lt;p&gt;2)Merge read-only indexes into larger segments to reduce fragmentation and memory usage.&lt;/p&gt;

&lt;p&gt;3)Disable historical data indexes that are not queried. This saves JVM memory.&lt;/p&gt;

&lt;p&gt;4)Use batch requests for better performance. Commit 5 MB to 15 MB of data at a time.&lt;/p&gt;

&lt;p&gt;These practices ensure your indices remain optimized and manageable.&lt;/p&gt;

&lt;h3&gt;
  
  
  &lt;strong&gt;Performing Backups&lt;/strong&gt;
&lt;/h3&gt;

&lt;h4&gt;
  
  
  &lt;strong&gt;Setting up automated snapshots&lt;/strong&gt;
&lt;/h4&gt;

&lt;p&gt;&lt;a href="https://www.alibabacloud.com/help/en/es/user-guide/data-backup-overview" rel="noopener noreferrer"&gt;Automated snapshots&lt;/a&gt; protect your data and simplify recovery. Follow these steps to set up snapshots:&lt;/p&gt;

&lt;p&gt;1)&lt;a href="https://www.alibabacloud.com/blog/601188" rel="noopener noreferrer"&gt;Register a snapshot repository&lt;/a&gt; on Alibaba Cloud OSS.&lt;/p&gt;

&lt;p&gt;2)Configure the &lt;a href="https://www.alibabacloud.com/help/en/es/user-guide/create-manual-snapshots-and-restore-data-from-manual-snapshots" rel="noopener noreferrer"&gt;OSS bucket      with Standard storage class&lt;/a&gt; and Public Read ACL.&lt;/p&gt;

&lt;p&gt;3)Use Snapshot Lifecycle Management (SLM) to automate snapshot handling     and retention.&lt;/p&gt;

&lt;p&gt;This setup ensures your data remains secure and recoverable.&lt;/p&gt;

&lt;h4&gt;
  
  
  &lt;strong&gt;Restoring data from backups&lt;/strong&gt;
&lt;/h4&gt;

&lt;p&gt;Restoring data from backups is straightforward. You can restore snapshots to the original cluster or a different one using a shared OSS repository. Follow these steps:&lt;/p&gt;

&lt;p&gt;1)Register a snapshot repository on Alibaba Cloud OSS.&lt;/p&gt;

&lt;p&gt;2)Create automatic or manual snapshots of your data.&lt;/p&gt;

&lt;p&gt;3)Use SLM for automated snapshot handling.&lt;/p&gt;

&lt;p&gt;4)Restore data to the desired cluster.&lt;/p&gt;

&lt;p&gt;This process ensures reliable data recovery and management.&lt;/p&gt;

&lt;h2&gt;
  
  
  &lt;strong&gt;Troubleshooting Common Issues in Alibaba Cloud Elasticsearch&lt;/strong&gt;
&lt;/h2&gt;

&lt;h3&gt;
  
  
  &lt;strong&gt;Resolving Connection Issues&lt;/strong&gt;
&lt;/h3&gt;

&lt;h4&gt;
  
  
  &lt;strong&gt;Diagnosing access problems&lt;/strong&gt;
&lt;/h4&gt;

&lt;p&gt;Connection issues can disrupt your ability to interact with your Elasticsearch cluster. Identifying the root cause is the first step to resolving these problems. &lt;a href="https://www.alibabacloud.com/help/en/es/support/faq-about-alibaba-cloud-elasticsearch-clusters" rel="noopener noreferrer"&gt;The table below outlines common causes and their solutions&lt;/a&gt;:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;&lt;strong&gt;Possible cause&lt;/strong&gt;&lt;/th&gt;
&lt;th&gt;&lt;strong&gt;Solution&lt;/strong&gt;&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;The Elasticsearch cluster cannot be accessed over  the Internet.&lt;/td&gt;
&lt;td&gt;Ensure the IP address of your device is whitelisted  and check network connectivity using ping or telnet commands.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;The Elasticsearch cluster cannot be accessed over an  internal network.&lt;/td&gt;
&lt;td&gt;Verify that the client is in the same VPC and test  connectivity with ping commands.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;The Elasticsearch cluster is unhealthy.&lt;/td&gt;
&lt;td&gt;Check the cluster's health status using the GET  _cat/health?v command and monitor resource usage.&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;h4&gt;
  
  
  &lt;strong&gt;Fixing IP whitelist misconfigurations&lt;/strong&gt;
&lt;/h4&gt;

&lt;p&gt;Misconfigured IP whitelists can block access to your cluster. Follow these steps to fix the issue:&lt;/p&gt;

&lt;p&gt;1)&lt;a href="https://www.alibabacloud.com/blog/securing-your-elasticsearch-clusters-comprehensive-access-control-methods_601326" rel="noopener noreferrer"&gt;Enable public      network access and add your host's IP address to the public IP address      whitelist.&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;2)Add the private IP address of your host to the cluster's private IP     address whitelist for internal access.&lt;/p&gt;

&lt;p&gt;3)Configure a whitelist for your host's IP address for Kibana access,     ensuring both public and private IPs are accounted for.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Tip:&lt;/strong&gt; Regularly update your whitelist to reflect changes in your network configuration.&lt;/p&gt;

&lt;h3&gt;
  
  
  &lt;strong&gt;Addressing Performance Problems&lt;/strong&gt;
&lt;/h3&gt;

&lt;h4&gt;
  
  
  &lt;strong&gt;Identifying bottlenecks&lt;/strong&gt;
&lt;/h4&gt;

&lt;p&gt;&lt;a href="https://www.alibabacloud.com/blog/alibaba-cloud-elasticsearch-performance-optimization_597092" rel="noopener noreferrer"&gt;Performance bottlenecks often arise from disk read and write limitations.&lt;/a&gt; SSDs provide faster speeds compared to HDDs, making them a better choice for Elasticsearch clusters. The number of nodes and the configuration of indexes and replicas also significantly impact performance. Monitoring these factors helps you pinpoint bottlenecks.&lt;/p&gt;

&lt;h4&gt;
  
  
  &lt;strong&gt;Solutions for high resource usage&lt;/strong&gt;
&lt;/h4&gt;

&lt;p&gt;High resource usage can slow down your cluster. &lt;a href="https://www.alibabacloud.com/blog/optimizing-high-cpu-usage-for-elasticsearch-on-alibaba-cloud_601213" rel="noopener noreferrer"&gt;Use these strategies to address the issue:&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Expand the cluster by adding more nodes or upgrading existing ones.&lt;/p&gt;

&lt;p&gt;Distribute bulk requests into smaller batches to reduce CPU strain.&lt;/p&gt;

&lt;p&gt;Cancel long-running searches using the tasks management API.&lt;/p&gt;

&lt;p&gt;Avoid resource-intensive searches, such as fuzzy or wildcard queries.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Note:&lt;/strong&gt; Optimizing your cluster's configuration ensures efficient resource utilization.&lt;/p&gt;

&lt;h3&gt;
  
  
  &lt;strong&gt;Fixing Indexing Errors&lt;/strong&gt;
&lt;/h3&gt;

&lt;h4&gt;
  
  
  &lt;strong&gt;Common causes of indexing failures&lt;/strong&gt;
&lt;/h4&gt;

&lt;p&gt;&lt;a href="https://www.alibabacloud.com/help/en/es/support/faq-about-common-errors" rel="noopener noreferrer"&gt;Indexing errors can occur due to several reasons:&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Node capacity issues during spikes in queries or write requests.&lt;/p&gt;

&lt;p&gt;Memory overload caused by excessive index cache usage.&lt;/p&gt;

&lt;p&gt;Low-spec cluster configurations.&lt;/p&gt;

&lt;p&gt;Disk usage exceeding 85%, preventing new shard allocation.&lt;/p&gt;

&lt;h4&gt;
  
  
  &lt;strong&gt;Steps to resolve indexing issues&lt;/strong&gt;
&lt;/h4&gt;

&lt;p&gt;Follow these steps to fix indexing problems:&lt;/p&gt;

&lt;p&gt;1)Run the POST /Index name/_cache/clear?fielddata=true command to clear     the cache for indexes.&lt;/p&gt;

&lt;p&gt;2)Use the GET /_cat/indices?v command to check shard distribution.&lt;/p&gt;

&lt;p&gt;3)Reduce write concurrency and delete invalid indexes to free up     resources.&lt;/p&gt;

&lt;p&gt;4)Upgrade the cluster configuration if issues persist..&lt;/p&gt;

&lt;p&gt;Managing clusters in Alibaba Cloud Elasticsearch becomes easier when you follow a structured approach. Start by creating an account, &lt;a href="https://www.alibabacloud.com/blog/how-to-create-an-alibaba-cloud-elasticsearch-cluster-and-log-on-to-the-kibana-console_600201" rel="noopener noreferrer"&gt;configuring your cluster&lt;/a&gt;, and ensuring proper access control. Use elastic scaling to handle growing workloads and &lt;a href="https://www.alibabacloud.com/blog/alibaba-cloud-elasticsearch-performance-optimization_597092" rel="noopener noreferrer"&gt;optimize performance with high-spec servers&lt;/a&gt;. Regular monitoring is essential for maintaining cluster health and preventing downtime. Proactive maintenance saves resources and ensures smooth operations. To deepen your knowledge, explore Alibaba's documentation on creating clusters, managing access, and using API operations. These resources help you master Elasticsearch cluster management and unlock its full potential.&lt;/p&gt;

&lt;h2&gt;
  
  
  &lt;strong&gt;FAQ&lt;/strong&gt;
&lt;/h2&gt;

&lt;h3&gt;
  
  
  &lt;strong&gt;What is the first step in installing Elasticsearch on Alibaba Cloud?&lt;/strong&gt;
&lt;/h3&gt;

&lt;p&gt;You need to create an Alibaba Cloud account and complete real-name verification. Afterward, access the Alibaba Cloud console to start creating a general-purpose business edition instance. This instance serves as the foundation for installing Elasticsearch and managing your cluster.&lt;/p&gt;

&lt;h3&gt;
  
  
  &lt;strong&gt;How do you begin working with ELK on Alibaba Cloud?&lt;/strong&gt;
&lt;/h3&gt;

&lt;p&gt;Start by installing the ELK stack. This involves installing Elasticsearch, Logstash, and Kibana. Each component plays a role in data ingestion, storage, and visualization. Ensure you install the required Elasticsearch client to interact with your cluster effectively.&lt;/p&gt;

&lt;h3&gt;
  
  
  &lt;strong&gt;What are the benefits of installing Kibana with Elasticsearch?&lt;/strong&gt;
&lt;/h3&gt;

&lt;p&gt;Kibana provides a user-friendly interface for visualizing data stored in Elasticsearch. It simplifies data analysis by offering dashboards, charts, and graphs. Installing Kibana enhances your ability to monitor and manage your Elasticsearch cluster.&lt;/p&gt;

&lt;h3&gt;
  
  
  &lt;strong&gt;How do you troubleshoot issues when accessing the Alibaba instance?&lt;/strong&gt;
&lt;/h3&gt;

&lt;p&gt;Check your IP whitelist settings to ensure your device's IP address is authorized. Verify network connectivity using ping or telnet commands. If problems persist, review your cluster's health status and resource usage through the Alibaba Cloud console.&lt;/p&gt;

&lt;h3&gt;
  
  
  &lt;strong&gt;What is the role of Logstash in the ELK stack?&lt;/strong&gt;
&lt;/h3&gt;

&lt;p&gt;Logstash processes and transforms data before sending it to Elasticsearch. Installing Logstash allows you to collect data from various sources, filter it, and store it efficiently. This ensures your Elasticsearch cluster receives clean and structured data.&lt;/p&gt;

</description>
      <category>cluster</category>
      <category>elasticsearch</category>
      <category>tutorial</category>
      <category>alibaba</category>
    </item>
    <item>
      <title>Alibaba Cloud AI Search Solution Explained: Intelligent Search Driven by Large Language Models, Helping Enterprises in Digital</title>
      <dc:creator>A_Lucas</dc:creator>
      <pubDate>Sat, 08 Feb 2025 02:09:29 +0000</pubDate>
      <link>https://dev.to/a_lucas/alibaba-cloud-ai-search-solution-explained-intelligent-search-driven-by-large-language-models-h14</link>
      <guid>https://dev.to/a_lucas/alibaba-cloud-ai-search-solution-explained-intelligent-search-driven-by-large-language-models-h14</guid>
      <description>&lt;p&gt;In the past year, the capabilities of large language models have been growing as the technology for large language models has skyrocketed. These have driven the rapid evolution of our search technology to the next generation, the AI search technology. The rapid development of large language models has not only reshaped the foundation of search technology, but also provided strong support for the digital transformation of various industries.&lt;/p&gt;

&lt;h1&gt;
  
  
  &lt;strong&gt;1） AI search technology trend&lt;/strong&gt;
&lt;/h1&gt;

&lt;p&gt;AI search technology has the following significant features:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Reconstruction:&lt;/strong&gt; One is the reconstruction of the technical aspects of AI search. the AI search technology is currently undergoing a comprehensive reconstruction based on the large language model, through which the whole chain of search capabilities have been reorganized, including the redefinition of the capabilities of text parsing, slicing and vectorization. Another is the reconstruction of the way of information access and the shape of the product. While traditional search relies on keyword matching, it now uses more natural language Q&amp;amp;A interactions, which brings new business scenarios, such as virtual digital people, Q&amp;amp;A for enterprise knowledge bases and intelligent customer service for e-commerce platforms.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;AI Infrastructure:&lt;/strong&gt; AI search technology has become an important part of AI native applications, including search vector retrieval, semantic search and retrieval generation technologies, which constitute the infrastructure for many AI applications. This not only improves the efficiency of data processing, but also enhances the user interaction experience with the system, helping enterprises realize more efficient information management and services.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Effectiveness enhancement:&lt;/strong&gt; Currently, the focus on effectiveness has reached an unprecedented level in both academia and industry. The addition of the large language model makes the search effect a qualitative leap compared with the traditional search. Users are able to obtain relevant information more quickly, improving the efficiency and accuracy of decision-making, especially in complex queries and deep information retrieval scenarios.&lt;/p&gt;

&lt;p&gt;However, the introduction of large language models also brings some troubles, especially the problem of phantom rate. In some scenarios that require very high answer accuracy, it becomes difficult to ensure the accuracy of the big model answer, which imposes constraints on enterprises and developers in the landing of AI search services.&lt;/p&gt;

&lt;h1&gt;
  
  
  &lt;strong&gt;2）The efforts and evolution of Alibaba Cloud AI search&lt;/strong&gt;
&lt;/h1&gt;

&lt;p&gt;With the evolution of the AI era, Alibaba Cloud has made corresponding efforts on the cloud. Alibaba Cloud's search products have historically been divided into two engines: one is the open source engine Elasticsearch, which cooperates with Elastic, and the other is Havenask, a self-developed search engine based on years of experience.&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fimg.alicdn.com%2Fimgextra%2Fi2%2FO1CN01S5JUIH28Fd8btKr3r_%21%216000000007903-0-tps-2772-1556.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fimg.alicdn.com%2Fimgextra%2Fi2%2FO1CN01S5JUIH28Fd8btKr3r_%21%216000000007903-0-tps-2772-1556.jpg" width="800" height="449"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  2.1 The evolution of the open source ecosystem:
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;2017:&lt;/strong&gt; Alibaba Cloud  has a strategic partnership with Elastic and a joint product release. Elasticsearch on Alibaba Cloud is what we are currently using.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;2019:&lt;/strong&gt; We continue to target this fully managed ES product on the cloud, continuously improve the operation and maintenance management capabilities, enhance the intelligent elastic expansion and contraction and monitoring and alarm services, to ensure that customers can obtain stable and efficient services in the process of use, and reduce the cost of development. In addition, Alibaba Cloud has developed some new features based on the ES kernel. For example: support for index construction Indexing service to achieve write acceleration. Through self-developed storage engine Openstore, to help customers in the case of massive data, reduce our storage costs.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;2022:&lt;/strong&gt; Serverless service release, to help small and medium-sized customers and large customers in the case of large amounts of data to reduce costs by 50%, optimize resource allocation.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;2023:&lt;/strong&gt; Alibaba Cloud's products fully enter the field of AI search, starting with version 8.X. The vector search capability has been continuously enhanced. The ability characteristics have also been continuously enhanced.&lt;/p&gt;

&lt;h2&gt;
  
  
  2.2 Self-research program evolution:
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;2008:&lt;/strong&gt; Alibaba Cloud began to self-research an internal open source engine Havenask, is also the history of our support for Taobao, Tmall, including internal double eleven a lot of this more high concurrency, more extreme scenarios of a search engine.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;2014:&lt;/strong&gt; Alibaba Cloud went to explore its commercialization in the cloud one after another. Scenario as the core to help courseware construction, to help customers build scenarios of intelligent search, to provide some industry templates and personalized programs. As well as in e-commerce, content education, games and other industries to do some personalized programs and effect enhancement.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;2023:&lt;/strong&gt; After entering the era of large language models, Havenask is also the first one-stop intelligent Q&amp;amp;A RAG product in China, as well as multimodal RAG products. Until today, Alibaba Cloud has been continuously optimizing the capabilities of RAG and AI search based on the accumulation of internal depth.&lt;/p&gt;

&lt;h2&gt;
  
  
  2.3 The core focus of search products:
&lt;/h2&gt;

&lt;p&gt;From the past to the present, and even into the future, the core concerns of search products still revolve around three main areas of optimization:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Cost:&lt;/strong&gt;This may also be a very personal concern of many customers. Because in the era of large language models, we have a lot of this resource is mainly based on the GPU to complete the GPU itself is relatively expensive, so how to help customers reduce costs, is the subsequent evolution of the entire product is an important direction.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Performance:&lt;/strong&gt;This is mainly two points, one is the massive AI data writing and processing speed. One is the response speed of online queries. These two speeds should realize the performance of good experience of such a requirement, but also Alibaba Cloud products to help you solve a problem.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Effect:&lt;/strong&gt;Alibaba Cloud and many customers have done online or offline exchanges. Customers have some scenarios where they want the interactive ability of AI, but also want to ensure that the results are 100 percent accurate, and they can't introduce any illusion of a large model, and they can't let the large model do any fabrication. We are introducing the search link, and under the guarantee of accuracy, we are able to apply to the AI's ability to carry out a complete interaction. As for the specific effect optimization is as follows:&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fimg.alicdn.com%2Fimgextra%2Fi4%2FO1CN01ij9Sxx24k50cMBS0u_%21%216000000007428-0-tps-3466-1854.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fimg.alicdn.com%2Fimgextra%2Fi4%2FO1CN01ij9Sxx24k50cMBS0u_%21%216000000007428-0-tps-3466-1854.jpg" width="800" height="427"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h1&gt;
  
  
  3）Alibaba Cloud AI Search Product Introduction
&lt;/h1&gt;

&lt;p&gt;Alibaba Cloud AI Search Open Platform provides five scenario-based products, including LLM Intelligent Q&amp;amp;A Edition, Log Retrieval Serverless Edition, Industry Algorithm Edition, Vector Retrieval Edition, and Graph Retrieval Edition. The bottom layer is based on a very large number of open source search engines, engines including the enterprise version of Elastcsearch, Alibaba's own engine Havenask and Milvus, etc., to facilitate support and docking. Through these products, users are able to realize an end-to-end search solution to quickly obtain the information they need.&lt;/p&gt;

&lt;p&gt;Currently, Alibaba Cloud AI Search provides customers with all the AI search capabilities as shown in the figure below:&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fimg.alicdn.com%2Fimgextra%2Fi2%2FO1CN01X0ocdD1FwrexlLR4V_%21%216000000000552-0-tps-3534-1922.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fimg.alicdn.com%2Fimgextra%2Fi2%2FO1CN01X0ocdD1FwrexlLR4V_%21%216000000000552-0-tps-3534-1922.jpg" width="800" height="435"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  3.1 Alibaba Cloud AI Search Open Platform Product Overview
&lt;/h2&gt;

&lt;p&gt;AI Search Open Platform, as a flagship product of Alibaba Cloud search team, is a precipitated fruit of more than 20 years of experience, which includes almost all services that can be used in all aspects of the current Alibaba Cloud AI search field, aiming to provide users with powerful search capabilities.&lt;/p&gt;

&lt;p&gt;For customers who are familiar with development, using the platform API calls can achieve higher flexibility and speed. And for customers who want to deliver services quickly, Alibaba Cloud provides scenario-based products based on feedback. These products are categorized and developed for different business scenarios, and there is a corresponding product version for each scenario. Users only need to write data into the system to directly access the relevant results, and there is almost no need to write code.&lt;/p&gt;

&lt;h2&gt;
  
  
  3.2 Introduction to Alibaba Cloud ES version 8.15 features and application scenarios
&lt;/h2&gt;

&lt;p&gt;The latest version 8.15 of Alibaba Cloud ES is a vector-enhanced version based on the latest kernel. This version improves performance by more than five times compared to version 8.9, supports data quantization, and can significantly reduce memory storage costs by 75%. In addition, this version natively supports vector search without the need for plug-ins, and supports the multi-way fusion sorting algorithm, which improves the fusion effect of vector search and traditional search results. This version also supports seamless integration with the AI search open platform.&lt;/p&gt;

&lt;p&gt;In terms of search scenarios (RAG), Alibaba Cloud's ES can be widely used in the following scenarios:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Intelligent Customer Service:&lt;/strong&gt; Through natural language processing, intelligent customer service can quickly respond to customer queries and provide accurate information. For example, a customer can ask about refund policy or express delivery status, and the system can instantly give relevant answers to improve customer satisfaction.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Big Data Retriveal:&lt;/strong&gt; Enterprises can use AI search technology to build an internal knowledge base to help employees quickly find the information they need. This approach not only improves work efficiency, but also promotes knowledge sharing.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Vector Search:&lt;/strong&gt; in e-commerce platforms, users can get personalized product recommendations through AI search. The system is able to provide accurate product suggestions based on the user's historical behavior and preferences, increasing the conversion rate.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Smart Q&amp;amp;A:&lt;/strong&gt; In finance and other industries, AI search technology can handle complex queries, users can ask questions such as “how to apply for desktop computer”, the system will be based on historical data in the form of a table to output the relevant results. This ability allows users to compare and analyze data more intuitively, supporting more accurate investment decisions.&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fimg.alicdn.com%2Fimgextra%2Fi1%2FO1CN01d71ZKJ1G9EM1zSXY6_%21%216000000000579-0-tps-3326-1870.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fimg.alicdn.com%2Fimgextra%2Fi1%2FO1CN01d71ZKJ1G9EM1zSXY6_%21%216000000000579-0-tps-3326-1870.jpg" width="800" height="449"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h1&gt;
  
  
  4）Alibaba Cloud AI Search Program
&lt;/h1&gt;

&lt;h2&gt;
  
  
  &lt;strong&gt;4.1 A full-link solution to build a RAG system based on Alibaba Cloud Elastisearch&lt;/strong&gt;
&lt;/h2&gt;

&lt;p&gt;This scheme shows how to build a full-link RAG (Retrieval-Augmented Generation) system by utilizing the modeling services provided by Alibaba Cloud Elasticsearch and AI search open platform.&lt;/p&gt;

&lt;p&gt;First, customer data sources (e.g., PDF, Word documents) are imported into the system, and the information is recognized and extracted by document parsing services, followed by slicing and vectorization of the documents, and ultimately these vector data are stored in Alibaba Cloud ES for index construction.&lt;/p&gt;

&lt;p&gt;Offline data writing mainly involves customers importing documents (e.g., word, PPT, PDF) into the AI search open platform and recognizing and extracting them by calling the document parsing service. For long documents, the system will be cut, such as the use of subheadings or subheadings and other structures for semantic or document structure cut. Then, the sliced text is transformed into dense vectors or sparse vectors and stored in Alibaba Cloud ES to construct indexes, completing the construction of original text indexes and dense vector and sparse data indexes.&lt;/p&gt;

&lt;p&gt;When querying online, the user inputs a question, the internal service understands and expands the customer's question, then the query understanding service is introduced to determine the intent of the question and generate multiple possible question variants. Then, the problem is transformed into vector data, indexes are constructed, and multiplexed recall is performed to return Top N knowledge fragments. Subsequently, after rearranging the model, it is integrated into the larger model in a Prompt engineering manner for aggregation processing to form the final answer to the customer.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fimg.alicdn.com%2Fimgextra%2Fi1%2FO1CN01eucNjE1eoxDGTeVSE_%21%216000000003919-0-tps-3392-1802.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fimg.alicdn.com%2Fimgextra%2Fi1%2FO1CN01eucNjE1eoxDGTeVSE_%21%216000000003919-0-tps-3392-1802.jpg" width="800" height="425"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;In addition, with version 8.15 of Alibaba Cloud ES, AI semantic search models can be created using the Influence API. The demonstration includes steps such as selecting a generic commercial version, configuring visualization control, setting up access whitelisting, selecting a model service, obtaining configuration information, and creating and debugging the model in ES. This process demonstrates how to seamlessly integrate Alibaba Cloud AI model services, reduce the threshold of model usage, and realize functions such as semantic search.&lt;/p&gt;

&lt;h2&gt;
  
  
  &lt;strong&gt;4.2 Effectiveness Evaluation and Optimization&lt;/strong&gt;
&lt;/h2&gt;

&lt;p&gt;To ensure the effectiveness of the AI search solution, Alibaba Cloud also provides a full-link effect evaluation service. The platform can help customers evaluate the relevance, illusion rate and credibility of answer results in a three-dimensional way. This evaluation mechanism not only supports the effect test of a single service, but also supports the synchronized evaluation of multiple services, which improves the decision-making efficiency of developers.&lt;/p&gt;

&lt;p&gt;Through these evaluations, enterprises are able to identify problems and optimize in time to ensure the stability and accuracy of AI search services, thus improving user satisfaction.&lt;/p&gt;

&lt;h1&gt;
  
  
  &lt;strong&gt;Closing&lt;/strong&gt;
&lt;/h1&gt;

&lt;p&gt;Alibaba Cloud AI search solution version 8.15 adds the AI service center column and model management function on the original basis, supports one-click creation of AI open platform space and API, and maintains support for all the functions of version 8.13. 8.15 kernel also emphasizes its vector capability and AI search capability, and also supports basic application scenarios such as logging, which is suitable for customers with the need to reduce costs.&lt;/p&gt;

&lt;p&gt;In terms of billing, it flexibly supports pay-per-volume or annual and monthly packages, and the AI platform is billed on a per-call basis, offering the first 100 free calls.&lt;/p&gt;

&lt;p&gt;In addition, Alibaba Cloud has launched major promotional activities to help enterprises develop.&lt;/p&gt;

&lt;p&gt;Alibaba Cloud Elacticsearch version 8.15 Full specification 50% off on new purchases of annual payment pass-through!&lt;/p&gt;

&lt;p&gt;Alibaba Cloud Elacticsearch non-8.15 version, 2-core/4-core full specification new purchase annual payment pass through 50% off!&lt;/p&gt;

</description>
      <category>llm</category>
      <category>rag</category>
      <category>alibabacloud</category>
      <category>elasticsearch</category>
    </item>
    <item>
      <title>From Enterprise RAG to AI Assistant—Alibaba Cloud Elasticsearch Technology Practice in AI Search</title>
      <dc:creator>A_Lucas</dc:creator>
      <pubDate>Wed, 22 Jan 2025 08:28:33 +0000</pubDate>
      <link>https://dev.to/a_lucas/from-enterprise-rag-to-ai-assistant-alibaba-cloud-elasticsearch-technology-practice-in-ai-search-457e</link>
      <guid>https://dev.to/a_lucas/from-enterprise-rag-to-ai-assistant-alibaba-cloud-elasticsearch-technology-practice-in-ai-search-457e</guid>
      <description>&lt;h1&gt;
  
  
  &lt;strong&gt;01 Challenges in landing AI search&lt;/strong&gt;
&lt;/h1&gt;

&lt;p&gt;Over the past year, rapid iteration of base large language model technology has driven the evolution of AI search in the following ways:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Refactoring of the search technology chain&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The comprehensive reconstruction based on the large language model is reshaping the technology chain of AI search. Every link from data collection, document parsing, and vector retrieval to query analysis, intent recognition, ranking models, and knowledge graphs is undergoing profound changes. New interaction methods such as conversational search, answer summarization, intelligent customer service, enterprise digital employees and virtual people are gradually becoming mainstream, which not only improves the user experience, but also provides the possibility for more application scenarios.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;AI search as infrastructure&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;AI search has become one of the basic technologies for various AI applications. As a popular AI native application, it not only drives the development of knowledge-based AI applications, but also gradually becomes a built-in capability of major basic models. For example, techniques such as vector retrieval, retrieval-enhanced generation (RAG), and semantic search have been widely used in several domains. This integration trend enhances the adaptability and flexibility of AI search in different scenarios.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Bottlenecks Facing Effectiveness Improvement&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Although AI search has made significant progress in terms of effectiveness, the illusion problem is still a major constraint to its wide application, especially in business scenarios that require high knowledge accuracy. In addition, high cost and low privacy security controllability are also important challenges in the implementation process.&lt;/p&gt;

&lt;p&gt;To cope with these issues, Alibaba cloud Elasticsearch has introduced an innovative AI search solution that uses RAG technology to enhance the capabilities of all aspects of retrieval enhancement generation, and deeply integrates the enterprise version of the AI Assistant, applying RAG technology to the field of AIOps.&lt;/p&gt;

&lt;h1&gt;
  
  
  &lt;strong&gt;02 5X Improvement in Elasticsearch Vector Performance&lt;/strong&gt;
&lt;/h1&gt;

&lt;p&gt;The Elasticsearch vector engine continues to be optimized, with particular focus on performance and cost improvements. Initially, due to a common perception bias - that ES vector engines are powerful but may have performance shortcomings, especially for applications in the Java ecosystem - this view is gradually being overturned by its technical evolution. From the initial 8.0 release to the current 8.15 release, Elasticsearch has iterated and made significant progress in performance optimization, including but not limited to the effective integration of hardware acceleration technologies.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F6biymc4pbi6yt0fi1c8t.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F6biymc4pbi6yt0fi1c8t.png" alt="Image description" width="800" height="725"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Elasticsearch utilizes hardware acceleration technology to achieve significant performance leaps in the field of vector retrieval, especially when dealing with complex similarity computation tasks. This technological innovation is not limited to the theoretical level; it has been proven that the deep integration of hardware gas pedals has increased the efficiency of some computationally intensive operations by several times or more. For example, benchmark data from September 2022 to the present can be visualized that the query response time has been drastically reduced from the initial 100ms to about 20ms now, highlighting the huge performance improvement brought by the iterative upgrading of Elasticsearch's vector retrieval.&lt;/p&gt;

&lt;p&gt;Elasticsearch's memory optimization is also noteworthy. Through vector quantization technology, the memory required is only a quarter of the original demand, which greatly improves resource utilization. In the latest version, BBQ (Better Binary Quantization) brings a quantization leap to Elasticsearch, reducing float32 dimensions to bits, which reduces memory by about 95% while maintaining high ranking quality. BBQ outperforms traditional methods such as Product Quantization (PQ) in terms of indexing speed (20-30x reduction in quantization time) and query speed (2-5x improvement in query speed), with no additional loss of accuracy.&lt;/p&gt;

&lt;h1&gt;
  
  
  &lt;strong&gt;03 Comprehensive Explanation of Elasticsearch for the Enterprise AI Capabilities&lt;/strong&gt;
&lt;/h1&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fimg.alicdn.com%2Fimgextra%2Fi4%2FO1CN01Ayu61y1sF8rampCWg_%21%216000000005736-0-tps-3538-1822.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fimg.alicdn.com%2Fimgextra%2Fi4%2FO1CN01Ayu61y1sF8rampCWg_%21%216000000005736-0-tps-3538-1822.jpg" alt="image" width="800" height="411"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Semantic expansion and sparse vector representation:&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Elasticsearch utilizes techniques such as sparse coding, which not only indexes the original vocabulary, but also effectively extends it to concepts or words related to it, with model-calculated weights attached to each extension, enhancing the depth and breadth of semantic understanding. This is due to the sparse vector technology, which stores information efficiently with a low memory footprint, compared to dense vectors that require full-memory indexing, which significantly improves resource efficiency.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Query efficiency and resource optimization:&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The query process benefits from the inverted index structure, which avoids the overhead of vector similarity matching and accelerates the retrieval speed. In addition, Elasticsearch's sparse vectors reduce memory requirements, further optimizing resource utilization.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Hybrid Search Strategy:&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Modern search requirements have motivated Elasticsearch to support multimodal queries, combining text, vector search, and rrf hybrid sorting methods to enhance the relevance and coverage of results. This hybrid search strategy recalls more diverse data and enhances the user experience.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Ranking and Relevance Adjustment:&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;In order to accurately select the most relevant results from the large amount of data recalled, ES employs a ranking mechanism such as BM25 to initially determine the weights considering factors such as document frequency and location. Subsequently, a secondary ranking of the initially filtered documents is performed by integrated learning or a more refined model (e.g., Rerank stage) to ensure that the top results are highly relevant.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Model integration and native support:&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Elasticsearch demonstrates powerful model integration capabilities, allowing users to load custom models directly into the cluster and run them, enabling end-to-end processing from input to output (e.g., word embedding generation) without external preprocessing steps. This not only simplifies the workflow, but also facilitates the seamless integration of machine learning models with the search engine, reinforcing the system's level of intelligence and adaptability.&lt;/p&gt;

&lt;h1&gt;
  
  
  &lt;strong&gt;04 Alibaba Cloud Elasticsearch accuracy has been increased to 95 percent&lt;/strong&gt;
&lt;/h1&gt;

&lt;p&gt;Relying on the powerful Elasticsearch foundation and based on the Alibaba Cloud AI search open platform, the Alibaba cloud Elasticsearch AI search product integrates diversified models and hybrid retrieval technologies to realize the leap from traditional search to AI semantic search. The solution forms a complete and efficient application framework for RAG scenarios through fine data preprocessing, intelligent vectorization, multi-dimensional search recall, and large model-assisted generation.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fimg.alicdn.com%2Fimgextra%2Fi2%2FO1CN01pr6Ggx1apdKur8Jfs_%21%216000000003379-0-tps-3558-1806.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fimg.alicdn.com%2Fimgextra%2Fi2%2FO1CN01pr6Ggx1apdKur8Jfs_%21%216000000003379-0-tps-3558-1806.jpg" width="800" height="406"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Document parsing and slicing:&lt;/strong&gt; Use self-developed models to identify unstructured data, extract key information, and ensure content integrity and semantic coherence.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Efficient vectorization:&lt;/strong&gt; Adopting parameter-optimized vector models to reduce costs while ensuring effectiveness and achieve efficient execution of the vectorization process.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;RRF Hybrid Retrieval Strategy:&lt;/strong&gt; Combining text, sparse and dense vector indexes to realize multi-way recall and significantly improve retrieval precision and efficiency.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Intent Understanding and Rearrangement Optimization:&lt;/strong&gt; Understand user intent through query analysis model, and work with the rearrangement model to fine-sort the results to ensure the relevance of the content.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Comprehensive Evaluation and Flexible Configuration:&lt;/strong&gt; AI search open platform provides one-stop service, including multiple model components, compatible with open source ecology, helping enterprises quickly build customized search systems.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fimg.alicdn.com%2Fimgextra%2Fi2%2FO1CN01dp7CgW1ZW6pte6uQc_%21%216000000003201-0-tps-3522-1928.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fimg.alicdn.com%2Fimgextra%2Fi2%2FO1CN01dp7CgW1ZW6pte6uQc_%21%216000000003201-0-tps-3522-1928.jpg" alt="image" width="800" height="437"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Through the comprehensive application of Alibaba cloud Elasticsearch AI search, the customer witnessed remarkable results in the knowledge base Q&amp;amp;A scenario, with the accuracy rate increasing from 48% initially to over 95% eventually. In addition, the combination of three-way hybrid search and re-ranking model further enhances the accuracy of search and guarantees the excellence of search experience.&lt;/p&gt;

&lt;h1&gt;
  
  
  &lt;strong&gt;05 AI Assistant Integrates Qwen LLM to Enable AIOps&lt;/strong&gt;
&lt;/h1&gt;

&lt;p&gt;Elasticsearch Enterprise Edition's AI Assistant incorporates RAG technology and Alibaba cloud's large language model services to provide an AIOps assistant for enterprises. This innovative tool shows excellent application potential in multiple areas such as universal search, observability analysis and security, and not only helps developers make significant progress in anomaly monitoring, alert handling, problem identification and diagnosis, data analysis and modeling, and query performance optimization, but also greatly improves work efficiency through a more intuitive and easy-to-use interactive interface.&lt;/p&gt;

&lt;p&gt;Especially in terms of observability, the AI Assistant can efficiently request, analyze, and visualize your data with the help of an automated function call mechanism, transforming it into information with practical operational value. In addition, a knowledge base based on Elastic Learned Sparse EncodeR (ELSER) support further enriches contextual information and suggestions from private datasets; while RAG technology combined with generalized large models ensures more accurate data understanding and expressiveness.&lt;/p&gt;

&lt;p&gt;After integrating the Qwen Model on the Alibaba Cloud AI Search Open Platform, Elasticsearch's AI Assistant pays special attention to activation function simulation calls to ensure seamless compatibility between different systems. This allows users to flexibly switch between multiple connectors according to specific needs, thus realizing efficient information retrieval and processing processes. Especially in microservice operation and maintenance scenarios, AI Assistant plays a crucial role - it not only monitors abnormal conditions and potential failure points in real time, but also analyzes detailed error logs and quickly locates the root cause of the problem in conjunction with existing operation and maintenance manuals. At the same time, AI Assistant can also effectively integrate all kinds of alarm information, comprehensively analyze the chain of security attacks, and then propose practical defense strategies, significantly improving the speed and quality of problem solving.&lt;/p&gt;

&lt;p&gt;By calling the API interface and automatically generating ESQL query statements, AI Assistant is able to perform complex data analysis tasks and generate intuitive and easy-to-understand statistical charts, even for users who do not know much about Elasticsearch query syntax. Whether it's exploring relationships between fields or interpreting data trends and other data insights, AI Assistant meets the diverse needs of users with high efficiency and easy operation.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fat6i6psn3q538gknxiyu.gif" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fat6i6psn3q538gknxiyu.gif" alt="gif" width="784" height="580"&gt;&lt;/a&gt;&lt;/p&gt;




&lt;p&gt;Free Trial：&lt;a href="https://www.alibabacloud.com/en/product/elasticsearch?_p_lc=1&amp;amp;spm=a3c0i.7911826.6791778070.252.6a3238708pxD6m" rel="noopener noreferrer"&gt;AlibabaCloud Elasticsearch official website&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Learn More：&lt;a href="https://www.alibabacloud.com/help/en/es/?spm=a2c63.p38356.0.0.24e11c37O7vKzE" rel="noopener noreferrer"&gt;AlibabaCloud Technology Solutions&lt;/a&gt;&lt;/p&gt;

</description>
      <category>productivity</category>
      <category>elasticsearch</category>
      <category>ai</category>
      <category>development</category>
    </item>
    <item>
      <title>Vector search using Alibaba Cloud inference API and semantic text</title>
      <dc:creator>A_Lucas</dc:creator>
      <pubDate>Mon, 20 Jan 2025 08:09:29 +0000</pubDate>
      <link>https://dev.to/a_lucas/vector-search-using-alibaba-cloud-infererence-api-and-semantic-text-26og</link>
      <guid>https://dev.to/a_lucas/vector-search-using-alibaba-cloud-infererence-api-and-semantic-text-26og</guid>
      <description>&lt;p&gt;In the previous post "&lt;a href="https://www.alibabacloud.com/blog/elasticsearch-open-inference-api-adds-support-for-alibaba-cloud-ai-search_601634" rel="noopener noreferrer"&gt;Elasticsearch Open Inference API Adds Support for Alibaba Cloud AI Search&lt;/a&gt;"，It describes in detail how to use the Elastic inference API to display Ali's dense vector model, sparse vector model, re-ranking and completion. In that article, it uses a lot of English examples. However, Ali's model is more suitable for displaying in Chinese. We know that Elastic's out-of-the-box sparse vector model, ELSER, is only suitable for English. It doesn't support Chinese at the moment. Ali's sparse vector model fills this gap. Sparse vectors can be used out of the box. For many developers who are not well versed in AI, this is definitely a boon, and it uses very little resources.&lt;/p&gt;

&lt;p&gt;For more reading, see Elasticsearch's "&lt;a href="https://www.elastic.co/guide/en/elasticsearch/reference/current/infer-service-alibabacloud-ai-search.html" rel="noopener noreferrer"&gt;AliCloud AI Search Inference Service&lt;/a&gt;"&lt;/p&gt;

&lt;h2&gt;
  
  
  Sparse Vector
&lt;/h2&gt;

&lt;p&gt;According to the documentation, we use the following command to create an inference API endpoint for sparse vectors:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight java"&gt;&lt;code&gt;&lt;span class="no"&gt;PUT&lt;/span&gt; &lt;span class="n"&gt;_inference&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;sparse_embedding&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;alibabacloud_ai_search_sparse&lt;/span&gt;
&lt;span class="o"&gt;{&lt;/span&gt;
    &lt;span class="s"&gt;"service"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="s"&gt;"alibabacloud-ai-search"&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;
    &lt;span class="s"&gt;"service_settings"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="o"&gt;{&lt;/span&gt;
        &lt;span class="s"&gt;"api_key"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="s"&gt;"&amp;lt;api_key&amp;gt;"&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;
        &lt;span class="s"&gt;"service_id"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="s"&gt;"ops-text-sparse-embedding-001"&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;
        &lt;span class="s"&gt;"host"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="s"&gt;"default-j01.platform-cn-shanghai.opensearch.aliyuncs.com"&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;
        &lt;span class="s"&gt;"workspace"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="s"&gt;"default"&lt;/span&gt;
    &lt;span class="o"&gt;}&lt;/span&gt;
&lt;span class="o"&gt;}&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;On top of that, we need to get the api_key from AliCloud to do so. Run the above command:&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F0o3zfufe12a7ow6g72zo.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F0o3zfufe12a7ow6g72zo.png" alt="image" width="800" height="410"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Next, We can test it by doing the following (the following information is obtained from the address)&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight java"&gt;&lt;code&gt;&lt;span class="no"&gt;POST&lt;/span&gt; &lt;span class="n"&gt;_inference&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;alibabacloud_ai_search_sparse&lt;/span&gt;
&lt;span class="o"&gt;{&lt;/span&gt;
    &lt;span class="err"&gt;“&lt;/span&gt;&lt;span class="n"&gt;input&lt;/span&gt;&lt;span class="err"&gt;"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="err"&gt;”&lt;/span&gt;&lt;span class="nc"&gt;Alibaba&lt;/span&gt; &lt;span class="o"&gt;(&lt;/span&gt;&lt;span class="nc"&gt;China&lt;/span&gt;&lt;span class="o"&gt;)&lt;/span&gt; &lt;span class="nc"&gt;Limited&lt;/span&gt; &lt;span class="n"&gt;was&lt;/span&gt; &lt;span class="n"&gt;founded&lt;/span&gt; &lt;span class="n"&gt;on&lt;/span&gt; &lt;span class="nc"&gt;March&lt;/span&gt; &lt;span class="mi"&gt;26&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="mi"&gt;2007&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;legal&lt;/span&gt; &lt;span class="n"&gt;representative&lt;/span&gt; &lt;span class="nc"&gt;Jiang&lt;/span&gt; &lt;span class="nc"&gt;Fang&lt;/span&gt;&lt;span class="err"&gt;”&lt;/span&gt;
&lt;span class="o"&gt;}&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;The above command shows the result as:&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fimg.alicdn.com%2Fimgextra%2Fi1%2FO1CN01uYOZ0Y1CzCpneb7gM_%21%216000000000151-0-tps-2498-1360.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fimg.alicdn.com%2Fimgextra%2Fi1%2FO1CN01uYOZ0Y1CzCpneb7gM_%21%216000000000151-0-tps-2498-1360.jpg" alt="image" width="800" height="435"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;We can see from the above displayed result that it is different from our previous Elastic ELSER output. Here it should be unicode.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fimg.alicdn.com%2Fimgextra%2Fi3%2FO1CN012pLnWj1I8QtNtEGti_%21%216000000000848-0-tps-1486-710.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fimg.alicdn.com%2Fimgextra%2Fi3%2FO1CN012pLnWj1I8QtNtEGti_%21%216000000000848-0-tps-1486-710.jpg" alt="image" width="800" height="382"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;The command above is the same as the command below:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight java"&gt;&lt;code&gt;&lt;span class="no"&gt;POST&lt;/span&gt; &lt;span class="n"&gt;_inference&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;sparse_embedding&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;alibabacloud_ai_search_sparse&lt;/span&gt;
&lt;span class="o"&gt;{&lt;/span&gt;
  &lt;span class="s"&gt;"input"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="s"&gt;"Alibaba (China) Limited was founded on March 26, 2007, legal representative Jiang Fang"&lt;/span&gt;
&lt;span class="o"&gt;}&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Because each endpoint is automatically detected and recognized as what type of model it is when it is created by Elasticsearch, the sparse_embedding in the path above can be omitted.&lt;/p&gt;

&lt;h2&gt;
  
  
  Dense Vectors
&lt;/h2&gt;

&lt;p&gt;Similarly, we use the following command to create an inference API endpoint for dense vectors:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight java"&gt;&lt;code&gt;&lt;span class="no"&gt;PUT&lt;/span&gt; &lt;span class="n"&gt;_inference&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;text_embedding&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;alibabacloud_ai_search_embeddings&lt;/span&gt;
&lt;span class="o"&gt;{&lt;/span&gt;
    &lt;span class="s"&gt;"service"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="s"&gt;"alibabacloud-ai-search"&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;
    &lt;span class="s"&gt;"service_settings"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="o"&gt;{&lt;/span&gt;
        &lt;span class="s"&gt;"api_key"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="s"&gt;"&amp;lt;api_key&amp;gt;"&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;
        &lt;span class="s"&gt;"service_id"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="s"&gt;"ops-text-embedding-001"&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;
        &lt;span class="s"&gt;"host"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="s"&gt;"default-j01.platform-cn-shanghai.opensearch.aliyuncs.com"&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;
        &lt;span class="s"&gt;"workspace"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="s"&gt;"default"&lt;/span&gt;
    &lt;span class="o"&gt;}&lt;/span&gt;
&lt;span class="o"&gt;}&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Running the above command, we can see that:&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F37ek15pjki48ba0zq6h3.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F37ek15pjki48ba0zq6h3.png" alt="image" width="800" height="407"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;We can use the following command to generate dense vectors:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight java"&gt;&lt;code&gt;&lt;span class="no"&gt;POST&lt;/span&gt; &lt;span class="n"&gt;_inference&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;alibabacloud_ai_search_embeddings&lt;/span&gt;
&lt;span class="o"&gt;{&lt;/span&gt;
  &lt;span class="s"&gt;"input"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="s"&gt;"Alibaba (China) Limited was founded on March 26, 2007, legal representative Jiang Fang"&lt;/span&gt;
&lt;span class="o"&gt;}&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fllg2se4lv5o7bi6t2b66.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fllg2se4lv5o7bi6t2b66.png" alt="image" width="800" height="398"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;A dense vector is an array of floating point numbers. We can still actually scalar quantize it when we generate it, which saves memory consumption and improves the speed of the search.&lt;/p&gt;

&lt;h2&gt;
  
  
  Completion
&lt;/h2&gt;

&lt;p&gt;We can even apply a big model to the search results to get a result of "completion", e.g., we use the following command to generate an inference API endpoint for completion:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight java"&gt;&lt;code&gt;&lt;span class="no"&gt;PUT&lt;/span&gt; &lt;span class="n"&gt;_inference&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;completion&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;alibabacloud_ai_search_completion&lt;/span&gt;
&lt;span class="o"&gt;{&lt;/span&gt;
    &lt;span class="s"&gt;"service"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="s"&gt;"alibabacloud-ai-search"&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;
    &lt;span class="s"&gt;"service_settings"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="o"&gt;{&lt;/span&gt;
        &lt;span class="s"&gt;"host"&lt;/span&gt; &lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="s"&gt;"default-j01.platform-cn-shanghai.opensearch.aliyuncs.com"&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;
        &lt;span class="s"&gt;"api_key"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="s"&gt;"{{API_KEY}}"&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;
        &lt;span class="s"&gt;"service_id"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="s"&gt;"ops-qwen-turbo"&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;
        &lt;span class="s"&gt;"workspace"&lt;/span&gt; &lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="s"&gt;"default"&lt;/span&gt;
    &lt;span class="o"&gt;}&lt;/span&gt;
&lt;span class="o"&gt;}&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fpidsn8lg312a9fz9t7du.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fpidsn8lg312a9fz9t7du.png" alt="image" width="800" height="407"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;We can demonstrate completion with the following example：&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight java"&gt;&lt;code&gt;&lt;span class="no"&gt;POST&lt;/span&gt; &lt;span class="n"&gt;_inference&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;completion&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;alibabacloud_ai_search_completion&lt;/span&gt;
&lt;span class="o"&gt;{&lt;/span&gt;
  &lt;span class="err"&gt;“&lt;/span&gt;&lt;span class="n"&gt;input&lt;/span&gt;&lt;span class="err"&gt;"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="err"&gt;”&lt;/span&gt;&lt;span class="nc"&gt;When&lt;/span&gt; &lt;span class="n"&gt;was&lt;/span&gt; &lt;span class="nf"&gt;Alibaba&lt;/span&gt; &lt;span class="o"&gt;(&lt;/span&gt;&lt;span class="nc"&gt;China&lt;/span&gt;&lt;span class="o"&gt;)&lt;/span&gt; &lt;span class="nc"&gt;Limited&lt;/span&gt; &lt;span class="n"&gt;founded&lt;/span&gt;&lt;span class="o"&gt;?&lt;/span&gt;&lt;span class="err"&gt;”&lt;/span&gt;
&lt;span class="o"&gt;}&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;This result is generated in conjunction with the big model. The big model provides a result based on the results obtained during the training of the big model.&lt;/p&gt;

&lt;h2&gt;
  
  
  Rerank
&lt;/h2&gt;

&lt;p&gt;Similarly, we can generate an inference API endpoint for rerank by following the commands below:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight java"&gt;&lt;code&gt;&lt;span class="no"&gt;PUT&lt;/span&gt; &lt;span class="n"&gt;_inference&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;rerank&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;alibabacloud_ai_search_rerank&lt;/span&gt;
&lt;span class="o"&gt;{&lt;/span&gt;
    &lt;span class="s"&gt;"service"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="s"&gt;"alibabacloud-ai-search"&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;
    &lt;span class="s"&gt;"service_settings"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="o"&gt;{&lt;/span&gt;
        &lt;span class="s"&gt;"api_key"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="s"&gt;"&amp;lt;api_key&amp;gt;"&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;
        &lt;span class="s"&gt;"service_id"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="s"&gt;"ops-bge-reranker-larger"&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;
        &lt;span class="s"&gt;"host"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="s"&gt;"default-j01.platform-cn-shanghai.opensearch.aliyuncs.com"&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;
        &lt;span class="s"&gt;"workspace"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="s"&gt;"default"&lt;/span&gt;
    &lt;span class="o"&gt;}&lt;/span&gt;
&lt;span class="o"&gt;}&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F6y1htf9s4omcvsxqp2yq.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F6y1htf9s4omcvsxqp2yq.png" alt="image" width="800" height="314"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;In the "input" above, we have listed two documents. We can reorder these two documents using the "rerank" endpoint. Let's say the text we want to search for is "When was Alibaba (China) Limited founded?". Then "rerank" returns the result:&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fimg.alicdn.com%2Fimgextra%2Fi1%2FO1CN016O03nu1tqzCiuYZcy_%21%216000000005954-0-tps-1530-496.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fimg.alicdn.com%2Fimgextra%2Fi1%2FO1CN016O03nu1tqzCiuYZcy_%21%216000000005954-0-tps-1530-496.jpg" alt="image" width="800" height="259"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;From the above, we can see that the second document is more relevant than the first, that is, it is more relevant.&lt;/p&gt;

&lt;h2&gt;
  
  
  RAG Applications
&lt;/h2&gt;

&lt;p&gt;In many cases, our organization has a lot of business data or private data being generated every now and then, and the knowledge of the big model is limited to when it is generated, so a lot of times, there is a lot of knowledge that the big model doesn't know. If we don't limit the answers of the big model, then it may give wrong answers and thus create illusions. In practice, we usually keep business or private data in a vector database like Elasticsearch. When searching, we first search Elasticsearch and send the results of the search to the big model as part of the prompt, which solves the hallucination problem.&lt;/p&gt;

&lt;p&gt;First, let's create a sparse vector index:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight java"&gt;&lt;code&gt;&lt;span class="no"&gt;PUT&lt;/span&gt; &lt;span class="n"&gt;alibaba_sparse&lt;/span&gt;
&lt;span class="o"&gt;{&lt;/span&gt;
  &lt;span class="s"&gt;"mappings"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="o"&gt;{&lt;/span&gt;
    &lt;span class="s"&gt;"properties"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="o"&gt;{&lt;/span&gt;
      &lt;span class="s"&gt;"inference_field"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="o"&gt;{&lt;/span&gt;
        &lt;span class="s"&gt;"type"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="s"&gt;"semantic_text"&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;
        &lt;span class="s"&gt;"inference_id"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="s"&gt;"alibabacloud_ai_search_sparse"&lt;/span&gt;
      &lt;span class="o"&gt;}&lt;/span&gt;
    &lt;span class="o"&gt;}&lt;/span&gt;
  &lt;span class="o"&gt;}&lt;/span&gt;
&lt;span class="o"&gt;}&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Then, we write the following two documents:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight java"&gt;&lt;code&gt;&lt;span class="no"&gt;PUT&lt;/span&gt; &lt;span class="n"&gt;alibaba_sparse&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;_bulk&lt;/span&gt;
&lt;span class="o"&gt;{&lt;/span&gt;&lt;span class="err"&gt;“&lt;/span&gt;&lt;span class="n"&gt;index&lt;/span&gt;&lt;span class="err"&gt;”&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="o"&gt;{&lt;/span&gt;&lt;span class="err"&gt;“&lt;/span&gt;&lt;span class="n"&gt;_id&lt;/span&gt;&lt;span class="err"&gt;”&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="err"&gt;“&lt;/span&gt;&lt;span class="mi"&gt;1&lt;/span&gt;&lt;span class="err"&gt;”&lt;/span&gt;&lt;span class="o"&gt;}}&lt;/span&gt;
&lt;span class="o"&gt;{&lt;/span&gt;&lt;span class="err"&gt;“&lt;/span&gt;&lt;span class="n"&gt;inference_field&lt;/span&gt;&lt;span class="err"&gt;”&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="err"&gt;“&lt;/span&gt;&lt;span class="nc"&gt;Alibaba&lt;/span&gt; &lt;span class="o"&gt;(&lt;/span&gt;&lt;span class="nc"&gt;China&lt;/span&gt;&lt;span class="o"&gt;)&lt;/span&gt; &lt;span class="nc"&gt;Limited&lt;/span&gt; &lt;span class="n"&gt;was&lt;/span&gt; &lt;span class="n"&gt;founded&lt;/span&gt; &lt;span class="n"&gt;on&lt;/span&gt; &lt;span class="nc"&gt;March&lt;/span&gt; &lt;span class="mi"&gt;26&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="mi"&gt;2007&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;legal&lt;/span&gt; &lt;span class="n"&gt;representative&lt;/span&gt; &lt;span class="nc"&gt;Jiang&lt;/span&gt; &lt;span class="nc"&gt;Fang&lt;/span&gt;&lt;span class="err"&gt;”&lt;/span&gt;&lt;span class="o"&gt;}}&lt;/span&gt;
&lt;span class="o"&gt;{&lt;/span&gt;&lt;span class="err"&gt;“&lt;/span&gt;&lt;span class="n"&gt;index&lt;/span&gt;&lt;span class="err"&gt;”&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="o"&gt;{&lt;/span&gt;&lt;span class="err"&gt;“&lt;/span&gt;&lt;span class="n"&gt;_id&lt;/span&gt;&lt;span class="err"&gt;”&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="err"&gt;“&lt;/span&gt;&lt;span class="mi"&gt;2&lt;/span&gt;&lt;span class="err"&gt;”&lt;/span&gt;&lt;span class="o"&gt;}}&lt;/span&gt;
&lt;span class="o"&gt;{&lt;/span&gt;&lt;span class="err"&gt;“&lt;/span&gt;&lt;span class="n"&gt;inference_field&lt;/span&gt;&lt;span class="err"&gt;”&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="err"&gt;“&lt;/span&gt;&lt;span class="nc"&gt;Founded&lt;/span&gt; &lt;span class="n"&gt;in&lt;/span&gt; &lt;span class="mi"&gt;2009&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;headquartered&lt;/span&gt; &lt;span class="n"&gt;in&lt;/span&gt; &lt;span class="nc"&gt;Hangzhou&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="nc"&gt;Alibaba&lt;/span&gt; &lt;span class="nc"&gt;Cloud&lt;/span&gt; &lt;span class="nc"&gt;Computing&lt;/span&gt; &lt;span class="nc"&gt;Co&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt; &lt;span class="nf"&gt;Ltd&lt;/span&gt; &lt;span class="o"&gt;(&lt;/span&gt;&lt;span class="n"&gt;also&lt;/span&gt; &lt;span class="n"&gt;known&lt;/span&gt; &lt;span class="n"&gt;as&lt;/span&gt; &lt;span class="nc"&gt;Alibaba&lt;/span&gt; &lt;span class="nc"&gt;Cloud&lt;/span&gt; &lt;span class="nc"&gt;Computing&lt;/span&gt; &lt;span class="nc"&gt;Co&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt; &lt;span class="nc"&gt;Ltd&lt;/span&gt;&lt;span class="o"&gt;)&lt;/span&gt; &lt;span class="n"&gt;is&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;digital&lt;/span&gt; &lt;span class="n"&gt;technology&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;intelligence&lt;/span&gt; &lt;span class="n"&gt;backbone&lt;/span&gt; &lt;span class="n"&gt;of&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="nc"&gt;Alibaba&lt;/span&gt; &lt;span class="nc"&gt;Group&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;providing&lt;/span&gt; &lt;span class="n"&gt;a&lt;/span&gt; &lt;span class="n"&gt;full&lt;/span&gt; &lt;span class="n"&gt;range&lt;/span&gt; &lt;span class="n"&gt;of&lt;/span&gt; &lt;span class="n"&gt;cloud&lt;/span&gt; &lt;span class="n"&gt;services&lt;/span&gt; &lt;span class="n"&gt;to&lt;/span&gt; &lt;span class="n"&gt;customers&lt;/span&gt; &lt;span class="n"&gt;around&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;globe&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;including&lt;/span&gt; &lt;span class="n"&gt;its&lt;/span&gt; &lt;span class="n"&gt;own&lt;/span&gt; &lt;span class="n"&gt;servers&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;elastic&lt;/span&gt; &lt;span class="n"&gt;computing&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;storage&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;network&lt;/span&gt; &lt;span class="n"&gt;security&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;database&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;big&lt;/span&gt; &lt;span class="n"&gt;data&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt; &lt;span class="nc"&gt;It&lt;/span&gt; &lt;span class="n"&gt;provides&lt;/span&gt; &lt;span class="n"&gt;a&lt;/span&gt; &lt;span class="n"&gt;full&lt;/span&gt; &lt;span class="n"&gt;range&lt;/span&gt; &lt;span class="n"&gt;of&lt;/span&gt; &lt;span class="n"&gt;cloud&lt;/span&gt; &lt;span class="n"&gt;services&lt;/span&gt; &lt;span class="n"&gt;to&lt;/span&gt; &lt;span class="n"&gt;global&lt;/span&gt; &lt;span class="n"&gt;customers&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;including&lt;/span&gt; &lt;span class="n"&gt;its&lt;/span&gt; &lt;span class="n"&gt;own&lt;/span&gt; &lt;span class="n"&gt;servers&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;elastic&lt;/span&gt; &lt;span class="n"&gt;computing&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;storage&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;network&lt;/span&gt; &lt;span class="n"&gt;security&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;database&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;big&lt;/span&gt; &lt;span class="n"&gt;data&lt;/span&gt; &lt;span class="n"&gt;services&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;&lt;span class="err"&gt;"&lt;/span&gt;&lt;span class="o"&gt;}&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;We perform a search with the following command:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight java"&gt;&lt;code&gt;&lt;span class="no"&gt;GET&lt;/span&gt; &lt;span class="n"&gt;alibaba_sparse&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;_search&lt;/span&gt;
&lt;span class="o"&gt;{&lt;/span&gt;
  &lt;span class="s"&gt;"query"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="o"&gt;{&lt;/span&gt;
    &lt;span class="s"&gt;"semantic"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="o"&gt;{&lt;/span&gt;
      &lt;span class="s"&gt;"field"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="s"&gt;"inference_field"&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;
      &lt;span class="s"&gt;"query"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="s"&gt;"when was Aliyun founded？"&lt;/span&gt;
    &lt;span class="o"&gt;}&lt;/span&gt;
  &lt;span class="o"&gt;}&lt;/span&gt;
&lt;span class="o"&gt;}&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;The search result above is:&lt;/p&gt;

&lt;p&gt;Alibaba Cloud is in the first place&lt;/p&gt;

&lt;p&gt;"Founded in 2009 and headquartered in Hangzhou, Alibaba Cloud Computing Co. Ltd (also known as Alibaba Cloud Computing Co. Ltd) is the digital technology and intelligence backbone of the Alibaba Group, providing a full range of cloud services to customers around the globe, including its own servers, elastic computing, storage, network security, database and big data. It provides a full range of cloud services to global customers, including its own servers, elastic computing, storage, network security, database and big data services."&lt;/p&gt;

&lt;p&gt;Alibaba is in second place&lt;/p&gt;

&lt;p&gt;"Alibaba (China) Limited was founded on March 26, 2007, legal representative Jiang Fang"&lt;/p&gt;

&lt;p&gt;The document containing Alibaba cloud is in the first position. This is what we call vector search.&lt;/p&gt;

&lt;p&gt;In practice, in many cases, we sometimes want to get a unique answer, or even this answer is reasoned out, not the original document. This time we need to use the big model, that is, use the completion of the reasoning endpoint.&lt;/p&gt;

&lt;p&gt;Let's start by doing the following search:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight java"&gt;&lt;code&gt;&lt;span class="no"&gt;GET&lt;/span&gt; &lt;span class="n"&gt;alibaba_sparse&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;_search&lt;/span&gt;
&lt;span class="o"&gt;{&lt;/span&gt;
  &lt;span class="s"&gt;"query"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="o"&gt;{&lt;/span&gt;
    &lt;span class="s"&gt;"semantic"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="o"&gt;{&lt;/span&gt;
      &lt;span class="s"&gt;"field"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="s"&gt;"inference_field"&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;
      &lt;span class="s"&gt;"query"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="s"&gt;"Who is the legal person of Alibaba？"&lt;/span&gt;
    &lt;span class="o"&gt;}&lt;/span&gt;
  &lt;span class="o"&gt;}&lt;/span&gt;
&lt;span class="o"&gt;}&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fwo9yjh6aadl4rvrgoz43.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fwo9yjh6aadl4rvrgoz43.png" alt="image" width="800" height="457"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;This time only one document is listed. We use the "completion" endpoint below:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight java"&gt;&lt;code&gt;&lt;span class="no"&gt;POST&lt;/span&gt; &lt;span class="n"&gt;_inference&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;completion&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;alibabacloud_ai_search_completion&lt;/span&gt;
&lt;span class="o"&gt;{&lt;/span&gt;
  &lt;span class="s"&gt;"input"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="s"&gt;"&amp;lt;|system|&amp;gt;
    You are a knowledgeable person.&amp;lt;/s&amp;gt;
    &amp;lt;|user|&amp;gt;
    CONTEXT:
    Alibaba (China) Limited was founded on March 26, 2007, legal representative Jiang Fang

    QUESTION: 
    Who is the legal person of Alibaba？&amp;lt;/s&amp;gt;
    &amp;lt;|assistant|&amp;gt;"&lt;/span&gt;
&lt;span class="o"&gt;}&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;In the above, we document the results of the search above "Alibaba (China) Limited was founded on March 26, 2007, legal representative Jiang Fang" as a part of the document，and submit the search question along with it. We use the completion endpoint API to view the results:&lt;/p&gt;

&lt;p&gt;The result will be "Alibaba's legal representative is Jiang Fang"。Obviously this is quite different from the previous approach of using "completion" without "context":&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight java"&gt;&lt;code&gt;&lt;span class="no"&gt;POST&lt;/span&gt; &lt;span class="n"&gt;_inference&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;completion&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;alibabacloud_ai_search_completion&lt;/span&gt;
&lt;span class="o"&gt;{&lt;/span&gt;
  &lt;span class="s"&gt;"input"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="s"&gt;"Who is the legal person of Alibaba?"&lt;/span&gt;
&lt;span class="o"&gt;}&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;It can be understood in this way that with the results from the Elasticsearch vector database (real-time business data or private data) searched and made available to the big model, the big model can get search results that are more closely aligned to the answer based on these contexts. This is used in practice to avoid illusions!&lt;/p&gt;

&lt;h2&gt;
  
  
  How to build a prompt
&lt;/h2&gt;

&lt;p&gt;Following the method of constructing a prompt, let's create an index of a dense vector:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight java"&gt;&lt;code&gt;&lt;span class="no"&gt;PUT&lt;/span&gt; &lt;span class="n"&gt;alibaba_dense&lt;/span&gt;
&lt;span class="o"&gt;{&lt;/span&gt;
  &lt;span class="s"&gt;"mappings"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="o"&gt;{&lt;/span&gt;
    &lt;span class="s"&gt;"properties"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="o"&gt;{&lt;/span&gt;
      &lt;span class="s"&gt;"inference_field"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="o"&gt;{&lt;/span&gt;
        &lt;span class="s"&gt;"type"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="s"&gt;"semantic_text"&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;
        &lt;span class="s"&gt;"inference_id"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="s"&gt;"alibabacloud_ai_search_embeddings"&lt;/span&gt;
      &lt;span class="o"&gt;}&lt;/span&gt;
    &lt;span class="o"&gt;}&lt;/span&gt;
  &lt;span class="o"&gt;}&lt;/span&gt;
&lt;span class="o"&gt;}&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;We follow the same method for writing documents:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight java"&gt;&lt;code&gt;&lt;span class="no"&gt;PUT&lt;/span&gt; &lt;span class="n"&gt;alibaba_dense&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;_bulk&lt;/span&gt;
&lt;span class="o"&gt;{&lt;/span&gt;&lt;span class="err"&gt;“&lt;/span&gt;&lt;span class="n"&gt;index&lt;/span&gt;&lt;span class="err"&gt;”&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="o"&gt;{&lt;/span&gt;&lt;span class="err"&gt;“&lt;/span&gt;&lt;span class="n"&gt;_id&lt;/span&gt;&lt;span class="err"&gt;”&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="err"&gt;“&lt;/span&gt;&lt;span class="mi"&gt;1&lt;/span&gt;&lt;span class="err"&gt;”&lt;/span&gt;&lt;span class="o"&gt;}}&lt;/span&gt;
&lt;span class="o"&gt;{&lt;/span&gt;&lt;span class="err"&gt;“&lt;/span&gt;&lt;span class="n"&gt;inference_field&lt;/span&gt;&lt;span class="err"&gt;”&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="err"&gt;“&lt;/span&gt;&lt;span class="nc"&gt;Alibaba&lt;/span&gt; &lt;span class="o"&gt;(&lt;/span&gt;&lt;span class="nc"&gt;China&lt;/span&gt;&lt;span class="o"&gt;)&lt;/span&gt; &lt;span class="nc"&gt;Limited&lt;/span&gt; &lt;span class="n"&gt;was&lt;/span&gt; &lt;span class="n"&gt;founded&lt;/span&gt; &lt;span class="n"&gt;on&lt;/span&gt; &lt;span class="nc"&gt;March&lt;/span&gt; &lt;span class="mi"&gt;26&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="mi"&gt;2007&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;legal&lt;/span&gt; &lt;span class="n"&gt;representative&lt;/span&gt; &lt;span class="nc"&gt;Jiang&lt;/span&gt; &lt;span class="nc"&gt;Fang&lt;/span&gt;&lt;span class="err"&gt;”&lt;/span&gt;&lt;span class="o"&gt;}&lt;/span&gt;
&lt;span class="o"&gt;{&lt;/span&gt;&lt;span class="err"&gt;“&lt;/span&gt;&lt;span class="n"&gt;index&lt;/span&gt;&lt;span class="err"&gt;”&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="o"&gt;{&lt;/span&gt;&lt;span class="err"&gt;“&lt;/span&gt;&lt;span class="n"&gt;_id&lt;/span&gt;&lt;span class="err"&gt;”&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="err"&gt;“&lt;/span&gt;&lt;span class="mi"&gt;2&lt;/span&gt;&lt;span class="err"&gt;”&lt;/span&gt;&lt;span class="o"&gt;}}&lt;/span&gt;
&lt;span class="o"&gt;{&lt;/span&gt;&lt;span class="err"&gt;“&lt;/span&gt;&lt;span class="n"&gt;inference_field&lt;/span&gt;&lt;span class="err"&gt;”&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="err"&gt;“&lt;/span&gt;&lt;span class="nc"&gt;Aliyun&lt;/span&gt; &lt;span class="o"&gt;(&lt;/span&gt;&lt;span class="nl"&gt;aka:&lt;/span&gt; &lt;span class="nc"&gt;Aliyun&lt;/span&gt; &lt;span class="nc"&gt;Computing&lt;/span&gt; &lt;span class="nc"&gt;Co&lt;/span&gt;&lt;span class="o"&gt;.,&lt;/span&gt; &lt;span class="nc"&gt;Ltd&lt;/span&gt;&lt;span class="o"&gt;.)&lt;/span&gt; &lt;span class="n"&gt;was&lt;/span&gt; &lt;span class="n"&gt;founded&lt;/span&gt; &lt;span class="n"&gt;in&lt;/span&gt; &lt;span class="mi"&gt;2009&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;headquartered&lt;/span&gt; &lt;span class="n"&gt;in&lt;/span&gt; &lt;span class="nc"&gt;Hangzhou&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;is&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;digital&lt;/span&gt; &lt;span class="n"&gt;technology&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;intelligence&lt;/span&gt; &lt;span class="n"&gt;backbone&lt;/span&gt; &lt;span class="n"&gt;of&lt;/span&gt; &lt;span class="nc"&gt;Alibaba&lt;/span&gt; &lt;span class="nc"&gt;Group&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;providing&lt;/span&gt; &lt;span class="n"&gt;a&lt;/span&gt; &lt;span class="n"&gt;full&lt;/span&gt; &lt;span class="n"&gt;range&lt;/span&gt; &lt;span class="n"&gt;of&lt;/span&gt; &lt;span class="n"&gt;cloud&lt;/span&gt; &lt;span class="n"&gt;services&lt;/span&gt; &lt;span class="n"&gt;to&lt;/span&gt; &lt;span class="n"&gt;customers&lt;/span&gt; &lt;span class="n"&gt;around&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;world&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;&lt;span class="err"&gt;”&lt;/span&gt;&lt;span class="o"&gt;}}&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Let's search this dense vector index alibaba_dense:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight java"&gt;&lt;code&gt;&lt;span class="no"&gt;GET&lt;/span&gt; &lt;span class="n"&gt;alibaba_dense&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;_search&lt;/span&gt;
&lt;span class="o"&gt;{&lt;/span&gt;
  &lt;span class="s"&gt;"query"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="o"&gt;{&lt;/span&gt;
    &lt;span class="s"&gt;"semantic"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="o"&gt;{&lt;/span&gt;
      &lt;span class="s"&gt;"field"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="s"&gt;"inference_field"&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;
      &lt;span class="s"&gt;"query"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="s"&gt;"Who is the legal person of Alibaba?"&lt;/span&gt;
    &lt;span class="o"&gt;}&lt;/span&gt;
  &lt;span class="o"&gt;}&lt;/span&gt;
&lt;span class="o"&gt;}&lt;/span&gt;
&lt;span class="no"&gt;GET&lt;/span&gt; &lt;span class="n"&gt;alibaba_dense&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;_search&lt;/span&gt;
&lt;span class="o"&gt;{&lt;/span&gt;
  &lt;span class="s"&gt;"query"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="o"&gt;{&lt;/span&gt;
    &lt;span class="s"&gt;"semantic"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="o"&gt;{&lt;/span&gt;
      &lt;span class="s"&gt;"field"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="s"&gt;"inference_field"&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;
      &lt;span class="s"&gt;"query"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="s"&gt;"What kind of company is Aliyun?"&lt;/span&gt;
    &lt;span class="o"&gt;}&lt;/span&gt;
  &lt;span class="o"&gt;}&lt;/span&gt;
&lt;span class="o"&gt;}&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;The results of the search above are:&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fimg.alicdn.com%2Fimgextra%2Fi1%2FO1CN01MgYwFX1mFFFw82qz8_%21%216000000004924-0-tps-3172-1834.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fimg.alicdn.com%2Fimgextra%2Fi1%2FO1CN01MgYwFX1mFFFw82qz8_%21%216000000004924-0-tps-3172-1834.jpg" alt="image" width="800" height="462"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Obviously, documents containing "Aliyun" are ranked first although we searched for "Alibaba".&lt;/p&gt;

&lt;h2&gt;
  
  
  Automatic chunking
&lt;/h2&gt;

&lt;p&gt;We know that the semantic_text field can help us automatically chunk a large piece of text without us having to manually chunk it. This is one of the great benefits of semantic_text. Let's create an index called alibaba_dense_large_text:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight java"&gt;&lt;code&gt;&lt;span class="no"&gt;PUT&lt;/span&gt; &lt;span class="n"&gt;alibaba_dense_large_text&lt;/span&gt;
&lt;span class="o"&gt;{&lt;/span&gt;
  &lt;span class="s"&gt;"mappings"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="o"&gt;{&lt;/span&gt;
    &lt;span class="s"&gt;"properties"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="o"&gt;{&lt;/span&gt;
      &lt;span class="s"&gt;"inference_field"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="o"&gt;{&lt;/span&gt;
        &lt;span class="s"&gt;"type"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="s"&gt;"semantic_text"&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;
        &lt;span class="s"&gt;"inference_id"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="s"&gt;"alibabacloud_ai_search_embeddings"&lt;/span&gt;
      &lt;span class="o"&gt;}&lt;/span&gt;
    &lt;span class="o"&gt;}&lt;/span&gt;
  &lt;span class="o"&gt;}&lt;/span&gt;
&lt;span class="o"&gt;}&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;We also create a large document for this index:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight java"&gt;&lt;code&gt;&lt;span class="no"&gt;PUT&lt;/span&gt; &lt;span class="n"&gt;alibaba_dense_large_text&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;_doc&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="mi"&gt;1&lt;/span&gt;
&lt;span class="o"&gt;{&lt;/span&gt;
    &lt;span class="err"&gt;“&lt;/span&gt;&lt;span class="n"&gt;inference_field&lt;/span&gt;&lt;span class="err"&gt;"&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt; &lt;span class="err"&gt;”&lt;/span&gt;
    &lt;span class="nc"&gt;About&lt;/span&gt; &lt;span class="nc"&gt;Alibaba&lt;/span&gt; &lt;span class="nc"&gt;Group&lt;/span&gt;
&lt;span class="nc"&gt;Alibaba&lt;/span&gt; &lt;span class="nf"&gt;Group&lt;/span&gt; &lt;span class="o"&gt;(&lt;/span&gt;&lt;span class="nc"&gt;Alibaba&lt;/span&gt; &lt;span class="nc"&gt;Group&lt;/span&gt;&lt;span class="o"&gt;)&lt;/span&gt; &lt;span class="n"&gt;is&lt;/span&gt; &lt;span class="n"&gt;a&lt;/span&gt; &lt;span class="n"&gt;globally&lt;/span&gt; &lt;span class="n"&gt;recognized&lt;/span&gt; &lt;span class="n"&gt;technology&lt;/span&gt; &lt;span class="n"&gt;company&lt;/span&gt; &lt;span class="n"&gt;headquartered&lt;/span&gt; &lt;span class="n"&gt;in&lt;/span&gt; &lt;span class="nc"&gt;Hangzhou&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="nc"&gt;Zhejiang&lt;/span&gt; &lt;span class="nc"&gt;Province&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="nc"&gt;China&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;founded&lt;/span&gt; &lt;span class="n"&gt;in&lt;/span&gt; &lt;span class="mi"&gt;1999&lt;/span&gt; &lt;span class="n"&gt;by&lt;/span&gt; &lt;span class="nc"&gt;Jack&lt;/span&gt; &lt;span class="nc"&gt;Ma&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;his&lt;/span&gt; &lt;span class="n"&gt;team&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt; &lt;span class="nc"&gt;As&lt;/span&gt; &lt;span class="n"&gt;a&lt;/span&gt; &lt;span class="n"&gt;technology&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;driven&lt;/span&gt; &lt;span class="n"&gt;company&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="nc"&gt;Alibaba&lt;/span&gt; &lt;span class="n"&gt;has&lt;/span&gt; &lt;span class="n"&gt;a&lt;/span&gt; &lt;span class="n"&gt;wide&lt;/span&gt; &lt;span class="n"&gt;range&lt;/span&gt; &lt;span class="n"&gt;of&lt;/span&gt; &lt;span class="n"&gt;businesses&lt;/span&gt; &lt;span class="n"&gt;in&lt;/span&gt; &lt;span class="n"&gt;e&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;commerce&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;cloud&lt;/span&gt; &lt;span class="n"&gt;computing&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;financial&lt;/span&gt; &lt;span class="n"&gt;technology&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;digital&lt;/span&gt; &lt;span class="n"&gt;media&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;logistics&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;other&lt;/span&gt; &lt;span class="n"&gt;fields&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;has&lt;/span&gt; &lt;span class="n"&gt;become&lt;/span&gt; &lt;span class="n"&gt;a&lt;/span&gt; &lt;span class="n"&gt;key&lt;/span&gt; &lt;span class="n"&gt;player&lt;/span&gt; &lt;span class="n"&gt;in&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;global&lt;/span&gt; &lt;span class="nc"&gt;Internet&lt;/span&gt; &lt;span class="n"&gt;industry&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;
&lt;span class="nc"&gt;Background&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="nc"&gt;Early&lt;/span&gt; &lt;span class="nc"&gt;Development&lt;/span&gt;
&lt;span class="nc"&gt;The&lt;/span&gt; &lt;span class="n"&gt;birth&lt;/span&gt; &lt;span class="n"&gt;of&lt;/span&gt; &lt;span class="nc"&gt;Alibaba&lt;/span&gt; &lt;span class="n"&gt;began&lt;/span&gt; &lt;span class="n"&gt;in&lt;/span&gt; &lt;span class="mi"&gt;1999&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;when&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="nc"&gt;Internet&lt;/span&gt; &lt;span class="n"&gt;was&lt;/span&gt; &lt;span class="n"&gt;just&lt;/span&gt; &lt;span class="n"&gt;emerging&lt;/span&gt; &lt;span class="n"&gt;in&lt;/span&gt; &lt;span class="nc"&gt;China&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt; &lt;span class="nc"&gt;With&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;vision&lt;/span&gt; &lt;span class="n"&gt;of&lt;/span&gt; &lt;span class="n"&gt;empowering&lt;/span&gt; &lt;span class="n"&gt;small&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;medium&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;sized&lt;/span&gt; &lt;span class="nf"&gt;enterprises&lt;/span&gt; &lt;span class="o"&gt;(&lt;/span&gt;&lt;span class="nc"&gt;SMEs&lt;/span&gt;&lt;span class="o"&gt;)&lt;/span&gt; &lt;span class="n"&gt;through&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="nc"&gt;Internet&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="nc"&gt;Jack&lt;/span&gt; &lt;span class="nc"&gt;Ma&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;his&lt;/span&gt; &lt;span class="n"&gt;team&lt;/span&gt; &lt;span class="n"&gt;launched&lt;/span&gt; &lt;span class="nc"&gt;Alibaba&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;&lt;span class="na"&gt;com&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;a&lt;/span&gt; &lt;span class="n"&gt;global&lt;/span&gt; &lt;span class="n"&gt;wholesale&lt;/span&gt; &lt;span class="n"&gt;marketplace&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt; &lt;span class="nc"&gt;Initially&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;platform&lt;/span&gt; &lt;span class="n"&gt;provided&lt;/span&gt; &lt;span class="nc"&gt;SMEs&lt;/span&gt; &lt;span class="n"&gt;with&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;opportunity&lt;/span&gt; &lt;span class="n"&gt;to&lt;/span&gt; &lt;span class="n"&gt;showcase&lt;/span&gt; &lt;span class="n"&gt;their&lt;/span&gt; &lt;span class="n"&gt;products&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;find&lt;/span&gt; &lt;span class="n"&gt;international&lt;/span&gt; &lt;span class="n"&gt;buyers&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;helping&lt;/span&gt; &lt;span class="nc"&gt;Chinese&lt;/span&gt; &lt;span class="n"&gt;manufacturers&lt;/span&gt; &lt;span class="n"&gt;reach&lt;/span&gt; &lt;span class="n"&gt;out&lt;/span&gt; &lt;span class="n"&gt;to&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;global&lt;/span&gt; &lt;span class="n"&gt;market&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt; &lt;span class="nc"&gt;After&lt;/span&gt; &lt;span class="n"&gt;several&lt;/span&gt; &lt;span class="n"&gt;years&lt;/span&gt; &lt;span class="n"&gt;of&lt;/span&gt; &lt;span class="n"&gt;development&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="nc"&gt;Alibaba&lt;/span&gt; &lt;span class="n"&gt;quickly&lt;/span&gt; &lt;span class="n"&gt;emerged&lt;/span&gt; &lt;span class="n"&gt;as&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;world&lt;/span&gt;&lt;span class="err"&gt;'&lt;/span&gt;&lt;span class="n"&gt;s&lt;/span&gt; &lt;span class="n"&gt;leading&lt;/span&gt; &lt;span class="no"&gt;B2B&lt;/span&gt; &lt;span class="n"&gt;e&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;commerce&lt;/span&gt; &lt;span class="n"&gt;platform&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;
&lt;span class="nc"&gt;In&lt;/span&gt; &lt;span class="mi"&gt;2003&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="nc"&gt;Alibaba&lt;/span&gt; &lt;span class="n"&gt;launched&lt;/span&gt; &lt;span class="nc"&gt;Taobao&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;a&lt;/span&gt; &lt;span class="no"&gt;C2C&lt;/span&gt; &lt;span class="n"&gt;platform&lt;/span&gt; &lt;span class="k"&gt;for&lt;/span&gt; &lt;span class="n"&gt;individual&lt;/span&gt; &lt;span class="n"&gt;consumers&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;which&lt;/span&gt; &lt;span class="n"&gt;quickly&lt;/span&gt; &lt;span class="n"&gt;attracted&lt;/span&gt; &lt;span class="n"&gt;a&lt;/span&gt; &lt;span class="n"&gt;large&lt;/span&gt; &lt;span class="n"&gt;number&lt;/span&gt; &lt;span class="n"&gt;of&lt;/span&gt; &lt;span class="n"&gt;users&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt; &lt;span class="nc"&gt;With&lt;/span&gt; &lt;span class="n"&gt;its&lt;/span&gt; &lt;span class="err"&gt;“&lt;/span&gt;&lt;span class="n"&gt;free&lt;/span&gt; &lt;span class="n"&gt;store&lt;/span&gt;&lt;span class="err"&gt;”&lt;/span&gt; &lt;span class="n"&gt;model&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="nc"&gt;Taobao&lt;/span&gt; &lt;span class="n"&gt;competed&lt;/span&gt; &lt;span class="n"&gt;with&lt;/span&gt; &lt;span class="n"&gt;other&lt;/span&gt; &lt;span class="n"&gt;e&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;commerce&lt;/span&gt; &lt;span class="n"&gt;platforms&lt;/span&gt; &lt;span class="n"&gt;in&lt;/span&gt; &lt;span class="n"&gt;a&lt;/span&gt; &lt;span class="kt"&gt;short&lt;/span&gt; &lt;span class="n"&gt;period&lt;/span&gt; &lt;span class="n"&gt;of&lt;/span&gt; &lt;span class="n"&gt;time&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;achieved&lt;/span&gt; &lt;span class="n"&gt;success&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt; &lt;span class="nc"&gt;To&lt;/span&gt; &lt;span class="n"&gt;enhance&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;user&lt;/span&gt; &lt;span class="n"&gt;experience&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="nc"&gt;Alibaba&lt;/span&gt; &lt;span class="n"&gt;launched&lt;/span&gt; &lt;span class="nc"&gt;Alipay&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;a&lt;/span&gt; &lt;span class="n"&gt;third&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;party&lt;/span&gt; &lt;span class="n"&gt;payment&lt;/span&gt; &lt;span class="n"&gt;platform&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;in&lt;/span&gt; &lt;span class="mi"&gt;2004&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;solving&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;trust&lt;/span&gt; &lt;span class="n"&gt;problem&lt;/span&gt; &lt;span class="n"&gt;of&lt;/span&gt; &lt;span class="n"&gt;online&lt;/span&gt; &lt;span class="n"&gt;payment&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;an&lt;/span&gt; &lt;span class="n"&gt;innovation&lt;/span&gt; &lt;span class="n"&gt;that&lt;/span&gt; &lt;span class="n"&gt;laid&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;foundation&lt;/span&gt; &lt;span class="k"&gt;for&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;rapid&lt;/span&gt; &lt;span class="n"&gt;development&lt;/span&gt; &lt;span class="n"&gt;of&lt;/span&gt; &lt;span class="n"&gt;e&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;commerce&lt;/span&gt; &lt;span class="n"&gt;in&lt;/span&gt; &lt;span class="nc"&gt;China&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;
&lt;span class="nc"&gt;Business&lt;/span&gt; &lt;span class="nc"&gt;Ecology&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="nc"&gt;Strategic&lt;/span&gt; &lt;span class="nc"&gt;Layout&lt;/span&gt;
&lt;span class="nc"&gt;Alibaba&lt;/span&gt; &lt;span class="n"&gt;has&lt;/span&gt; &lt;span class="n"&gt;gradually&lt;/span&gt; &lt;span class="n"&gt;built&lt;/span&gt; &lt;span class="n"&gt;a&lt;/span&gt; &lt;span class="n"&gt;diversified&lt;/span&gt; &lt;span class="n"&gt;ecosystem&lt;/span&gt; &lt;span class="n"&gt;covering&lt;/span&gt; &lt;span class="n"&gt;e&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;commerce&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;finance&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;logistics&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;cloud&lt;/span&gt; &lt;span class="n"&gt;computing&lt;/span&gt; &lt;span class="n"&gt;through&lt;/span&gt; &lt;span class="n"&gt;continuous&lt;/span&gt; &lt;span class="n"&gt;business&lt;/span&gt; &lt;span class="n"&gt;expansion&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;
&lt;span class="no"&gt;E&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;commerce&lt;/span&gt;
&lt;span class="nc"&gt;As&lt;/span&gt; &lt;span class="nc"&gt;Alibaba&lt;/span&gt;&lt;span class="err"&gt;'&lt;/span&gt;&lt;span class="n"&gt;s&lt;/span&gt; &lt;span class="n"&gt;core&lt;/span&gt; &lt;span class="n"&gt;business&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;e&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;commerce&lt;/span&gt; &lt;span class="n"&gt;includes&lt;/span&gt; &lt;span class="nc"&gt;Taobao&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="nc"&gt;Tmall&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="nc"&gt;AliExpress&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt; &lt;span class="nc"&gt;Among&lt;/span&gt; &lt;span class="n"&gt;them&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="nc"&gt;Taobao&lt;/span&gt; &lt;span class="n"&gt;focuses&lt;/span&gt; &lt;span class="n"&gt;on&lt;/span&gt; &lt;span class="n"&gt;meeting&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;needs&lt;/span&gt; &lt;span class="n"&gt;of&lt;/span&gt; &lt;span class="n"&gt;individual&lt;/span&gt; &lt;span class="n"&gt;consumers&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="k"&gt;while&lt;/span&gt; &lt;span class="nc"&gt;Tmall&lt;/span&gt; &lt;span class="n"&gt;caters&lt;/span&gt; &lt;span class="n"&gt;to&lt;/span&gt; &lt;span class="n"&gt;brands&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;enterprises&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;providing&lt;/span&gt; &lt;span class="n"&gt;high&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;quality&lt;/span&gt; &lt;span class="n"&gt;goods&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;services&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt; &lt;span class="nc"&gt;Global&lt;/span&gt; &lt;span class="nc"&gt;Express&lt;/span&gt; &lt;span class="n"&gt;is&lt;/span&gt; &lt;span class="n"&gt;dedicated&lt;/span&gt; &lt;span class="n"&gt;to&lt;/span&gt; &lt;span class="n"&gt;connecting&lt;/span&gt; &lt;span class="n"&gt;overseas&lt;/span&gt; &lt;span class="n"&gt;buyers&lt;/span&gt; &lt;span class="n"&gt;with&lt;/span&gt; &lt;span class="nc"&gt;Chinese&lt;/span&gt; &lt;span class="n"&gt;sellers&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;is&lt;/span&gt; &lt;span class="n"&gt;gaining&lt;/span&gt; &lt;span class="n"&gt;influence&lt;/span&gt; &lt;span class="n"&gt;in&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;international&lt;/span&gt; &lt;span class="n"&gt;market&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;
&lt;span class="nc"&gt;Cloud&lt;/span&gt; &lt;span class="nc"&gt;Computing&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="nc"&gt;Big&lt;/span&gt; &lt;span class="nc"&gt;Data&lt;/span&gt;
&lt;span class="nc"&gt;Alibaba&lt;/span&gt; &lt;span class="nc"&gt;Cloud&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;core&lt;/span&gt; &lt;span class="n"&gt;technology&lt;/span&gt; &lt;span class="n"&gt;division&lt;/span&gt; &lt;span class="n"&gt;of&lt;/span&gt; &lt;span class="nc"&gt;Alibaba&lt;/span&gt; &lt;span class="nc"&gt;Group&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;was&lt;/span&gt; &lt;span class="n"&gt;founded&lt;/span&gt; &lt;span class="n"&gt;in&lt;/span&gt; &lt;span class="mi"&gt;2009&lt;/span&gt; &lt;span class="n"&gt;to&lt;/span&gt; &lt;span class="n"&gt;provide&lt;/span&gt; &lt;span class="n"&gt;cloud&lt;/span&gt; &lt;span class="n"&gt;computing&lt;/span&gt; &lt;span class="n"&gt;services&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;artificial&lt;/span&gt; &lt;span class="n"&gt;intelligence&lt;/span&gt; &lt;span class="n"&gt;solutions&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;big&lt;/span&gt; &lt;span class="n"&gt;data&lt;/span&gt; &lt;span class="n"&gt;technical&lt;/span&gt; &lt;span class="n"&gt;support&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt; &lt;span class="nc"&gt;Today&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="nc"&gt;Alibaba&lt;/span&gt; &lt;span class="nc"&gt;Cloud&lt;/span&gt; &lt;span class="n"&gt;has&lt;/span&gt; &lt;span class="n"&gt;become&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;world&lt;/span&gt;&lt;span class="err"&gt;'&lt;/span&gt;&lt;span class="n"&gt;s&lt;/span&gt; &lt;span class="n"&gt;leading&lt;/span&gt; &lt;span class="n"&gt;cloud&lt;/span&gt; &lt;span class="n"&gt;service&lt;/span&gt; &lt;span class="n"&gt;provider&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;supporting&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;digital&lt;/span&gt; &lt;span class="n"&gt;transformation&lt;/span&gt; &lt;span class="n"&gt;of&lt;/span&gt; &lt;span class="n"&gt;enterprises&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;
&lt;span class="nc"&gt;Financial&lt;/span&gt; &lt;span class="nc"&gt;Technology&lt;/span&gt;
&lt;span class="nc"&gt;Ant&lt;/span&gt; &lt;span class="nc"&gt;Group&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;parent&lt;/span&gt; &lt;span class="n"&gt;company&lt;/span&gt; &lt;span class="n"&gt;of&lt;/span&gt; &lt;span class="nc"&gt;Alipay&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;is&lt;/span&gt; &lt;span class="n"&gt;an&lt;/span&gt; &lt;span class="n"&gt;important&lt;/span&gt; &lt;span class="n"&gt;part&lt;/span&gt; &lt;span class="n"&gt;of&lt;/span&gt; &lt;span class="nc"&gt;Alibaba&lt;/span&gt;&lt;span class="err"&gt;'&lt;/span&gt;&lt;span class="n"&gt;s&lt;/span&gt; &lt;span class="n"&gt;fintech&lt;/span&gt; &lt;span class="n"&gt;layout&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt; &lt;span class="nc"&gt;Through&lt;/span&gt; &lt;span class="nc"&gt;Alipay&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="nc"&gt;Alibaba&lt;/span&gt; &lt;span class="n"&gt;provides&lt;/span&gt; &lt;span class="n"&gt;users&lt;/span&gt; &lt;span class="n"&gt;with&lt;/span&gt; &lt;span class="n"&gt;online&lt;/span&gt; &lt;span class="n"&gt;payment&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;wealth&lt;/span&gt; &lt;span class="n"&gt;management&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;loans&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;other&lt;/span&gt; &lt;span class="n"&gt;services&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;promoting&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;development&lt;/span&gt; &lt;span class="n"&gt;of&lt;/span&gt; &lt;span class="n"&gt;a&lt;/span&gt; &lt;span class="n"&gt;cashless&lt;/span&gt; &lt;span class="n"&gt;society&lt;/span&gt; &lt;span class="n"&gt;in&lt;/span&gt; &lt;span class="nc"&gt;China&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;
&lt;span class="nc"&gt;Logistics&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="nc"&gt;Supply&lt;/span&gt; &lt;span class="nc"&gt;Chain&lt;/span&gt;
&lt;span class="nc"&gt;Founded&lt;/span&gt; &lt;span class="n"&gt;in&lt;/span&gt; &lt;span class="mi"&gt;2013&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="nc"&gt;Cainiao&lt;/span&gt; &lt;span class="nf"&gt;Network&lt;/span&gt; &lt;span class="o"&gt;(&lt;/span&gt;&lt;span class="nc"&gt;Cainiao&lt;/span&gt;&lt;span class="o"&gt;)&lt;/span&gt; &lt;span class="n"&gt;is&lt;/span&gt; &lt;span class="n"&gt;committed&lt;/span&gt; &lt;span class="n"&gt;to&lt;/span&gt; &lt;span class="n"&gt;optimizing&lt;/span&gt; &lt;span class="n"&gt;logistics&lt;/span&gt; &lt;span class="n"&gt;efficiency&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;building&lt;/span&gt; &lt;span class="n"&gt;an&lt;/span&gt; &lt;span class="n"&gt;intelligent&lt;/span&gt; &lt;span class="n"&gt;logistics&lt;/span&gt; &lt;span class="n"&gt;network&lt;/span&gt; &lt;span class="n"&gt;with&lt;/span&gt; &lt;span class="n"&gt;global&lt;/span&gt; &lt;span class="n"&gt;coverage&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt; &lt;span class="nc"&gt;Cainiao&lt;/span&gt;&lt;span class="err"&gt;'&lt;/span&gt;&lt;span class="n"&gt;s&lt;/span&gt; &lt;span class="n"&gt;technology&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;platform&lt;/span&gt; &lt;span class="n"&gt;capabilities&lt;/span&gt; &lt;span class="n"&gt;enhance&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;speed&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;reliability&lt;/span&gt; &lt;span class="n"&gt;of&lt;/span&gt; &lt;span class="n"&gt;e&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;commerce&lt;/span&gt; &lt;span class="n"&gt;logistics&lt;/span&gt; &lt;span class="n"&gt;services&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;
&lt;span class="nc"&gt;Digital&lt;/span&gt; &lt;span class="nc"&gt;Media&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="nc"&gt;Entertainment&lt;/span&gt;
&lt;span class="nc"&gt;Alibaba&lt;/span&gt; &lt;span class="n"&gt;has&lt;/span&gt; &lt;span class="n"&gt;also&lt;/span&gt; &lt;span class="n"&gt;been&lt;/span&gt; &lt;span class="n"&gt;actively&lt;/span&gt; &lt;span class="n"&gt;laying&lt;/span&gt; &lt;span class="n"&gt;out&lt;/span&gt; &lt;span class="n"&gt;in&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;digital&lt;/span&gt; &lt;span class="n"&gt;media&lt;/span&gt; &lt;span class="n"&gt;sector&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;including&lt;/span&gt; &lt;span class="nc"&gt;Youku&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="nc"&gt;AliFilm&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="nc"&gt;Shrimp&lt;/span&gt; &lt;span class="nc"&gt;Music&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;to&lt;/span&gt; &lt;span class="n"&gt;promote&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;construction&lt;/span&gt; &lt;span class="n"&gt;of&lt;/span&gt; &lt;span class="n"&gt;a&lt;/span&gt; &lt;span class="n"&gt;content&lt;/span&gt; &lt;span class="n"&gt;ecosystem&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt; &lt;span class="nc"&gt;In&lt;/span&gt; &lt;span class="n"&gt;addition&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="nc"&gt;Group&lt;/span&gt; &lt;span class="n"&gt;has&lt;/span&gt; &lt;span class="n"&gt;further&lt;/span&gt; &lt;span class="n"&gt;enriched&lt;/span&gt; &lt;span class="n"&gt;its&lt;/span&gt; &lt;span class="n"&gt;brand&lt;/span&gt; &lt;span class="n"&gt;by&lt;/span&gt; &lt;span class="n"&gt;investing&lt;/span&gt; &lt;span class="n"&gt;in&lt;/span&gt; &lt;span class="n"&gt;cultural&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;creative&lt;/span&gt; &lt;span class="n"&gt;industries&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;expanding&lt;/span&gt; &lt;span class="n"&gt;its&lt;/span&gt; &lt;span class="n"&gt;sports&lt;/span&gt; &lt;span class="n"&gt;business&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;
&lt;span class="nc"&gt;Internationalization&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="nc"&gt;Innovation&lt;/span&gt;
&lt;span class="nc"&gt;In&lt;/span&gt; &lt;span class="n"&gt;recent&lt;/span&gt; &lt;span class="n"&gt;years&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="nc"&gt;Alibaba&lt;/span&gt; &lt;span class="n"&gt;has&lt;/span&gt; &lt;span class="n"&gt;been&lt;/span&gt; &lt;span class="n"&gt;accelerating&lt;/span&gt; &lt;span class="n"&gt;its&lt;/span&gt; &lt;span class="n"&gt;pace&lt;/span&gt; &lt;span class="n"&gt;of&lt;/span&gt; &lt;span class="n"&gt;internationalization&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;aiming&lt;/span&gt; &lt;span class="n"&gt;to&lt;/span&gt; &lt;span class="n"&gt;become&lt;/span&gt; &lt;span class="n"&gt;a&lt;/span&gt; &lt;span class="n"&gt;bridge&lt;/span&gt; &lt;span class="n"&gt;between&lt;/span&gt; &lt;span class="n"&gt;global&lt;/span&gt; &lt;span class="n"&gt;consumers&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;businesses&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt; &lt;span class="nc"&gt;The&lt;/span&gt; &lt;span class="nc"&gt;Group&lt;/span&gt; &lt;span class="n"&gt;has&lt;/span&gt; &lt;span class="n"&gt;expanded&lt;/span&gt; &lt;span class="n"&gt;its&lt;/span&gt; &lt;span class="n"&gt;business&lt;/span&gt; &lt;span class="n"&gt;in&lt;/span&gt; &lt;span class="n"&gt;markets&lt;/span&gt; &lt;span class="n"&gt;such&lt;/span&gt; &lt;span class="n"&gt;as&lt;/span&gt; &lt;span class="nc"&gt;Southeast&lt;/span&gt; &lt;span class="nc"&gt;Asia&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="nc"&gt;Europe&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="nc"&gt;North&lt;/span&gt; &lt;span class="nc"&gt;America&lt;/span&gt; &lt;span class="n"&gt;through&lt;/span&gt; &lt;span class="n"&gt;mergers&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;acquisitions&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;partnerships&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt; &lt;span class="nc"&gt;For&lt;/span&gt; &lt;span class="n"&gt;example&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;through&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;acquisition&lt;/span&gt; &lt;span class="n"&gt;of&lt;/span&gt; &lt;span class="nc"&gt;Lazada&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="nc"&gt;Alibaba&lt;/span&gt; &lt;span class="n"&gt;has&lt;/span&gt; &lt;span class="n"&gt;enhanced&lt;/span&gt; &lt;span class="n"&gt;its&lt;/span&gt; &lt;span class="n"&gt;e&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;commerce&lt;/span&gt; &lt;span class="n"&gt;presence&lt;/span&gt; &lt;span class="n"&gt;in&lt;/span&gt; &lt;span class="nc"&gt;Southeast&lt;/span&gt; &lt;span class="nc"&gt;Asia&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;
&lt;span class="nc"&gt;In&lt;/span&gt; &lt;span class="n"&gt;addition&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="nc"&gt;Alibaba&lt;/span&gt; &lt;span class="n"&gt;has&lt;/span&gt; &lt;span class="n"&gt;been&lt;/span&gt; &lt;span class="n"&gt;actively&lt;/span&gt; &lt;span class="n"&gt;exploring&lt;/span&gt; &lt;span class="k"&gt;new&lt;/span&gt; &lt;span class="n"&gt;technologies&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;including&lt;/span&gt; &lt;span class="n"&gt;artificial&lt;/span&gt; &lt;span class="n"&gt;intelligence&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;blockchain&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="nc"&gt;Internet&lt;/span&gt; &lt;span class="n"&gt;of&lt;/span&gt; &lt;span class="nc"&gt;Things&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt; &lt;span class="nc"&gt;The&lt;/span&gt; &lt;span class="n"&gt;application&lt;/span&gt; &lt;span class="n"&gt;of&lt;/span&gt; &lt;span class="n"&gt;these&lt;/span&gt; &lt;span class="n"&gt;technologies&lt;/span&gt; &lt;span class="n"&gt;not&lt;/span&gt; &lt;span class="n"&gt;only&lt;/span&gt; &lt;span class="n"&gt;optimizes&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="nc"&gt;Group&lt;/span&gt;&lt;span class="err"&gt;'&lt;/span&gt;&lt;span class="n"&gt;s&lt;/span&gt; &lt;span class="n"&gt;existing&lt;/span&gt; &lt;span class="n"&gt;business&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;but&lt;/span&gt; &lt;span class="n"&gt;also&lt;/span&gt; &lt;span class="n"&gt;lays&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;foundation&lt;/span&gt; &lt;span class="k"&gt;for&lt;/span&gt; &lt;span class="n"&gt;future&lt;/span&gt; &lt;span class="n"&gt;innovations&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;
&lt;span class="nc"&gt;Corporate&lt;/span&gt; &lt;span class="nc"&gt;Culture&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="nc"&gt;Social&lt;/span&gt; &lt;span class="nc"&gt;Responsibility&lt;/span&gt;
&lt;span class="nc"&gt;Alibaba&lt;/span&gt;&lt;span class="err"&gt;'&lt;/span&gt;&lt;span class="n"&gt;s&lt;/span&gt; &lt;span class="n"&gt;corporate&lt;/span&gt; &lt;span class="n"&gt;culture&lt;/span&gt; &lt;span class="n"&gt;is&lt;/span&gt; &lt;span class="n"&gt;based&lt;/span&gt; &lt;span class="n"&gt;on&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;core&lt;/span&gt; &lt;span class="n"&gt;mission&lt;/span&gt; &lt;span class="n"&gt;of&lt;/span&gt; &lt;span class="err"&gt;“&lt;/span&gt;&lt;span class="n"&gt;making&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;world&lt;/span&gt; &lt;span class="n"&gt;a&lt;/span&gt; &lt;span class="n"&gt;business&lt;/span&gt; &lt;span class="n"&gt;without&lt;/span&gt; &lt;span class="n"&gt;difficulty&lt;/span&gt;&lt;span class="err"&gt;”&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;encourages&lt;/span&gt; &lt;span class="n"&gt;employees&lt;/span&gt; &lt;span class="n"&gt;to&lt;/span&gt; &lt;span class="n"&gt;innovate&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;collaborate&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt; &lt;span class="nc"&gt;Jack&lt;/span&gt; &lt;span class="nc"&gt;Ma&lt;/span&gt; &lt;span class="n"&gt;emphasizes&lt;/span&gt; &lt;span class="n"&gt;corporate&lt;/span&gt; &lt;span class="n"&gt;social&lt;/span&gt; &lt;span class="n"&gt;responsibility&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;advocates&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;use&lt;/span&gt; &lt;span class="n"&gt;of&lt;/span&gt; &lt;span class="n"&gt;resources&lt;/span&gt; &lt;span class="k"&gt;for&lt;/span&gt; &lt;span class="kd"&gt;public&lt;/span&gt; &lt;span class="n"&gt;welfare&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt; &lt;span class="nc"&gt;For&lt;/span&gt; &lt;span class="n"&gt;example&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="nc"&gt;Alibaba&lt;/span&gt; &lt;span class="nc"&gt;Foundation&lt;/span&gt; &lt;span class="n"&gt;supports&lt;/span&gt; &lt;span class="n"&gt;environmental&lt;/span&gt; &lt;span class="n"&gt;protection&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;education&lt;/span&gt; &lt;span class="n"&gt;development&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;poverty&lt;/span&gt; &lt;span class="n"&gt;alleviation&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;creating&lt;/span&gt; &lt;span class="kt"&gt;long&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;term&lt;/span&gt; &lt;span class="n"&gt;value&lt;/span&gt; &lt;span class="k"&gt;for&lt;/span&gt; &lt;span class="n"&gt;society&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;
&lt;span class="nc"&gt;Challenges&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="nc"&gt;Future&lt;/span&gt; &lt;span class="nc"&gt;Prospects&lt;/span&gt;
&lt;span class="nc"&gt;Despite&lt;/span&gt; &lt;span class="n"&gt;its&lt;/span&gt; &lt;span class="n"&gt;remarkable&lt;/span&gt; &lt;span class="n"&gt;achievements&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="nc"&gt;Alibaba&lt;/span&gt; &lt;span class="n"&gt;faces&lt;/span&gt; &lt;span class="n"&gt;challenges&lt;/span&gt; &lt;span class="n"&gt;such&lt;/span&gt; &lt;span class="n"&gt;as&lt;/span&gt; &lt;span class="n"&gt;intensified&lt;/span&gt; &lt;span class="n"&gt;international&lt;/span&gt; &lt;span class="n"&gt;competition&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;increased&lt;/span&gt; &lt;span class="n"&gt;industry&lt;/span&gt; &lt;span class="n"&gt;regulation&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;pressure&lt;/span&gt; &lt;span class="k"&gt;for&lt;/span&gt; &lt;span class="n"&gt;technological&lt;/span&gt; &lt;span class="n"&gt;innovation&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt; &lt;span class="nc"&gt;In&lt;/span&gt; &lt;span class="n"&gt;order&lt;/span&gt; &lt;span class="n"&gt;to&lt;/span&gt; &lt;span class="n"&gt;maintain&lt;/span&gt; &lt;span class="n"&gt;its&lt;/span&gt; &lt;span class="n"&gt;leading&lt;/span&gt; &lt;span class="n"&gt;position&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="nc"&gt;Alibaba&lt;/span&gt; &lt;span class="n"&gt;needs&lt;/span&gt; &lt;span class="n"&gt;to&lt;/span&gt; &lt;span class="k"&gt;continue&lt;/span&gt; &lt;span class="n"&gt;to&lt;/span&gt; &lt;span class="n"&gt;innovate&lt;/span&gt; &lt;span class="n"&gt;globally&lt;/span&gt; &lt;span class="k"&gt;while&lt;/span&gt; &lt;span class="n"&gt;actively&lt;/span&gt; &lt;span class="n"&gt;adapting&lt;/span&gt; &lt;span class="n"&gt;to&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;regulations&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;cultures&lt;/span&gt; &lt;span class="n"&gt;of&lt;/span&gt; &lt;span class="n"&gt;different&lt;/span&gt; &lt;span class="n"&gt;markets&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;
&lt;span class="nc"&gt;Looking&lt;/span&gt; &lt;span class="n"&gt;ahead&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="nc"&gt;Alibaba&lt;/span&gt; &lt;span class="n"&gt;will&lt;/span&gt; &lt;span class="k"&gt;continue&lt;/span&gt; &lt;span class="n"&gt;to&lt;/span&gt; &lt;span class="n"&gt;promote&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;integration&lt;/span&gt; &lt;span class="n"&gt;of&lt;/span&gt; &lt;span class="n"&gt;technology&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;commerce&lt;/span&gt; &lt;span class="n"&gt;to&lt;/span&gt; &lt;span class="n"&gt;help&lt;/span&gt; &lt;span class="nc"&gt;SMEs&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;consumers&lt;/span&gt; &lt;span class="n"&gt;realize&lt;/span&gt; &lt;span class="n"&gt;more&lt;/span&gt; &lt;span class="n"&gt;value&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt; &lt;span class="nc"&gt;At&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;same&lt;/span&gt; &lt;span class="n"&gt;time&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;through&lt;/span&gt; &lt;span class="n"&gt;technology&lt;/span&gt; &lt;span class="n"&gt;development&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;globalization&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="nc"&gt;Alibaba&lt;/span&gt; &lt;span class="n"&gt;is&lt;/span&gt; &lt;span class="n"&gt;expected&lt;/span&gt; &lt;span class="n"&gt;to&lt;/span&gt; &lt;span class="n"&gt;play&lt;/span&gt; &lt;span class="n"&gt;a&lt;/span&gt; &lt;span class="n"&gt;more&lt;/span&gt; &lt;span class="n"&gt;important&lt;/span&gt; &lt;span class="n"&gt;role&lt;/span&gt; &lt;span class="n"&gt;in&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;development&lt;/span&gt; &lt;span class="n"&gt;of&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;global&lt;/span&gt; &lt;span class="n"&gt;digital&lt;/span&gt; &lt;span class="n"&gt;economy&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;
&lt;span class="nc"&gt;Conclusion&lt;/span&gt;
&lt;span class="nc"&gt;As&lt;/span&gt; &lt;span class="n"&gt;one&lt;/span&gt; &lt;span class="n"&gt;of&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;most&lt;/span&gt; &lt;span class="n"&gt;influential&lt;/span&gt; &lt;span class="nc"&gt;Internet&lt;/span&gt; &lt;span class="n"&gt;companies&lt;/span&gt; &lt;span class="n"&gt;in&lt;/span&gt; &lt;span class="nc"&gt;China&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;world&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="nc"&gt;Alibaba&lt;/span&gt;&lt;span class="err"&gt;'&lt;/span&gt;&lt;span class="n"&gt;s&lt;/span&gt; &lt;span class="n"&gt;growth&lt;/span&gt; &lt;span class="n"&gt;history&lt;/span&gt; &lt;span class="n"&gt;has&lt;/span&gt; &lt;span class="n"&gt;demonstrated&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;profound&lt;/span&gt; &lt;span class="n"&gt;transformation&lt;/span&gt; &lt;span class="n"&gt;of&lt;/span&gt; &lt;span class="n"&gt;technology&lt;/span&gt; &lt;span class="n"&gt;on&lt;/span&gt; &lt;span class="n"&gt;business&lt;/span&gt; &lt;span class="n"&gt;models&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt; &lt;span class="nc"&gt;Through&lt;/span&gt; &lt;span class="n"&gt;continuous&lt;/span&gt; &lt;span class="n"&gt;innovation&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;expansion&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="nc"&gt;Alibaba&lt;/span&gt; &lt;span class="n"&gt;is&lt;/span&gt; &lt;span class="n"&gt;injecting&lt;/span&gt; &lt;span class="k"&gt;new&lt;/span&gt; &lt;span class="n"&gt;vitality&lt;/span&gt; &lt;span class="n"&gt;into&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;global&lt;/span&gt; &lt;span class="n"&gt;economy&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;creating&lt;/span&gt; &lt;span class="n"&gt;unprecedented&lt;/span&gt; &lt;span class="n"&gt;opportunities&lt;/span&gt; &lt;span class="k"&gt;for&lt;/span&gt; &lt;span class="n"&gt;countless&lt;/span&gt; &lt;span class="n"&gt;businesses&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;consumers&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt; &lt;span class="nc"&gt;In&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;future&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="nc"&gt;Alibaba&lt;/span&gt; &lt;span class="n"&gt;will&lt;/span&gt; &lt;span class="k"&gt;continue&lt;/span&gt; &lt;span class="n"&gt;to&lt;/span&gt; &lt;span class="n"&gt;be&lt;/span&gt; &lt;span class="n"&gt;driven&lt;/span&gt; &lt;span class="n"&gt;by&lt;/span&gt; &lt;span class="n"&gt;technology&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;commerce&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;strive&lt;/span&gt; &lt;span class="n"&gt;to&lt;/span&gt; &lt;span class="n"&gt;build&lt;/span&gt; &lt;span class="n"&gt;a&lt;/span&gt; &lt;span class="n"&gt;more&lt;/span&gt; &lt;span class="n"&gt;convenient&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;efficient&lt;/span&gt; &lt;span class="n"&gt;digital&lt;/span&gt; &lt;span class="n"&gt;world&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;
    &lt;span class="err"&gt;”&lt;/span&gt;
&lt;span class="o"&gt;}&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;The above article is about 4500 words long.&lt;/p&gt;

&lt;p&gt;We can view the written document in the following way:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight java"&gt;&lt;code&gt;&lt;span class="no"&gt;GET&lt;/span&gt; &lt;span class="n"&gt;alibaba_dense_large_text&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;_search&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fimg.alicdn.com%2Fimgextra%2Fi3%2FO1CN01Mwm19E1PxSKGe2UDT_%21%216000000001907-0-tps-974-590.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fimg.alicdn.com%2Fimgextra%2Fi3%2FO1CN01Mwm19E1PxSKGe2UDT_%21%216000000001907-0-tps-974-590.jpg" alt="image" width="800" height="484"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;From above, we can see that the semantic_text field automatically chunks a very long textual document and vectorizes them separately.&lt;/p&gt;




&lt;p&gt;We hope this article will provide you with valuable reference and inspiration in your exploration of vector search using AliCloud and Elasticsearch. In the future, as the technology continues to evolve, we expect to see more intelligent applications based on semantic understanding to land, and push the search technology to a higher level.&lt;/p&gt;

&lt;p&gt;Harness the power of Alibaba Cloud Elasticsearch for your vector search needs. Its robust infrastructure and AI capabilities make it a top choice for businesses seeking to leverage AI for improved search outcomes.&lt;/p&gt;

&lt;p&gt;Ready to start your journey with Elasticsearch on Alibaba Cloud? Explore our tailored Cloud solutions and services to take the first step towards transforming your data into a visual masterpiece. &lt;a href="https://c.tb.cn/F3.bTfFpS" rel="noopener noreferrer"&gt;Click here, Embark on Your 30-Day Free Trial&lt;/a&gt;.&lt;/p&gt;

</description>
      <category>bigdata</category>
      <category>elasticsearch</category>
      <category>tutorial</category>
      <category>api</category>
    </item>
    <item>
      <title>Mastering Elasticsearch with Alibaba Cloud: How to Create an Instance</title>
      <dc:creator>A_Lucas</dc:creator>
      <pubDate>Mon, 29 Jul 2024 01:59:44 +0000</pubDate>
      <link>https://dev.to/a_lucas/mastering-elasticsearch-with-alibaba-cloud-how-to-create-an-instance-36pg</link>
      <guid>https://dev.to/a_lucas/mastering-elasticsearch-with-alibaba-cloud-how-to-create-an-instance-36pg</guid>
      <description>&lt;h2&gt;
  
  
  Introduction
&lt;/h2&gt;

&lt;p&gt;&lt;a href="https://www.alibabacloud.com/en/product/elasticsearch" rel="noopener noreferrer"&gt;Alibaba Cloud Elasticsearch&lt;/a&gt; offers a powerful and flexible solution for managing and analyzing vast amounts of data in real-time. This tutorial will guide you through the process of creating an Elasticsearch instance, from understanding the prerequisites to executing API requests. &lt;br&gt;
&lt;a&gt;&lt;/a&gt;&lt;/p&gt;
&lt;h2&gt;
  
  
  Prerequisites
&lt;/h2&gt;

&lt;p&gt;Before delving into the technical details of creating an Elasticsearch instance, ensure that:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;You are well-versed with the billing and pricing standards of Alibaba Cloud Elasticsearch. For more information, see &lt;a href="https://www.alibabacloud.com/en/product/elasticsearch" rel="noopener noreferrer"&gt;Pricing of Alibaba Cloud Elasticsearch&lt;/a&gt;.&lt;/li&gt;
&lt;li&gt;You have completed real-name verification.&lt;/li&gt;
&lt;li&gt;You understand that you do not need to specify a zone when creating a cluster; by default, it is deployed in the same zone as the specified VPC.
&lt;a&gt;&lt;/a&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;h2&gt;
  
  
  Request Headers
&lt;/h2&gt;

&lt;p&gt;The &lt;code&gt;createInstance&lt;/code&gt; operation does not require operation-specific headers but uses common request headers. For more information, see &lt;a href="https://www.alibabacloud.com/help/doc-detail/29976.htm" rel="noopener noreferrer"&gt;Common Request Headers&lt;/a&gt;.&lt;br&gt;
&lt;a&gt;&lt;/a&gt;&lt;/p&gt;
&lt;h2&gt;
  
  
  Request Syntax
&lt;/h2&gt;

&lt;p&gt;The base syntax for creating an instance is as follows:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;POST /openapi/instances HTTP/1.1
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;a&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h3&gt;
  
  
  Request Parameters
&lt;/h3&gt;

&lt;p&gt;Below are some essential parameters you'll need for the request:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Parameter&lt;/th&gt;
&lt;th&gt;Type&lt;/th&gt;
&lt;th&gt;Required&lt;/th&gt;
&lt;th&gt;Description&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;clientToken&lt;/td&gt;
&lt;td&gt;String&lt;/td&gt;
&lt;td&gt;No&lt;/td&gt;
&lt;td&gt;Unique token to ensure idempotence of the request.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;nodeAmount&lt;/td&gt;
&lt;td&gt;Integer&lt;/td&gt;
&lt;td&gt;Yes&lt;/td&gt;
&lt;td&gt;The number of data nodes. Valid values: 2 to 50.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;instanceCategory&lt;/td&gt;
&lt;td&gt;String&lt;/td&gt;
&lt;td&gt;No&lt;/td&gt;
&lt;td&gt;Type of instance, e.g., advanced.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;esAdminPassword&lt;/td&gt;
&lt;td&gt;String&lt;/td&gt;
&lt;td&gt;Yes&lt;/td&gt;
&lt;td&gt;Password for accessing the cluster, must be 8 to 32 characters long and include at least three types of characters.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;esVersion&lt;/td&gt;
&lt;td&gt;String&lt;/td&gt;
&lt;td&gt;Yes&lt;/td&gt;
&lt;td&gt;Version of the Elasticsearch instance.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;networkConfig&lt;/td&gt;
&lt;td&gt;NetworkConfig&lt;/td&gt;
&lt;td&gt;Yes&lt;/td&gt;
&lt;td&gt;The network configurations.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;paymentType&lt;/td&gt;
&lt;td&gt;String&lt;/td&gt;
&lt;td&gt;No&lt;/td&gt;
&lt;td&gt;Billing method, either &lt;code&gt;postpaid&lt;/code&gt; or &lt;code&gt;prepaid&lt;/code&gt;.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;description&lt;/td&gt;
&lt;td&gt;String&lt;/td&gt;
&lt;td&gt;No&lt;/td&gt;
&lt;td&gt;The name or description of the instance.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;nodeSpec&lt;/td&gt;
&lt;td&gt;NodeSpec&lt;/td&gt;
&lt;td&gt;No&lt;/td&gt;
&lt;td&gt;Configurations of data nodes. Example: &lt;code&gt;{"spec":"elasticsearch.sn2ne.xlarge", "disk":50, "diskType":"cloud_ssd"}&lt;/code&gt;.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;warmNodeConfig&lt;/td&gt;
&lt;td&gt;WarmNodeConfig&lt;/td&gt;
&lt;td&gt;No&lt;/td&gt;
&lt;td&gt;Configurations of warm nodes.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;kibanaConfig&lt;/td&gt;
&lt;td&gt;KibanaNodeConfig&lt;/td&gt;
&lt;td&gt;No&lt;/td&gt;
&lt;td&gt;Configurations of Kibana nodes' settings.&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;For a detailed list of parameters, refer to the &lt;a href="https://www.alibabacloud.com/help/doc-detail/120949.htm" rel="noopener noreferrer"&gt;Full API Documentation&lt;/a&gt;.&lt;br&gt;
&lt;a&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Example Request Body
&lt;/h2&gt;

&lt;p&gt;&lt;a&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h3&gt;
  
  
  Creating a General-purpose Business Edition Instance with Both Hot and Cold Nodes
&lt;/h3&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight json"&gt;&lt;code&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"description"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"xpack-hot-cold"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"nodeAmount"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="mi"&gt;2&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"esVersion"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"7.10_with_X-Pack"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"instanceCategory"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"x-pack"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"nodeSpec"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="w"&gt;
    &lt;/span&gt;&lt;span class="nl"&gt;"spec"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"elasticsearch.sn2ne.xlarge"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
    &lt;/span&gt;&lt;span class="nl"&gt;"disk"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="mi"&gt;20&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
    &lt;/span&gt;&lt;span class="nl"&gt;"diskType"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"cloud_ssd"&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="p"&gt;},&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"warmNodeConfiguration"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="w"&gt;
    &lt;/span&gt;&lt;span class="nl"&gt;"spec"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"elasticsearch.sn1ne.xlarge"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
    &lt;/span&gt;&lt;span class="nl"&gt;"amount"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="mi"&gt;3&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
    &lt;/span&gt;&lt;span class="nl"&gt;"diskType"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"cloud_efficiency"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
    &lt;/span&gt;&lt;span class="nl"&gt;"disk"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="mi"&gt;500&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="p"&gt;},&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"networkConfig"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="w"&gt;
    &lt;/span&gt;&lt;span class="nl"&gt;"type"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"vpc"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
    &lt;/span&gt;&lt;span class="nl"&gt;"vpcId"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"vpc-8vbfu74bmw4m7m84a****"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
    &lt;/span&gt;&lt;span class="nl"&gt;"vswitchId"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"vsw-8vbnk6cmurbpump2f****"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
    &lt;/span&gt;&lt;span class="nl"&gt;"vsArea"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"cn-zhangjiakou-c"&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="p"&gt;},&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"paymentType"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"postpaid"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"esAdminPassword"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"yourPassword"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"kibanaConfiguration"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="w"&gt;
    &lt;/span&gt;&lt;span class="nl"&gt;"spec"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"elasticsearch.sn1ne.large"&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="p"&gt;}&lt;/span&gt;&lt;span class="w"&gt;
&lt;/span&gt;&lt;span class="p"&gt;}&lt;/span&gt;&lt;span class="w"&gt;
&lt;/span&gt;&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;a&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h3&gt;
  
  
  Creating an Indexing Service Instance with Only Hot Nodes
&lt;/h3&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight json"&gt;&lt;code&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"description"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"is-hot"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"nodeAmount"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="mi"&gt;2&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"esVersion"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"7.10_with_X-Pack"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"instanceCategory"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"IS"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"nodeSpec"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="w"&gt;
    &lt;/span&gt;&lt;span class="nl"&gt;"spec"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"elasticsearch.sn1ne.xlarge"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
    &lt;/span&gt;&lt;span class="nl"&gt;"disk"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="mi"&gt;20&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
    &lt;/span&gt;&lt;span class="nl"&gt;"diskType"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"cloud_ssd"&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="p"&gt;},&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"networkConfig"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="w"&gt;
    &lt;/span&gt;&lt;span class="nl"&gt;"type"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"vpc"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
    &lt;/span&gt;&lt;span class="nl"&gt;"vpcId"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"vpc-8vbfu74bmw4m7m84a****"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
    &lt;/span&gt;&lt;span class="nl"&gt;"vswitchId"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"vsw-8vbnk6cmurbpump2f****"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
    &lt;/span&gt;&lt;span class="nl"&gt;"vsArea"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"cn-zhangjiakou-c"&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="p"&gt;},&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"paymentType"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"postpaid"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"esAdminPassword"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"yourPassword"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"kibanaConfiguration"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="w"&gt;
    &lt;/span&gt;&lt;span class="nl"&gt;"spec"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"elasticsearch.sn1ne.large"&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="p"&gt;}&lt;/span&gt;&lt;span class="w"&gt;
&lt;/span&gt;&lt;span class="p"&gt;}&lt;/span&gt;&lt;span class="w"&gt;
&lt;/span&gt;&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;a&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Response Parameters
&lt;/h2&gt;

&lt;p&gt;Upon successful execution, the response will include:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;code&gt;RequestId&lt;/code&gt;: A unique ID for the request.&lt;/li&gt;
&lt;li&gt;
&lt;code&gt;Result.instanceId&lt;/code&gt;: The ID of the created Elasticsearch instance.
&lt;a&gt;&lt;/a&gt;
&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  Sample Success Responses
&lt;/h3&gt;

&lt;p&gt;&lt;a&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h4&gt;
  
  
  XML Format
&lt;/h4&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight xml"&gt;&lt;code&gt;HTTP/1.1 200 OK
Content-Type:application/xml
&lt;span class="nt"&gt;&amp;lt;createInstanceResponse&amp;gt;&lt;/span&gt;
    &lt;span class="nt"&gt;&amp;lt;RequestId&amp;gt;&lt;/span&gt;838D9D11-8EEF-46D8-BF0D-BC8FC2B0C2F3&lt;span class="nt"&gt;&amp;lt;/RequestId&amp;gt;&lt;/span&gt;

    &lt;span class="nt"&gt;&amp;lt;Result&amp;gt;&lt;/span&gt;
        &lt;span class="nt"&gt;&amp;lt;instanceId&amp;gt;&lt;/span&gt;es-is-0u2ecp69tt****&lt;span class="nt"&gt;&amp;lt;/instanceId&amp;gt;&lt;/span&gt;

    &lt;span class="nt"&gt;&amp;lt;/Result&amp;gt;&lt;/span&gt;

&lt;span class="nt"&gt;&amp;lt;/createInstanceResponse&amp;gt;&lt;/span&gt;

&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;a&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h4&gt;
  
  
  JSON Format
&lt;/h4&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight json"&gt;&lt;code&gt;&lt;span class="err"&gt;HTTP/&lt;/span&gt;&lt;span class="mf"&gt;1.1&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="mi"&gt;200&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="err"&gt;OK&lt;/span&gt;&lt;span class="w"&gt;
&lt;/span&gt;&lt;span class="err"&gt;Content-Type:application/json&lt;/span&gt;&lt;span class="w"&gt;
&lt;/span&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"RequestId"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"838D9D11-8EEF-46D8-BF0D-BC8FC2B0C2F3"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"Result"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="w"&gt;
    &lt;/span&gt;&lt;span class="nl"&gt;"instanceId"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"es-is-0u2ecp69tt****"&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="p"&gt;}&lt;/span&gt;&lt;span class="w"&gt;
&lt;/span&gt;&lt;span class="p"&gt;}&lt;/span&gt;&lt;span class="w"&gt;
&lt;/span&gt;&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;a&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Conclusion
&lt;/h2&gt;

&lt;p&gt;Creating an Elasticsearch instance on &lt;a href="https://www.alibabacloud.com/en/product/elasticsearch" rel="noopener noreferrer"&gt;Alibaba Cloud&lt;/a&gt; is a streamlined process, ensuring that you can efficiently manage and scale your data analytics capabilities. By following the guidelines and examples provided in this article, you should be well-equipped to get started.&lt;br&gt;Ready to start your journey with Elasticsearch on Alibaba Cloud? Explore our tailored Cloud solutions and services to take the first step towards transforming your data into a visual masterpiece. &lt;br&gt;&lt;a href="https://c.tb.cn/F3.bTfFpS" rel="noopener noreferrer"&gt;Click here to embark on Your 30-Day Free Trial&lt;/a&gt;&lt;/p&gt;

</description>
      <category>webdev</category>
      <category>beginners</category>
      <category>tutorial</category>
      <category>programming</category>
    </item>
    <item>
      <title>Integrate AMAP into Alibaba Cloud Kibana: A Complete Guide</title>
      <dc:creator>A_Lucas</dc:creator>
      <pubDate>Wed, 24 Jul 2024 02:12:44 +0000</pubDate>
      <link>https://dev.to/a_lucas/integrate-amap-into-alibaba-cloud-kibana-a-complete-guide-3ej</link>
      <guid>https://dev.to/a_lucas/integrate-amap-into-alibaba-cloud-kibana-a-complete-guide-3ej</guid>
      <description>&lt;h2&gt;
  
  
  Background Information
&lt;/h2&gt;

&lt;p&gt;By default, Elastic Maps uses data from OpenStreetMap. However, &lt;a href="https://www.alibabacloud.com/en/product/elasticsearch" rel="noopener noreferrer"&gt;Alibaba Cloud Elasticsearch&lt;/a&gt; Kibana does not display this data. Fortunately, you're able to integrate AMAP into Kibana, allowing you to leverage AMAP data directly on the Maps page of the Kibana console.&lt;br&gt;
&lt;a&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Prerequisites
&lt;/h2&gt;

&lt;p&gt;1）You need to create an Alibaba Cloud Elasticsearch cluster. For this example, we'll use an Alibaba Cloud Elasticsearch V8.9 cluster.&lt;br&gt;2）Ensure Cross-Origin Resource Sharing (CORS) is configured for your tile server to grant tile requests from the Kibana origin access to your tile server origin.&lt;br&gt;
&lt;a&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Step 1: Configure an AMAP Tile URL
&lt;/h2&gt;

&lt;p&gt;Log into the Kibana console of your Elasticsearch cluster via the Access over Internet entry. See &lt;a href="https://www.alibabacloud.com/help/en/elasticsearch" rel="noopener noreferrer"&gt;Log on to the Kibana console&lt;/a&gt; for more information.&lt;br&gt;&lt;strong&gt;Note&lt;/strong&gt;: Ensure your browser can access the Internet when logging into the Kibana console.&lt;br&gt;
&lt;a&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h3&gt;
  
  
  Steps:
&lt;/h3&gt;

&lt;p&gt;1）After logging into the Kibana console, click on the menu icon in the upper-left corner.&lt;/p&gt;

&lt;p&gt;2）Navigate to Analytics &amp;gt; Maps.&lt;/p&gt;

&lt;p&gt;3）(Optional) Click Create map in the upper-right corner of the Maps page.&lt;/p&gt;

&lt;p&gt;4）Click Add layer in the upper-right corner of the Maps page.&lt;/p&gt;

&lt;p&gt;5）In the Add layer panel, select Tile Map Service.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--Gv0-KORQ--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://img.alicdn.com/imgextra/i2/O1CN01Hx2k8Q1R9eR0v8Hj2_%21%216000000002069-0-tps-558-476.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--Gv0-KORQ--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://img.alicdn.com/imgextra/i2/O1CN01Hx2k8Q1R9eR0v8Hj2_%21%216000000002069-0-tps-558-476.jpg" alt="1.png" width="558" height="476"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;6）Enter a tile URL of AMAP in the Url field. Use HTTPS for the URL. Here are some common AMAP tile URL formats:&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--wc6ukzs0--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://img.alicdn.com/imgextra/i3/O1CN01KQaeUV25G8fvi8N20_%21%216000000007498-0-tps-1313-787.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--wc6ukzs0--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://img.alicdn.com/imgextra/i3/O1CN01KQaeUV25G8fvi8N20_%21%216000000007498-0-tps-1313-787.jpg" width="800" height="480"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;{1,4} indicates that you can select a number from 1 to 4.&lt;/li&gt;
&lt;li&gt;{x}, {y}, and {z} separately indicate the related layers.&lt;/li&gt;
&lt;li&gt;lang=zh_cn indicates that the language of annotations is Chinese. lang=en indicates that the language of annotations is English.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;7）In the lower part of the Add layer panel, click &lt;strong&gt;Add and close&lt;/strong&gt;.&lt;br&gt;
&lt;a&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Step 2: Upload a Geographic Information Dataset
&lt;/h2&gt;

&lt;p&gt;1）Prepare a geographic information dataset. You can use a provincial boundary dataset which can be obtained from &lt;a href="https://datav.aliyun.com/tools/atlas" rel="noopener noreferrer"&gt;DataV Atlas&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--pKjbctLB--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://img.alicdn.com/imgextra/i1/O1CN01U089Gi1cjL4Ndx0rs_%21%216000000003636-0-tps-501-579.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--pKjbctLB--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://img.alicdn.com/imgextra/i1/O1CN01U089Gi1cjL4Ndx0rs_%21%216000000003636-0-tps-501-579.jpg" alt="2.png" width="501" height="579"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;2）Click Add layer in the upper-right corner of the Maps page.&lt;/p&gt;

&lt;p&gt;3）Click Upload file in the Add layer panel.&lt;/p&gt;

&lt;p&gt;4）Select or drag and drop your dataset file. Supported file formats include .json, .geojson, and .shp.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--xagW9mJT--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://img.alicdn.com/imgextra/i1/O1CN01eaTGl51j68Tknv6Bh_%21%216000000004498-0-tps-584-358.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--xagW9mJT--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://img.alicdn.com/imgextra/i1/O1CN01eaTGl51j68Tknv6Bh_%21%216000000004498-0-tps-584-358.jpg" alt="3.png" width="584" height="358"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;5）Click Import file and then Add as document layer.&lt;/p&gt;

&lt;p&gt;6）Enter a name in the Name field and click Keep changes.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--2SY5oBY1--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://img.alicdn.com/imgextra/i2/O1CN01ktdh9M1hPiAxHJvQc_%21%216000000004270-0-tps-1821-729.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--2SY5oBY1--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://img.alicdn.com/imgextra/i2/O1CN01ktdh9M1hPiAxHJvQc_%21%216000000004270-0-tps-1821-729.jpg" alt="p753942.png" width="800" height="320"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;7）Click Save in the upper-right corner.&lt;/p&gt;

&lt;p&gt;With these steps, AMAP data will be integrated into your Alibaba Cloud Kibana Map!&lt;br&gt;
&lt;a&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  &lt;strong&gt;Conclusion&lt;/strong&gt;
&lt;/h2&gt;

&lt;p&gt;Integrating AMAP into Alibaba Cloud Kibana can significantly enhance your data visualizations, providing rich geographic insights directly within your Elasticsearch environment.&lt;br&gt;&lt;a href="https://www.alibabacloud.com/en/product/elasticsearch" rel="noopener noreferrer"&gt;Alibaba Cloud Elasticsearch&lt;/a&gt; offers a robust platform that simplifies these complex integrations. Ready to start your journey with Elasticsearch on Alibaba Cloud? Explore our tailored Cloud solutions and services to take the first step towards transforming your data into a visual masterpiece.&lt;br&gt;&lt;a href="https://c.tb.cn/F3.bTfFpS" rel="noopener noreferrer"&gt;Please Click here, Embark on Your 30-Day Free Trial&lt;/a&gt;&lt;/p&gt;

</description>
      <category>webdev</category>
      <category>beginners</category>
      <category>programming</category>
      <category>tutorial</category>
    </item>
    <item>
      <title>Elevate Your Elasticsearch Experience with Java High Level REST Client (7.x)</title>
      <dc:creator>A_Lucas</dc:creator>
      <pubDate>Wed, 17 Jul 2024 01:45:55 +0000</pubDate>
      <link>https://dev.to/a_lucas/elevate-your-elasticsearch-experience-with-java-high-level-rest-client-7x-348k</link>
      <guid>https://dev.to/a_lucas/elevate-your-elasticsearch-experience-with-java-high-level-rest-client-7x-348k</guid>
      <description>&lt;h2&gt;
  
  
  Introduction
&lt;/h2&gt;

&lt;p&gt;Java High Level REST Client (7.x) is a powerful tool for interacting with Elasticsearch clusters, making server communication more accessible and efficient. In this guide, we will walk you through the steps to use the High Level REST Client to call Elasticsearch Java APIs on an &lt;a href="https://www.alibabacloud.com/en/product/elasticsearch" rel="noopener noreferrer"&gt;Alibaba Cloud Elasticsearch&lt;/a&gt; cluster.&lt;br&gt;
&lt;a&gt;&lt;/a&gt;&lt;/p&gt;
&lt;h2&gt;
  
  
  Preparations
&lt;/h2&gt;

&lt;p&gt;&lt;a&gt;&lt;/a&gt;&lt;/p&gt;
&lt;h3&gt;
  
  
  Step 1: Create an Elasticsearch Cluster
&lt;/h3&gt;

&lt;p&gt;Ensure your cluster version is the same as or newer than the Java High Level REST Client version you plan to use. For step-by-step instructions, see &lt;a href="https://www.alibabacloud.com/en/product/elasticsearch" rel="noopener noreferrer"&gt;Create an Alibaba Cloud Elasticsearch cluster&lt;/a&gt;.&lt;br&gt;
&lt;a&gt;&lt;/a&gt;&lt;/p&gt;
&lt;h3&gt;
  
  
  Step 2: Enable Auto Indexing
&lt;/h3&gt;

&lt;p&gt;Enable the Auto Indexing feature in the YAML configuration file. For details, see &lt;a href="https://www.alibabacloud.com/help/en/elasticsearch/latest/configuration" rel="noopener noreferrer"&gt;Configure the YML file&lt;/a&gt;.&lt;br&gt;
&lt;a&gt;&lt;/a&gt;&lt;/p&gt;
&lt;h3&gt;
  
  
  Step 3: Configure IP Address Whitelist
&lt;/h3&gt;

&lt;p&gt;Ensure proper communication by configuring an IP address whitelist. If you're accessing the cluster over the Internet, allow requests from the required IP addresses by following the guidelines in &lt;a href="https://www.alibabacloud.com/help/en/elasticsearch/latest/configure-a-whitelist" rel="noopener noreferrer"&gt;Configure a public or private IP address whitelist&lt;/a&gt;.&lt;br&gt;
&lt;a&gt;&lt;/a&gt;&lt;/p&gt;
&lt;h3&gt;
  
  
  Step 4: Install JDK
&lt;/h3&gt;

&lt;p&gt;Install Java Development Kit (JDK) version 1.8 or later. For more information, see &lt;a href="https://www.oracle.com/java/technologies/javase-jdk11-downloads.html" rel="noopener noreferrer"&gt;Install a JDK&lt;/a&gt;.&lt;br&gt;
&lt;a&gt;&lt;/a&gt;&lt;/p&gt;
&lt;h3&gt;
  
  
  Step 5: Create a Java Maven Project
&lt;/h3&gt;

&lt;p&gt;Add the necessary dependencies to your &lt;code&gt;pom.xml&lt;/code&gt; file. Change the version number in the dependencies from &lt;code&gt;7.x&lt;/code&gt; to the specific version of the High Level REST Client you are using.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight xml"&gt;&lt;code&gt;&lt;span class="nt"&gt;&amp;lt;dependency&amp;gt;&lt;/span&gt;
    &lt;span class="nt"&gt;&amp;lt;groupId&amp;gt;&lt;/span&gt;org.elasticsearch.client&lt;span class="nt"&gt;&amp;lt;/groupId&amp;gt;&lt;/span&gt;
    &lt;span class="nt"&gt;&amp;lt;artifactId&amp;gt;&lt;/span&gt;elasticsearch-rest-high-level-client&lt;span class="nt"&gt;&amp;lt;/artifactId&amp;gt;&lt;/span&gt;
    &lt;span class="nt"&gt;&amp;lt;version&amp;gt;&lt;/span&gt;7.x&lt;span class="nt"&gt;&amp;lt;/version&amp;gt;&lt;/span&gt;
&lt;span class="nt"&gt;&amp;lt;/dependency&amp;gt;&lt;/span&gt;
&lt;span class="nt"&gt;&amp;lt;dependency&amp;gt;&lt;/span&gt;
    &lt;span class="nt"&gt;&amp;lt;groupId&amp;gt;&lt;/span&gt;org.apache.logging.log4j&lt;span class="nt"&gt;&amp;lt;/groupId&amp;gt;&lt;/span&gt;
    &lt;span class="nt"&gt;&amp;lt;artifactId&amp;gt;&lt;/span&gt;log4j-core&lt;span class="nt"&gt;&amp;lt;/artifactId&amp;gt;&lt;/span&gt;
    &lt;span class="nt"&gt;&amp;lt;version&amp;gt;&lt;/span&gt;2.20.0&lt;span class="nt"&gt;&amp;lt;/version&amp;gt;&lt;/span&gt;
&lt;span class="nt"&gt;&amp;lt;/dependency&amp;gt;&lt;/span&gt;
&lt;span class="nt"&gt;&amp;lt;dependency&amp;gt;&lt;/span&gt;
    &lt;span class="nt"&gt;&amp;lt;groupId&amp;gt;&lt;/span&gt;org.apache.logging.log4j&lt;span class="nt"&gt;&amp;lt;/groupId&amp;gt;&lt;/span&gt;
    &lt;span class="nt"&gt;&amp;lt;artifactId&amp;gt;&lt;/span&gt;log4j-api&lt;span class="nt"&gt;&amp;lt;/artifactId&amp;gt;&lt;/span&gt;
    &lt;span class="nt"&gt;&amp;lt;version&amp;gt;&lt;/span&gt;2.20.0&lt;span class="nt"&gt;&amp;lt;/version&amp;gt;&lt;/span&gt;
&lt;span class="nt"&gt;&amp;lt;/dependency&amp;gt;&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;a&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Example: Managing an Index
&lt;/h2&gt;

&lt;p&gt;Below is an example of creating and deleting an index using the High Level REST Client. Replace placeholders &lt;code&gt;{}&lt;/code&gt; with your specific parameters.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight java"&gt;&lt;code&gt;&lt;span class="kn"&gt;import&lt;/span&gt; &lt;span class="nn"&gt;org.apache.http.HttpHost&lt;/span&gt;&lt;span class="o"&gt;;&lt;/span&gt;
&lt;span class="kn"&gt;import&lt;/span&gt; &lt;span class="nn"&gt;org.apache.http.auth.AuthScope&lt;/span&gt;&lt;span class="o"&gt;;&lt;/span&gt;
&lt;span class="kn"&gt;import&lt;/span&gt; &lt;span class="nn"&gt;org.apache.http.auth.UsernamePasswordCredentials&lt;/span&gt;&lt;span class="o"&gt;;&lt;/span&gt;
&lt;span class="kn"&gt;import&lt;/span&gt; &lt;span class="nn"&gt;org.apache.http.client.CredentialsProvider&lt;/span&gt;&lt;span class="o"&gt;;&lt;/span&gt;
&lt;span class="kn"&gt;import&lt;/span&gt; &lt;span class="nn"&gt;org.apache.http.impl.client.BasicCredentialsProvider&lt;/span&gt;&lt;span class="o"&gt;;&lt;/span&gt;
&lt;span class="kn"&gt;import&lt;/span&gt; &lt;span class="nn"&gt;org.apache.http.impl.nio.client.HttpAsyncClientBuilder&lt;/span&gt;&lt;span class="o"&gt;;&lt;/span&gt;
&lt;span class="kn"&gt;import&lt;/span&gt; &lt;span class="nn"&gt;org.elasticsearch.action.delete.DeleteRequest&lt;/span&gt;&lt;span class="o"&gt;;&lt;/span&gt;
&lt;span class="kn"&gt;import&lt;/span&gt; &lt;span class="nn"&gt;org.elasticsearch.action.delete.DeleteResponse&lt;/span&gt;&lt;span class="o"&gt;;&lt;/span&gt;
&lt;span class="kn"&gt;import&lt;/span&gt; &lt;span class="nn"&gt;org.elasticsearch.action.index.IndexRequest&lt;/span&gt;&lt;span class="o"&gt;;&lt;/span&gt;
&lt;span class="kn"&gt;import&lt;/span&gt; &lt;span class="nn"&gt;org.elasticsearch.action.index.IndexResponse&lt;/span&gt;&lt;span class="o"&gt;;&lt;/span&gt;
&lt;span class="kn"&gt;import&lt;/span&gt; &lt;span class="nn"&gt;org.elasticsearch.client.*&lt;/span&gt;&lt;span class="o"&gt;;&lt;/span&gt;

&lt;span class="kn"&gt;import&lt;/span&gt; &lt;span class="nn"&gt;java.io.IOException&lt;/span&gt;&lt;span class="o"&gt;;&lt;/span&gt;
&lt;span class="kn"&gt;import&lt;/span&gt; &lt;span class="nn"&gt;java.util.HashMap&lt;/span&gt;&lt;span class="o"&gt;;&lt;/span&gt;
&lt;span class="kn"&gt;import&lt;/span&gt; &lt;span class="nn"&gt;java.util.Map&lt;/span&gt;&lt;span class="o"&gt;;&lt;/span&gt;

&lt;span class="kd"&gt;public&lt;/span&gt; &lt;span class="kd"&gt;class&lt;/span&gt; &lt;span class="nc"&gt;RestClientExample&lt;/span&gt; &lt;span class="o"&gt;{&lt;/span&gt;
    &lt;span class="kd"&gt;private&lt;/span&gt; &lt;span class="kd"&gt;static&lt;/span&gt; &lt;span class="kd"&gt;final&lt;/span&gt; &lt;span class="nc"&gt;RequestOptions&lt;/span&gt; &lt;span class="no"&gt;COMMON_OPTIONS&lt;/span&gt;&lt;span class="o"&gt;;&lt;/span&gt;

    &lt;span class="kd"&gt;static&lt;/span&gt; &lt;span class="o"&gt;{&lt;/span&gt;
        &lt;span class="nc"&gt;RequestOptions&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;&lt;span class="na"&gt;Builder&lt;/span&gt; &lt;span class="n"&gt;builder&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="nc"&gt;RequestOptions&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;&lt;span class="na"&gt;DEFAULT&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;&lt;span class="na"&gt;toBuilder&lt;/span&gt;&lt;span class="o"&gt;();&lt;/span&gt;
        &lt;span class="n"&gt;builder&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;&lt;span class="na"&gt;setHttpAsyncResponseConsumerFactory&lt;/span&gt;&lt;span class="o"&gt;(&lt;/span&gt;
                &lt;span class="k"&gt;new&lt;/span&gt; &lt;span class="nc"&gt;HttpAsyncResponseConsumerFactory&lt;/span&gt;
                        &lt;span class="o"&gt;.&lt;/span&gt;&lt;span class="na"&gt;HeapBufferedResponseConsumerFactory&lt;/span&gt;&lt;span class="o"&gt;(&lt;/span&gt;&lt;span class="mi"&gt;30&lt;/span&gt; &lt;span class="o"&gt;*&lt;/span&gt; &lt;span class="mi"&gt;1024&lt;/span&gt; &lt;span class="o"&gt;*&lt;/span&gt; &lt;span class="mi"&gt;1024&lt;/span&gt;&lt;span class="o"&gt;));&lt;/span&gt;
        &lt;span class="no"&gt;COMMON_OPTIONS&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="n"&gt;builder&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;&lt;span class="na"&gt;build&lt;/span&gt;&lt;span class="o"&gt;();&lt;/span&gt;
    &lt;span class="o"&gt;}&lt;/span&gt;

    &lt;span class="kd"&gt;public&lt;/span&gt; &lt;span class="kd"&gt;static&lt;/span&gt; &lt;span class="kt"&gt;void&lt;/span&gt; &lt;span class="nf"&gt;main&lt;/span&gt;&lt;span class="o"&gt;(&lt;/span&gt;&lt;span class="nc"&gt;String&lt;/span&gt;&lt;span class="o"&gt;[]&lt;/span&gt; &lt;span class="n"&gt;args&lt;/span&gt;&lt;span class="o"&gt;)&lt;/span&gt; &lt;span class="o"&gt;{&lt;/span&gt;
        &lt;span class="kd"&gt;final&lt;/span&gt; &lt;span class="nc"&gt;CredentialsProvider&lt;/span&gt; &lt;span class="n"&gt;credentialsProvider&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="k"&gt;new&lt;/span&gt; &lt;span class="nc"&gt;BasicCredentialsProvider&lt;/span&gt;&lt;span class="o"&gt;();&lt;/span&gt;
        &lt;span class="n"&gt;credentialsProvider&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;&lt;span class="na"&gt;setCredentials&lt;/span&gt;&lt;span class="o"&gt;(&lt;/span&gt;&lt;span class="nc"&gt;AuthScope&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;&lt;span class="na"&gt;ANY&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; 
            &lt;span class="k"&gt;new&lt;/span&gt; &lt;span class="nf"&gt;UsernamePasswordCredentials&lt;/span&gt;&lt;span class="o"&gt;(&lt;/span&gt;&lt;span class="s"&gt;"{Username}"&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="s"&gt;"{Password}"&lt;/span&gt;&lt;span class="o"&gt;));&lt;/span&gt;

        &lt;span class="nc"&gt;RestClientBuilder&lt;/span&gt; &lt;span class="n"&gt;builder&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="nc"&gt;RestClient&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;&lt;span class="na"&gt;builder&lt;/span&gt;&lt;span class="o"&gt;(&lt;/span&gt;&lt;span class="k"&gt;new&lt;/span&gt; &lt;span class="nc"&gt;HttpHost&lt;/span&gt;&lt;span class="o"&gt;(&lt;/span&gt;&lt;span class="s"&gt;"{Endpoint of the Elasticsearch cluster}"&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="mi"&gt;9200&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="s"&gt;"http"&lt;/span&gt;&lt;span class="o"&gt;))&lt;/span&gt;
                &lt;span class="o"&gt;.&lt;/span&gt;&lt;span class="na"&gt;setHttpClientConfigCallback&lt;/span&gt;&lt;span class="o"&gt;(&lt;/span&gt;&lt;span class="k"&gt;new&lt;/span&gt; &lt;span class="nc"&gt;RestClientBuilder&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;&lt;span class="na"&gt;HttpClientConfigCallback&lt;/span&gt;&lt;span class="o"&gt;()&lt;/span&gt; &lt;span class="o"&gt;{&lt;/span&gt;
                    &lt;span class="nd"&gt;@Override&lt;/span&gt;
                    &lt;span class="kd"&gt;public&lt;/span&gt; &lt;span class="nc"&gt;HttpAsyncClientBuilder&lt;/span&gt; &lt;span class="nf"&gt;customizeHttpClient&lt;/span&gt;&lt;span class="o"&gt;(&lt;/span&gt;&lt;span class="nc"&gt;HttpAsyncClientBuilder&lt;/span&gt; &lt;span class="n"&gt;httpClientBuilder&lt;/span&gt;&lt;span class="o"&gt;)&lt;/span&gt; &lt;span class="o"&gt;{&lt;/span&gt;
                        &lt;span class="k"&gt;return&lt;/span&gt; &lt;span class="n"&gt;httpClientBuilder&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;&lt;span class="na"&gt;setDefaultCredentialsProvider&lt;/span&gt;&lt;span class="o"&gt;(&lt;/span&gt;&lt;span class="n"&gt;credentialsProvider&lt;/span&gt;&lt;span class="o"&gt;);&lt;/span&gt;
                    &lt;span class="o"&gt;}&lt;/span&gt;
                &lt;span class="o"&gt;});&lt;/span&gt;

        &lt;span class="nc"&gt;RestHighLevelClient&lt;/span&gt; &lt;span class="n"&gt;highClient&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="k"&gt;new&lt;/span&gt; &lt;span class="nc"&gt;RestHighLevelClient&lt;/span&gt;&lt;span class="o"&gt;(&lt;/span&gt;&lt;span class="n"&gt;builder&lt;/span&gt;&lt;span class="o"&gt;);&lt;/span&gt;

        &lt;span class="k"&gt;try&lt;/span&gt; &lt;span class="o"&gt;{&lt;/span&gt;
            &lt;span class="nc"&gt;Map&lt;/span&gt;&lt;span class="o"&gt;&amp;lt;&lt;/span&gt;&lt;span class="nc"&gt;String&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="nc"&gt;Object&lt;/span&gt;&lt;span class="o"&gt;&amp;gt;&lt;/span&gt; &lt;span class="n"&gt;jsonMap&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="k"&gt;new&lt;/span&gt; &lt;span class="nc"&gt;HashMap&lt;/span&gt;&lt;span class="o"&gt;&amp;lt;&amp;gt;();&lt;/span&gt;
            &lt;span class="n"&gt;jsonMap&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;&lt;span class="na"&gt;put&lt;/span&gt;&lt;span class="o"&gt;(&lt;/span&gt;&lt;span class="s"&gt;"{field_01}"&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="s"&gt;"{value_01}"&lt;/span&gt;&lt;span class="o"&gt;);&lt;/span&gt;
            &lt;span class="n"&gt;jsonMap&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;&lt;span class="na"&gt;put&lt;/span&gt;&lt;span class="o"&gt;(&lt;/span&gt;&lt;span class="s"&gt;"{field_02}"&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="s"&gt;"{value_02}"&lt;/span&gt;&lt;span class="o"&gt;);&lt;/span&gt;

            &lt;span class="nc"&gt;IndexRequest&lt;/span&gt; &lt;span class="n"&gt;indexRequest&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="k"&gt;new&lt;/span&gt; &lt;span class="nc"&gt;IndexRequest&lt;/span&gt;&lt;span class="o"&gt;(&lt;/span&gt;&lt;span class="s"&gt;"{index_name}"&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="s"&gt;"_doc"&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="s"&gt;"{doc_id}"&lt;/span&gt;&lt;span class="o"&gt;).&lt;/span&gt;&lt;span class="na"&gt;source&lt;/span&gt;&lt;span class="o"&gt;(&lt;/span&gt;&lt;span class="n"&gt;jsonMap&lt;/span&gt;&lt;span class="o"&gt;);&lt;/span&gt;
            &lt;span class="nc"&gt;IndexResponse&lt;/span&gt; &lt;span class="n"&gt;indexResponse&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="n"&gt;highClient&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;&lt;span class="na"&gt;index&lt;/span&gt;&lt;span class="o"&gt;(&lt;/span&gt;&lt;span class="n"&gt;indexRequest&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="no"&gt;COMMON_OPTIONS&lt;/span&gt;&lt;span class="o"&gt;);&lt;/span&gt;
            &lt;span class="kt"&gt;long&lt;/span&gt; &lt;span class="n"&gt;version&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="n"&gt;indexResponse&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;&lt;span class="na"&gt;getVersion&lt;/span&gt;&lt;span class="o"&gt;();&lt;/span&gt;
            &lt;span class="nc"&gt;System&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;&lt;span class="na"&gt;out&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;&lt;span class="na"&gt;println&lt;/span&gt;&lt;span class="o"&gt;(&lt;/span&gt;&lt;span class="s"&gt;"Index document successfully! "&lt;/span&gt; &lt;span class="o"&gt;+&lt;/span&gt; &lt;span class="n"&gt;version&lt;/span&gt;&lt;span class="o"&gt;);&lt;/span&gt;

            &lt;span class="nc"&gt;DeleteRequest&lt;/span&gt; &lt;span class="n"&gt;deleteRequest&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="k"&gt;new&lt;/span&gt; &lt;span class="nc"&gt;DeleteRequest&lt;/span&gt;&lt;span class="o"&gt;(&lt;/span&gt;&lt;span class="s"&gt;"{index_name}"&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="s"&gt;"_doc"&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="s"&gt;"{doc_id}"&lt;/span&gt;&lt;span class="o"&gt;);&lt;/span&gt;
            &lt;span class="nc"&gt;DeleteResponse&lt;/span&gt; &lt;span class="n"&gt;deleteResponse&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="n"&gt;highClient&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;&lt;span class="na"&gt;delete&lt;/span&gt;&lt;span class="o"&gt;(&lt;/span&gt;&lt;span class="n"&gt;deleteRequest&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="no"&gt;COMMON_OPTIONS&lt;/span&gt;&lt;span class="o"&gt;);&lt;/span&gt;
            &lt;span class="nc"&gt;System&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;&lt;span class="na"&gt;out&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;&lt;span class="na"&gt;println&lt;/span&gt;&lt;span class="o"&gt;(&lt;/span&gt;&lt;span class="s"&gt;"Delete document successfully! \n"&lt;/span&gt; &lt;span class="o"&gt;+&lt;/span&gt; &lt;span class="n"&gt;deleteResponse&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;&lt;span class="na"&gt;toString&lt;/span&gt;&lt;span class="o"&gt;());&lt;/span&gt;

            &lt;span class="n"&gt;highClient&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;&lt;span class="na"&gt;close&lt;/span&gt;&lt;span class="o"&gt;();&lt;/span&gt;
        &lt;span class="o"&gt;}&lt;/span&gt; &lt;span class="k"&gt;catch&lt;/span&gt; &lt;span class="o"&gt;(&lt;/span&gt;&lt;span class="nc"&gt;IOException&lt;/span&gt; &lt;span class="n"&gt;ioException&lt;/span&gt;&lt;span class="o"&gt;)&lt;/span&gt; &lt;span class="o"&gt;{&lt;/span&gt;
            &lt;span class="n"&gt;ioException&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;&lt;span class="na"&gt;printStackTrace&lt;/span&gt;&lt;span class="o"&gt;();&lt;/span&gt;
        &lt;span class="o"&gt;}&lt;/span&gt;
    &lt;span class="o"&gt;}&lt;/span&gt;
&lt;span class="o"&gt;}&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;a&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  High-Concurrency Configuration
&lt;/h2&gt;

&lt;p&gt;For high-concurrency scenarios, increase the number of client connections:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight java"&gt;&lt;code&gt;&lt;span class="n"&gt;httpClientBuilder&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;&lt;span class="na"&gt;setMaxConnTotal&lt;/span&gt;&lt;span class="o"&gt;(&lt;/span&gt;&lt;span class="mi"&gt;500&lt;/span&gt;&lt;span class="o"&gt;);&lt;/span&gt;
&lt;span class="n"&gt;httpClientBuilder&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;&lt;span class="na"&gt;setMaxConnPerRoute&lt;/span&gt;&lt;span class="o"&gt;(&lt;/span&gt;&lt;span class="mi"&gt;300&lt;/span&gt;&lt;span class="o"&gt;);&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Sample code snippet:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight java"&gt;&lt;code&gt;&lt;span class="nc"&gt;String&lt;/span&gt; &lt;span class="n"&gt;host&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="s"&gt;"127.0.0.1"&lt;/span&gt;&lt;span class="o"&gt;;&lt;/span&gt;
&lt;span class="kt"&gt;int&lt;/span&gt; &lt;span class="n"&gt;port&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="mi"&gt;9200&lt;/span&gt;&lt;span class="o"&gt;;&lt;/span&gt;
&lt;span class="nc"&gt;String&lt;/span&gt; &lt;span class="n"&gt;username&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="s"&gt;"elastic"&lt;/span&gt;&lt;span class="o"&gt;;&lt;/span&gt;
&lt;span class="nc"&gt;String&lt;/span&gt; &lt;span class="n"&gt;password&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="s"&gt;"passwd"&lt;/span&gt;&lt;span class="o"&gt;;&lt;/span&gt;
&lt;span class="kd"&gt;final&lt;/span&gt; &lt;span class="kt"&gt;int&lt;/span&gt; &lt;span class="n"&gt;max_conn_total&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="mi"&gt;500&lt;/span&gt;&lt;span class="o"&gt;;&lt;/span&gt;
&lt;span class="kd"&gt;final&lt;/span&gt; &lt;span class="kt"&gt;int&lt;/span&gt; &lt;span class="n"&gt;max_conn_per_route&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="mi"&gt;300&lt;/span&gt;&lt;span class="o"&gt;;&lt;/span&gt;

&lt;span class="nc"&gt;RestHighLevelClient&lt;/span&gt; &lt;span class="n"&gt;restHighLevelClient&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="k"&gt;new&lt;/span&gt; &lt;span class="nc"&gt;RestHighLevelClient&lt;/span&gt;&lt;span class="o"&gt;(&lt;/span&gt;
  &lt;span class="nc"&gt;RestClient&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;&lt;span class="na"&gt;builder&lt;/span&gt;&lt;span class="o"&gt;(&lt;/span&gt;&lt;span class="k"&gt;new&lt;/span&gt; &lt;span class="nc"&gt;HttpHost&lt;/span&gt;&lt;span class="o"&gt;(&lt;/span&gt;&lt;span class="n"&gt;host&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;port&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="s"&gt;"http"&lt;/span&gt;&lt;span class="o"&gt;)).&lt;/span&gt;&lt;span class="na"&gt;setHttpClientConfigCallback&lt;/span&gt;&lt;span class="o"&gt;(&lt;/span&gt;&lt;span class="k"&gt;new&lt;/span&gt; &lt;span class="nc"&gt;RestClientBuilder&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;&lt;span class="na"&gt;HttpClientConfigCallback&lt;/span&gt;&lt;span class="o"&gt;()&lt;/span&gt; &lt;span class="o"&gt;{&lt;/span&gt;
    &lt;span class="kd"&gt;public&lt;/span&gt; &lt;span class="nc"&gt;HttpAsyncClientBuilder&lt;/span&gt; &lt;span class="nf"&gt;customizeHttpClient&lt;/span&gt;&lt;span class="o"&gt;(&lt;/span&gt;&lt;span class="nc"&gt;HttpAsyncClientBuilder&lt;/span&gt; &lt;span class="n"&gt;httpClientBuilder&lt;/span&gt;&lt;span class="o"&gt;)&lt;/span&gt; &lt;span class="o"&gt;{&lt;/span&gt;
      &lt;span class="n"&gt;httpClientBuilder&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;&lt;span class="na"&gt;setMaxConnTotal&lt;/span&gt;&lt;span class="o"&gt;(&lt;/span&gt;&lt;span class="n"&gt;max_conn_total&lt;/span&gt;&lt;span class="o"&gt;);&lt;/span&gt;
      &lt;span class="n"&gt;httpClientBuilder&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;&lt;span class="na"&gt;setMaxConnPerRoute&lt;/span&gt;&lt;span class="o"&gt;(&lt;/span&gt;&lt;span class="n"&gt;max_conn_per_route&lt;/span&gt;&lt;span class="o"&gt;);&lt;/span&gt;
      &lt;span class="k"&gt;return&lt;/span&gt; &lt;span class="n"&gt;httpClientBuilder&lt;/span&gt;&lt;span class="o"&gt;.&lt;/span&gt;&lt;span class="na"&gt;setDefaultCredentialsProvider&lt;/span&gt;&lt;span class="o"&gt;(&lt;/span&gt;&lt;span class="n"&gt;credentialsProvider&lt;/span&gt;&lt;span class="o"&gt;);&lt;/span&gt;
    &lt;span class="o"&gt;}&lt;/span&gt;
  &lt;span class="o"&gt;})&lt;/span&gt;
&lt;span class="o"&gt;);&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;For more details on features and configurations, see the official &lt;a href="https://www.elastic.co/guide/en/elasticsearch/client/java-rest/current/index.html" rel="noopener noreferrer"&gt;Java High Level REST Client documentation&lt;/a&gt;.&lt;br&gt;
&lt;a&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Conclusion
&lt;/h2&gt;

&lt;p&gt;Using the Java High Level REST Client ensures efficient interaction with your &lt;a href="https://www.alibabacloud.com/en/product/elasticsearch" rel="noopener noreferrer"&gt;Alibaba Cloud Elasticsearch&lt;/a&gt; cluster. Follow this guide to make the most out of your Elasticsearch setup.&lt;br&gt;Ready to start your journey with Elasticsearch on Alibaba Cloud? Explore our tailored Cloud solutions and services to transform your data into a visual masterpiece. &lt;br&gt;&lt;br&gt;
&lt;a href="https://c.tb.cn/F3.bTfFpS" rel="noopener noreferrer"&gt;Click here to embark on Your 30-Day Free Trial&lt;/a&gt;&lt;/p&gt;

</description>
      <category>javascript</category>
      <category>tutorial</category>
      <category>productivity</category>
      <category>webdev</category>
    </item>
  </channel>
</rss>
