<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Edwin Lisowski</title>
    <description>The latest articles on DEV Community by Edwin Lisowski (@edwin_lisowski).</description>
    <link>https://dev.to/edwin_lisowski</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F2463335%2F82ed05dd-b503-4f2c-b955-8c2188fddbd7.jpg</url>
      <title>DEV Community: Edwin Lisowski</title>
      <link>https://dev.to/edwin_lisowski</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/edwin_lisowski"/>
    <language>en</language>
    <item>
      <title>СontextCheck: LLM &amp; RAG Evaluation Framework</title>
      <dc:creator>Edwin Lisowski</dc:creator>
      <pubDate>Wed, 27 Nov 2024 08:51:21 +0000</pubDate>
      <link>https://dev.to/edwin_lisowski/sontextcheck-llm-rag-evaluation-framework-59a9</link>
      <guid>https://dev.to/edwin_lisowski/sontextcheck-llm-rag-evaluation-framework-59a9</guid>
      <description>&lt;p&gt;Hi all! We open-sourced a framework for testing LLMs, RAGs, and chatbots. The tool automates query generation, completion requests, regression detection, penetration testing, and hallucination assessment. Designed for developers, researchers, and businesses. And we are looking for contributors! Feel free to try it out for yourself and share your feedback!&lt;/p&gt;

&lt;p&gt;&lt;a href="https://github.com/Addepto/contextcheck" rel="noopener noreferrer"&gt;Repo on Github&lt;/a&gt;&lt;/p&gt;

</description>
      <category>aiops</category>
    </item>
    <item>
      <title>ContextCheck: An open-source framework for testing and evaluating LLMs, RAGs, Chatbots</title>
      <dc:creator>Edwin Lisowski</dc:creator>
      <pubDate>Thu, 21 Nov 2024 10:20:15 +0000</pubDate>
      <link>https://dev.to/edwin_lisowski/contextcheck-an-open-source-framework-for-testing-and-evaluating-llms-rags-chatbots-1mpa</link>
      <guid>https://dev.to/edwin_lisowski/contextcheck-an-open-source-framework-for-testing-and-evaluating-llms-rags-chatbots-1mpa</guid>
      <description>&lt;p&gt;Hey devs!&lt;/p&gt;

&lt;p&gt;We just open-sourced ContextCheck, a framework for testing and evaluating LLMs, RAGs, and chatbots 🚀&lt;/p&gt;

&lt;p&gt;What it does:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Generates queries and handles completions&lt;/li&gt;
&lt;li&gt;Detects regressions and hallucinations&lt;/li&gt;
&lt;li&gt;Runs penetration tests&lt;/li&gt;
&lt;li&gt;Works in CI pipelines (YAML-configurable)&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;We built it while developing our AI Knowledge Base Assistant to solve real headaches with testing and validating LLMs. Now it’s out there for you to use, break, and improve.&lt;/p&gt;

&lt;p&gt;Try it out and let us know what you think! ➡️ &lt;a href="https://github.com/Addepto/contextcheck" rel="noopener noreferrer"&gt;Github repo&lt;/a&gt;&lt;/p&gt;

</description>
      <category>ai</category>
    </item>
    <item>
      <title>ContextCheck: An open-source framework for testing and evaluating LLMs, RAGs, Chatbots</title>
      <dc:creator>Edwin Lisowski</dc:creator>
      <pubDate>Thu, 21 Nov 2024 10:04:20 +0000</pubDate>
      <link>https://dev.to/edwin_lisowski/contextcheck-an-open-source-framework-for-testing-and-evaluating-llms-rags-chatbots-3hkn</link>
      <guid>https://dev.to/edwin_lisowski/contextcheck-an-open-source-framework-for-testing-and-evaluating-llms-rags-chatbots-3hkn</guid>
      <description>&lt;p&gt;Hey everyone!&lt;/p&gt;

&lt;p&gt;I’m one of the co-founders of Addepto, and I’m excited to share ContextCheck—a new open-source framework we’ve developed for testing and evaluating LLMs, RAGs, and chatbots.&lt;/p&gt;

&lt;p&gt;ContextCheck offers tools to:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Automatically generate queries and request completions&lt;/li&gt;
&lt;li&gt;Detect regressions and assess hallucinations&lt;/li&gt;
&lt;li&gt;Perform penetration testing&lt;/li&gt;
&lt;li&gt;Ensure the robustness and reliability of AI systems&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;It’s fully configurable via YAML and integrates seamlessly into CI pipelines for automated testing.&lt;/p&gt;

&lt;p&gt;We built ContextCheck during the development of our AI-powered Knowledge Base Assistant to solve the challenges we faced with testing and validating Large Language Models. It’s a tool designed by developers for developers to tackle real-world issues.&lt;/p&gt;

&lt;p&gt;We’d love for you to try it out, contribute, and share your feedback!&lt;/p&gt;

&lt;p&gt;&lt;a href="https://github.com/Addepto/contextcheck" rel="noopener noreferrer"&gt;Github repo&lt;/a&gt;&lt;/p&gt;

</description>
      <category>showdev</category>
      <category>github</category>
      <category>opensource</category>
      <category>llm</category>
    </item>
  </channel>
</rss>
