<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Jannik Maierhoefer</title>
    <description>The latest articles on DEV Community by Jannik Maierhoefer (@jannik_maierhoefer).</description>
    <link>https://dev.to/jannik_maierhoefer</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F2450649%2Fde694cef-66da-45d9-9a98-5d297fe0def8.jpg</url>
      <title>DEV Community: Jannik Maierhoefer</title>
      <link>https://dev.to/jannik_maierhoefer</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/jannik_maierhoefer"/>
    <language>en</language>
    <item>
      <title>Langfuse Launch Week #2</title>
      <dc:creator>Jannik Maierhoefer</dc:creator>
      <pubDate>Tue, 19 Nov 2024 19:32:28 +0000</pubDate>
      <link>https://dev.to/jannik_maierhoefer/langfuse-launch-week-2-5ged</link>
      <guid>https://dev.to/jannik_maierhoefer/langfuse-launch-week-2-5ged</guid>
      <description>&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fs8t5icdjlti3u7i16n5s.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fs8t5icdjlti3u7i16n5s.png" alt="Langfuse Launch Week Header Image" width="800" height="523"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://langfuse.com" rel="noopener noreferrer"&gt;Langfuse&lt;/a&gt;, the open-source LLM engineering platform, is excited to announce its &lt;strong&gt;second Launch Week&lt;/strong&gt;, starting on &lt;strong&gt;Monday, November 18, 2024&lt;/strong&gt;. This week-long event will feature daily platform updates, culminating in a &lt;strong&gt;Product Hunt launch&lt;/strong&gt; on Friday and a &lt;strong&gt;Virtual Town Hall&lt;/strong&gt; on Wednesday.&lt;/p&gt;




&lt;h2&gt;
  
  
  Focus of Launch Week
&lt;/h2&gt;

&lt;p&gt;Langfuse's second Launch Week is all about supporting the &lt;strong&gt;next generation of AI models&lt;/strong&gt; and integrating the platform more deeply into developer workflows. The updates aim to deliver &lt;strong&gt;end-to-end prompt engineering tools&lt;/strong&gt; specifically designed for product teams, enhancing the &lt;strong&gt;robustness&lt;/strong&gt; and &lt;strong&gt;versatility&lt;/strong&gt; of AI applications.&lt;/p&gt;




&lt;h2&gt;
  
  
  🔻 Day 0: Prompt Management for Vercel AI SDK
&lt;/h2&gt;

&lt;p&gt;On the first day, Langfuse introduced &lt;strong&gt;&lt;a href="https://langfuse.com/changelog/2024-11-17-vercel-ai-sdk-prompt-mgmt" rel="noopener noreferrer"&gt;native integration&lt;/a&gt;&lt;/strong&gt; of its Prompt Management with the &lt;strong&gt;Vercel AI SDK&lt;/strong&gt;. This integration enables developers to:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Version and release prompts&lt;/strong&gt; directly in Langfuse.&lt;/li&gt;
&lt;li&gt;Utilize prompts via the &lt;strong&gt;Vercel AI SDK&lt;/strong&gt;.&lt;/li&gt;
&lt;li&gt;Seamlessly monitor metrics like latency, costs, and usage.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This update answers critical questions for developers:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Which prompt version caused a specific bug?&lt;/li&gt;
&lt;li&gt;What’s the cost and latency impact of each prompt version?&lt;/li&gt;
&lt;li&gt;Which prompt versions are most used?&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  🆚 Day 1: Dataset Experiment Run Comparison View
&lt;/h2&gt;

&lt;p&gt;The second day brought a new &lt;strong&gt;&lt;a href="https://langfuse.com/changelog/2024-11-18-dataset-runs-comparison-view" rel="noopener noreferrer"&gt;comparison view for dataset experiment runs&lt;/a&gt;&lt;/strong&gt; within Langfuse Datasets. This powerful feature allows teams to:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Analyze &lt;strong&gt;multiple experiment runs side-by-side&lt;/strong&gt;.&lt;/li&gt;
&lt;li&gt;Compare application performance across test dataset experiments.&lt;/li&gt;
&lt;li&gt;Explore metrics like latency and costs.&lt;/li&gt;
&lt;li&gt;Drill down into individual dataset items.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This enhancement is particularly valuable for testing different prompts, models, or application configurations, making it a must-have tool for teams working on &lt;strong&gt;AI-powered products&lt;/strong&gt;.&lt;/p&gt;




&lt;h2&gt;
  
  
  ⚖️ Day 2: LLM-as-a-Judge Evaluations for Datasets
&lt;/h2&gt;

&lt;p&gt;Day 2 of Launch Week 2 brings managed &lt;strong&gt;&lt;a href="https://langfuse.com/changelog/2024-11-19-llm-as-a-judge-for-datasets" rel="noopener noreferrer"&gt;LLM-as-a-judge&lt;/a&gt; evaluators&lt;/strong&gt; to dataset experiments. Assign evaluators to your datasets and they will automatically run on new experiment runs, scoring your outputs based on your evaluation criteria.&lt;/p&gt;

&lt;p&gt;You can run any &lt;strong&gt;LLM-as-a-judge&lt;/strong&gt; prompt, Langfuse comes with templates for the following evaluation criteria: Hallucination, Helpfulness, Relevance, Toxicity, Correctness, Contextrelevance, Contextcorrectness, Conciseness.&lt;/p&gt;

&lt;p&gt;Langfuse LLM-as-a-judge works with any LLM that supports tool/function calling that is accessible via the following APIs: OpenAI, Azure OpenAI, Anthropic, AWS Bedrock. Via LLM gateways such as LiteLLM, virtually any popular LLM can be used via the OpenAI connector.&lt;/p&gt;




&lt;h2&gt;
  
  
  🎨 Day 3: Full multi-modal support, including audio, images, and attachments
&lt;/h2&gt;

&lt;p&gt;We're excited that Langfuse now offers &lt;a href="https://langfuse.com/changelog/2024-11-20-full-multi-modal-images-audio-attachments" rel="noopener noreferrer"&gt;full multi-modal support&lt;/a&gt;, including images, audio files, and attachments! This highly requested feature allows you to integrate media such as images (PNG, JPG, WEBP), audio files (MPEG, MP3, WAV), and documents (PDF, plain text) directly into your traces, enhancing your development and monitoring workflow in Langfuse.&lt;/p&gt;

&lt;p&gt;Getting started is easy—simply upgrade to the latest version of the Langfuse SDK. Our SDKs now automatically handle base64 encoded media, extracting and uploading them independently while referencing them in your traces. For more control or different media types, you can use the new LangfuseMediaclass to wrap your media before inclusion.&lt;/p&gt;




&lt;h2&gt;
  
  
  📚 Day 4: All new Datasets and Evaluations documentation
&lt;/h2&gt;

&lt;p&gt;Today we're highlighting documentation - an often overlooked but critical element of great Developer Experience. Alongside major updates to our Datasets and Evaluations features, we've completely rebuilt their documentation to be more thorough and user-friendly than ever before. The new docs better explain how and when to use these features, introduce core data models, and provide end-to-end examples as Jupyter Notebooks. We've also revamped the &lt;code&gt;/docs&lt;/code&gt; start page to reflect Langfuse's comprehensive platform scope, and added &lt;code&gt;llms.txt&lt;/code&gt; for better LLM tool integration. Documentation is product at Langfuse - we take it seriously and have built many features to help users get the most value from it.&lt;/p&gt;

&lt;p&gt;See the &lt;a href="https://dev.to/changelog/2024-11-21-all-new-datasets-and-evals-documentation"&gt;changelog&lt;/a&gt; for more details. It also includes a summary of all the features we added to the documentation over the last year to make it truly awesome.&lt;/p&gt;




&lt;h2&gt;
  
  
  🧪 Day 5: Prompt Experiments
&lt;/h2&gt;

&lt;p&gt;Prompt Experiments are the final piece of the launch week theme of "closing the development loop". They allow you to test prompt versions from Langfuse Prompt Management on datasets of test inputs and expected outputs. You can optionally use LLM-as-a-Judge evaluators to automatically evaluate responses based on expected outputs, and compare results in the new side-by-side experiment comparison view. This powerful combination speeds up the feedback loop when working on prompts and prevents regressions when making rapid prompt changes&lt;/p&gt;

&lt;p&gt;See the &lt;a href="https://langfuse.com/changelog/2024-11-22-prompt-experimentation" rel="noopener noreferrer"&gt;changelog&lt;/a&gt; for more details or watch the video above for a walkthrough.&lt;/p&gt;




&lt;h2&gt;
  
  
  🍒 Extra Goodies
&lt;/h2&gt;

&lt;p&gt;List of additional features that were released this week:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;a href="https://langfuse.com/changelog/2024-11-17-llms-txt" rel="noopener noreferrer"&gt;&lt;code&gt;llms.txt&lt;/code&gt;&lt;/a&gt;: Easily use the Langfuse documentation in Cursor and other LLM editors via the new &lt;code&gt;llms.txt&lt;/code&gt; file.&lt;/li&gt;
&lt;li&gt;
&lt;a href="https://dev.to/docs"&gt;&lt;code&gt;/docs&lt;/code&gt;&lt;/a&gt;: New documentation start page with a simplified overview of all Langfuse features.&lt;/li&gt;
&lt;li&gt;
&lt;a href="https://langfuse.com/pricing-self-host" rel="noopener noreferrer"&gt;Self-hosted Pro Plan&lt;/a&gt;: Get access to additional features without the need for a sales call or enterprise pricing. All core Langfuse features are OSS without limitations, see &lt;a href="https://dev.to/pricing-self-host"&gt;comparison&lt;/a&gt; for more details.&lt;/li&gt;
&lt;li&gt;
&lt;a href="https://langfuse.com/docs/deployment/v3/overview" rel="noopener noreferrer"&gt;Developer Preview of v3 (self-hosted)&lt;/a&gt;: v3 is the biggest release in Langfuse history. After running large parts of it on Langfuse Cloud for a while, an initial developer preview for self-hosted users is now available.&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  Stay Updated
&lt;/h2&gt;

&lt;p&gt;Stay connected with Langfuse during Launch Week:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;🌟 &lt;strong&gt;Star the project on GitHub&lt;/strong&gt; to show your support.&lt;/li&gt;
&lt;li&gt;Follow Langfuse on &lt;a href="https://twitter.com/langfuse" rel="noopener noreferrer"&gt;Twitter&lt;/a&gt; and &lt;a href="https://linkedin.com/company/langfuse" rel="noopener noreferrer"&gt;LinkedIn&lt;/a&gt; for updates.&lt;/li&gt;
&lt;li&gt;Subscribe to the &lt;a href="https://langfuse.com" rel="noopener noreferrer"&gt;Langfuse mailing list&lt;/a&gt; to receive daily updates throughout the week.&lt;/li&gt;
&lt;/ul&gt;




&lt;p&gt;&lt;strong&gt;Learn more:&lt;/strong&gt; &lt;a href="https://langfuse.com/blog/2024-11-17-launch-week-2" rel="noopener noreferrer"&gt;Langfuse Blog&lt;/a&gt;&lt;/p&gt;

</description>
    </item>
  </channel>
</rss>
