<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Nexdata AI</title>
    <description>The latest articles on DEV Community by Nexdata AI (@nexdata).</description>
    <link>https://dev.to/nexdata</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F2957126%2Fa680d98b-32dc-48ca-b2d0-7761d47661d8.png</url>
      <title>DEV Community: Nexdata AI</title>
      <link>https://dev.to/nexdata</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/nexdata"/>
    <language>en</language>
    <item>
      <title>Free Registration, Free Dataset, and $20K Prize Pool: Join the 2nd MLC-SLM Challenge 2026</title>
      <dc:creator>Nexdata AI</dc:creator>
      <pubDate>Wed, 29 Apr 2026 07:15:49 +0000</pubDate>
      <link>https://dev.to/nexdata/free-registration-free-dataset-and-20k-prize-pool-join-the-2nd-mlc-slm-challenge-2026-4nni</link>
      <guid>https://dev.to/nexdata/free-registration-free-dataset-and-20k-prize-pool-join-the-2nd-mlc-slm-challenge-2026-4nni</guid>
      <description>&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F4sizttwu7fjwola4hpm5.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F4sizttwu7fjwola4hpm5.png" alt=" " width="800" height="403"&gt;&lt;/a&gt;The &lt;strong&gt;2nd Multilingual Conversational Speech Language Models Challenge 2026&lt;/strong&gt; is now open for registration.&lt;/p&gt;

&lt;p&gt;This year’s challenge focuses on advancing &lt;strong&gt;Speech Large Language Models&lt;/strong&gt; for real-world multilingual conversational speech, with tasks covering &lt;strong&gt;speaker diarization, speech recognition, and conversational speech understanding.&lt;/strong&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Why join?
&lt;/h2&gt;

&lt;p&gt;The 2nd MLC-SLM Challenge offers:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Free registration&lt;/strong&gt;&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Free access to a large-scale multilingual conversational speech dataset&lt;/strong&gt; for registered participants, featuring around &lt;strong&gt;2,100 hours of data across 14 languages&lt;/strong&gt;
&lt;/li&gt;
&lt;li&gt;A total prize pool of** USD 20,000**
Support for both academic and industry teams, as well as individual researchers&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The first MLC-SLM Challenge attracted 78 teams from 13 countries and regions, with 489 valid leaderboard submissions and 14 technical reports. &lt;strong&gt;Its summary paper has also been accepted by ICASSP 2026.&lt;/strong&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Challenge tasks
&lt;/h2&gt;

&lt;p&gt;Participants can work on two tracks:&lt;/p&gt;

&lt;p&gt;Write on Medium&lt;br&gt;
&lt;strong&gt;Task 1: Multilingual Conversational Speech Diarization and&lt;/strong&gt; Recognition&lt;br&gt;
Build systems that identify who is speaking when and transcribe multilingual conversational speech. No oracle segmentation or speaker labels will be provided during evaluation.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Task 2: Multilingual Conversational Speech Understanding&lt;/strong&gt;&lt;br&gt;
Build systems that understand multilingual conversations through acoustic and semantic information. Evaluation will be based on multiple-choice questions about the full conversation.&lt;/p&gt;

&lt;p&gt;Both pipeline-based and end-to-end Speech LLM systems are welcome. External datasets and pretrained models are allowed, as long as they are freely accessible and clearly reported.&lt;/p&gt;

&lt;h2&gt;
  
  
  Dataset highlights
&lt;/h2&gt;

&lt;p&gt;The challenge dataset contains around &lt;strong&gt;2,100 hours&lt;/strong&gt; of two-speaker conversational speech across &lt;strong&gt;14 languages.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;It also includes diverse regional accents, such as** Canadian French, Mexican Spanish, Brazilian Portuguese, British English, American English, Australian English, Indian English, and Philippine English.**&lt;/p&gt;

&lt;p&gt;This makes the challenge a valuable testbed for researchers working on multilingual ASR, speaker diarization, Speech LLMs, and spoken language understanding.&lt;/p&gt;

&lt;h2&gt;
  
  
  Registration
&lt;/h2&gt;

&lt;p&gt;Registration is now open.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Participation is free, and the dataset will be provided free of charge to registered participants.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Registration Link: &lt;a href="https://forms.gle/jfAZ95abGy4ZiNHo7" rel="noopener noreferrer"&gt;https://forms.gle/jfAZ95abGy4ZiNHo7&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;More Details: &lt;a href="https://www.nexdata.ai/competition/mlc-slm" rel="noopener noreferrer"&gt;https://www.nexdata.ai/competition/mlc-slm&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Contact Email: &lt;a href="mailto:mlc-slmw@nexdata.ai"&gt;mlc-slmw@nexdata.ai&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Join the challenge and help advance the next generation of multilingual Speech LLMs.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>llm</category>
      <category>machinelearning</category>
      <category>nlp</category>
    </item>
    <item>
      <title>Interspeech 2025 Multilingual Conversational Speech Language Model (MLC-SLM) Challenge</title>
      <dc:creator>Nexdata AI</dc:creator>
      <pubDate>Thu, 20 Mar 2025 08:11:26 +0000</pubDate>
      <link>https://dev.to/nexdata/interspeech-2025-multilingual-conversational-speech-language-model-mlc-slm-challenge-47mm</link>
      <guid>https://dev.to/nexdata/interspeech-2025-multilingual-conversational-speech-language-model-mlc-slm-challenge-47mm</guid>
      <description>&lt;p&gt;The Multilingual Conversational Speech LLM (MLC-SLM) Challenge is now open as a satellite event of Interspeech 2025!&lt;/p&gt;

&lt;p&gt;Hosted by Meta, Google, Samsung Electronics, NAVER Corp, China Mobile, Northwestern Polytechnical University and Nexdata, this challenge aims to advance multilingual conversational AI by developing cutting-edge speech language models and providing access to a real-world multilingual conversational speech dataset.&lt;/p&gt;

&lt;p&gt;The challenge consists of two tasks, both of which require participants to explore the development of speech language models (SLMs):&lt;/p&gt;

&lt;p&gt;Task I: Multilingual Conversational Speech Recognition&lt;/p&gt;

&lt;p&gt;Objective: Develop a multilingual LLM-based ASR model. Participants will be provided with oracle segmentation and speaker labels for each conversation.&lt;/p&gt;

&lt;p&gt;Task II: Multilingual Conversational Speech Diarization and Recognition&lt;/p&gt;

&lt;p&gt;Objective: Develop a system for both speaker diarization (identifying who is speaking when), and recognition (transcribing speech to text). No prior or oracle information will be provided during evaluation (e.g., no pre-segmented utterances or speaker labels). Both pipeline-based and end-to-end systems are encouraged, providing flexibility in system design and implementation.&lt;/p&gt;

&lt;p&gt;The training set (Train) comprises approximately 11 languages: English (en), French (fr), German (de), Italian (it), Portuguese (pt), Spanish (es), Japanese (jp), Korean (ko), Russian (ru), Thai (th), Vietnamese (vi). &lt;/p&gt;

&lt;p&gt;Important Dates (AOT Time)&lt;/p&gt;

&lt;p&gt;March 10, 2025: Registration opens&lt;/p&gt;

&lt;p&gt;March 15, 2025: Training data release&lt;/p&gt;

&lt;p&gt;April 1, 2025: Development set and baseline system release&lt;/p&gt;

&lt;p&gt;May 15, 2025: Evaluation set release and Leaderboard open&lt;/p&gt;

&lt;p&gt;May 30, 2025: Leaderboard freeze and paper submission portal opens (CMT system)&lt;/p&gt;

&lt;p&gt;June 15, 2025: Paper submission deadline&lt;/p&gt;

&lt;p&gt;July 1, 2025: Notification of acceptance&lt;/p&gt;

&lt;p&gt;August 18, 2025: Workshop date&lt;/p&gt;

&lt;p&gt;We have set a prize pool of $20,000 for the winners. Based on performance, the top three teams in each track will be awarded:&lt;/p&gt;

&lt;p&gt;1st Prize: $5,000&lt;/p&gt;

&lt;p&gt;2nd Prize: $3,000&lt;/p&gt;

&lt;p&gt;3rd Prize: $2,000&lt;/p&gt;

&lt;p&gt;🔗 Join now: &lt;a href="https://lnkd.in/gwR8dvVp" rel="noopener noreferrer"&gt;https://lnkd.in/gwR8dvVp&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;📩 Register here: &lt;a href="https://lnkd.in/gUYs9M4Y" rel="noopener noreferrer"&gt;https://lnkd.in/gUYs9M4Y&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;For inquiries: &lt;a href="mailto:mlc-slmw@nexdata.ai"&gt;mlc-slmw@nexdata.ai&lt;/a&gt;&lt;/p&gt;

</description>
      <category>interspeech</category>
      <category>datascience</category>
      <category>llm</category>
    </item>
  </channel>
</rss>
