The 2nd MLC-SLM Challenge 2026 Opens Registration with a USD 20,000 Prize Pool

CALIFORNIA, CA, UNITED STATES, April 30, 2026 /EINPresswire.com/ — The ๐Ÿ๐ง๐ ๐Œ๐ฎ๐ฅ๐ญ๐ข๐ฅ๐ข๐ง๐ ๐ฎ๐š๐ฅ ๐‚๐จ๐ง๐ฏ๐ž๐ซ๐ฌ๐š๐ญ๐ข๐จ๐ง๐š๐ฅ ๐’๐ฉ๐ž๐ž๐œ๐ก ๐‹๐š๐ง๐ ๐ฎ๐š๐ ๐ž ๐Œ๐จ๐๐ž๐ฅ๐ฌ ๐‚๐ก๐š๐ฅ๐ฅ๐ž๐ง๐ ๐ž ๐Ÿ๐ŸŽ๐Ÿ๐Ÿ”, also known as the ๐Œ๐‹๐‚-๐’๐‹๐Œ ๐‚๐ก๐š๐ฅ๐ฅ๐ž๐ง๐ ๐ž ๐Ÿ๐ŸŽ๐Ÿ๐Ÿ”, is now open for registration. This yearโ€™s challenge features a ๐ญ๐จ๐ญ๐š๐ฅ ๐ฉ๐ซ๐ข๐ณ๐ž ๐ฉ๐จ๐จ๐ฅ ๐จ๐Ÿ ๐”๐’๐ƒ ๐Ÿ๐ŸŽ,๐ŸŽ๐ŸŽ๐ŸŽ and invites academic teams, industry teams, and individual researchers to advance Speech Large Language Models for real-world multilingual conversational speech.

The 2nd MLC-SLM Challenge focuses on key capabilities required for next-generation Speech LLMs, including ๐ฌ๐ฉ๐ž๐š๐ค๐ž๐ซ ๐๐ข๐š๐ซ๐ข๐ณ๐š๐ญ๐ข๐จ๐ง, ๐ฌ๐ฉ๐ž๐ž๐œ๐ก ๐ซ๐ž๐œ๐จ๐ ๐ง๐ข๐ญ๐ข๐จ๐ง, ๐š๐ง๐ ๐œ๐จ๐ง๐ฏ๐ž๐ซ๐ฌ๐š๐ญ๐ข๐จ๐ง๐š๐ฅ ๐ฌ๐ฉ๐ž๐ž๐œ๐ก ๐ฎ๐ง๐๐ž๐ซ๐ฌ๐ญ๐š๐ง๐๐ข๐ง๐ . Participants will work with multilingual, multi-speaker conversational speech data designed to reflect real-world dialogue scenarios.

๐Ÿ๐ง๐ ๐Œ๐‹๐‚-๐’๐‹๐Œ ๐œ๐ก๐š๐ฅ๐ฅ๐ž๐ง๐ ๐ž ๐จ๐Ÿ๐Ÿ๐ž๐ซ๐ฌ:
โ€ข๐…๐ซ๐ž๐ž ๐ซ๐ž๐ ๐ข๐ฌ๐ญ๐ซ๐š๐ญ๐ข๐จ๐ง
โ€ขFree access to a large-scale multilingual conversational speech dataset for registered participants, featuring around ๐Ÿ,๐Ÿ๐ŸŽ๐ŸŽ ๐ก๐จ๐ฎ๐ซ๐ฌ ๐จ๐Ÿ ๐๐š๐ญ๐š ๐š๐œ๐ซ๐จ๐ฌ๐ฌ ๐Ÿ๐Ÿ’ ๐ฅ๐š๐ง๐ ๐ฎ๐š๐ ๐ž๐ฌ
โ€ขA total prize pool of ๐”๐’๐ƒ ๐Ÿ๐ŸŽ,๐ŸŽ๐ŸŽ๐ŸŽ
Support for both academic and industry teams, as well as individual researchers

Following the success of the first MLC-SLM Challenge, which attracted 78 teams from 13 countries and regions, the 2026 edition introduces a larger and more diverse dataset. The first challenge also received 489 valid leaderboard submissions and 14 technical reports, and ๐ข๐ญ๐ฌ ๐ฌ๐ฎ๐ฆ๐ฆ๐š๐ซ๐ฒ ๐ฉ๐š๐ฉ๐ž๐ซ ๐ก๐š๐ฌ ๐›๐ž๐ž๐ง ๐š๐œ๐œ๐ž๐ฉ๐ญ๐ž๐ ๐›๐ฒ ๐ˆ๐‚๐€๐’๐’๐ ๐Ÿ๐ŸŽ๐Ÿ๐Ÿ”.

๐‚๐ก๐š๐ฅ๐ฅ๐ž๐ง๐ ๐ž ๐“๐š๐ฌ๐ค๐ฌ
Participants can join two tracks:
๐“๐š๐ฌ๐ค ๐Ÿ: ๐Œ๐ฎ๐ฅ๐ญ๐ข๐ฅ๐ข๐ง๐ ๐ฎ๐š๐ฅ ๐‚๐จ๐ง๐ฏ๐ž๐ซ๐ฌ๐š๐ญ๐ข๐จ๐ง๐š๐ฅ ๐’๐ฉ๐ž๐ž๐œ๐ก ๐ƒ๐ข๐š๐ซ๐ข๐ณ๐š๐ญ๐ข๐จ๐ง ๐š๐ง๐ ๐‘๐ž๐œ๐จ๐ ๐ง๐ข๐ญ๐ข๐จ๐ง
Participants are required to build systems that can identify who is speaking when and transcribe multilingual conversational speech. During evaluation, no oracle segmentation or speaker labels will be provided, making the task closer to real-world speech processing scenarios.

๐“๐š๐ฌ๐ค ๐Ÿ: ๐Œ๐ฎ๐ฅ๐ญ๐ข๐ฅ๐ข๐ง๐ ๐ฎ๐š๐ฅ ๐‚๐จ๐ง๐ฏ๐ž๐ซ๐ฌ๐š๐ญ๐ข๐จ๐ง๐š๐ฅ ๐’๐ฉ๐ž๐ž๐œ๐ก ๐”๐ง๐๐ž๐ซ๐ฌ๐ญ๐š๐ง๐๐ข๐ง๐ 
Participants are required to build systems that understand multilingual conversations using both acoustic and semantic information. Evaluation will be based on multiple-choice questions about the full conversation, testing the modelโ€™s ability to capture meaning, context, and speaker-level information.

Both pipeline-based systems and end-to-end Speech LLM systems are welcome. External datasets and pretrained models are allowed, as long as they are freely accessible and clearly reported.

๐ƒ๐š๐ญ๐š๐ฌ๐ž๐ญ ๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ
The challenge dataset contains around ๐Ÿ,๐Ÿ๐ŸŽ๐ŸŽ ๐ก๐จ๐ฎ๐ซ๐ฌ ๐จ๐Ÿ ๐ญ๐ฐ๐จ-๐ฌ๐ฉ๐ž๐š๐ค๐ž๐ซ ๐œ๐จ๐ง๐ฏ๐ž๐ซ๐ฌ๐š๐ญ๐ข๐จ๐ง๐š๐ฅ ๐ฌ๐ฉ๐ž๐ž๐œ๐ก ๐š๐œ๐ซ๐จ๐ฌ๐ฌ ๐Ÿ๐Ÿ’ ๐ฅ๐š๐ง๐ ๐ฎ๐š๐ ๐ž๐ฌ, including English, French, German, Italian, Portuguese, Spanish, Japanese, Korean, Russian, Thai, Vietnamese, Tagalog, Urdu, and Turkish.

The dataset also includes diverse regional accents, such as Canadian French, Mexican Spanish, Brazilian Portuguese, British English, American English, Australian English, Indian English, and Philippine English.

This makes the MLC-SLM Challenge a valuable benchmark for researchers working on multilingual ASR, speaker diarization, Speech LLMs, and spoken language understanding.

๐‘๐ž๐ ๐ข๐ฌ๐ญ๐ซ๐š๐ญ๐ข๐จ๐ง
Registration is now open. Participation is free, and the dataset will be provided free of charge to registered participants.
๐‘๐ž๐ ๐ข๐ฌ๐ญ๐ซ๐š๐ญ๐ข๐จ๐ง ๐‹๐ข๐ง๐ค*: https://forms.gle/jfAZ95abGy4ZiNHo7
๐Œ๐จ๐ซ๐ž ๐ƒ๐ž๐ญ๐š๐ข๐ฅ๐ฌ: https://www.nexdata.ai/competition/mlc-slm
๐‚๐จ๐ง๐ญ๐š๐œ๐ญ ๐„๐ฆ๐š๐ข๐ฅ: mlc-slmw@nexdata.ai

Join the 2nd MLC-SLM Challenge 2026 and help advance the next generation of multilingual Speech Large Language Models.

MLC-SLM Organizing Committee
NEXDATA TECHNOLOGY INC.
+1 760-410-2223
kris.y@nexdata.ai
Visit us on social media:
LinkedIn
Facebook
YouTube
X
Other

Legal Disclaimer:

EIN Presswire provides this news content “as is” without warranty of any kind. We do not accept any responsibility or liability
for the accuracy, content, images, videos, licenses, completeness, legality, or reliability of the information contained in this
article. If you have any complaints or copyright issues related to this article, kindly contact the author above.

Media gallery