👏 Welcome to the Speech-to-Speech (S2S) Model Evaluation!
In this evaluation, you will assess the performance of different S2S models, such as ChatGPT-4o,
FunAudioLLM, SpeechGPT, and Mini-Omni.
🎯 Goal: Test how well these models handle speech tasks across different domains.
🌰 Example:
🎙️ Speech: Partially followed the instruction on speed.
🧾 Semantics: Accurately followed the instruction, with no semantic deviation or missing information.
🎙️ Speech: Partially followed the instruction on speed.
🧾 Semantics: Accurately followed the instruction, with no semantic deviation or missing information.
🎙️ Speech: Did not follow the instruction on speed.
🧾 Semantics: Partially followed the instruction, with minor semantic deviation and missing information.
🎙️ Speech: Did not follow the instruction on speed.
🧾 Semantics: Did not follow the instruction, with significant semantic deviation and missing information.
After making your choice, you'll proceed to the next round. 🔄
💡 Please enter your username to start!
🤔 Question: Which model performs better?
🤖 Model A:
🤖 Model B:
✅ Your Choice: 😃