代码库
Your faithful, impartial partner for audio evaluation — know yourself, know your rivals. 真实评测,知己知彼。
Python
evaluationspeech-recognitionspeech-to-speechspeech-to-text
A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone
Python
minicpmminicpm-vmulti-modal
Official PyTorch+CUDA Full-functional Web Demo for MiniCPM-o 4.5
Python
ChatDev 2.0: Dev All through LLM-powered Multi-Agent Collaboration
Python
VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning
Python
audiodeeplearningminicpmmultilingualpythonpytorchspeechspeech-synthesistext-to-speechttstts-modelvoice-cloningvoice-designvoxcpm
AgentCPM-GUI: An on-device GUI agent for operating Android apps, enhancing reasoning ability with reinforcement fine-tuning for efficient task execution.
Python
MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks
Jupyter Notebook
EdgeClaw: Edge-Cloud Collaborative Personal AI Assistant based on OpenClaw
TypeScript