Open Source Voice Recognition

News

18h

Release of the LLaSO Framework: Defining New Benchmarks for LSLM Research in Open Source Voice Models and Accelerating AI Voice Innovation

The 'ImageNet Moment' for LSLM Research? In the context of the flourishing development of large language models (LLMs), significant progress has been made in multimodal AI, particularly in the field ...

11d

Breaking the Bottleneck of Voice Interaction, Jumps Star Launches Open Source SOTA-Level Voice Model Step-Audio 2 mini

In today's rapidly evolving technology landscape, voice interaction has become the mainstream method of human-computer communication. On September 1, Jumps Star officially launched Step-Audio 2 mini—a ...

InfoQ1mon

Mistral Voxtral is an Open-Weights Competitor to OpenAI Whisper and Other ASR Tools

A monthly overview of things you need to know as an architect or aspiring architect. Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with ...

Digi Times1mon

Xiaomi open-sources voice AI model to enter automotive and smart home markets

Xiaomi has launched a 7-billion-parameter version of its open-source voice model, MiDashengLM, which incorporates Alibaba's open-source Qwen 2.5 series. This model focuses on in-car systems and smart ...

WinBuzzer11d

Microsoft Releases VibeVoice Open-Source AI Model for Generating Multi-Speaker Podcasts

Microsoft has launched VibeVoice, a new open-source AI model capable of generating up to 90 minutes of multi-speaker audio ...

Geeky Gadgets2mon

Professional Quality Voice Cloning : Open Source vs ElevenLabs

What if you could replicate a voice so convincingly that even the closest of listeners couldn’t tell the difference? The rise of professional-quality voice cloning has made this a reality, ...

Voice AI Needs an Accurate Evaluation Layer. Podonos Just Raised $2.4M to Build It

Podonos, a startup building the infrastructure layer for evaluating voice AI, has raised $2.4 million in pre-seed funding to bring structure and speed to one of the most overlooked parts of voice AI ...

Business Wire1mon

SignalWire Unveils Beta Open Source SDK & Reference App for Next-Gen Communications and Voice AI

PALO ALTO, Calif.--(BUSINESS WIRE)--SignalWire, the leader in Programmable Unified Communications (PUC), today announced the open beta of its fully open source Call Fabric SDK and Reference App. This ...

Results that may be inaccessible to you are currently showing.

Hide inaccessible results