News
The 'ImageNet Moment' for LSLM Research? In the context of the flourishing development of large language models (LLMs), significant progress has been made in multimodal AI, particularly in the field ...
In today's rapidly evolving technology landscape, voice interaction has become the mainstream method of human-computer communication. On September 1, Jumps Star officially launched Step-Audio 2 mini—a ...
A monthly overview of things you need to know as an architect or aspiring architect. Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with ...
Xiaomi has launched a 7-billion-parameter version of its open-source voice model, MiDashengLM, which incorporates Alibaba's open-source Qwen 2.5 series. This model focuses on in-car systems and smart ...
Microsoft has launched VibeVoice, a new open-source AI model capable of generating up to 90 minutes of multi-speaker audio ...
What if you could replicate a voice so convincingly that even the closest of listeners couldn’t tell the difference? The rise of professional-quality voice cloning has made this a reality, ...
Podonos, a startup building the infrastructure layer for evaluating voice AI, has raised $2.4 million in pre-seed funding to bring structure and speed to one of the most overlooked parts of voice AI ...
PALO ALTO, Calif.--(BUSINESS WIRE)--SignalWire, the leader in Programmable Unified Communications (PUC), today announced the open beta of its fully open source Call Fabric SDK and Reference App. This ...
Results that may be inaccessible to you are currently showing.
Hide inaccessible results