Realtime audio analysis in Python, using PyAudio and Numpy to extract and visualize FFT features from streaming audio.
-
Updated
Apr 30, 2024 - Python
Realtime audio analysis in Python, using PyAudio and Numpy to extract and visualize FFT features from streaming audio.
Join the community on Discord for more discussions around Neutone! https://discord.gg/VHSMzb8Wqp
🎵 strongly-timed musical programming language
Rapida is an open-source, end-to-end voice AI orchestration platform for building real-time conversational voice agents with audio streaming, STT, TTS, VAD, multi-channel integration, agent state management, and observability.
Guide on how to set-up Linux and Docker for real-time applications using the Ubuntu realtime-kernel/PREEMPT_RT patch with a focus on robotics with ROS and ROS 2
Example applications that use the OpenTok iOS SDK
FreeSWITCH module to stream audio to websocket and receive response
ppooll (formerly lloopp) is an audio & video performance environment written in max/MSP.
🔥 Modular shader engine designed for simplicity and speed
Realtime Safe OSC packet serialization and dispatch
A realtime scripted modular audio engine for video games and musical applications.
An opensource harmonizer implementation leveraging the DISTRHO Plugin Framework.
Rust Agent Development Kit (ADK-Rust): Build AI agents in Rust with modular components for models, tools, memory, realtime voice, and more. ADK-Rust is a flexible framework for developing AI agents with simplicity and power. Model-agnostic, deployment-agnostic, optimized for frontier AI models. Includes support for real-time voice agents.
STatic (LLVM) Object file Analysis Tool
Xiaozhi websocket protocol implemented by Golang, setup your own xiaozhi-server by routing requests to OpenAI Realtime API protocol such as Stepfun API
Syntax sugar of OpenTok iOS SDK with Audio/Video communication including screen sharing
Coffee Chat Voice Assistant is a voice-driven ordering system powered by Azure OpenAI GPT-4o Realtime API, simulating the experience of ordering coffee with a café barista. It supports natural conversations, live order updates, and real-time transcription, showcasing the power of AI for seamless customer interactions.
🦀 Rust powered LLM, Whisper, Embedding inference, backed by 🤗 candle from HuggingFace
LiveKit plugin for TEN VAD: low-latency voice activity detection for real-time streaming, integrated with livekit-agents
Add a description, image, and links to the realtime-audio topic page so that developers can more easily learn about it.
To associate your repository with the realtime-audio topic, visit your repo's landing page and select "manage topics."