This project implements a speech-controlled UR10 robot using OpenAI's Whisper model for speech recognition. The robot listens to spoken commands and assembles vehicles accordingly.
- Speech Recognition: Utilizes OpenAI's Whisper model for accurate speech-to-text conversion.
- Robot Control: Controls the UR10 robot using the URBasic library.
- Flask Server: A Flask-based server processes audio inputs and sends commands to the robot.
Ensure you have Python 3.8+ installed, then install the required libraries:
pip install -r requirements.txtStart the server to process speech and control the UR10 robot:
python scripts/server.pyRun the send_audio.py script to record and send speech commands:
python scripts/send_audio.pytorchtransformersflasknumpysounddeviceURBasicwhisper
- Improve recognition accuracy with more training data.
- Support additional languages.
