speech_recognition_robot

Overview

This project implements a speech-controlled UR10 robot using OpenAI's Whisper model for speech recognition. The robot listens to spoken commands and assembles vehicles accordingly.

Features

Speech Recognition: Utilizes OpenAI's Whisper model for accurate speech-to-text conversion.
Robot Control: Controls the UR10 robot using the URBasic library.
Flask Server: A Flask-based server processes audio inputs and sends commands to the robot.

Setup Instructions

1. Install Dependencies

Ensure you have Python 3.8+ installed, then install the required libraries:

pip install -r requirements.txt

2. Run the Flask Server

Start the server to process speech and control the UR10 robot:

python scripts/server.py

3. Record and Send Audio

Run the send_audio.py script to record and send speech commands:

python scripts/send_audio.py

Dependencies

torch
transformers
flask
numpy
sounddevice
URBasic
whisper

Future Improvements

Improve recognition accuracy with more training data.
Support additional languages.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
URBasic		URBasic
scripts		scripts
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

speech_recognition_robot

Overview

Features

Setup Instructions

1. Install Dependencies

2. Run the Flask Server

3. Record and Send Audio

Dependencies

Future Improvements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

speech_recognition_robot

Overview

Features

Setup Instructions

1. Install Dependencies

2. Run the Flask Server

3. Record and Send Audio

Dependencies

Future Improvements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages