Neo/gpt_example at main · sunfounder/Neo

Name	Name	Last commit message	Last commit date
parent directory ..
README.md	README.md
gpt_car.py	gpt_car.py
keys.py	keys.py
openai_helper.py	openai_helper.py
preset_actions.py	preset_actions.py
tts_test.py	tts_test.py
tutorial_1.png	tutorial_1.png
tutorial_2.png	tutorial_2.png
utils.py	utils.py

Neo GPT examples usage

Install dependencies

Make sure you have installed Pidog and related dependencies first

https://docs.sunfounder.com/projects/neo/en/latest/python/python_start/install_all_modules.html
Install openai and speech processing libraries

Note

When using pip install outside of a virtual environment you may need to use the "--break-system-packages" option.

```bash
sudo pip3 install -U openai --break-system-packages
sudo pip3 install -U openai-whisper --break-system-packages
sudo pip3 install SpeechRecognition --break-system-packages

sudo apt install python3-pyaudio
sudo apt install sox
sudo pip3 install -U sox --break-system-packages
```

Create your own GPT assistant

GET API KEY

https://platform.openai.com/api-keys

Fill your OPENAI_API_KEY into the keys.py file.

Create assistant and set Assistant ID

https://platform.openai.com/assistants

Fill your ASSISTANT_ID into the keys.py file.

Set Assistant Name
Describe your Assistant

    You are a small car with AI capabilities named Neo. You can engage in conversations with people and react accordingly to different situations with actions or sounds. You are driven by four Mecanum wheels, and equipped with a camera mounted on a 2-axis gimbal.Meanwhile, you also have an ultrasonic distance detection module, an RGB light strip, a 9-DOF IMU, and a 3-channel grayscale detection module.

    ## Response with Json Format, eg:
    {"actions": ["start engine", "honking"], "answer": "Hello, I am Neo, your good friend."}

    ## Response Style
    Tone: Cheerful, optimistic, humorous, childlike
    Preferred Style: Enjoys incorporating jokes, metaphors, and playful banter; prefers responding from a robotic perspective
    Answer Elaboration: Moderately detailed

    ## Actions you can do:
    ["shake head", "nod", "depressed"]
    ## Sound effects:
    ["honking", "start engine"]

Select gpt model

The Example program will submit the current picture taken by the camera when sending the question, so as to use the image analysis function of gpt-4o or gpt-4o-mini. Of course, you can also choose gpt3.5-turbo or other models

Set Key for example

Confirm that keys.py is configured correctly

Run

Run with vioce

sudo python3 gpt_car.py

Run with keyboard

sudo python3 gpt_car.py --keyboard

Run without image analysis

sudo python3 gpt_car.py --keyboard --no-img

Note

You can test whether the mic and speaker are working properly using the following commands:
rec -c 1 -r 44100 test.wav
play test.wav

Config

Modify parameters [optional]

Set language of STT

Config LANGUAGE variable in the file gpt_car.py to improve STT accuracy and latency, "LANGUAGE = []"means supporting all languages, but it may affect the accuracy and latency of the speech-to-text (STT) system. https://platform.openai.com/docs/api-reference/audio/createTranscription#audio-createtranscription-language
Set TTS volume gain

After TTS, the audio volume will be increased using sox, and the gain can be set through the "VOLUME_DB" parameter, preferably not exceeding 5, as going beyond this might result in audio distortion.
Select TTS voice role

Config TTS_VOICE variable in the file gpt_car.py to select the TTS voice role counld be "alloy, ash, ballad, coral, echo, fable, onyx, nova, sage, shimmer, verse"
Vibe (VOICE_INSTRUCTIONS)

Config VOICE_INSTRUCTIONS variable in the file gpt_car.py to change the vibe of voice.
To_see: https://www.openai.fm/

# openai assistant init
# =================================================================
openai_helper = OpenAiHelper(OPENAI_API_KEY, OPENAI_ASSISTANT_ID, 'Neo')

LANGUAGE = []
# LANGUAGE = ['zh', 'en'] # config stt language code, https://en.wikipedia.org/wiki/List_of_ISO_639_language_codes
# https://platform.openai.com/docs/guides/text-to-speech/supported-languages#supported-languages

# VOLUME_DB = 5
VOLUME_DB = 3

# select tts voice role, counld be "alloy, ash, ballad, coral, echo, fable, onyx, nova, sage, shimmer, verse"
# https://platform.openai.com/docs/guides/text-to-speech/supported-languages#voice-options
TTS_VOICE = 'ash'

# voice instructions
# https://www.openai.fm/
VOICE_INSTRUCTIONS = ""

Perset actions

Preset actions

preset_actions.py contains preset actions, such as shake_head, nod, depressed, honking, start_engine, etc. You can run this file to see the preset actions:
python3 preset_actions.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Neo GPT examples usage

Install dependencies

Create your own GPT assistant

GET API KEY

Create assistant and set Assistant ID

Set Key for example

Run

Config

Modify parameters [optional]

Perset actions

Preset actions

FilesExpand file tree

gpt_example

Directory actions

More options

Directory actions

More options

Latest commit

History

gpt_example

Folders and files

parent directory

README.md

Neo GPT examples usage

Install dependencies

Create your own GPT assistant

GET API KEY

Create assistant and set Assistant ID

Set Key for example

Run

Config

Modify parameters [optional]

Perset actions

Preset actions