TsgcAIOpenAITranslator

To build a Translator with voice commands, the following steps are required:

The Microphone Audio must be captured, so a speech to text system is needed to get the text that will be sent to OpenAI.
1. Capturing the Microphone Audio is done using the component TsgcAudioRecorderMCI.
2. Once we've captured the audio, this is sent to the OpenAI whisper api to convert the audio file to text.
Once we get the speech to text, now we send the text to OpenAI using the ChatCompletion API.
The response from OpenAI must be converted now to Speech using one of the following components:
1. TsgcTextToSpeechSystem: (currently only for Windows) uses the Windows Speech To Text from Operating System.
2. TsgcTextToSpeechGoogle: sends the response from OpenAI to the Google Cloud Servers and an mp3 file is returned which is played by the TsgcAudioPlayerMCI.
3. TsgcTextToSpeechAmazon: ends the response from OpenAI to the Amazon AWS Servers and an mp3 file is returned which is played by the TsgcAudioPlayerMCI.

Properties

OpenAIOptions: configure here the OpenAI properties.
- ApiKey: an API key is required to interactuate with the OpenAI APIs.
- LogOptions
  - Enabled: if set to true, the API requests will be log into a text file.
  - FileName: the filename of the log.
- Organization: an optional OpenAI API field.

TranslatorOptions: configure here the Translator properties.
- Translation: configure here the OpenAI Translation API settings.
  - Model: by default whisper-1

AudioRecorder: assign a TsgcAudioRecorder component to capture the microphone audio.

TextToSpeech: assign a TsgcTextToSpeech component to listen the response from OpenAI.

Events

OnAudioStart: the event is called when the Audio Starts to being recorded.
OnAudioStop: the event is called after the Audio Stops Recording.
OnTranslation: the event is called when receiving a response from OpenAI Translation API with the translation result.

Code Example

Create a new Translator, using the default Text-To-Speech from Microsoft Windows. Use Start to Start the recording of the audio and Stop to Stop the recording and send the audio to the OpenAI API and translate it.


// ... create the translator component
TsgcAIOpenAITranslator *sgcTranslator = new TsgcAIOpenAITranslator(NULL);
sgcTranslator->OpenAIOptions->ApiKey = "your_openapi_api_key";
// ... create audio recorder and text-to-speech
TsgcAudioRecorderMCI *sgcAudioRecorder = new TsgcAudioRecorderMCI(NULL);
TsgcTextToSpeechSystem *sgcTextToSpeech = new TsgcTextToSpeechSystem(NULL);
// ... assign audio components to translator
sgcTranslator->AudioRecorder = sgcAudioRecorder;
sgcTranslator->TextToSpeech = sgcTextToSpeech;
// ... start the translator, speak with a microphone to capture the audio, and stop to translate it
sgcTranslator->Start();
// ... speak
sgcTranslator->Stop();