SynthiaGPT leverages Google's Gemini & the Hugging Face Transformers library to interact with a pre-trained GPT-2 model. It employs TensorFlow for model management and AutoTokenizer for efficient tokenization. The script enables users to input prompts interactively, generating text responses from the GPT-2 model. To improve the user experience, unnecessary warning messages related to the transformers library are effectively suppressed.
- main.py: The selection menu for SynthiaGPT
- app.py: Run SynthiaGPT as a web-server
- redirect.py: Redirect to main.py
- gemini.py: Run SynthiaGPT with Gemini
- gemini_t2s.py: Run SynthiaGPT with Gemini and text-to-speech functionality
- transformer.py: Run SynthiaGPT
- transformer_t2s.py: Run SynthiaGPT with text-to-speech functionality
- transformer_s2t2s.py: Run SynthiaGPT with speech-to-text-to-speech functionality
- transformer_webcam.py: Run SynthiaGPT with webcam functionality
- install_dependencies.py: Install dependencies
- generate_text.py: The GPT text generator
- greeting.py: Greets the user
- clean_text.py: Clean text from recorded voice audio
- stt.py: Speech-to-text
- tts.py: Text-to-speech
- capture_photo.py: Capture photo with a webcam
- play_audio.py: Play generated audio
transformers==4.37.1
tensorflow==2.14.0
torch==2.1.1
torchvision==0.16.1
torchaudio==2.1.1
flask==3.0.0
gtts==2.5.0
pyaudio==0.2.14
pydub==0.25.1
beautifulsoup4==4.10.0
SpeechRecognition==3.10.1
pygame==2.5.2
google-generativeai==0.3.2
pip install -r requirements.txt
pip install torch torchvision torchaudio
pip install -q -U google-generativeai
python main.py
python app.py
SynthWomb / Cursed Entertainment 2024