Skip to content

sandroamiridze/amy-voice-assistant

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 

Repository files navigation


Voice Assistant App (Soon Video)

Welcome to the Voice Assistant App repository! This project is a voice assistant application that leverages the power of various technologies to provide a seamless voice interaction experience. The app integrates OpenAI's GPT model, FFmpeg, Vosk, and Aplay to deliver an efficient and effective voice assistant experience.

Table of Contents

Features

  • Voice Recognition: Uses Vosk for accurate speech-to-text conversion.
  • Voice Synthesis: Utilizes Aplay to handle audio playback.
  • Natural Language Processing: Integrates GPT for understanding and generating responses.
  • Audio Processing: Employs FFmpeg for handling audio files and streams.

Technologies Used

  • TGPT (OpenAI): For natural language understanding and generation.
  • FFmpeg: For audio processing and manipulation.
  • Vosk: For speech recognition and converting speech to text.
  • Aplay: For audio playback.

Installation

Follow these steps to get your development environment set up and running:

  1. Clone the repository:

    git clone https://github.com/your-username/voice-assistant-app.git
    cd voice-assistant-app
  2. Install dependencies:

    • Make sure you have Python 3.8+ installed. You can check your Python version with:

      python --version
    • Install FFmpeg. You can download it from FFmpeg official site or use a package manager:

      sudo apt-get install ffmpeg
    • Install Vosk. You can do this with:

      pip install vosk
    • Ensure Aplay is installed (usually comes with the ALSA-utils package):

      sudo apt-get install alsa-utils
  3. Set up your environment variables:

    Create a .env file in the root directory and add your API keys and configuration details. For example:

    OPENAI_API_KEY=your_openai_api_key

Usage

To start the voice assistant, run the following command:

chmod +x amy, voice

This will initialize the voice assistant and wait for voice commands. Make sure your microphone and speakers are properly set up. also u want to make sure that you've downloaded vosk and piper models

Configuration

You can customize various aspects of the voice assistant by modifying the configuration settings. Edit the config.json file to adjust parameters such as:

  • Audio Input Device
  • Audio Output Device
  • Speech Recognition Language
  • Response Parameters

Contributing

We welcome contributions to improve the Voice Assistant App! If you have any ideas or fixes, please follow these guidelines:

  1. Fork the repository.
  2. Create a new branch (git checkout -b feature/YourFeature).
  3. Make your changes.
  4. Commit your changes (git commit -am 'Add new feature').
  5. Push to the branch (git push origin feature/YourFeature).
  6. Open a Pull Request.

Please make sure to follow the coding standards and include appropriate tests with your contributions.

License

This project is licensed under the MIT License. See the LICENSE file for details.


Feel free to reach out with any questions or feedback. Enjoy using the Voice Assistant App!

About

this is voice assistant using vosk ffmepg, aplay and piper

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages