... | ... | @@ -32,12 +32,23 @@ Since unCaptcha2 has to go to specific coordinates on the screen, you'll need to |
|
|
$ xdotool getmouselocation --shell
|
|
|
```
|
|
|
|
|
|
Next you'll need to configure credentials for whichever speech-to-text API you choose. Google's, Microsoft's, and IBM's speech-to-text systems are already included in `queryAPI.py` (set the username and password as required).
|
|
|
Next you'll need to configure credentials for whichever speech-to-text API you choose. Google's, Microsoft's, and IBM's speech-to-text systems are already included in `queryAPI.py` (set the API key or username/password in this file as required).
|
|
|
|
|
|
For Google's API, you'll have to set the `GOOGLE_APPLICATION_CREDENTIALS` environment variable to the name of a file containing your Google application credentials.
|
|
|
|
|
|
Ensure that Firefox is configured to download MP3 files (rather than play them in browser) by navigating to `about:config` and setting `media.play_stand_alone` to `false`.
|
|
|
|
|
|
Finally, install the dependencies:
|
|
|
|
|
|
```sh
|
|
|
$ pip install -r dependencies.txt
|
|
|
```
|
|
|
|
|
|
**Note**: FFmpeg is required for converting the audio file from MP3 to WAV format.
|
|
|
|
|
|
## Usage
|
|
|
After browser coordinates and colors are configured properly and all necessary dependencies are installed, you may run the script:
|
|
|
|
|
|
```sh
|
|
|
$ python run.py
|
|
|
``` |
|
|
\ No newline at end of file |