Google Cloud Speech-to-Text

Hey @Johocen_ai

It looks like GCS is the only option available as the source of the files for Google Cloud Speech-to-Text. This shouldn’t be a problem though as you can use Google Cloud Storage node to upload the file from Google Drive.

The doc section explaining how the output is handled is Konstruksi permintaan Speech-to-Text  |  Dokumentasi Cloud Speech-to-Text  |  Google Cloud.

For a start, it depends on what mode you use synchronous or asynchronous. If it is a synchronous mode the output is expected to be in the form

{
  "results": [
    {
      "alternatives": [
        {
          "confidence": 0.98267895,
          "transcript": "YOUR_TRANSCRIPT_GOES_HERE"
        }
      ]
    }
  ]
}

They also point out

If no speech from the supplied audio could be recognized, then the returned results list will contain no items. Unrecognized speech is commonly the result of very poor-quality audio, or from language code, encoding, or sample rate values that do not match the supplied audio.

Each synchronous Speech-to-Text API response returns a list of results, rather than a single result containing all recognized audio. The list of recognized audio (within the transcript elements) will appear in contiguous order.

The asynchronous output has a different form

{
  "name": "operation_name",
  "metadata": {
    "@type": "type.googleapis.com/google.cloud.speech.v1.LongRunningRecognizeMetadata"
    "progressPercent": 34,
    "startTime": "2016-08-30T23:26:29.579144Z",
    "lastUpdateTime": "2016-08-30T23:26:29.826903Z"
  }
}

HTTP Request node won’t be able to return the actual transcript output in that case.

The actual binary file has to be available in the node immediately preceding the node where this binary is required. Otherwise it will fail. If Google Drive node with the downloaded binary does not immediately precede OpenAI, you need to bring it in to the node that does. See the solutions demoed in [onedrive via Graph HTTP] why binary don't upload if not exactly previous node? - #2 by ihortom.