I think it would be beneficial to add this because:
Are you willing to work on this?
As we always do. We get one primary model and only one fallback model options. It would be good if we can add multiple fallback models. So that if my fallback model exhausts or gives error we can have another backup. BASICALLY (BACKUP FOR BACKUP!)
Another request is If we can reduce the delay from one model to another. For example if I’m using a free model as primary and it’s quota exhausts. It automatically switches to fallback BUT TAKES 2 MINUTS DELAY and continues that way THIS IS ANNOYING FOR MY CHATBOT CONVERSATIONS. PLEASE FIND A WAY TO REDUCE THIS DELAY FROM SWITCHING FROM PRIMARY TO FALLBACK WHEN THE PRIMARY DOSEN’T WORKS