I've recently noticed that the API for Bing in the EdgeGPT repository is performing significantly slower than the local server method. Is it possible to implement a similar approach for Bing, leveraging the fastapi library to improve its performance?
I believe such an implementation would yield significant speed improvements and overall enhance the experience for users.