You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi.
I'm trying to rewrite this project in C++ in search of better interoperability, better user friendliness and better performance.
Now I successfully implemented MRCG extraction and get a huge quality boost as well as a small memory usage. However I have some problem understanding the scripts that does the prediction. This script involves lots of array allocating and I want to know the purpose of every single line in order to write better implementation.
So, could you please kindly give an explanation of the bdnn_transform function?
Excellent! thank you for your interest and contributions!
Because it has been a long time since I implemented it, I can't exactly remember it in detail. However, the purpose is, implementing equation (7) in [1]. Also, it will be helpful to refer Fig. 2 in [1].
If there is some spare time for me, I can analyze the written code in detail, however, these day, I'm too busy. Thank you!
[1] X. Zhang and D. Wang, "Boosting Contextual Information for Deep Neural Network Based Voice Activity Detection," in IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 24, no. 2, pp. 252-264, Feb. 2016.
Hi.
I'm trying to rewrite this project in C++ in search of better interoperability, better user friendliness and better performance.
Now I successfully implemented MRCG extraction and get a huge quality boost as well as a small memory usage. However I have some problem understanding the scripts that does the prediction. This script involves lots of array allocating and I want to know the purpose of every single line in order to write better implementation.
So, could you please kindly give an explanation of the bdnn_transform function?
Thanks in advance.
The text was updated successfully, but these errors were encountered: