Tags: amd/blis
Tags
Implemented f32tobf16 reorder function Description: aocl_reorder_f32obf16 function is implemented to reorder input weight matrix of data type float to bfloat16. The reordering is done to match the input requirements of API aocl_gemm_bf16bf16f32o<f32|bf16>. The objective of the API is to convert a model/matrix of type f32 to bf16 and process when machine supports bf16 FMA instruction _mm512_dpbf16_ps but the model is still in float Change-Id: Ib7c743d52d01a1ac09e84ac120577ec9e02f90f5
Added "NOTICES" file. Change-Id: I3022942f46983a7b80c2b1ca39ce6b013e303768 (cherry picked from commit 62da6e6163158be78b2425f2083a6389811c345f)
PreviousNext