Skip to content

Tags: amd/blis

Tags

AOCL-Nov2024-b1

Toggle AOCL-Nov2024-b1's commit message
Implemented f32tobf16 reorder function

Description:
aocl_reorder_f32obf16 function is implemented to
reorder input weight matrix of data type float to
bfloat16.

The reordering is done to match the input requirements
of API aocl_gemm_bf16bf16f32o<f32|bf16>.

The objective of the API is to convert a model/matrix
of type f32 to bf16 and process when machine supports
bf16 FMA instruction _mm512_dpbf16_ps but the model
is still in float

Change-Id: Ib7c743d52d01a1ac09e84ac120577ec9e02f90f5

AOCL-Oct2024

Toggle AOCL-Oct2024's commit message
Added "NOTICES" file.

Change-Id: I3022942f46983a7b80c2b1ca39ce6b013e303768
(cherry picked from commit 62da6e6163158be78b2425f2083a6389811c345f)

5.0

Toggle 5.0's commit message
AOCL-BLAS 5.0 Release

4.2

Toggle 4.2's commit message
AOCL-BLAS Release 4.2

4.1

Toggle 4.1's commit message
AOCL-BLAS Release 4.1

AOCL-3.0-rc6

Toggle AOCL-3.0-rc6's commit message
BLIS 3.0 release having DGEMM performance fix

4.0

Toggle 4.0's commit message
AOCL-BLIS Release 4.0

3.2

Toggle 3.2's commit message
Merged AOCL BLIS 3.2

3.1

Toggle 3.1's commit message
Blis AOCL 3.1 Release

3.0.1

Toggle 3.0.1's commit message
Merge branch 'master' of https://github.com/amd/blis

BLIS release 3.0.1