Laurent Mazare
|
2653002f29
|
Gumbel-Softmax sampling. (#2894)
* Gumbel-Softmax sampling.
* Add a sampling test.
* Share the gumbel-softmax bits.
|
2025-04-14 15:42:42 +02:00 |
Laurent Mazare
|
27996a1a9e
|
Remove the old MFA gemm kernels. (#2742)
* Remove the old MFA gemm kernels.
* Use bf16 in helium on metal.
|
2025-01-26 20:36:31 +01:00 |
Laurent Mazare
|
efd0e6822f
|
Fix the helium weights download. (#2717)
|
2025-01-13 18:21:37 +01:00 |
Laurent Mazare
|
158817f230
|
Helium repo update. (#2716)
|
2025-01-13 18:04:14 +01:00 |
Laurent Mazare
|
309cd0f7c7
|
Add the helium model. (#2715)
|
2025-01-13 17:39:49 +01:00 |