Commit Graph

5 Commits

Author SHA1 Message Date
Laurent Mazare 2653002f29
Gumbel-Softmax sampling. (#2894)
* Gumbel-Softmax sampling.

* Add a sampling test.

* Share the gumbel-softmax bits.
2025-04-14 15:42:42 +02:00
Laurent Mazare 27996a1a9e
Remove the old MFA gemm kernels. (#2742)
* Remove the old MFA gemm kernels.

* Use bf16 in helium on metal.
2025-01-26 20:36:31 +01:00
Laurent Mazare efd0e6822f
Fix the helium weights download. (#2717) 2025-01-13 18:21:37 +01:00
Laurent Mazare 158817f230
Helium repo update. (#2716) 2025-01-13 18:04:14 +01:00
Laurent Mazare 309cd0f7c7
Add the helium model. (#2715) 2025-01-13 17:39:49 +01:00