candle/candle-nn
Laurent Mazare d9904a3baf
Update to cudarc 0.14 (breaking change). (#2858)
* Start updating to cudarc 0.14.

* Adapt a couple more things.

* And a couple more fixes.

* More tweaks.

* And a couple more fixes.

* Bump the major version number.

* Proper module system for the cuda kernels.

* Proper ptx loading.

* Launch the sort kernel.

* Custom op.

* Start using the builder pattern.

* More builder.

* More builder.

* Get candle-core to compile.

* Get the tests to pass.

* Get candle-nn to work too.

* Support for custom cuda functions.

* cudnn fixes.

* Get flash attn to run.

* Switch the crate versions to be alpha.

* Bump the ug dependency.
2025-04-03 09:12:19 +02:00
..
benches Metal: Improved reduce and softmax (#1819) 2025-02-08 07:27:01 +01:00
examples Optimize the cat operation on contiguous tensors (#1855) 2024-03-17 10:49:13 +01:00
src Update to cudarc 0.14 (breaking change). (#2858) 2025-04-03 09:12:19 +02:00
tests upgrade half library to fix rand (#2806) 2025-03-14 09:01:54 +01:00
Cargo.toml Make the metal sdpa tests deterministic. (#2750) 2025-01-28 09:05:24 +01:00
README.md Add some missing readme files. (#304) 2023-08-02 10:57:12 +01:00

README.md

candle-nn