candle

Laurent Mazare 52ae332910 Use llama v3 by default + add to readme. (#2094 )	2024-04-20 16:11:24 +02:00
..
bert	Speed up bert with approx gelu (#1410 )	2023-12-06 17:46:37 +01:00
bigcode	Adapt more examples to the updated safetensor api. (#947 )	2023-09-23 21:26:03 +01:00
blip	Quantized GGUF style (#1523 )	2024-01-17 10:27:58 +01:00
chatglm	Add the custom tokenizer. (#1686 )	2024-02-09 17:36:50 +01:00
clip	CLIP model implementation with example (#1950 )	2024-03-28 13:44:12 +01:00
convmixer	Move the image tensor to the appropriate device. (#1856 )	2024-03-16 22:25:46 +01:00
convnext	Move the image tensor to the appropriate device. (#1856 )	2024-03-16 22:25:46 +01:00
custom-ops	Cuda acceleration for quantized model. (#1754 )	2024-02-25 18:11:47 +01:00
dinov2	Move the image tensor to the appropriate device. (#1856 )	2024-03-16 22:25:46 +01:00
distilbert	Distibert (#1366 )	2023-11-24 15:09:14 +00:00
efficientnet	Move the image tensor to the appropriate device. (#1856 )	2024-03-16 22:25:46 +01:00
efficientvit	Move the image tensor to the appropriate device. (#1856 )	2024-03-16 22:25:46 +01:00
encodec	Microphone support for the encodec example. (#1866 )	2024-03-18 11:19:46 +01:00
falcon	Use the new hub helper function. (#1484 )	2023-12-26 09:44:30 +01:00
gemma	Add the code-gemma models. (#2038 )	2024-04-10 21:19:21 +02:00
jina-bert	Use the hub model file when possible. (#1190 )	2023-10-26 20:00:50 +01:00
llama	Use llama v3 by default + add to readme. (#2094 )	2024-04-20 16:11:24 +02:00
llama2-c	Explicit caching in llama2.c.	2024-02-22 10:22:03 +01:00
llama_multiprocess	Only download the weights in the main process (and not in the child processes). (#2093 )	2024-04-20 13:01:23 +02:00
mamba	Support alternative dtypes for mamba (#2036 )	2024-04-10 18:10:01 +02:00
mamba-minimal	Improved mamba model optimized for inference (#1694 )	2024-02-11 17:04:57 +01:00
marian-mt	Add a KV cache to marian decoding. (#1226 )	2023-10-31 08:47:44 +00:00
metavoice	Quantized version of the metavoice model. (#1824 )	2024-03-09 11:06:04 +01:00
mistral	Include topk sampling in the quantized example. (#2005 )	2024-04-04 09:27:54 +02:00
mixtral	Use the tokenizer-output-stream in the llama example. (#1715 )	2024-02-15 16:47:33 +01:00
mnist-training	Allow for different behavior between training and eval (#1213 )	2023-10-29 07:53:09 +01:00
mobileone	Move the image tensor to the appropriate device. (#1856 )	2024-03-16 22:25:46 +01:00
moondream	Optimize copy-2d for metal. (#2024 )	2024-04-07 12:34:16 +02:00
musicgen	Support more modes in the encodec example. (#1777 )	2024-02-28 09:22:33 +01:00
onnx	Update docs to reflect current usage of example (#1610 )	2024-02-04 11:59:47 +01:00
phi	Quantized GGUF style (#1523 )	2024-01-17 10:27:58 +01:00
quantized	Add support for llama3 on the quantized example (#2086 )	2024-04-18 22:52:00 +02:00
quantized-t5	Move the tensor-tools binary in a separate crate. (#1969 )	2024-03-30 15:49:37 +01:00
qwen	Readme fix. (#1961 )	2024-03-28 23:24:46 +01:00
recurrent-gemma	Add a quantized version of recurrent-gemma. (#2054 )	2024-04-13 20:07:01 +02:00
reinforcement-learning	Update for pyo3 0.21. (#1985 )	2024-04-01 17:07:02 +02:00
replit-code	Quantized GGUF style (#1523 )	2024-01-17 10:27:58 +01:00
repvgg	Move the image tensor to the appropriate device. (#1856 )	2024-03-16 22:25:46 +01:00
resnet	Move the image tensor to the appropriate device. (#1856 )	2024-03-16 22:25:46 +01:00
rwkv	Fix the model path for rwkv. (#1825 )	2024-03-09 11:21:48 +01:00
segformer	Update the image crate + use the re-exported version. (#1893 )	2024-03-21 10:56:41 +01:00
segment-anything	Add negative prompts to segment-anything. (#1000 )	2023-09-30 06:17:42 +01:00
stable-diffusion	Utilize batches in Stable Diffusion (#2071 )	2024-04-16 06:49:04 +02:00
stable-lm	Avoid copying the data on squeeze and unsqueeze. (#1884 )	2024-03-20 13:04:36 +01:00
starcoder2	Add the StarCoder2 model. (#1779 )	2024-02-28 21:02:41 +01:00
t5	Expose the t5 config fields + allow t5-large. (#1987 )	2024-04-01 20:58:34 +02:00
trocr	Move image tensor to device in trocr example (#2063 )	2024-04-15 06:50:32 +02:00
vgg	Move the image tensor to the appropriate device. (#1856 )	2024-03-16 22:25:46 +01:00
vit	Move the image tensor to the appropriate device. (#1856 )	2024-03-16 22:25:46 +01:00
whisper	whisper readme (#1899 )	2024-03-21 12:54:09 +01:00
whisper-microphone	feat: support microphone whisper streaming (#1678 )	2024-02-12 18:01:21 +01:00
wuerstchen	Remove some unusued bits. (#1067 )	2023-10-09 19:49:57 +01:00
yi	Fix token generation in bilingual models (non-English outputs) (#1668 )	2024-02-06 12:03:53 +01:00
yolo-v3	Fix clippy lints for 1.76. (#1682 )	2024-02-08 16:48:47 +01:00
yolo-v8	Support alternative dtypes for mamba (#2036 )	2024-04-10 18:10:01 +02:00
onnx_basics.rs	[ONNX] Support a couple more ops. (#1284 )	2023-11-06 22:44:58 +01:00