.. |
bert
|
Speed up bert with approx gelu (#1410)
|
2023-12-06 17:46:37 +01:00 |
bigcode
|
Adapt more examples to the updated safetensor api. (#947)
|
2023-09-23 21:26:03 +01:00 |
blip
|
Quantized GGUF style (#1523)
|
2024-01-17 10:27:58 +01:00 |
chatglm
|
Add the custom tokenizer. (#1686)
|
2024-02-09 17:36:50 +01:00 |
clip
|
CLIP model implementation with example (#1950)
|
2024-03-28 13:44:12 +01:00 |
convmixer
|
Move the image tensor to the appropriate device. (#1856)
|
2024-03-16 22:25:46 +01:00 |
convnext
|
Move the image tensor to the appropriate device. (#1856)
|
2024-03-16 22:25:46 +01:00 |
custom-ops
|
Cuda acceleration for quantized model. (#1754)
|
2024-02-25 18:11:47 +01:00 |
dinov2
|
Move the image tensor to the appropriate device. (#1856)
|
2024-03-16 22:25:46 +01:00 |
distilbert
|
Distibert (#1366)
|
2023-11-24 15:09:14 +00:00 |
efficientnet
|
Move the image tensor to the appropriate device. (#1856)
|
2024-03-16 22:25:46 +01:00 |
efficientvit
|
Move the image tensor to the appropriate device. (#1856)
|
2024-03-16 22:25:46 +01:00 |
encodec
|
Microphone support for the encodec example. (#1866)
|
2024-03-18 11:19:46 +01:00 |
falcon
|
Use the new hub helper function. (#1484)
|
2023-12-26 09:44:30 +01:00 |
gemma
|
Add the code-gemma models. (#2038)
|
2024-04-10 21:19:21 +02:00 |
jina-bert
|
Use the hub model file when possible. (#1190)
|
2023-10-26 20:00:50 +01:00 |
llama
|
Use llama v3 by default + add to readme. (#2094)
|
2024-04-20 16:11:24 +02:00 |
llama2-c
|
Explicit caching in llama2.c.
|
2024-02-22 10:22:03 +01:00 |
llama_multiprocess
|
Only download the weights in the main process (and not in the child processes). (#2093)
|
2024-04-20 13:01:23 +02:00 |
mamba
|
Support alternative dtypes for mamba (#2036)
|
2024-04-10 18:10:01 +02:00 |
mamba-minimal
|
Improved mamba model optimized for inference (#1694)
|
2024-02-11 17:04:57 +01:00 |
marian-mt
|
Add a KV cache to marian decoding. (#1226)
|
2023-10-31 08:47:44 +00:00 |
metavoice
|
Quantized version of the metavoice model. (#1824)
|
2024-03-09 11:06:04 +01:00 |
mistral
|
Include topk sampling in the quantized example. (#2005)
|
2024-04-04 09:27:54 +02:00 |
mixtral
|
Use the tokenizer-output-stream in the llama example. (#1715)
|
2024-02-15 16:47:33 +01:00 |
mnist-training
|
Allow for different behavior between training and eval (#1213)
|
2023-10-29 07:53:09 +01:00 |
mobileone
|
Move the image tensor to the appropriate device. (#1856)
|
2024-03-16 22:25:46 +01:00 |
moondream
|
Optimize copy-2d for metal. (#2024)
|
2024-04-07 12:34:16 +02:00 |
musicgen
|
Support more modes in the encodec example. (#1777)
|
2024-02-28 09:22:33 +01:00 |
onnx
|
Update docs to reflect current usage of example (#1610)
|
2024-02-04 11:59:47 +01:00 |
phi
|
Quantized GGUF style (#1523)
|
2024-01-17 10:27:58 +01:00 |
quantized
|
Add support for llama3 on the quantized example (#2086)
|
2024-04-18 22:52:00 +02:00 |
quantized-t5
|
Move the tensor-tools binary in a separate crate. (#1969)
|
2024-03-30 15:49:37 +01:00 |
qwen
|
Readme fix. (#1961)
|
2024-03-28 23:24:46 +01:00 |
recurrent-gemma
|
Add a quantized version of recurrent-gemma. (#2054)
|
2024-04-13 20:07:01 +02:00 |
reinforcement-learning
|
Update for pyo3 0.21. (#1985)
|
2024-04-01 17:07:02 +02:00 |
replit-code
|
Quantized GGUF style (#1523)
|
2024-01-17 10:27:58 +01:00 |
repvgg
|
Move the image tensor to the appropriate device. (#1856)
|
2024-03-16 22:25:46 +01:00 |
resnet
|
Move the image tensor to the appropriate device. (#1856)
|
2024-03-16 22:25:46 +01:00 |
rwkv
|
Fix the model path for rwkv. (#1825)
|
2024-03-09 11:21:48 +01:00 |
segformer
|
Update the image crate + use the re-exported version. (#1893)
|
2024-03-21 10:56:41 +01:00 |
segment-anything
|
Add negative prompts to segment-anything. (#1000)
|
2023-09-30 06:17:42 +01:00 |
stable-diffusion
|
Utilize batches in Stable Diffusion (#2071)
|
2024-04-16 06:49:04 +02:00 |
stable-lm
|
Avoid copying the data on squeeze and unsqueeze. (#1884)
|
2024-03-20 13:04:36 +01:00 |
starcoder2
|
Add the StarCoder2 model. (#1779)
|
2024-02-28 21:02:41 +01:00 |
t5
|
Expose the t5 config fields + allow t5-large. (#1987)
|
2024-04-01 20:58:34 +02:00 |
trocr
|
Move image tensor to device in trocr example (#2063)
|
2024-04-15 06:50:32 +02:00 |
vgg
|
Move the image tensor to the appropriate device. (#1856)
|
2024-03-16 22:25:46 +01:00 |
vit
|
Move the image tensor to the appropriate device. (#1856)
|
2024-03-16 22:25:46 +01:00 |
whisper
|
whisper readme (#1899)
|
2024-03-21 12:54:09 +01:00 |
whisper-microphone
|
feat: support microphone whisper streaming (#1678)
|
2024-02-12 18:01:21 +01:00 |
wuerstchen
|
Remove some unusued bits. (#1067)
|
2023-10-09 19:49:57 +01:00 |
yi
|
Fix token generation in bilingual models (non-English outputs) (#1668)
|
2024-02-06 12:03:53 +01:00 |
yolo-v3
|
Fix clippy lints for 1.76. (#1682)
|
2024-02-08 16:48:47 +01:00 |
yolo-v8
|
Support alternative dtypes for mamba (#2036)
|
2024-04-10 18:10:01 +02:00 |
onnx_basics.rs
|
[ONNX] Support a couple more ops. (#1284)
|
2023-11-06 22:44:58 +01:00 |