Skip to content

Actions: aphrodite-engine/aphrodite-engine

Deploy to GitHub Pages

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
702 workflow runs
702 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

chore: add deepseek_v2 torch.compile support
Deploy to GitHub Pages #702: Commit bb84dad pushed by AlpinDale
April 25, 2025 02:24 44s main
April 25, 2025 02:24 44s
chore: test & support torch.compile on more MoE models (#1385)
Deploy to GitHub Pages #701: Commit b8ebb9b pushed by AlpinDale
April 25, 2025 02:22 41s main
April 25, 2025 02:22 41s
feat: add multi-video support (#1384)
Deploy to GitHub Pages #700: Commit 9300c2e pushed by AlpinDale
April 25, 2025 02:17 42s main
April 25, 2025 02:17 42s
fix: ray existing instance detection issue (#1383)
Deploy to GitHub Pages #699: Commit 3aa421f pushed by AlpinDale
April 25, 2025 01:44 42s main
April 25, 2025 01:44 42s
feat: torch.compile support for MoE models (#1382)
Deploy to GitHub Pages #698: Commit 237a05f pushed by AlpinDale
April 25, 2025 01:42 44s main
April 25, 2025 01:42 44s
core: improve cuda graph memory usage with tensor weakref (#1381)
Deploy to GitHub Pages #697: Commit 1a5214c pushed by AlpinDale
April 24, 2025 23:57 44s main
April 24, 2025 23:57 44s
model: add support for Qwen2ForSequenceClassification arch (#1380)
Deploy to GitHub Pages #696: Commit 2664d75 pushed by AlpinDale
April 24, 2025 21:54 43s main
April 24, 2025 21:54 43s
API: add support for bad_words sampling (#1379)
Deploy to GitHub Pages #695: Commit 2d139d0 pushed by AlpinDale
April 24, 2025 21:19 58s main
April 24, 2025 21:19 58s
(4/N) Triton Backend: Triton Flash Attention backend (#1376)
Deploy to GitHub Pages #694: Commit a91e570 pushed by AlpinDale
April 22, 2025 02:27 50s main
April 22, 2025 02:27 50s
chore: add openvino platform selector (#1375)
Deploy to GitHub Pages #693: Commit e4cb3ab pushed by AlpinDale
April 22, 2025 02:13 50s main
April 22, 2025 02:13 50s
fix: disable continuous_usage_stats by default (#1374)
Deploy to GitHub Pages #692: Commit 39a0827 pushed by AlpinDale
April 22, 2025 02:10 41s main
April 22, 2025 02:10 41s
fix: compressed_tensors_moe bad config strategy (#1373)
Deploy to GitHub Pages #691: Commit 01001a5 pushed by AlpinDale
April 22, 2025 01:07 52s main
April 22, 2025 01:07 52s
V1: add sliding window support for Flash Attention (#1372)
Deploy to GitHub Pages #690: Commit c0ef719 pushed by AlpinDale
April 22, 2025 00:13 41s main
April 22, 2025 00:13 41s
chore: layer lora module for GraniteMoE (#1371)
Deploy to GitHub Pages #689: Commit 96065cb pushed by AlpinDale
April 22, 2025 00:11 44s main
April 22, 2025 00:11 44s
fix: remove xformers as a requirement for pixtral (#1370)
Deploy to GitHub Pages #688: Commit 348165b pushed by AlpinDale
April 22, 2025 00:10 49s main
April 22, 2025 00:10 49s
readme: add sponsors section (#1369)
Deploy to GitHub Pages #687: Commit 1633ef7 pushed by AlpinDale
April 22, 2025 00:07 42s main
April 22, 2025 00:07 42s
kernel: implement an efficient MoE sum kernel (#1368)
Deploy to GitHub Pages #686: Commit 4a99476 pushed by AlpinDale
April 22, 2025 00:03 49s main
April 22, 2025 00:03 49s
fix: default value check for image_url.detail (#1367)
Deploy to GitHub Pages #685: Commit b8edabf pushed by AlpinDale
April 21, 2025 21:57 51s main
April 21, 2025 21:57 51s
VLM: compute llava_next max_tokens/dummy_data from grid points (#1366)
Deploy to GitHub Pages #684: Commit d58e029 pushed by AlpinDale
April 21, 2025 21:56 41s main
April 21, 2025 21:56 41s
fix: disable post_norm for llava models (#1365)
Deploy to GitHub Pages #683: Commit 628cc8f pushed by AlpinDale
April 21, 2025 21:54 44s main
April 21, 2025 21:54 44s
torch.compile: all-gather compilation + more model support (#1364)
Deploy to GitHub Pages #682: Commit 97afdd9 pushed by AlpinDale
April 21, 2025 21:52 41s main
April 21, 2025 21:52 41s
torch.compile: add support for more models (#1363)
Deploy to GitHub Pages #681: Commit 5bc1475 pushed by AlpinDale
April 21, 2025 21:33 53s main
April 21, 2025 21:33 53s
core: simplify sequence group code (#1362)
Deploy to GitHub Pages #680: Commit bbe0c8c pushed by AlpinDale
April 21, 2025 21:27 45s main
April 21, 2025 21:27 45s
V1: clean up requests when aborted (#1361)
Deploy to GitHub Pages #679: Commit 79b8218 pushed by AlpinDale
April 21, 2025 21:04 53s main
April 21, 2025 21:04 53s
fix: PP for ChatGLM and Molmo (#1360)
Deploy to GitHub Pages #678: Commit faaff47 pushed by AlpinDale
April 21, 2025 18:30 49s main
April 21, 2025 18:30 49s