Skip to content

Actions: IBM/text-generation-inference

Actions

Test

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
355 workflow runs
355 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Free blocks in KVCacheManager upon error
Test #362: Pull request #96 opened by tdoublep
May 14, 2024 14:38 7m 55s tpa-cleanup-blocks
May 14, 2024 14:38 7m 55s
TGIS gRPC adapter for lm-eval
Test #361: Pull request #90 synchronize by maxdebayser
May 13, 2024 14:48 6m 44s lm-eval
May 13, 2024 14:48 6m 44s
May 10, 2024 20:13 8m 35s
fix: check for tokenizer eos_token in ModelInfo response
Test #359: Pull request #93 synchronize by tjohnson31415
May 10, 2024 18:03 7m 25s fix-modelinfo-eos
May 10, 2024 18:03 7m 25s
May 10, 2024 17:52 7m 56s
feat: deprecate TRANSFORMERS_CACHE, use HF_HUB_CACHE everywhere
Test #357: Pull request #89 synchronize by tjohnson31415
May 10, 2024 15:53 8m 12s cache-env-vars
May 10, 2024 15:53 8m 12s
May 10, 2024 13:33 7m 47s
TGIS gRPC adapter for lm-eval
Test #352: Pull request #90 opened by maxdebayser
May 9, 2024 16:06 6m 46s lm-eval
May 9, 2024 16:06 6m 46s
Fix llama gqa attention bias (#88)
Test #347: Commit e87d462 pushed by njhill
May 8, 2024 17:38 7m 47s main
May 8, 2024 17:38 7m 47s
Fix llama gqa attention bias
Test #346: Pull request #88 opened by njhill
May 8, 2024 16:32 8m 7s fix-attn-bias
May 8, 2024 16:32 8m 7s
✨ allow single-shard paged attention
Test #345: Pull request #86 synchronize by joerunde
May 7, 2024 20:30 7m 35s paged-attn
May 7, 2024 20:30 7m 35s
Log number of KVCacheManager blocks at init (#87)
Test #344: Commit f091ad5 pushed by tdoublep
May 7, 2024 19:27 5m 53s main
May 7, 2024 19:27 5m 53s
May 6, 2024 21:18 7m 53s
✨ allow single-shard paged attention
Test #341: Pull request #86 opened by joerunde
May 6, 2024 20:52 7m 15s paged-attn
May 6, 2024 20:52 7m 15s
added mlp and attn bias option to flash and paged llama models
Test #340: Pull request #85 synchronize by joerunde
May 6, 2024 18:56 7m 32s attn_mlp_bias
May 6, 2024 18:56 7m 32s
Added attn and mlp bias
Test #338: Pull request #84 synchronize by JRosenkranz
May 6, 2024 15:32 7m 14s added_attn_mlp_bias
May 6, 2024 15:32 7m 14s