Commit 9a87476
authored
Kv cache transfer support duplicate heads (#4929)
Signed-off-by: Chuang Zhu <[email protected]>1 parent 947571c commit 9a87476
File tree
9 files changed
+226
-95
lines changed- cpp
- tensorrt_llm
- batch_manager
- executor/cache_transmission
- tests/batch_manager
9 files changed
+226
-95
lines changedLarge diffs are not rendered by default.
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
96 | 96 | | |
97 | 97 | | |
98 | 98 | | |
| 99 | + | |
| 100 | + | |
| 101 | + | |
| 102 | + | |
| 103 | + | |
99 | 104 | | |
100 | 105 | | |
101 | 106 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
66 | 66 | | |
67 | 67 | | |
68 | 68 | | |
| 69 | + | |
| 70 | + | |
| 71 | + | |
| 72 | + | |
| 73 | + | |
69 | 74 | | |
70 | 75 | | |
71 | 76 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
185 | 185 | | |
186 | 186 | | |
187 | 187 | | |
| 188 | + | |
| 189 | + | |
| 190 | + | |
188 | 191 | | |
189 | 192 | | |
190 | | - | |
| 193 | + | |
| 194 | + | |
| 195 | + | |
| 196 | + | |
| 197 | + | |
| 198 | + | |
| 199 | + | |
191 | 200 | | |
192 | 201 | | |
193 | 202 | | |
194 | 203 | | |
195 | 204 | | |
| 205 | + | |
| 206 | + | |
196 | 207 | | |
197 | 208 | | |
198 | 209 | | |
199 | | - | |
200 | 210 | | |
201 | 211 | | |
202 | 212 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
41 | 41 | | |
42 | 42 | | |
43 | 43 | | |
44 | | - | |
| 44 | + | |
45 | 45 | | |
46 | 46 | | |
47 | 47 | | |
| |||
469 | 469 | | |
470 | 470 | | |
471 | 471 | | |
| 472 | + | |
472 | 473 | | |
473 | 474 | | |
474 | 475 | | |
475 | 476 | | |
476 | 477 | | |
477 | | - | |
| 478 | + | |
478 | 479 | | |
479 | 480 | | |
480 | 481 | | |
481 | 482 | | |
| 483 | + | |
482 | 484 | | |
483 | 485 | | |
484 | 486 | | |
| |||
487 | 489 | | |
488 | 490 | | |
489 | 491 | | |
| 492 | + | |
490 | 493 | | |
491 | 494 | | |
492 | 495 | | |
493 | 496 | | |
494 | 497 | | |
495 | 498 | | |
496 | 499 | | |
| 500 | + | |
497 | 501 | | |
498 | 502 | | |
499 | 503 | | |
500 | 504 | | |
501 | 505 | | |
| 506 | + | |
502 | 507 | | |
503 | 508 | | |
504 | 509 | | |
505 | 510 | | |
| 511 | + | |
506 | 512 | | |
507 | 513 | | |
508 | 514 | | |
509 | 515 | | |
510 | 516 | | |
| 517 | + | |
511 | 518 | | |
512 | 519 | | |
513 | 520 | | |
514 | 521 | | |
515 | 522 | | |
| 523 | + | |
516 | 524 | | |
517 | 525 | | |
518 | 526 | | |
519 | 527 | | |
520 | 528 | | |
521 | | - | |
| 529 | + | |
522 | 530 | | |
523 | 531 | | |
524 | 532 | | |
525 | 533 | | |
526 | 534 | | |
| 535 | + | |
527 | 536 | | |
528 | 537 | | |
529 | 538 | | |
530 | 539 | | |
531 | 540 | | |
| 541 | + | |
532 | 542 | | |
533 | 543 | | |
534 | 544 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
72 | 72 | | |
73 | 73 | | |
74 | 74 | | |
75 | | - | |
| 75 | + | |
76 | 76 | | |
77 | | - | |
| 77 | + | |
78 | 78 | | |
79 | 79 | | |
80 | 80 | | |
| |||
Lines changed: 29 additions & 42 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
85 | 85 | | |
86 | 86 | | |
87 | 87 | | |
| 88 | + | |
| 89 | + | |
88 | 90 | | |
89 | 91 | | |
90 | 92 | | |
| |||
112 | 114 | | |
113 | 115 | | |
114 | 116 | | |
115 | | - | |
| 117 | + | |
| 118 | + | |
| 119 | + | |
| 120 | + | |
| 121 | + | |
| 122 | + | |
| 123 | + | |
| 124 | + | |
| 125 | + | |
| 126 | + | |
| 127 | + | |
| 128 | + | |
| 129 | + | |
| 130 | + | |
116 | 131 | | |
117 | 132 | | |
118 | 133 | | |
119 | 134 | | |
120 | 135 | | |
121 | | - | |
122 | | - | |
123 | | - | |
124 | | - | |
125 | | - | |
126 | | - | |
127 | | - | |
128 | | - | |
129 | | - | |
130 | | - | |
131 | | - | |
132 | | - | |
133 | | - | |
134 | | - | |
135 | | - | |
136 | | - | |
137 | | - | |
138 | | - | |
139 | | - | |
140 | | - | |
141 | | - | |
142 | | - | |
143 | | - | |
144 | | - | |
145 | | - | |
146 | | - | |
147 | | - | |
148 | | - | |
149 | | - | |
150 | | - | |
151 | | - | |
152 | | - | |
153 | | - | |
154 | | - | |
155 | | - | |
156 | | - | |
157 | | - | |
158 | | - | |
159 | | - | |
| 136 | + | |
160 | 137 | | |
161 | 138 | | |
162 | 139 | | |
| |||
791 | 768 | | |
792 | 769 | | |
793 | 770 | | |
| 771 | + | |
| 772 | + | |
| 773 | + | |
| 774 | + | |
794 | 775 | | |
795 | 776 | | |
796 | 777 | | |
| |||
840 | 821 | | |
841 | 822 | | |
842 | 823 | | |
843 | | - | |
| 824 | + | |
| 825 | + | |
844 | 826 | | |
845 | 827 | | |
846 | 828 | | |
| |||
1017 | 999 | | |
1018 | 1000 | | |
1019 | 1001 | | |
| 1002 | + | |
| 1003 | + | |
| 1004 | + | |
| 1005 | + | |
1020 | 1006 | | |
1021 | 1007 | | |
1022 | 1008 | | |
| |||
1064 | 1050 | | |
1065 | 1051 | | |
1066 | 1052 | | |
1067 | | - | |
| 1053 | + | |
| 1054 | + | |
1068 | 1055 | | |
1069 | 1056 | | |
1070 | 1057 | | |
| |||
Lines changed: 2 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
36 | 36 | | |
37 | 37 | | |
38 | 38 | | |
| 39 | + | |
| 40 | + | |
39 | 41 | | |
40 | 42 | | |
41 | 43 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
606 | 606 | | |
607 | 607 | | |
608 | 608 | | |
609 | | - | |
| 609 | + | |
| 610 | + | |
610 | 611 | | |
611 | 612 | | |
612 | 613 | | |
613 | 614 | | |
614 | 615 | | |
615 | | - | |
| 616 | + | |
| 617 | + | |
| 618 | + | |
| 619 | + | |
| 620 | + | |
| 621 | + | |
| 622 | + | |
616 | 623 | | |
617 | 624 | | |
618 | 625 | | |
| 626 | + | |
619 | 627 | | |
620 | 628 | | |
621 | 629 | | |
| |||
656 | 664 | | |
657 | 665 | | |
658 | 666 | | |
659 | | - | |
| 667 | + | |
660 | 668 | | |
661 | 669 | | |
662 | 670 | | |
| |||
806 | 814 | | |
807 | 815 | | |
808 | 816 | | |
809 | | - | |
| 817 | + | |
810 | 818 | | |
811 | 819 | | |
812 | 820 | | |
| |||
906 | 914 | | |
907 | 915 | | |
908 | 916 | | |
909 | | - | |
| 917 | + | |
910 | 918 | | |
911 | 919 | | |
912 | 920 | | |
| |||
970 | 978 | | |
971 | 979 | | |
972 | 980 | | |
973 | | - | |
| 981 | + | |
974 | 982 | | |
975 | 983 | | |
976 | 984 | | |
| |||
1063 | 1071 | | |
1064 | 1072 | | |
1065 | 1073 | | |
| 1074 | + | |
1066 | 1075 | | |
1067 | 1076 | | |
1068 | 1077 | | |
| |||
1343 | 1352 | | |
1344 | 1353 | | |
1345 | 1354 | | |
| 1355 | + | |
| 1356 | + | |
| 1357 | + | |
| 1358 | + | |
| 1359 | + | |
| 1360 | + | |
| 1361 | + | |
| 1362 | + | |
| 1363 | + | |
| 1364 | + | |
| 1365 | + | |
| 1366 | + | |
| 1367 | + | |
| 1368 | + | |
| 1369 | + | |
| 1370 | + | |
| 1371 | + | |
| 1372 | + | |
| 1373 | + | |
| 1374 | + | |
| 1375 | + | |
| 1376 | + | |
1346 | 1377 | | |
1347 | 1378 | | |
1348 | 1379 | | |
| |||
0 commit comments