File tree
758 files changed
+1273212
-832844
lines changed- benchmarks
- cpp
- python
- cpp
- include/tensorrt_llm
- batch_manager
- common
- executor
- plugins/api
- runtime
- utils
- micro_benchmarks
- tensorrt_llm
- batch_manager
- aarch64-linux-gnu
- x86_64-linux-gnu
- x86_64-windows-msvc
- common
- cutlass_extensions/include/cutlass_extensions
- gemm/threadblock
- executor
- aarch64-linux-gnu
- x86_64-linux-gnu
- x86_64-windows-msvc
- kernels
- beamSearchKernels
- contextFusedMultiHeadAttention
- cubin
- cutlass_kernels
- fp8_rowwise_gemm
- decoderMaskedMultiheadAttention
- cubin
- decoderXQAImplJIT
- nvrtcWrapper
- aarch64-linux-gnu
- x86_64-linux-gnu
- x86_64-windows-msvc
- internal_cutlass_kernels
- aarch64-linux-gnu
- include
- x86_64-linux-gnu
- x86_64-windows-msvc
- lora
- mixtureOfExperts
- speculativeDecoding
- unfusedAttentionKernels
- weightOnlyBatchedGemv
- layers
- plugins
- api
- common
- eaglePlugin
- fp8RowwiseGemmPlugin
- gemmPlugin
- gptAttentionCommon
- gptAttentionPlugin
- identityPlugin
- lowLatencyGemmPlugin
- lowLatencyGemmSwigluPlugin
- ncclPlugin
- qserveGemmPlugin
- quantizePerTokenPlugin
- rmsnormQuantizationPlugin
- topkLastDimPlugin
- weightOnlyGroupwiseQuantMatmulPlugin
- weightOnlyQuantMatmulPlugin
- pybind
- batch_manager
- common
- executor
- runtime
- utils
- runtime
- utils
- thop
- tests
- common
- kernels
- weightOnly
- layers
- resources
- data
- scripts
- runtime
- docker
- common
- docs
- source
- advanced
- architecture
- blogs
- commands
- installation
- llm-api-examples
- media
- performance
- reference
- examples
- apps
- baichuan
- bert
- bloom
- chatglm
- commandr
- cpp/executor
- dbrx
- deepseek_v1
- deepseek_v2
- draft_target_model
- eagle
- enc_dec
- exaone
- falcon
- gemma
- gptj
- gptneox
- gpt
- grok
- internlm2
- internlm
- jais
- llama
- llm-api
- lookahead
- mamba
- medusa
- mixtral
- mllama
- model_api
- mpt
- multimodal
- nemotron_nas
- nemotron
- opt
- phi
- prompt_lookup
- python_plugin
- plugin_lib
- quantization
- qwenvl
- qwen
- recurrentgemma
- redrafter
- skywork
- smaug
- whisper
- scripts
- tensorrt_llm
- auto_parallel
- tensor_parallel/plugin_nodes
- bench
- benchmark
- build
- utils
- commands
- hlapi
- layers
- llmapi
- models
- baichuan
- chatglm
- commandr
- deepseek_v1
- deepseek_v2
- eagle
- enc_dec
- falcon
- gemma
- gptj
- gpt
- llama
- mamba
- medusa
- mllama
- nemotron_nas
- phi3
- phi
- qwen
- recurrentgemma
- redrafter
- plugin
- quantization
- runtime
- serve
- tools
- tests
- attention
- bindings
- functional
- hlapi
- apps
- llmapi
- _perf_evaluator
- apps
- model
- eagle
- python_plugin
- quantization
- utils
- windows
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
758 files changed
+1273212
-832844
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
29 | 29 | | |
30 | 30 | | |
31 | 31 | | |
32 | | - | |
| 32 | + | |
33 | 33 | | |
34 | 34 | | |
35 | 35 | | |
36 | 36 | | |
37 | | - | |
| 37 | + | |
38 | 38 | | |
39 | 39 | | |
40 | 40 | | |
| |||
55 | 55 | | |
56 | 56 | | |
57 | 57 | | |
| 58 | + | |
| 59 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
6 | 6 | | |
7 | 7 | | |
8 | 8 | | |
9 | | - | |
10 | | - | |
11 | | - | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
12 | 12 | | |
13 | 13 | | |
14 | | - | |
| 14 | + | |
15 | 15 | | |
16 | 16 | | |
17 | 17 | | |
18 | 18 | | |
19 | 19 | | |
20 | | - | |
21 | | - | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
22 | 24 | | |
23 | | - | |
| 25 | + | |
24 | 26 | | |
25 | 27 | | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
26 | 44 | | |
27 | 45 | | |
28 | 46 | | |
| |||
35 | 53 | | |
36 | 54 | | |
37 | 55 | | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
38 | 59 | | |
39 | 60 | | |
40 | 61 | | |
| |||
61 | 82 | | |
62 | 83 | | |
63 | 84 | | |
64 | | - | |
65 | | - | |
66 | | - | |
67 | 85 | | |
68 | 86 | | |
69 | 87 | | |
| |||
125 | 143 | | |
126 | 144 | | |
127 | 145 | | |
| 146 | + | |
128 | 147 | | |
129 | 148 | | |
130 | 149 | | |
| |||
0 commit comments