-
Notifications
You must be signed in to change notification settings - Fork 74
Enable TensorIndexer with all C++ tests #5724
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
|
!test |
|
Review updated until commit 0823fe7 Description
|
| Relevant files | |||||||
|---|---|---|---|---|---|---|---|
| Bug fix |
| ||||||
| Tests |
|
PR Reviewer Guide
Here are some key observations to aid the review process:
| 🧪 PR contains tests |
| ⚡ Recommended focus areas for review |
Code Movement Impact
|
Test failures
-
(High, 189)
CUDA driver version insufficient for runtime on dlcluster_h100 (nvFuser test suites)Test Name H100 Source ArgsortParameterizedWithBlockAndBatch.SharedMemoryRequirement/128_1_0_0 ❌ Link ArgsortParameterizedWithBlockAndBatch.SharedMemoryRequirement/4096_2_1_1 ❌ Link ArgsortParameterizedWithBlockAndBatch.SharedMemoryRequirement/512_1_0_0 ❌ Link ArgsortParameterizedWithBlockAndBatch.SharedMemoryRequirement/512_1_1_1 ❌ Link ArgsortTest.ZeroDimensionalInput ❌ Link BlackwellMatmulTest.EpilogueSiluPersistentBroadcastInputs ❌ Link BlockSizeAndItemsPerThread/ArgSortComprehensiveTest.ComprehensiveValidation/BlockSize128_ItemsPerThread4 ❌ Link ClusterReductionTest.SimpleFusionAllReduce/cluster_10_dtype_float ❌ Link ClusterReductionTest.SimpleFusionAllReduce/cluster_6_dtype_double ❌ Link ClusterReductionTest.SimpleFusionAllReduce/cluster_9_dtype___bfloat ❌ Link ... with 179 more test failures omitted. Check internal logs. -
(High, 44)
NCCL NVLS multicast memory bind failures across multidevice/nvfuser test suites on dlcluster_viking_ciTest Name H100 (dist.) Source tests.python.multidevice.test_communication.test_allgather ❌ tests.python.multidevice.test_communication.test_allgather_expanded_broadcast ❌ tests.python.multidevice.test_communication.test_allreduce ❌ tests.python.multidevice.test_communication.test_reduce_scatter ❌ tests.python.multidevice.test_communication.test_reduce_scatter_noncontiguous ❌ tests.python.multidevice.test_dtensor.test_column_parallel_linear ❌ tests.python.multidevice.test_dtensor.test_plus_one ❌ tests.python.multidevice.test_dtensor.test_row_parallel_linear ❌ tests.python.multidevice.test_expert_parallel.test_dispatch_and_combine ❌ tests.python.multidevice.test_matmul.test_column_parallel_grouped_mm ❌ ... with 34 more test failures omitted. Check internal logs. -
(Medium, 1)
NCCL invalid usage error in multidevice overlap tests (tests/python/multidevice/test_overlap.py)Test Name H100 (dist.) Source tests.python.multidevice.test_overlap.test_overlap_allgather_matmul_shard_outermost[backend_type=CommunicatorBackend.cuda] ❌
|
!test |
No description provided.