The simulator seems to work as expected for MAC operations that take `__half` datatypes as input but it aborts due to an assert failure in ptx_ir.h Are there any plans to support/model INT8 operations of TCUs?