New issue New issue Open Open Maximum recursion depth exceeded when resolving schedule from trace#9 ...
From the "https://github.com/NVIDIA/Model-Optimizer/tree/main/examples/gpt-oss" code, we try to utilize QAT after training with SFT. When using QATSFTTrainer with a ...