trunk/37753b6117a7447c4137a4afb62d99f16fa4ccc0: Revert "[SymmMem] put_signal and wait_signal (#174034)"
A new PyTorch feature caused internal crashes, forcing a swift reversal.
Deep Dive
A recent update to PyTorch, a popular AI framework, introduced new memory synchronization features called 'put_signal' and 'wait_signal'. However, this change caused internal crashes with a specific type conversion error. The development team has now reverted the update, removing the problematic code to restore stability. This highlights the continuous testing and rapid response needed in large-scale open-source projects to maintain reliability for millions of users.
Why It Matters
This ensures stability for developers and companies relying on PyTorch for critical AI applications.