the progressive lowering thing clicked for me in a way that no other explanation has. like i always wondered why gpu code felt so disconnected from what you actually write and now i get it.
curious about one thing though - you mentioned both torch.export and TorchScript for capturing the graph. has the community mostly moved to one over the other? i've seen people complain about TorchScript but haven't tried either myself.