Triton interpreter cannot handle parameters that alias #5791

saagarjha · 2025-02-02T01:56:42Z

Describe the bug

When invoking the interpreter, Triton will make a copy of the tensors passed in so that they can be operated on directly (e.g. by copying to the host). Unfortunately the straightforward way to do this where each input tensor gets a new host copy, running the kernel, then copying out is subtly incorrect. In particular, a kernel like this:

@triton.jit
def aliasing_test(buffer, buffer2):
    triton.language.store(buffer, 1)
    
if __name__ == "__main__":
    buffer = torch.zeros(1, device="cuda")
    aliasing_test[(1,)](buffer, buffer)
    print(buffer)

should print "1" but it prints "0". This is because buffer is copied in from the kernel and then buffer2 (which isn't written to) overwrites it with the original value.

Environment details

Triton: built from main
GPU: H100

Jokeren · 2025-02-02T02:30:59Z

Yeah, I acknowledge the problem and thank you for reporting it.

I think we should first check the storage of all tensors to determine if any tensor is a slice of or identical to another tensor in the input arguments. These child tensors should then be excluded from the copy process and instead take a slice from the copied "parent" tensors. When storing data back to the GPU, a similar process will be performed.

It's not a priority since most kernels do not have such a case.

Jokeren · 2025-02-02T02:53:49Z

Let me know if you want to propose a fix and I will assign the issue to you

saagarjha · 2025-02-02T12:39:27Z

So the reason I filed this as a bug instead of just opening a PR is that this actually seems nontrivial to solve ;) Figuring out if two parameters alias seems like a hard problem, because you have to perform an intersection of two arbitrarily-strided tensors, and I don't know if there is any API or non-annoying way to compute this. I can keep thinking about it but if you have ideas I'd be happy to hear them.

Jokeren · 2025-02-02T14:59:29Z

It won't be that difficult to solve. Any tensor that is a view of another will share the same storage and maintain the same storage data_ptr. Likely I won't have time to handle the problem this week. Let me keep this issue open and feel free to take it later, or I can ask someone else to take it.

saagarjha · 2025-02-03T05:26:28Z

Well, you can have aliasing occur from any pointer really, not just views into the same tensor. Like, you can wrap a tensor around an existing allocation, and you'll run into issues if there is any overlap. The following two tensors partially overlap for example:

base 0x1000, stride 2, size 5
base 0x1002, stride 3, size 4

They overlap at 0x1002 and 0x1008. I think the general version of this would require computing some sort of LCM stride for all the parameters and then figuring out what the layout of the overlap is.

lezcano · 2025-02-03T09:52:47Z

In PyTorch, to see if two tensors are views of the same tensor, you can do x._base is y._base (if x is not a view, _base will return None, so in that case you have to take x as its base, but yeah).

saagarjha · 2025-02-03T11:00:01Z

Right, but my point is that you can form a tensor around any memory address, including one owned by someone else, without it being needing to be a view. I'm fine with going "yeah but that is dumb and we are not going to do aliasing checks for that" but ideally that is something we decide on rather than just missing that case by accident :)

Jokeren · 2025-02-03T13:08:31Z

Right, but my point is that you can form a tensor around any memory address, including one owned by someone else, without it being needing to be a view

Can you show me how you create overlapped tensors with different t.untyped_storage().data_ptr() in Python?

peterbell10 · 2025-02-03T13:37:48Z

Here is a minimal example:

import numpy as np
import torch

a = np.random.randn(100)
b = torch.from_numpy(a)
c = torch.from_numpy(a[1:])
assert b.untyped_storage().data_ptr() != c.untyped_storage().data_ptr()

I'm inclined to say this is too much of an edge case to focus on though.

Jokeren · 2025-02-03T14:03:11Z

Here is a minimal example:

import numpy as np
import torch

a = np.random.randn(100)
b = torch.from_numpy(a)
c = torch.from_numpy(a[1:])
assert b.untyped_storage().data_ptr() != c.untyped_storage().data_ptr()
I'm inclined to say this is too much of an edge case to focus on though.

Fair enough example, then detecting the maximum range of all input arguments using strides and sizes would be required.

lezcano · 2025-02-03T15:33:39Z

I think that we can do something that works on a best effort basis, and that would already cover 99% of the use cases. Then we can leave a note somewhere mentioning that if you twist the interpreter's arm a bit too much with views + funny constructors (not regular PyTorch ops) it might struggle.

Jokeren · 2025-02-03T23:42:33Z

I think that we can do something that works on a best effort basis, and that would already cover 99% of the use cases. Then we can leave a note somewhere mentioning that if you twist the interpreter's arm a bit too much with views + funny constructors (not regular PyTorch ops) it might struggle.

Agreed.

saagarjha · 2025-02-04T00:19:15Z

Alright, I’m happy to take it in that case. At the very least this will let me clear out some of the workarounds we have in our code :)

saagarjha added the bug label Feb 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Triton interpreter cannot handle parameters that alias #5791

Triton interpreter cannot handle parameters that alias #5791

saagarjha commented Feb 2, 2025

Jokeren commented Feb 2, 2025

Jokeren commented Feb 2, 2025

saagarjha commented Feb 2, 2025

Jokeren commented Feb 2, 2025

saagarjha commented Feb 3, 2025

lezcano commented Feb 3, 2025 •

edited

Loading

saagarjha commented Feb 3, 2025

Jokeren commented Feb 3, 2025

peterbell10 commented Feb 3, 2025

Jokeren commented Feb 3, 2025

lezcano commented Feb 3, 2025 •

edited

Loading

Jokeren commented Feb 3, 2025

saagarjha commented Feb 4, 2025

Triton interpreter cannot handle parameters that alias #5791

Triton interpreter cannot handle parameters that alias #5791

Comments

saagarjha commented Feb 2, 2025

Describe the bug

Environment details

Jokeren commented Feb 2, 2025

Jokeren commented Feb 2, 2025

saagarjha commented Feb 2, 2025

Jokeren commented Feb 2, 2025

saagarjha commented Feb 3, 2025

lezcano commented Feb 3, 2025 • edited Loading

saagarjha commented Feb 3, 2025

Jokeren commented Feb 3, 2025

peterbell10 commented Feb 3, 2025

Jokeren commented Feb 3, 2025

lezcano commented Feb 3, 2025 • edited Loading

Jokeren commented Feb 3, 2025

saagarjha commented Feb 4, 2025

lezcano commented Feb 3, 2025 •

edited

Loading

lezcano commented Feb 3, 2025 •

edited

Loading