Named tensors #5048

juliuskunze · 2020-11-30T14:31:22Z

PyTorch has experimental support for named tensors achieving some compelling design goals while keeping existing code compatible. For example, binop broadcasting is still based on dimension order (unlike in xarray), consistent with standard NumPy/JAX/... semantics, but checks that aligned dimension names match.

It would be great to have named tensors that work both in op-by-op and under function transformations in JAX.

@shoyer In #1565 you mentioned that this could be done by wrapping JAX based on #611. According to my current understanding, this means:

Add name rules for lax primitives, returning the output dimension names for given input dimension names.
Add a corresponding eval_names transform.
Add a NamedDeviceArray subtype of DeviceArray that adds a names property.
We want names to be propagated in op-by-op mode on NamedDeviceArrays. For that,
- make a named version of jax.numpy, wrapping each op with the named transform.
- attach it to NamedDeviceArray using Implement overrides of NumPy's public API on JAX arrays #611 (+1 for merging). Alternatively, one could rewrite jax.numpy using numpy_dispatch.get_array_module from Add experimental __array_module__ method #4076 (appears cumbersome).
Make jitted functions propagate names when applied to NamedDeviceArrays.

Is this plan sound? @shoyer @mattjj Would you update (and merge, if successful) #611 just for this application? In that case, I'd be interested in prototyping a named tensor library for JAX, with a good amount of passion, in accordance with #1565. (:

The text was updated successfully, but these errors were encountered:

Jeevesh8 · 2020-12-20T11:11:25Z

Have you started working on this @juliuskunze ?

apaszke · 2020-12-21T11:56:45Z

We are actually working on something that will pretty much realize the plan that @juliuskunze has outlined here, with some additional benefits too (e.g. making it very easy to shard those programs with named axes over multiple accelerators).

juliuskunze · 2020-12-28T18:11:39Z

@Jeevesh8 No, and now I won't anymore. (: @apaszke That's great to hear! Will this go into the JAX repo?

degregat · 2021-06-25T10:04:06Z

Do I assume correctly that this evolved into named axes, or is there another module I did not find?

froystig · 2021-06-26T00:15:56Z

That's correct.

juliuskunze · 2021-06-28T18:50:56Z

@apaszke @froystig That looks awesome! Rad choice not taking into account order of named axes and broadcasting by name! That's semantically cleaner and probably more future-proof than I expected. (: The thing that I thought would make this impractical is that it's hard to optimize misaligned axes for dot products and similar ops where implicit transposes are needed on device. I guess the performance hit is not so bad or axis order optimization could/should be automated in the future anyway? Curious about your thoughts on this.

+1 for allowing arrays and operations with named axes outside of xmap, i. e. make named axis arrays first-class in jax as suggested above.

Bit0r · 2023-08-06T13:38:53Z

A more powerful implementation is to use first-class dimensions, and torchdim uses objects as dimension "variables"

Bit0r · 2023-08-08T03:49:51Z

@apaszke Perhaps it could be further independent of axis position? By utilizing the named tensor feature, operations that do not depend on axis position can be achieved.

For example, the for loop operation can directly specify which axis to loop on, and the framework automatically advances the axis to the first dimension. The entire operation is transparent, and users do not need to write any additional code.
For example, we can directly use axis names and indices on that axis to access tensors.

# tensor.named_shape={'batch':32, 'time':100, 'hidden':200}

t[{'time':0, 'hidden':0}] = 1000 # Select tensor with axis time 0 and axis hidden 0, and set tensor to 1000 with broadcast.

for t in tensor['time']:
# Jax automatically performs dimension permutation for operations: tensor: batch, time, hidden -> time, batch, hidden
# t.named_shape = {'batch':32, 'hidden':200}
    ...

kentslaney · 2024-10-31T22:08:59Z

@Bit0r I built your suggestion into some code I wrote for operating on a tuple or namedtuple of tensors

https://github.com/kentslaney/umap-cam/blob/7be1b04c509850cca4b192d821f9c6761ec6b218/src/umap-cam/group.py#L192-L203

hawkinsp assigned jekbradbury Nov 30, 2020

hawkinsp added the enhancement New feature or request label Nov 30, 2020

tylerflex mentioned this issue Aug 14, 2023

jax and xarray integration for automatic differentiation? #17107

Open

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Named tensors #5048

Named tensors #5048

juliuskunze commented Nov 30, 2020 •

edited

Loading

Jeevesh8 commented Dec 20, 2020

apaszke commented Dec 21, 2020

juliuskunze commented Dec 28, 2020

degregat commented Jun 25, 2021

froystig commented Jun 26, 2021

juliuskunze commented Jun 28, 2021 •

edited

Loading

Bit0r commented Aug 6, 2023

Bit0r commented Aug 8, 2023

kentslaney commented Oct 31, 2024

Named tensors #5048

Named tensors #5048

Comments

juliuskunze commented Nov 30, 2020 • edited Loading

Jeevesh8 commented Dec 20, 2020

apaszke commented Dec 21, 2020

juliuskunze commented Dec 28, 2020

degregat commented Jun 25, 2021

froystig commented Jun 26, 2021

juliuskunze commented Jun 28, 2021 • edited Loading

Bit0r commented Aug 6, 2023

Bit0r commented Aug 8, 2023

kentslaney commented Oct 31, 2024

juliuskunze commented Nov 30, 2020 •

edited

Loading

juliuskunze commented Jun 28, 2021 •

edited

Loading