Ungraded Lab 1: Use of tie_in

Elemento · January 23, 2023, 6:39am

Hey Guys,
In the ungraded lab 1 of Week 4, I can see very frequent use of the tie_in function from jax.lax, however, I don’t understand the use case of this function. As per the documentation here, it returns y from x and y being the function parameters, but why would we use this function anywhere, why not simply using y in the first place? This function seems to be pretty redundant to me.

Cheers,
Elemento

arvyzukai · January 23, 2023, 8:41am

Hi @Elemento

This is a good question and I had the same question but failed to find the detailed definitive answer to it. The conclusion I had is that this the way gradient graph needs to be maintained.

According to the explanation in the Notebook:

tie_in: Some non-numeric operations must be invoked during backpropagation. Normally, the gradient compute graph would determine invocation but these functions are not included. To force re-evaluation, they are ‘tied’ to other numeric operations using tie_in.

Which for me suggests that y gradient is “tied” again to x in order not to break the gradient compute graph. I think that is the reason (not loosing or including/re-including into gradient graph) is the use of this function (instead of “not simply using y”). Most of the time these non-numeric operations are related to masks so it makes sense but I failed to find the details (the mechanism) of that.

Maybe someone will elaborate more?

Cheers

Elemento · January 23, 2023, 11:19am

Thanks a lot @arvyzukai for your valuable inputs on this.

Cheers,
Elemento

cmosguy · March 20, 2023, 1:50pm

Thanks @Elemento and @arvyzukai for this discussion. I too was wondering about this function tie_in. When you go to the source: jax._src.lax.lax — JAX documentation

It says this is deprecated? Sounds like an important function, why would they remove this?:

def tie_in(x: Any, y: T) -> T:
  """Deprecated. Ignores ``x`` and returns ``y``."""
  return y

arvyzukai · March 21, 2023, 7:27am

@cmosguy

Don’t quote me on this I’m just loosely speculating… If I had to guess they were cleaning up the library and the tie_in function seems to be a “hack” of some type to keep the gradient.

The previous doc string stated:

  """Gives ``y`` a fake data dependence on ``x``.
  When staging to XLA (e.g. running under jit or pmap), values that don't depend
  on computation inputs are computed op-by-op, and folded into the XLA
  computation as constants.
  ``tie_in`` provides a way to explicitly stage values into the computation.
  When staging to XLA and ``x`` is already staged, then the result of ``tie_in``
  is ``y``, but staged to XLA. Downstream use of the result will also be staged
  to XLA.
  """

Maybe they found another way to get around this or even it might have been some kind of security flaw… We can just speculate or we even could contact jax developers and try to find out, but I guess the world moves on and some things are left to be forgotten

Topic		Replies	Views
C4 W4: the purpose of tie-in function NLP Resources	0	127	August 12, 2023
Lab 1 bucket calculation, Can you explain tie_in()? NLP with Attention Models week-4	2	285	November 21, 2023
Optional Lab: Gradient Descent1 Supervised ML: Regression and Classification week-1	4	513	April 28, 2023
Week 2 lab notebook Supervised ML: Regression and Classification week-2	5	32	March 28, 2025
Question on C2_W2_Lab_2 Calculus for Machine Learning and Data Science week-2	1	393	February 13, 2023

Ungraded Lab 1: Use of tie_in

Related topics