Optimizing an opaque function

soufg · May 27, 2023, 8:40am

Hi,

I was playing around with optimization and gradient descent to understand it fully when I came to try to optimize a function that is opaque (i.e a function that for which I don’t know the implementation). I’m curious to know more about this case.
An example of this kind of function would be a function from a 3rd party package that is configurable with some variables and I want to find the best value of these variable for my training set.

So let’s call this function f. f has one variables w to configure its behavior, and takes a parameter x. I have a cost function J, that I want to minimize with the best value of w.

So if I don’t know f then I don’t know the “known” derivative of f. That means there is no easy way to calculate the derivative of J. Right?

So the only way I can think of doing that is to “manually” calculate the derivative of f by changing w by a tiny amount and calculating the difference. However that would be very inefficient as we add a lot more calculation in contrast with the case when it’s a “known” derivative.

So I’m wondering if there exist some other efficient algorithm to optimize this kind of function?

Thanks!

Mujassim_Jamal · May 27, 2023, 10:06am

If you use a framework like TensorFlow, you don’t need to manually calculate derivatives because it offers automatic differentiation, which calculates derivatives automatically for you. This feature can be helpful when optimizing these type of functions since it allows you to determine the gradient of the function with respect to the parameters.

I’ve also heard about the Bayesian optimization algorithm, but I don’t know much about it.

soufg · May 27, 2023, 10:23am

Thanks for the answer. I’ll read about automatic differentiation and Bayesian optimization. That seems to be exactly the kind of answer I was looking for

TMosh · May 27, 2023, 6:59pm

You don’t need to know the derivatives. If you know the outputs of the function for a range of input values, you can use those as a training set and learn a model of the function.

Christian_Simonis · May 27, 2023, 9:30pm

Yes! In addition to @Mujassim_Jamal’s reply:

also JAX offers great auto-diff functionality: The Autodiff Cookbook — JAX documentation

This is super helpful e.g. if you want to model highly complex relationships in highly dimensional parameter spaces, to compute the gradients efficiently.

That being said, I have mainly worked with that framework in the context of physics-informed AI and adjoint sensitivities.

Here, autodiff helps to align the complexity of parameters in Deeplearning with physical effects and keep them in sync during an optimization which is highly complex!

Best regards
Christian

soufg · June 19, 2023, 12:16pm

I stumbled upon the notion of “derivate free optimization” Derivative-free optimization - Wikipedia, that includes the previous answer

Christian_Simonis · June 24, 2023, 5:23am

Thanks for sharing! These threads might be interesting for you:

and also this paper discussed here:

Best regards
Christian

Topic		Replies	Views
Optional Lab: Gradient Descent1 Supervised ML: Regression and Classification week-1	4	513	April 28, 2023
Cost Function and Gradient Descent Supervised ML: Regression and Classification week-1	3	326	October 27, 2023
C2_W2_Computation graph (Optional) Advanced Learning Algorithms week-2	5	515	March 16, 2023
Regarding Gradient Descent Function Supervised ML: Regression and Classification week-1	6	507	January 24, 2023
Supervised Machine Learning Optional lab: Gradient descent Question Supervised ML: Regression and Classification week-1	6	555	July 11, 2023

Optimizing an opaque function

Related topics