Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Added layer for learnable eltwise y=kx+b #2996

Closed
wants to merge 2 commits into from

Conversation

ducha-aiki
Copy link
Contributor

This layer could be used as building brick for custom learnable activation functions. PR is ready for review.

@siddharthm83
Copy link

@ducha-aiki , as per #3229 (comment)
Attached

  1. Sample output when this layer is used in CPU and GPU
  2. Prototxt file used (caffenet +bn +ea)

eltwiseaffine_gpu_bug.txt
train_val_prototxt.txt

@ducha-aiki
Copy link
Contributor Author

@siddharthm83 Thanks, I will take a look on it.

@siddharthm83
Copy link

@ducha-aiki , the log link seems broken. When you say restarting training, it is gone, what do you exactly mean (are you restarting with a saved solverstate?)

@ducha-aiki
Copy link
Contributor Author

@siddharthm83 sorry, filenames changed after repo update.

But it does`t matter - I have done 100 restarts in script and it seems code is broken - it recovers only 2 times :( So time for debugging for me
By restart I mean start from same initialization - I initialize net in python and save weights.

@siddharthm83
Copy link

@ducha-aiki , any luck in identifying the issue? No hurry, just checking.

@ducha-aiki
Copy link
Contributor Author

@siddharthm83 sorry, ICLR rebuttal deadline. Going to fix next week.

@itijyou
Copy link

itijyou commented Jan 11, 2016

@ducha-aiki , I think here might be the problem, not CblasTrans but CblasNoTrans.

// Gradient with respect to bias
for (int n = 0; n < num; ++n) {
  caffe_gpu_gemv<Dtype>(CblasNoTrans, channels, height * width, (Dtype)1.,
      top_diff + top[0]->offset(n), bias_multiplier_.gpu_data(), (Dtype)1.,
      bias_diff);
}

@ducha-aiki
Copy link
Contributor Author

@wuzifeng thanks! Looks like it was the cause. I will finish training caffenet with it and update PR.

@siddharthm83
Copy link

awesome, thanks @ducha-aiki and @wuzifeng

@ducha-aiki
Copy link
Contributor Author

Travis crashes before it tries to compile anything, on stage of downloading packages.

@jeffdonahue
Copy link
Contributor

Replaced and merged in #3591. Thanks again for this work and your benchmarks @ducha-aiki!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants