1.5M ratings

277k ratings

See, that’s what the app is perfect for.

Sounds perfect Wahhhh, I don’t wanna

lossfunctions

They are a window to your model's heart.

Contribute loss functions to @karpathy. It doesn't matter if your loss functions are flat, converge, diverge, step or oscillate (or any combination of the above). All loss functions are computed beautiful in their own way and are sought after with equal tenacity.

Tired: loss minimalists. Wired: loss maximalists.
by @sharifshameem :) — Tired: loss minimalists. Wired: loss maximalists.

by @sharifshameem :)

Datasets with a data loader without a shuffle after each epoch? Generously contributed by @richardgalvez. — Datasets with a data loader without a shuffle after each epoch? Generously contributed by @richardgalvez.

A highly amusing specimen from @_karfly . Truly baffles the mind.

“The Snek”, a gracious contribution from @TheReibel and Vicki :)

Evades diagnosis. Graciously contributed by @bleyddyn.

Another loss function contributed by Ray Zhang. Diagnosis impossible.

A heart rate or a loss function? :)
This one of a custom implementation of an RNN, graciously contributed by Ray Zhang.

Blue: baseline. Red: attempt to create a new architecture :D
Contributed by Hyun Jae Kim.

An educational post! We’re looking at the validation accuracy of a model as a function of dropout we train with. This trend is consistent with my overall experience: models with dropout train faster, but models with higher dropout win eventually. The... — An educational post! We’re looking at the validation accuracy of a model as a function of dropout we train with. This trend is consistent with my overall experience: models with dropout train faster, but models with higher dropout win eventually. The dropout of one model is quite extreme (0.85), but it is gaining on the others! What’s going to happen as we train longer? #soexciting

Spatial Transformer Network identifying right whales, L2 reg and loss plot.
Contributed by ‏@robibok