Folgen
Mor Shpigel Nacson
Mor Shpigel Nacson
PhD Student, Technion
Bestätigte E-Mail-Adresse bei campus.technion.ac.il
Titel
Zitiert von
Zitiert von
Jahr
The implicit bias of gradient descent on separable data
D Soudry, E Hoffer, MS Nacson, S Gunasekar, N Srebro
Journal of Machine Learning Research 19 (70), 1-57, 2018
8942018
Convergence of gradient descent on separable data
MS Nacson, J Lee, S Gunasekar, PHP Savarese, N Srebro, D Soudry
The 22nd International Conference on Artificial Intelligence and Statistics …, 2019
1512019
Stochastic gradient descent on separable data: Exact convergence with a fixed learning rate
MS Nacson, N Srebro, D Soudry
The 22nd International Conference on Artificial Intelligence and Statistics …, 2019
862019
On the implicit bias of initialization shape: Beyond infinitesimal mirror descent
S Azulay, E Moroshko, MS Nacson, BE Woodworth, N Srebro, ...
International Conference on Machine Learning, 468-477, 2021
642021
Lexicographic and depth-sensitive margins in homogeneous and non-homogeneous deep models
MS Nacson, S Gunasekar, J Lee, N Srebro, D Soudry
International Conference on Machine Learning, 4683-4692, 2019
612019
Implicit bias of the step size in linear diagonal neural networks
MS Nacson, K Ravichandran, N Srebro, D Soudry
International Conference on Machine Learning, 16270-16295, 2022
382022
TAEN: temporal aware embedding network for few-shot action recognition
R Ben-Ari, MS Nacson, O Azulai, U Barzelay, D Rotman
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021
242021
At Stability's Edge: How to Adjust Hyperparameters to Preserve Minima Selection in Asynchronous Training of Neural Networks?
N Giladi, MS Nacson, E Hoffer, D Soudry
arXiv preprint arXiv:1909.12340, 2019
182019
Gradient descent monotonically decreases the sharpness of gradient flow solutions in scalar networks and beyond
I Kreisler, MS Nacson, D Soudry, Y Carmon
International Conference on Machine Learning, 17684-17744, 2023
62023
The implicit bias of minima stability in multivariate shallow relu networks
MS Nacson, R Mulayoff, G Ongie, T Michaeli, D Soudry
arXiv preprint arXiv:2306.17499, 2023
42023
Action recognition using limited data
R Ben-Ari, O Azulai, U Barzelay, MS Nacson
US Patent App. 17/219,322, 2022
12022
How Uniform Random Weights Induce Non-uniform Bias: Typical Interpolating Neural Networks Generalize with Narrow Teachers
G Buzaglo, I Harel, MS Nacson, A Brutzkus, N Srebro, D Soudry
arXiv preprint arXiv:2402.06323, 2024
2024
How Learning Rate and Delay Affect Minima Selection in Asynchronous Training of Neural Networks
N Giladi, MS Nacson, E Hoffer, D Soudry
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–13