서브메뉴
검색
Algorithmic Aspects of Learning General ReLU Neural Networks.
Algorithmic Aspects of Learning General ReLU Neural Networks.
- Material Type
- 학위논문
- 0017161536
- Date and Time of Latest Transaction
- 20250211151409
- ISBN
- 9798382762982
- DDC
- 004
- Author
- Tang, Alex.
- Title/Author
- Algorithmic Aspects of Learning General ReLU Neural Networks.
- Publish Info
- [S.l.] : Northwestern University., 2024
- Publish Info
- Ann Arbor : ProQuest Dissertations & Theses, 2024
- Material Info
- 152 p.
- General Note
- Source: Dissertations Abstracts International, Volume: 85-11, Section: B.
- General Note
- Advisor: Vijayaraghavan, Aravindan.
- 학위논문주기
- Thesis (Ph.D.)--Northwestern University, 2024.
- Abstracts/Etc
- 요약Non-linear activation functions enable deep neural networks to represent arbitrary relations. Rectified Linear Unit (ReLU) is one of the most commonly used activation function in modern neural network systems. Despite its empirical success, most of the theoretical principles governing ReLU-activated neural networks, including tractable learning algorithms, training dynamics and convergence rate remain unknown. To bridge this gap, we first present a convergence analysis of gradient descent for the problem of agnostically learning a single ReLU neuron with potentially non-zero bias under Gaussian distributions. Following our previous results, we generalize our analysis from a single neuron to neural networks by presenting polynomial-time and sample-efficient algorithms for learning an unknown depth-2 feedforward ReLU-activated neural network with non-zero bias, under mild non-degeneracy assumptions. Using these ideas we establish identifiability of the neural network parameters, as well as polynomial-time learnability of 2-layer ReLU-activated neural networks in the smoothed analysis framework under minimal assumptions.
- Subject Added Entry-Topical Term
- Computer science.
- Subject Added Entry-Topical Term
- Statistics.
- Subject Added Entry-Topical Term
- Mathematics.
- Index Term-Uncontrolled
- Learning algorithms
- Index Term-Uncontrolled
- Machine learning
- Index Term-Uncontrolled
- Neural networks
- Index Term-Uncontrolled
- Gaussian distributions
- Added Entry-Corporate Name
- Northwestern University Computer Science
- Host Item Entry
- Dissertations Abstracts International. 85-11B.
- Electronic Location and Access
- 로그인을 한후 보실 수 있는 자료입니다.
- Control Number
- joongbu:658204
Detail Info.
- Reservation
- 캠퍼스간 도서대출
- 서가에 없는 책 신고
- My Folder