-
Notifications
You must be signed in to change notification settings - Fork 186
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Showing
16 changed files
with
84 additions
and
35 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,10 +1,10 @@ | ||
- [第0章 序言](chapter0/chapter0.md) | ||
- [第1章 预备知识](chapter1/chapter1.md) | ||
- [第2章 可学性](chapter2/chapter2.md) | ||
- [第3章 复杂度](chapter3/chapter3.md) | ||
- [第4章 泛化界](chapter4/chapter4.md) | ||
- [第5章 稳定性](chapter5/chapter5.md) | ||
- [第6章 一致性](chapter6/chapter6.md) | ||
- [第7章 收敛率](chapter7/chapter7.md) | ||
- [第8章 遗憾界](chapter8/chapter8.md) | ||
|
||
- [第0章 序言](chapter0.md) | ||
- [第1章 预备知识](chapter1.md) | ||
- [第2章 可学性](chapter2.md) | ||
- [第3章 复杂度](chapter3.md) | ||
- [第4章 泛化界](chapter4.md) | ||
- [第5章 稳定性](chapter5.md) | ||
- [第6章 一致性](chapter6.md) | ||
- [第7章 收敛率](chapter7.md) | ||
- [第8章 遗憾界](chapter8.md) | ||
- [参考文献](references.md) |
File renamed without changes.
File renamed without changes.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
File renamed without changes.
File renamed without changes.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes
File renamed without changes
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,49 @@ | ||
# 参考文献 | ||
|
||
Abernethy, Jacob, et al. "Optimal strategies and minimax lower bounds for online convex games." Proceedings of the 21st annual conference on learning theory. 2008. | ||
|
||
Auer, Peter. "Using confidence bounds for exploitation-exploration trade-offs." Journal of Machine Learning Research 3.Nov (2002): 397-422. | ||
|
||
Bouneffouf, Djallel. "Finite-time analysis of the multi-armed bandit problem with known trend." 2016 IEEE Congress on Evolutionary Computation (CEC). IEEE, 2016. | ||
|
||
Bubeck, Sébastien, Ronen Eldan, and Yin Tat Lee. "Kernel-based methods for bandit convex optimization." Journal of the ACM (JACM) 68.4 (2021): 1-35. | ||
|
||
Boyd, Stephen, and Lieven Vandenberghe. Convex optimization. Cambridge university press, 2004. | ||
|
||
Cesa-Bianchi, N., Conconi, A., and Gentile, C. "On the Generalization Ability of On-Line Learning Algorithms." IEEE Transactions on Information Theory, vol. 50, no. 9, 2004, pp. 2050-2057, doi:10.1109/TIT.2004.833339. | ||
|
||
Devroye, Luc, László Györfi, and Gábor Lugosi. A probabilistic theory of pattern recognition. Vol. 31. Springer Science & Business Media, 2013. | ||
|
||
Feller, William. "An introduction to probability theory and its applications." (1971). | ||
|
||
Flaxman, Abraham D., Adam Tauman Kalai, and H. Brendan McMahan. "Online convex optimization in the bandit setting: gradient descent without a gradient." arXiv preprint cs/0408007 (2004). | ||
|
||
Hazan, Elad, Amit Agarwal, and Satyen Kale. "Logarithmic regret algorithms for online convex optimization." Machine Learning 69.2 (2007): 169-192. | ||
|
||
Kakade, Sham M., and Ambuj Tewari. "On the generalization ability of online strongly convex programming algorithms." Advances in neural information processing systems 21 (2008). | ||
|
||
Kearns, Michael J., and Umesh Vazirani. An introduction to computational learning theory. MIT press, 1994. | ||
|
||
Lai, Tze Leung, and Herbert Robbins. "Asymptotically efficient adaptive allocation rules." Advances in applied mathematics 6.1 (1985): 4-22. | ||
|
||
McAllester, David A. "PAC-Bayesian stochastic model selection." Machine Learning 51.1 (2003): 5-21. | ||
|
||
Mohri, Mehryar. "Foundations of machine learning." (2018). | ||
|
||
Nakkiran, Preetum, et al. "Deep double descent: Where bigger models and more data hurt." Journal of Statistical Mechanics: Theory and Experiment 2021.12 (2021): 124003. | ||
|
||
Penot, Jean-Paul. "On regularity conditions in mathematical programming." Optimality and Stability in Mathematical Programming (1982): 167-199. | ||
|
||
Robbins, Herbert. "Some aspects of the sequential design of experiments." (1952): 527-535. | ||
|
||
Thompson, William R. "On the likelihood that one unknown probability exceeds another in view of the evidence of two samples." Biometrika 25.3-4 (1933): 285-294. | ||
|
||
Wainwright, Martin J. High-dimensional statistics: A non-asymptotic viewpoint. Vol. 48. Cambridge university press, 2019. | ||
|
||
Wang, Guanghui, Shiyin Lu, and Lijun Zhang. "Adaptivity and optimality: A universal algorithm for online convex optimization." Uncertainty in Artificial Intelligence. PMLR, 2020. | ||
|
||
Zhang, Lijun, Shiyin Lu, and Zhi-Hua Zhou. "Adaptive online learning in dynamic environments." Advances in neural information processing systems 31 (2018) | ||
|
||
Zhang, Lijun, Tie-Yan Liu, and Zhi-Hua Zhou. "Adaptive regret of convex and smooth functions." International Conference on Machine Learning. PMLR, 2019. | ||
|
||
Zinkevich, Martin. "Online convex programming and generalized infinitesimal gradient ascent." Proceedings of the 20th international conference on machine learning (icml-03). 2003. |