Posted 2025-03-26Updated 2025-04-01AI / RL5 minutes read (About 809 words)RL-规划与学习Planing and Learning都学了一半了才来理概念,怎么想的Read more
Posted 2025-03-25Updated 2025-04-01AI / RL15 minutes read (About 2178 words)RL-时序差分Temporal DifferenceRead more
Posted 2025-03-24Updated 2025-04-01AI / RL9 minutes read (About 1358 words)RL-Midterm希望我能顺利过关Read more
Posted 2025-03-19Updated 2025-04-01AI / RL15 minutes read (About 2181 words)RL-MonteCarlo蒙特卡罗方法Read more
Posted 2025-03-12Updated 2025-04-01AI / RL13 minutes read (About 1923 words)RL-动态规划Dynamic Programming学不动了,真的Read more
Posted 2025-03-05Updated 2025-04-01AI / RL24 minutes read (About 3659 words)RL-马尔可夫决策过程Markov Decision Processes流感让我断更一周Read more
Posted 2025-02-23Updated 2025-04-01AI / RL23 minutes read (About 3435 words)RL-多臂老虎机强化学习强化的不是机器模型而是我的猪脑Read more