| 일 | 월 | 화 | 수 | 목 | 금 | 토 |
|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 |
| 8 | 9 | 10 | 11 | 12 | 13 | 14 |
| 15 | 16 | 17 | 18 | 19 | 20 | 21 |
| 22 | 23 | 24 | 25 | 26 | 27 | 28 |
- CS285
- sliding video q-former
- CNN
- Artificial Intelligence
- memory bank
- tensorflow
- Github
- jmeter
- vision-language-action
- Server
- Kaggle
- leetcode
- hackerrank
- 용어
- MySQL
- 코딩테스트
- autogluon
- multimodal machine learning
- long video understanding
- quantification
- transference
- 백준
- Python
- ma-lmm
- error
- Anaconda
- deeprl
- LeNet-5
- Linux
- Reinforcement Learning
- Today
- Total
| 일 | 월 | 화 | 수 | 목 | 금 | 토 |
|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 |
| 8 | 9 | 10 | 11 | 12 | 13 | 14 |
| 15 | 16 | 17 | 18 | 19 | 20 | 21 |
| 22 | 23 | 24 | 25 | 26 | 27 | 28 |
- CS285
- sliding video q-former
- CNN
- Artificial Intelligence
- memory bank
- tensorflow
- Github
- jmeter
- vision-language-action
- Server
- Kaggle
- leetcode
- hackerrank
- 용어
- MySQL
- 코딩테스트
- autogluon
- multimodal machine learning
- long video understanding
- quantification
- transference
- 백준
- Python
- ma-lmm
- error
- Anaconda
- deeprl
- LeNet-5
- Linux
- Reinforcement Learning
- Today
- Total
목록Robotics (3)
Juni_DEV
Goal of this courseTrain an agent to perform useful tasksWhat is PyTorch?Python library forDefining neural networksAutomatically computing gradientsAnd more! (datasets, optimizers, GPUs, etc.)Numpyhttp://colab.research.google.com/drive/12nQiv6aZHXNuCfAAuTjJenDWKQbIt2Mz#scrollTo=U5rl_7Kx5vk8PyTorch Basicshttps://colab.research.google.com/drive/1hIVRi1fb7baLKoPw9PNqbkhpjLmv9DaA#scrollTo=xkOqj3t3ov..
Terminology & notationMarkov property (Very Very Important!!!)If you know the state S2 and you need to figure out the state S3 then S1 doesn’t give you any additional information that means that S3 is conditionally independent S1 given S2.If you know the state now, then the state in the past does not matter to you because you know everything about the state of the world.현재는 모든 과거를 온전히 표현한다 → 미래의..
I’ve finally picked up the CS285 reinforcement learning lectures again, after putting them off for a while... Let’s get it!What is reinforce learning?Mathematical formalism for learning-based decision makingApproach for learning decision making and control from experienceHow is this different from other machine learning topics?Standard (supervised) machine learningUsually assumes: i.i.d dataknow..