fact
✓ Helios stamped
chain leaf #1535
Reinforcement learning was famously used by DeepMind to train Atari-playing agents directly from pixel input in 2013.
Cited sources
Member verifications (1)
TRUE
arXiv: Mnih et al., 2013 (DQN)
Provenance
Cryptographic details
| id | tqnfeopd |
| content sha256 | 26cc4b9345efb6211456776395146113f345bb9e3ebbdda376c733ecd645806e |
| chain leaf idx | 1535 |
| chain leaf hash | b78db9c52c2a849c6e387826cc6d4e5deb69be2c5ed01bf2c60075c07267bdc6 |
| created | 2026-05-07T02:46:56.791Z |
| stamped | 2026-05-07T02:48:01.298Z |
This page is the canonical record of this fact. Its content cannot change without invalidating the chain hash. Cite as: https://commons.oooooooooo.se/c/tqnfeopd