@_wenlixiao
What if robots could improve themselves by learning from their own failures in the real-world? Introducing ๐ฃ๐๐ (๐ฃ๐ฟ๐ผ๐ฏ๐ฒ, ๐๐ฒ๐ฎ๐ฟ๐ป, ๐๐ถ๐๐๐ถ๐น๐น) โ a recipe that enables Vision-Language-Action (VLA) models to self-improve for high-precision manipulation tasks. PLD couples real-world residual reinforcement learning with standard supervised fine-tuning โ letting robots discover, recover, and distill their own data flywheel. Quick ๐งต