@suchenzang
made the mistake of continuing down this rabbit 🕳️ apparently babistories itself is a synthetically generated dataset from the memory mosaics paper (https://t.co/lYKd4dPsBH), which is a synthetic copy of tinystories, which itself is also synthetically generated so the "strong… https://t.co/gN4bUzwJDT