@steverab
๐ฃ Excited to share my first work @Princeton : ๐ง๐ผ๐๐ฎ๐ฟ๐ฑ๐ ๐ฎ ๐ฆ๐ฐ๐ถ๐ฒ๐ป๐ฐ๐ฒ ๐ผ๐ณ ๐๐ ๐๐ด๐ฒ๐ป๐ ๐ฅ๐ฒ๐น๐ถ๐ฎ๐ฏ๐ถ๐น๐ถ๐๐ AI agents keep getting more capable. But are they actually reliable? ๐ Paper: https://t.co/1CvygFLdct ๐ Dashboard: https://t.co/C1EfoMyaS8 ๐งต๐ https://t.co/KvPJSVgl76