@vanstriendaniel
Can LLMs help with real-world GitHub issues? š¢ SWE-bench Dataset from @princeton-nlp is on @huggingface Hub šÆ Focus: Automated GitHub issue resolution š Content: 2,294 Issue-PR pairs from 12 Python repos š Evaluation: Unit test verification using post-PR behavior https://t.co/z0eQFmVp3v