@LiorOnAI
You can now transform LLMs into diffusion models. dLLM released an open recipe that converts any autoregressive model into a diffusion LLM. How the conversion works: 1. Remove the causal mask and enable bidirectional attention 2. Mask random tokens and train the model to fill the gaps 3.Add light supervised training to stabilize outputs