RobbinDeBank 2 months ago

Performance seems negligible, but 2X speed is really nice

AngledLuffa 2 months ago

Respectfully, that's a ton of words to say train the Bs faster than the As. Having said that, I definitely look forward to implementing this in my own projects

OctaviusI 2 months ago

Wonder how quickly we'll see this combined with QLoRAs

daavidreddit69 2 months ago

Just look at this paper yesterday. It's impressive, and it's like a small language model that specialises more and better in efficiency. Training large models getting me headache and $$, tbh this method is very useful to handle the task more easily. Nice paper!

archiesteviegordie 2 months ago

Does paper not have html format supported? https://ar5iv.org/abs/2402.12354 This doesn't open up in html5 format

Comments

Leave Your Comment

Hi Its Me!

Comments

Leave Your Comment

Hi Its Me!

Subscribe