AI Character Consistency x Memory - 25
References
HCI
- Chatbots + Speech Processing
Some parellel Attention calculation
- 📍 MEMORY - Transformers vs. RNN / LSTM
- Add Reflection - 2024 - You Only Cache Once: Decoder-Decoder Architectures for Language Models
- RetNet - Retention Network -> Gated Retention
- 2023 - RetNet: Retinal Disease Detection using Convolutional Neural Network
- DeltaNet - 2025 - Parallelizing Linear Transformers with the Delta Rule over Sequence Length
Enjoy Reading This Article?
Here are some more articles you might like to read next: