memorism's Highlights on '97% Accuracy with ViT on 90 Animal Dataset; A comparative study Vision Transformers vs.' | Glasp