-
Attention Is All You Need
Paper β’ 1706.03762 β’ Published β’ 122 -
FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning
Paper β’ 2307.08691 β’ Published β’ 9 -
Mixtral of Experts
Paper β’ 2401.04088 β’ Published β’ 160 -
Mistral 7B
Paper β’ 2310.06825 β’ Published β’ 58
Snehasish Barman
sbarman25
AI & ML interests
Machine Learning for Health, AI, Distributed Systems
Recent Activity
upvoted an article 18 days ago
β΄οΈ ScreenSpot-Pro: GUI Grounding for Professional High-Resolution Computer Use updated a collection about 1 month ago
Audio Stuff liked a model about 1 month ago
RoyalCities/Foundation-1Organizations
None yet