← Back to Blog
Soham Sharma
Writer Profile

Soham Sharma

1 published article across 1 category

About

Soham Sharma contributes practical engineering and product insights on automation, AI, and modern web systems at Botmartz.

Contact: sohamnsharma@gmail.com

Published

1

Live articles

Categories

1

Covered topics

Latest Post

April 17, 2026

Last published date

Published Posts

Latest insights and articles by Soham Sharma

Flash Attention 2: IO-Aware Exact Attention, Memory Math, and PyTorch Implementation
Featured by Writer

Flash Attention 2: IO-Aware Exact Attention, Memory Math, and PyTorch Implementation

Flash Attention 2 doesn't approximate attention — it reorders computation to minimize GPU memory reads and writes. Here's the math, the memory analysis, and a working implementation.

April 17, 2026