Writer Profile

Soham Sharma

17 published articles across 10 categories

About

Soham Sharma contributes practical engineering and product insights on automation, AI, and modern web systems at Botmartz.

Contact: sohamnsharma@gmail.com

Published

Live articles

Published Posts

Latest insights and articles by Soham Sharma

Featured by Writer

Research Paper Deep Dive: RoPE (Rotary Position Embeddings) — Better Position Information

Standard position embeddings are additive and have poor long-range generalization. RoPE embeds positions via rotation: multiply Q, K by rotation matrices. Enables 100K+ token context.

June 12, 2026

LLMs

Language Model Architectures: Transformers, Attention, and the Path from GPT-1 to GPT-4

Modern LLMs are Transformers. Understand the evolution: self-attention, positional encoding, scaling laws, and how each architectural change improved performance.

June 12, 2026

Soham Sharma

Published Posts

Research Paper Deep Dive: RoPE (Rotary Position Embeddings) — Better Position Information

Language Model Architectures: Transformers, Attention, and the Path from GPT-1 to GPT-4

AI Agent Fundamentals: Decision-Making Loops, Tools, and Agentic vs. Procedural Reasoning

Mamba: State Space Models and the Alternative to Transformer Attention

Research Paper Deep Dive: Flash Attention 2 — Optimizing Transformer Attention

Optimizer Comparison: SGD, Momentum, Adam, RMSprop, and When Each Shines

Vision Transformers (ViT): Image Classification with Pure Transformers

Custom Training Loops with GradientTape: Manual Forward and Backward Passes in TensorFlow

Rotary Positional Embeddings (RoPE): How It Works and Why It Beats Learned Embeddings

PyTorch Custom Dataset and DataLoader: __getitem__, __len__, collate_fn, and num_workers

Working with LLMs and Chat Models in LangChain: OpenAI, Anthropic, and Local Models via Ollama

Keras Sequential vs Functional vs Subclassing: When to Use Which API

PyTorch Autograd Internals: Computation Graphs, retain_graph, grad_fn Chain, and detach

LangChain Prompt Templates and Output Parsers: PromptTemplate, ChatPromptTemplate, and Pydantic Parsers

TensorFlow 2.x Architecture: Eager Execution, tf.function, AutoGraph, and Graphs

PyTorch Tensors Deep Dive: dtypes, Device Movement, Memory Layout, and Broadcasting

LangChain Architecture Overview: Chains, Runnables, LCEL, and the New vs Old API

The IntelligenceBriefing

PyTorch Custom Dataset and DataLoader: getitem, len, collate_fn, and num_workers

The Intelligence
Briefing