Botmartz Logo
Weekly
Soham Sharma
Writer Profile

Soham Sharma

17 published articles across 10 categories

About

Soham Sharma contributes practical engineering and product insights on automation, AI, and modern web systems at Botmartz.

Contact: sohamnsharma@gmail.com

Published

17

Live articles

Categories

10

Covered topics

Latest Post

June 12, 2026

Last published date

Published Posts

Latest insights and articles by Soham Sharma

Research Paper Deep Dive: RoPE (Rotary Position Embeddings) — Better Position Information
Featured by Writer

Research Paper Deep Dive: RoPE (Rotary Position Embeddings) — Better Position Information

Standard position embeddings are additive and have poor long-range generalization. RoPE embeds positions via rotation: multiply Q, K by rotation matrices. Enables 100K+ token context.

June 12, 2026

Language Model Architectures: Transformers, Attention, and the Path from GPT-1 to GPT-4
LLMs

Language Model Architectures: Transformers, Attention, and the Path from GPT-1 to GPT-4

Modern LLMs are Transformers. Understand the evolution: self-attention, positional encoding, scaling laws, and how each architectural change improved performance.

June 12, 2026

Read article →
AI Agent Fundamentals: Decision-Making Loops, Tools, and Agentic vs. Procedural Reasoning
Agents

AI Agent Fundamentals: Decision-Making Loops, Tools, and Agentic vs. Procedural Reasoning

Agents make autonomous decisions by reasoning, planning, and calling tools. Understand the perception-decision-action loop and when to use agents vs. deterministic workflows.

June 12, 2026

Read article →
Mamba: State Space Models and the Alternative to Transformer Attention
Advanced Models

Mamba: State Space Models and the Alternative to Transformer Attention

Transformers require O(n²) attention. Mamba uses state space models for O(n) complexity with better scaling. Understand selective SSMs and why Mamba matches transformer quality at 1/5 the memory.

June 12, 2026

Read article →
Research Paper Deep Dive: Flash Attention 2 — Optimizing Transformer Attention
Research Explained

Research Paper Deep Dive: Flash Attention 2 — Optimizing Transformer Attention

Flash Attention achieves 2-4× speedup on attention by changing memory access patterns. Understand I/O complexity, tiling, and how to optimize matrix operations on GPUs.

June 8, 2026

Read article →
Optimizer Comparison: SGD, Momentum, Adam, RMSprop, and When Each Shines
Optimization

Optimizer Comparison: SGD, Momentum, Adam, RMSprop, and When Each Shines

Different optimizers suit different problems. SGD is stable, Momentum accelerates, Adam is adaptive. Understand why each optimizer works and pick the right one.

June 8, 2026

Read article →
Vision Transformers (ViT): Image Classification with Pure Transformers
Models

Vision Transformers (ViT): Image Classification with Pure Transformers

Vision Transformers apply Transformers to image classification. Patch embeddings convert images to sequences, enabling the same architecture as NLP models.

June 8, 2026

Read article →
Custom Training Loops with GradientTape: Manual Forward and Backward Passes in TensorFlow
TensorFlow

Custom Training Loops with GradientTape: Manual Forward and Backward Passes in TensorFlow

model.fit() hides the training loop. GradientTape exposes it. Use it when you need per-batch gradient manipulation, custom loss combinations, or training dynamics that Keras callbacks can't express.

June 3, 2026

Read article →
Rotary Positional Embeddings (RoPE): How It Works and Why It Beats Learned Embeddings
Research

Rotary Positional Embeddings (RoPE): How It Works and Why It Beats Learned Embeddings

RoPE encodes position by rotating query and key vectors in complex space. It extrapolates beyond training length, transfers across fine-tuning, and adds zero parameters — here's the math.

June 3, 2026

Read article →
PyTorch Custom Dataset and DataLoader: __getitem__, __len__, collate_fn, and num_workers
PyTorch

PyTorch Custom Dataset and DataLoader: __getitem__, __len__, collate_fn, and num_workers

DataLoader is more than a loop — it's a parallel data pipeline. Build a correct Dataset, write a proper collate_fn, and understand num_workers to eliminate training bottlenecks.

June 3, 2026

Read article →
Working with LLMs and Chat Models in LangChain: OpenAI, Anthropic, and Local Models via Ollama
LangChain

Working with LLMs and Chat Models in LangChain: OpenAI, Anthropic, and Local Models via Ollama

LangChain wraps every LLM provider behind the same Runnable interface. Swap OpenAI for Claude or a local Llama model without changing a line of your chain logic.

June 2, 2026

Read article →
Keras Sequential vs Functional vs Subclassing: When to Use Which API
TensorFlow

Keras Sequential vs Functional vs Subclassing: When to Use Which API

Keras gives you three model-building APIs. Sequential is a dead end for anything non-trivial. Functional handles 90% of production architectures. Subclassing gives you full control when you need it.

April 27, 2026

Read article →
PyTorch Autograd Internals: Computation Graphs, retain_graph, grad_fn Chain, and detach
PyTorch

PyTorch Autograd Internals: Computation Graphs, retain_graph, grad_fn Chain, and detach

Autograd is not magic — it's a directed acyclic graph of Function nodes. Understand how gradients flow, when retain_graph matters, and how detach prevents gradient leaks.

April 27, 2026

Read article →
LangChain Prompt Templates and Output Parsers: PromptTemplate, ChatPromptTemplate, and Pydantic Parsers
LangChain

LangChain Prompt Templates and Output Parsers: PromptTemplate, ChatPromptTemplate, and Pydantic Parsers

Prompt templates and output parsers are the input and output contracts of your LLM pipeline. Build them wrong and your chain breaks on every edge case.

April 27, 2026

Read article →
TensorFlow 2.x Architecture: Eager Execution, tf.function, AutoGraph, and Graphs
TensorFlow

TensorFlow 2.x Architecture: Eager Execution, tf.function, AutoGraph, and Graphs

TensorFlow 2.x made eager execution the default, but tf.function and AutoGraph still power production deployments. Understand when and how graphs take over.

April 20, 2026

Read article →
PyTorch Tensors Deep Dive: dtypes, Device Movement, Memory Layout, and Broadcasting
PyTorch

PyTorch Tensors Deep Dive: dtypes, Device Movement, Memory Layout, and Broadcasting

Tensors are the foundation of every PyTorch model. Master dtypes, device movement, memory layout, and broadcasting rules to eliminate hours of debugging.

April 20, 2026

Read article →
LangChain Architecture Overview: Chains, Runnables, LCEL, and the New vs Old API
LangChain

LangChain Architecture Overview: Chains, Runnables, LCEL, and the New vs Old API

LangChain's architecture changed fundamentally with v0.1. Learn chains, runnables, and LCEL so you build on the current API — not the deprecated one.

April 20, 2026

Read article →
Stay Ahead

The Intelligence
Briefing

Weekly dispatches on AI automation, technical deep-dives, and perspectives from the frontier—delivered straight to your inbox.

No spam, ever. Unsubscribe in one click.