Transformer experiments

In this post, we collect our experiments with tranformer based models [repo]

Exploring TypeScript and Kotlin

In this post, we will explore TypeScript and Kotlin. TypeScript is a superset of JavaScript allowing static data types as an optional choice. Kotlin is a modern Android development language [repo]

Exploring LangChain and LlamaIndex

In this post, we will explore LangChain and LlamaIndex - two popular frameworks to help with application development using large language models in Python [repo]

Exploring Transformers and Diffusers

In this post, we will explore Transformers and Diffusers - two popular generative AI libraries by HuggingFace, both based on the transformer architecture in Python [repo]

AIreflections#7 - transformer architecture

Let us take a deep dive into the transformer architecture - key to the recent success of large language models.

Exploring NLTK and spaCy

In this post, we will explore NLTK (Natural Language Toolkit) and spaCy - two traditional libraries for statistics-based natural language processing in Python [repo]

Exploring scikit-learn, XGBoost, and pandas

In this post, we will explore scikit-learn, XGBoost, and pandas - three traditional machine learning libraries in Python [repo]

AIreflections#6 - bias-variance tradeoff

Let us revisit the bias-variance tradeoff - a topic of timeless intrigue in statistics and machine learning.

AIreflections#5 - bagging and boosting

Let us understand the concepts of bagging and boosting in machine learning.

AIreflections#4 - parameters and hyperparameters

Let us revisit the distinction between parameters and hyperparameters in ML models.

Exploring distributed training and inference - CUDA, DDP vs FSDP, ONNX, TensorRT, Triton

In this post, we will explore: i) Distributed training on GPUs based on torch.cuda and torch.distributed, ii) Model optimization based on onnx and torch_tensorrt, and iii) Distributed inference with the nvidia triton inference server [repo]

AIreflections#3 - implicit schema in databases

Let us appreciate a nice explanation of implicit schema in databases as one might explain to a 5-year-old.

Understanding non-relational and vector databases

In this post, we will try to gain an intuitive understanding of MongoDB, Elasticsearch, and Qdrant with examples [repo]

Practical SQL use in relational databases

In this post, we will learn how to use SQL to query a plant disease dataset loaded both as MySQL and PostgreSQL databases. We will also compare the capabilities of each database in general and specific to our example task.

Understanding Kafka, Terraform, and Jenkins

In this post, we will learn about Kafka, Terraform, and Jenkins - three well recognized tools to automate and accelerate software deployment pipelines.

Map of AWS, GCP, and Azure

In this post, we will learn about three popular cloud service platforms: Amazon Web Services (AWS), Google Cloud Platform (GCP), and Microsoft Azure. We hope to build up this post asynchronously as we explore more of this ecosystem.

AIreflections#2 - why are GPUs expensive?

Let us take a look at key factors that contribute to the high prices of GPUs.

Understanding API calling

In this post, we will see how APIs work. We will learn about two distinct architectural styles: REST and GraphQL. We will also see code examples of i) how REST APIs are implemented in Python using the Django REST Framework (DRF), and ii) how GraphQL APIs are implemented in Python with the Graphene library [repo]

Understanding Docker and Kubernetes

In this post, we hope to concisely introduce basic concepts in Docker and Kubernetes. We hope to get new learners interested in the exciting world of containerization and distributed systems.

AIreflections#1 - future of database search

Let us understand the prospects and challenges of using vector embeddings in database search. In the context of language, this is often called semantic search, although Google products today are evidence that semantic search is certainly not limited to text!

Exploring PyTorch and JAX

In this post, we hope to concisely introduce PyTorch and JAX - two prominent frameworks for deep learning [colab] [repo]

Understanding backpropogation

In this post, we will implement backpropogation using elementary arithmetic operations [colab] [repo]

Computational physics class

This term, I am taking a graduate class titled “Special Topics in Computational Physics”. I look forward to it, and hopefully sharing some nice things I learn.

First post

Welcome to my blog. I hope to communicate things I find interesting here.