AI · AesVoy

Pioneers of Machine Learning and Artificial Intelligence

12 February 2025·591 words·3 mins

The journey of pioneers in Machine Learning (ML) and Artificial Intelligence (AI) is a remarkable tale of innovation, collaboration, and the relentless pursuit of knowledge.

Small Language Models

10 October 2024·1096 words·6 mins

Small Language Models (SLMs) are a specialized type of artificial intelligence designed for natural language processing (NLP) tasks. Unlike Large Language Models (LLMs), which are characterized by their vast size and extensive training datasets, SLMs are built to be more efficient and effective for specific applications.

From CNNs to Vision Transformers: The Future of Image Recognition

8 August 2024·6015 words·29 mins

Vision Transformers (ViTs) are redefining image recognition by using Transformer models to capture global context, unlike traditional Convolutional Neural Networks (CNNs) that focus on local features. ViTs excel with large datasets and show impressive scalability and performance.

imageNet-Computer Vision Backbone

6 August 2024·1065 words·5 mins

ImageNet is more than just a dataset. The sheer scale of ImageNet, combined with its detailed labeling, made it essentially the backbone of Computer Vision.

Transformers & Attention

5 August 2024·866 words·5 mins

This blog post explains how self-attention and softmax function in Transformer models, crucial for modern NLP. It breaks down how self-attention helps models understand relationships between tokens and how softmax ensures efficient computation and numerical stability.

Diffusion VS Auto-Regressive Models

4 August 2024·1085 words·6 mins

Generative AI has come a long way, producing stunning images from simple text prompts. But how do Diffusion and Auto-Regressive models work, and why are diffusion models preferred.

AlexNet Revolution

26 July 2024·1304 words·7 mins

In 2012, the field of artificial intelligence witnessed a seismic shift. The catalyst for this transformation was a deep learning model known as AlexNet.

Generative Adversarial Network

29 May 2024·753 words·4 mins

A neural network is like a highly sophisticated, multi-layered calculator that learns from data. It consists of numerous “neurons” (tiny calculators) connected in layers, with each layer performing a unique function to help the network make predictions or decisions.

Variational-Auto-Encoder

28 May 2024·729 words·4 mins

The beauty of VAEs lies in their ability to generate new samples by randomly sampling vectors from this known region and then passing them through the generator part of our model.

Auto-Encoder

27 May 2024·545 words·3 mins

An autoencoder begins its journey by compressing input data into a lower dimension. It then endeavors to reconstruct the original input from this compressed representation.