BERT — Bidirectional Encoder Representations

SBERT stands for Bidirectional Encoder Representations from Transformers.

Saman .E

5 Min read

BERT uses a type of deep learning architecture called Transformers, which allows it to understand the context and meaning of words in a sentence or passage. The model is pre-trained on large amounts of text data, such as Wikipedia articles, to learn the relationships between words and sentences. ...

Self-Attention in Machine learning

Self-attention is what you need!

Saman .E

5 Min read

Self-attention is a mechanism that allows a neural network to selectively weigh the importance of different parts of an input sequence when making predictions or generating outputs. It is a key component of transformer models, which have revolutionized the field of natural language processing (NLP) in recent years. However, ...

All about loss functions in machine learning

In machine learning, a loss function is a mathematical function that measures how well a machine learning model is able to make predictions.

Saman .E

7 Min read

The loss function compares the predicted output of the model to the true output and produces a score that indicates how different the two are. The goal of a machine learning model is to minimize this difference, or “loss”, in order to make accurate predictions. The choice of a loss ...

Introducing Rock-Paper- Scissors Dataset

The rock-paper-scissors dataset is a classic dataset used for computer vision and machine learning purposes.

Saman .E

5 Min read

An image dataset is a collection of digital images that are organized and labeled for use in machine learning and computer vision tasks. These datasets are used to train and test computer vision algorithms, such as object recognition, image classification, and segmentation. The images in an image ...

Image datasets for developing ML algorithms

Introducing well-known Image datasets for developing machine learning algorithms.

Saman .E

3 Min read

Image data refers to a collection of digital images that are stored and processed by computers. MNIST is a commonly used dataset for image classification tasks in machine learning and computer vision. It consists of 70,000 grayscale images of handwritten digits (0 9) and their corresponding labels. These images are pre-processed and ...