Jump to content

Main menu Navigation ●Main page ●Contents ●Current events ●Random article ●About Wikipedia ●Contact us ●Donate Contribute ●Help ●Learn to edit ●Community portal ●Recent changes ●Upload file

●Create account ●Log in ●Create account ● Log in Pages for logged out editors learn more ●Contributions ●Talk

(Top) 1 Adoption 2 Example 3 References 4 External links

Albumentations

●Русский Edit links ●Article ●Talk ●Read ●Edit ●View history Tools Actions ●Read ●Edit ●View history General ●What links here ●Related changes ●Upload file ●Special pages ●Permanent link ●Page information ●Cite this page ●Get shortened URL ●Download QR code ●Wikidata item Print/export ●Download as PDF ●Printable version Appearance From Wikipedia, the free encyclopedia

Albumentations
Original author(s)	Alexander Buslaev Alex Parinov Vladimir I. Iglovikov Evegene Khvedchenya Mikhail Druzhinin
Initial release	September 2016; 7 years ago (2016-09)^[1]
Repository	github.com/albumentations-team/albumentations
Written in	Python
Operating system	Linux macOS Windows
Available in	English
Type	Library for machine learning and deep learning
License	MIT^[2]
Website	albumentations.ai

Machine learning and data mining
Part of a series on
Paradigms Supervised learning Unsupervised learning Online learning Batch learning Meta-learning Semi-supervised learning Self-supervised learning Reinforcement learning Curriculum learning Rule-based learning Quantum machine learning
Problems Classification Generative modeling Regression Clustering Dimensionality reduction Density estimation Anomaly detection Data cleaning AutoML Association rules Semantic analysis Structured prediction Feature engineering Feature learning Learning to rank Grammar induction Ontology learning Multimodal learning
Supervised learning (classification • regression) Apprenticeship learning Decision trees Ensembles Bagging Boosting Random forest k-NN Linear regression Naive Bayes Artificial neural networks Logistic regression Perceptron Relevance vector machine (RVM) Support vector machine (SVM)
Clustering BIRCH CURE Hierarchical k-means Fuzzy Expectation–maximization (EM) DBSCAN OPTICS Mean shift
Dimensionality reduction Factor analysis CCA ICA LDA NMF PCA PGD t-SNE SDL
Structured prediction Graphical models Bayes net Conditional random field Hidden Markov
Anomaly detection RANSAC k-NN Local outlier factor Isolation forest
Artificial neural network Autoencoder Cognitive computing Deep learning DeepDream Feedforward neural network Recurrent neural network LSTM GRU ESN reservoir computing Restricted Boltzmann machine GAN Diffusion model SOM Convolutional neural network U-Net Transformer Vision Mamba Spiking neural network Memtransistor Electrochemical RAM (ECRAM)
Reinforcement learning Q-learning SARSA Temporal difference (TD) Multi-agent Self-play
Learning with humans Active learning Crowdsourcing Human-in-the-loop RLHF
Model diagnostics Coefficient of determination Confusion matrix Learning curve ROC curve
Mathematical foundations Kernel machines Bias–variance tradeoff Computational learning theory Empirical risk minimization Occam learning PAC learning Statistical learning VC theory
Machine-learning venues ECML PKDD NeurIPS ICML ICLR IJCAI ML JMLR
Related articles Glossary of artificial intelligence List of datasets for machine-learning research List of datasets in computer vision and image processing Outline of machine learning
v t e

Albumentations is a powerful open-source image augmentation library created in June 2018 by a group of researchers and engineers, including Alexander Buslaev, Vladimir Iglovikov, and Alex Parinov. The library was designed to provide a flexible and efficient framework for data augmentation in computer vision tasks.

Data augmentation is a technique that involves artificially expanding the size of a dataset by creating new images through various transformations such as rotation, scaling, flipping, and color adjustments. This process helps improve the performance of machine learning models by providing a more diverse set of training examples.

Built on top of OpenCV, a widely used computer vision library, Albumentations provides high-performance implementations of various image processing functions. It also offers a rich set of image transformation functions and a simple API for combining them, allowing users to create custom augmentation pipelines tailored to their specific needs.^[3]

Adoption[edit]

Albumentations has gained significant popularity and recognition in the computer vision and deep learning community since its introduction in 2018. The library was designed to provide a flexible and efficient framework for data augmentation in computer vision tasks, and has been widely adopted in academic research, open-source projects, and machine learning competitions.

The library's research paper, "Albumentations: Fast and Flexible Image Augmentations," has received over 1000 citations, highlighting its importance and impact in the field of computer vision.^[4] The library has also been widely adopted in computer vision and deep learning projects, with over 12,000 packages depending on it as listed on its GitHub dependents page.^[5]

In addition, Albumentations has been used in many winning solutions for computer vision competitions, including the DeepFake Detection challenge at Kaggle with a prize of 1 million dollars.^[6]

Example[edit]

The following program shows the functionality of the library with a simple example:

import albumentations as A
import cv2

# Declare an augmentation pipeline
transform = A.Compose([
    A.RandomCrop(width=256, height=256),
    A.HorizontalFlip(p=0.5),
    A.RandomBrightnessContrast(p=0.2),
])

# Read an image with OpenCV and convert it to the RGB colorspace
image = cv2.imread("image.jpg")
image = cv2.cvtColor(image, cv2.COLOR_BGR2RGB)

# Augment an image
transformed = transform(image=image)
transformed_image = transformed["image"]

References[edit]

^ "First Commit". GitHub. 5 June 2018.

^ "MIT License". GitHub.

^ Alexander Buslaev; Vladimir Iglovikov; Alex Parinov; Eugene Khvedchenya; Alexandr A Kalinin (2020). "Albumentations: Fast and Flexible Image Augmentations". Information. 11 (2). MDPI: 125. arXiv:1809.06839. doi:10.3390/info11020125.

^ "Google Scholar - Albumentations: Fast and Flexible Image Augmentations". Google Scholar. Retrieved 2023-03-31.

^ "Albumentations GitHub Dependents". GitHub. Retrieved 2023-03-31.

^ "Albumentations - Who's Using?". Albumentations. Retrieved 2023-03-31.

External links[edit]

Retrieved from "https://en.wikipedia.org/w/index.php?title=Albumentations&oldid=1170974916" Categories: ●2018 software ●Computer vision software Hidden categories: ●Articles with short description ●Short description matches Wikidata ●Orphaned articles from March 2023 ●All orphaned articles ●Official website not in Wikidata ●This page was last edited on 18 August 2023, at 08:37 (UTC). ●Text is available under the Creative Commons Attribution-ShareAlike License 4.0; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization. ●Privacy policy ●About Wikipedia ●Disclaimers ●Contact Wikipedia ●Code of Conduct ●Developers ●Statistics ●Cookie statement ●Mobile view