Xu, 2025 - Google Patents

Image colorization based on transformer

Xu, 2025

Document ID: 1136735553153182542
Author: Xu M
Publication year: 2025
Publication venue: Scientific Reports

External Links

Cited by

Snippet

This paper presents a transformer-based method for colorizing grayscale image. By employing a deep architecture with stacked encoder-decoder layers, the model effectively captures intricate features, significantly improving its expressive capacity. The encoder …

Continue reading at www.nature.com (HTML) (other versions)

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30244—Information retrieval; Database structures therefor; File system structures therefor in image databases
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30861—Retrieval from the Internet, e.g. browsers
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30781—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F17/30784—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
- G06F17/30799—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre using low-level visual features of the video content
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T3/00—Geometric image transformation in the plane of the image, e.g. from bit-mapped to bit-mapped creating a different image
- G06T3/40—Scaling the whole image or part thereof
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration, e.g. from bit-mapped to bit-mapped creating a similar image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T1/00—General purpose image data processing
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q40/00—Finance; Insurance; Tax strategies; Processing of corporate or income taxes

Similar Documents

Publication	Publication Date	Title
Zhao et al.	2020	Pixelated semantic colorization
Mirzaei et al.	2024	Watch your steps: Local image and scene editing by text instructions
Ganin et al.	2016	Deepwarp: Photorealistic image resynthesis for gaze manipulation
Fu et al.	2024	Low-light image enhancement base on brightness attention mechanism generative adversarial networks
Li et al.	2023	Image super-resolution reconstruction based on multi-scale dual-attention
Yan et al.	2022	PCNet: Partial convolution attention mechanism for image inpainting
Li et al.	2022	D2c-sr: A divergence to convergence approach for real-world image super-resolution
Kim et al.	2018	A multi-purpose convolutional neural network for simultaneous super-resolution and high dynamic range image reconstruction
Jin et al.	2023	Image colorization using deep convolutional auto-encoder with multi-skip connections
Stival et al.	2023	Survey on video colorization: concepts, methods and applications
Wang et al.	2025	Optimized UNet framework with a joint loss function for underwater image enhancement
Xu	2025	Image colorization based on transformer
Karthikeyan et al.	2025	Attention-based lightweight deep hybrid CNN framework for image restoration
Bugeau et al.	2023	Influence of color spaces for deep learning image colorization
Hu et al.	2023	Prediction of broken areas in murals based on MLP-fused long-range semantics
CN118608437A (en)	2024-09-06	A Generative Adversarial Network Underwater Image Enhancement Model Based on Improved Swin Transformer
Zhu et al.	2024	Photorealistic attention style transfer network for architectural photography photos
Wang et al.	2023	Flow learning based dual networks for low-light image enhancement
Park et al.	2023	Dual-color space network with global priors for photo retouching
Yu et al.	2024	AGG: attention-based gated convolutional GAN with prior guidance for image inpainting
Gong et al.	2025	SA-LUT: spatial adaptive 4D look-up table for photorealistic style transfer
Liu et al.	2021	Deliberation on object-aware video style transfer network with long–short temporal and depth-consistent constraints
Reyes-Saldana et al.	2022	Deep Variational Method with Attention for High-Definition Face Generation
Perla et al.	2023	Low Light Image Illumination Adjustment Using Fusion of MIRNet and Deep Illumination Curves
Tang et al.	2025	Ipdm: identity preserving diffusion model for face sketch and photo synthesis