Xu, 2025 - Google Patents
Image colorization based on transformerXu, 2025
View HTML- Document ID
- 1136735553153182542
- Author
- Xu M
- Publication year
- Publication venue
- Scientific Reports
External Links
Snippet
This paper presents a transformer-based method for colorizing grayscale image. By employing a deep architecture with stacked encoder-decoder layers, the model effectively captures intricate features, significantly improving its expressive capacity. The encoder …
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30244—Information retrieval; Database structures therefor; File system structures therefor in image databases
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30861—Retrieval from the Internet, e.g. browsers
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30781—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F17/30784—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
- G06F17/30799—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre using low-level visual features of the video content
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T3/00—Geometric image transformation in the plane of the image, e.g. from bit-mapped to bit-mapped creating a different image
- G06T3/40—Scaling the whole image or part thereof
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration, e.g. from bit-mapped to bit-mapped creating a similar image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T1/00—General purpose image data processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q40/00—Finance; Insurance; Tax strategies; Processing of corporate or income taxes
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Zhao et al. | Pixelated semantic colorization | |
| Mirzaei et al. | Watch your steps: Local image and scene editing by text instructions | |
| Ganin et al. | Deepwarp: Photorealistic image resynthesis for gaze manipulation | |
| Fu et al. | Low-light image enhancement base on brightness attention mechanism generative adversarial networks | |
| Li et al. | Image super-resolution reconstruction based on multi-scale dual-attention | |
| Yan et al. | PCNet: Partial convolution attention mechanism for image inpainting | |
| Li et al. | D2c-sr: A divergence to convergence approach for real-world image super-resolution | |
| Kim et al. | A multi-purpose convolutional neural network for simultaneous super-resolution and high dynamic range image reconstruction | |
| Jin et al. | Image colorization using deep convolutional auto-encoder with multi-skip connections | |
| Stival et al. | Survey on video colorization: concepts, methods and applications | |
| Wang et al. | Optimized UNet framework with a joint loss function for underwater image enhancement | |
| Xu | Image colorization based on transformer | |
| Karthikeyan et al. | Attention-based lightweight deep hybrid CNN framework for image restoration | |
| Bugeau et al. | Influence of color spaces for deep learning image colorization | |
| Hu et al. | Prediction of broken areas in murals based on MLP-fused long-range semantics | |
| CN118608437A (en) | A Generative Adversarial Network Underwater Image Enhancement Model Based on Improved Swin Transformer | |
| Zhu et al. | Photorealistic attention style transfer network for architectural photography photos | |
| Wang et al. | Flow learning based dual networks for low-light image enhancement | |
| Park et al. | Dual-color space network with global priors for photo retouching | |
| Yu et al. | AGG: attention-based gated convolutional GAN with prior guidance for image inpainting | |
| Gong et al. | SA-LUT: spatial adaptive 4D look-up table for photorealistic style transfer | |
| Liu et al. | Deliberation on object-aware video style transfer network with long–short temporal and depth-consistent constraints | |
| Reyes-Saldana et al. | Deep Variational Method with Attention for High-Definition Face Generation | |
| Perla et al. | Low Light Image Illumination Adjustment Using Fusion of MIRNet and Deep Illumination Curves | |
| Tang et al. | Ipdm: identity preserving diffusion model for face sketch and photo synthesis |