[go: up one dir, main page]

Xu, 2025 - Google Patents

Image colorization based on transformer

Xu, 2025

View HTML
Document ID
1136735553153182542
Author
Xu M
Publication year
Publication venue
Scientific Reports

External Links

Snippet

This paper presents a transformer-based method for colorizing grayscale image. By employing a deep architecture with stacked encoder-decoder layers, the model effectively captures intricate features, significantly improving its expressive capacity. The encoder …
Continue reading at www.nature.com (HTML) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30244Information retrieval; Database structures therefor; File system structures therefor in image databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10024Color image
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30861Retrieval from the Internet, e.g. browsers
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30781Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F17/30784Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
    • G06F17/30799Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre using low-level visual features of the video content
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T3/00Geometric image transformation in the plane of the image, e.g. from bit-mapped to bit-mapped creating a different image
    • G06T3/40Scaling the whole image or part thereof
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/00221Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/36Image preprocessing, i.e. processing the image information without deciding about the identity of the image
    • G06K9/46Extraction of features or characteristics of the image
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T11/002D [Two Dimensional] image generation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T5/00Image enhancement or restoration, e.g. from bit-mapped to bit-mapped creating a similar image
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T1/00General purpose image data processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06QDATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes

Similar Documents

Publication Publication Date Title
Zhao et al. Pixelated semantic colorization
Mirzaei et al. Watch your steps: Local image and scene editing by text instructions
Ganin et al. Deepwarp: Photorealistic image resynthesis for gaze manipulation
Fu et al. Low-light image enhancement base on brightness attention mechanism generative adversarial networks
Li et al. Image super-resolution reconstruction based on multi-scale dual-attention
Yan et al. PCNet: Partial convolution attention mechanism for image inpainting
Li et al. D2c-sr: A divergence to convergence approach for real-world image super-resolution
Kim et al. A multi-purpose convolutional neural network for simultaneous super-resolution and high dynamic range image reconstruction
Jin et al. Image colorization using deep convolutional auto-encoder with multi-skip connections
Stival et al. Survey on video colorization: concepts, methods and applications
Wang et al. Optimized UNet framework with a joint loss function for underwater image enhancement
Xu Image colorization based on transformer
Karthikeyan et al. Attention-based lightweight deep hybrid CNN framework for image restoration
Bugeau et al. Influence of color spaces for deep learning image colorization
Hu et al. Prediction of broken areas in murals based on MLP-fused long-range semantics
CN118608437A (en) A Generative Adversarial Network Underwater Image Enhancement Model Based on Improved Swin Transformer
Zhu et al. Photorealistic attention style transfer network for architectural photography photos
Wang et al. Flow learning based dual networks for low-light image enhancement
Park et al. Dual-color space network with global priors for photo retouching
Yu et al. AGG: attention-based gated convolutional GAN with prior guidance for image inpainting
Gong et al. SA-LUT: spatial adaptive 4D look-up table for photorealistic style transfer
Liu et al. Deliberation on object-aware video style transfer network with long–short temporal and depth-consistent constraints
Reyes-Saldana et al. Deep Variational Method with Attention for High-Definition Face Generation
Perla et al. Low Light Image Illumination Adjustment Using Fusion of MIRNet and Deep Illumination Curves
Tang et al. Ipdm: identity preserving diffusion model for face sketch and photo synthesis