Lu et al., 2022 - Google Patents

Frozen pretrained transformers as universal computation engines

Lu et al., 2022

Document ID: 17880037628825399557
Author: Lu K; Grover A; Abbeel P; Mordatch I
Publication year: 2022
Publication venue: Proceedings of the AAAI conference on artificial intelligence

External Links

Cited by

Snippet

We investigate the capability of a transformer pretrained on natural language to generalize to other modalities with minimal finetuning--in particular, without finetuning of the self- attention and feedforward layers of the residual blocks. We consider such a model, which we …

Continue reading at ojs.aaai.org (PDF) (other versions)

102000004169 proteins and genes 0 abstract description 9

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G06K9/6268—Classification techniques relating to the classification paradigm, e.g. parametric or non-parametric approaches
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G06K9/6232—Extracting features by transforming the feature space, e.g. multidimensional scaling; Mappings, e.g. subspace methods
- G06K9/6247—Extracting features by transforming the feature space, e.g. multidimensional scaling; Mappings, e.g. subspace methods based on an approximation criterion, e.g. principal component analysis
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/06—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
- G06N3/063—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons using electronic means
- G06N3/0635—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons using electronic means using analogue means
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/10—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis

Similar Documents

Publication	Publication Date	Title
Lu et al.	2022	Frozen pretrained transformers as universal computation engines
Gao et al.	2022	Adamixer: A fast-converging query-based object detector
Sarwar et al.	2017	Gabor filter assisted energy efficient fast learning convolutional neural networks
Khalil et al.	2022	Designing novel AAD pooling in hardware for a convolutional neural network accelerator
Pu et al.	2024	Fine-grained recognition with learnable semantic data augmentation
Venugopalan et al.	2021	Applying deep neural networks for the automatic recognition of sign language words: A communication aid to deaf agriculturists
US20220036231A1 (en)	2022-02-03	Method and device for processing quantum data
Chen et al.	2018	Learning linear regression via single-convolutional layer for visual object tracking
Nie et al.	2019	A multi-stage convolution machine with scaling and dilation for human pose estimation
Wu et al.	2024	An economic framework for 6-dof grasp detection
Zhang et al.	2024	SiT-MLP: A simple MLP with point-wise topology feature learning for skeleton-based action recognition
Chen et al.	2023	Deep data augmentation for weed recognition enhancement: A diffusion probabilistic model and transfer learning based approach
Hu et al.	2024	Demystify transformers & convolutions in modern image deep networks
Cao et al.	2023	Ghostvit: Expediting vision transformers via cheap operations
Jin et al.	2025	Groupwise label enhancement broad learning system for image classification
Qiu et al.	2023	Semantic-visual guided transformer for few-shot class-incremental learning
Mai et al.	2024	From efficient multimodal models to world models: A survey
Hossain et al.	2021	Convolutional neural network based skin cancer detection (Malignant vs Benign)
Zand et al.	2023	Flow-based spatio-temporal structured prediction of motion dynamics
Dumortier et al.	2022	Petribert: Augmenting bert with tridimensional encoding for inverse protein folding and design
Zhou et al.	2024	Training-free transformer architecture search with zero-cost proxy guided evolution
Zhang et al.	2022	Bidirectional parallel feature pyramid network for object detection
Zahidi et al.	2021	Active learning for crop-weed discrimination by image classification from convolutional neural network’s feature pyramid levels
Im et al.	2024	Blend AutoAugment: Automatic Data Augmentation for Image Classification Using Linear Blending
Dehban et al.	2022	Learning deep features for robotic inference from physical interactions