Tu et al., 2023 - Google Patents
Consistent 3d hand reconstruction in video via self-supervised learningTu et al., 2023
View PDF- Document ID
- 7326407718636395328
- Author
- Tu Z
- Huang Z
- Chen Y
- Kang D
- Bao L
- Yang B
- Yuan J
- Publication year
- Publication venue
- IEEE Transactions on Pattern Analysis and Machine Intelligence
External Links
Snippet
We present a method for reconstructing accurate and consistent 3D hands from a monocular video. We observe that the detected 2D hand keypoints and the image texture provide important cues about the geometry and texture of the 3D hand, which can reduce or even …
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G06K9/00268—Feature extraction; Face representation
- G06K9/00281—Local features and components; Facial parts ; Occluding parts, e.g. glasses; Geometrical relationships
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T13/00—Animation
- G06T13/20—3D [Three Dimensional] animation
- G06T13/40—3D [Three Dimensional] animation of characters, e.g. humans, animals or virtual beings
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6201—Matching; Proximity measures
- G06K9/6202—Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/20—Analysis of motion
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T17/00—Three dimensional [3D] modelling, e.g. data description of 3D objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T15/00—3D [Three Dimensional] image rendering
- G06T15/10—Geometric effects
- G06T15/20—Perspective computation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2200/00—Indexing scheme for image data processing or generation, in general
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T9/00—Image coding, e.g. from bit-mapped to non bit-mapped
- G06T9/001—Model-based coding, e.g. wire frame
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Tu et al. | Consistent 3d hand reconstruction in video via self-supervised learning | |
| Chen et al. | Model-based 3d hand reconstruction via self-supervised learning | |
| Romero et al. | Embodied hands: Modeling and capturing hands and bodies together | |
| Li et al. | Monocular real-time volumetric performance capture | |
| Zhang et al. | Object-occluded human shape and pose estimation from a single color image | |
| Zhang et al. | Learning 3D human shape and pose from dense body parts | |
| Ge et al. | 3d convolutional neural networks for efficient and robust hand pose estimation from single depth images | |
| Yang et al. | Weakly-supervised disentangling with recurrent transformations for 3d view synthesis | |
| Tewari et al. | Learning complete 3d morphable face models from images and videos | |
| Wang et al. | A progressive quadric graph convolutional network for 3D human mesh recovery | |
| Gou et al. | Cascade learning from adversarial synthetic images for accurate pupil detection | |
| Peng et al. | Implicit neural representations with structured latent codes for human body modeling | |
| Zhou et al. | Hemlets posh: Learning part-centric heatmap triplets for 3d human pose and shape estimation | |
| Li et al. | Detailed 3D human body reconstruction from multi-view images combining voxel super-resolution and learned implicit representation | |
| Huang et al. | Object-occluded human shape and pose estimation with probabilistic latent consistency | |
| Chen et al. | Autosweep: Recovering 3d editable objects from a single photograph | |
| Kang et al. | Competitive learning of facial fitting and synthesis using uv energy | |
| Li et al. | Image-guided human reconstruction via multi-scale graph transformation networks | |
| Hu et al. | Personalized graph generation for monocular 3D human pose and shape estimation | |
| Gan et al. | Fine-grained multi-view hand reconstruction using inverse rendering | |
| Cai et al. | Automatic generation of Labanotation based on human pose estimation in folk dance videos | |
| Ren et al. | Pyramid deep fusion network for two-hand reconstruction from RGB-d images | |
| Caselles et al. | Implicit shape and appearance priors for few-shot full head reconstruction | |
| Su et al. | Omnidirectional depth estimation with hierarchical deep network for multi-fisheye navigation systems | |
| Yang et al. | Innovative AI techniques for photorealistic 3D clothed human reconstruction from monocular images or videos: a survey |