TW201407538A - Image capturing device and method for image processing by voice recognition - Google Patents
Image capturing device and method for image processing by voice recognition Download PDFInfo
- Publication number
- TW201407538A TW201407538A TW102127336A TW102127336A TW201407538A TW 201407538 A TW201407538 A TW 201407538A TW 102127336 A TW102127336 A TW 102127336A TW 102127336 A TW102127336 A TW 102127336A TW 201407538 A TW201407538 A TW 201407538A
- Authority
- TW
- Taiwan
- Prior art keywords
- image
- controller
- module
- special effect
- voice recognition
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims description 25
- 238000012545 processing Methods 0.000 title claims description 19
- 230000000694 effects Effects 0.000 claims abstract description 81
- 239000002131 composite material Substances 0.000 claims description 29
- 239000000758 substrate Substances 0.000 claims description 3
- 230000006870 function Effects 0.000 description 7
- 230000005540 biological transmission Effects 0.000 description 5
- 238000010586 diagram Methods 0.000 description 5
- BQCADISMDOOEFD-UHFFFAOYSA-N Silver Chemical compound [Ag] BQCADISMDOOEFD-UHFFFAOYSA-N 0.000 description 4
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 4
- 238000004519 manufacturing process Methods 0.000 description 4
- 238000006243 chemical reaction Methods 0.000 description 3
- 238000001914 filtration Methods 0.000 description 3
- 229910052737 gold Inorganic materials 0.000 description 3
- 239000010931 gold Substances 0.000 description 3
- 229910052709 silver Inorganic materials 0.000 description 3
- 239000004332 silver Substances 0.000 description 3
- 239000003086 colorant Substances 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 238000003475 lamination Methods 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000036651 mood Effects 0.000 description 1
- 238000012805 post-processing Methods 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 230000001960 triggered effect Effects 0.000 description 1
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N1/00—Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
- H04N1/0035—User-machine interface; Control console
- H04N1/00352—Input means
- H04N1/00403—Voice input means, e.g. voice commands
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/76—Television signal recording
- H04N5/765—Interface circuits between an apparatus for recording and another apparatus
- H04N5/77—Interface circuits between an apparatus for recording and another apparatus between a recording apparatus and a television camera
- H04N5/772—Interface circuits between an apparatus for recording and another apparatus between a recording apparatus and a television camera the recording apparatus and the television camera being placed in the same enclosure
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N2201/00—Indexing scheme relating to scanning, transmission or reproduction of documents or the like, and to details thereof
- H04N2201/32—Circuits or arrangements for control or supervision between transmitter and receiver or between image input and image output device, e.g. between a still-image camera and its memory or between a still-image camera and a printer device
- H04N2201/3201—Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title
- H04N2201/3225—Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title of data relating to an image, a page or a document
- H04N2201/3242—Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title of data relating to an image, a page or a document of processing required or performed, e.g. for reproduction or before recording
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N2201/00—Indexing scheme relating to scanning, transmission or reproduction of documents or the like, and to details thereof
- H04N2201/32—Circuits or arrangements for control or supervision between transmitter and receiver or between image input and image output device, e.g. between a still-image camera and its memory or between a still-image camera and a printer device
- H04N2201/3201—Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title
- H04N2201/3225—Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title of data relating to an image, a page or a document
- H04N2201/3245—Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title of data relating to an image, a page or a document of image modifying data, e.g. handwritten addenda, highlights or augmented reality information
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N2201/00—Indexing scheme relating to scanning, transmission or reproduction of documents or the like, and to details thereof
- H04N2201/32—Circuits or arrangements for control or supervision between transmitter and receiver or between image input and image output device, e.g. between a still-image camera and its memory or between a still-image camera and a printer device
- H04N2201/3201—Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title
- H04N2201/3261—Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title of multimedia information, e.g. a sound signal
- H04N2201/3263—Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title of multimedia information, e.g. a sound signal of a graphical motif or symbol, e.g. Christmas symbol, logo
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Image Processing (AREA)
- Editing Of Facsimile Originals (AREA)
Abstract
Description
本發明是有關於一種影像擷取裝置,特別是有關於一種具有語音辨識功能的影像擷取裝置。 The present invention relates to an image capturing device, and more particularly to an image capturing device having a voice recognition function.
傳統的照相手機或數位相機除了基本的拍照功能外,多數還另具有效果編輯器,用來對影像進行後製處理,例如改變色彩成灰階模式、懷舊模式等,以創造一張獨一無二的相片,彰顯個人精采的風格。 In addition to the basic camera functions, traditional camera phones or digital cameras also have an effect editor for post-processing of images, such as changing the color to grayscale mode, nostalgic mode, etc., to create a unique photo. , highlighting the individual style.
習知的效果編輯器係於螢幕中顯示一系列選項,使用者根據當時心情從中挑選喜歡的樣式來執行設定,其操作過程繁瑣,要從龐大的資料庫中一一過濾找出喜歡的樣式,在出遊或者聚會等場合,如此的操作過程將會變得十分地不合宜。因此,如何設計出一種可提升搜尋效率,讓使用者可快速地找出喜好模式來進行影像後製的方法與其相關產品,即為相機產業的發展目標之一。 The conventional effect editor displays a series of options on the screen. The user selects the favorite styles according to the mood at that time to perform the setting. The operation process is cumbersome, and it is necessary to filter out the favorite styles from the huge database. In the case of travel or gatherings, such operations will become extremely inappropriate. Therefore, how to design a method and related products that can improve the search efficiency and allow users to quickly find the preferred mode for image post-production is one of the development goals of the camera industry.
另外,若是照相手機或數位相機搭配近來發展的隨身相片印表機,使用者可以攜帶這些裝置,在出遊或者聚 會等場合,拍照完馬上印製相片,但是整個從拍照到印製照片的流程將會變得更加複雜。因此,為了要能夠在如此複雜的流程中提供使用者最佳的便利感受,減少繁瑣的操作過程,將會變得更加重要。 In addition, if it is a camera phone or a digital camera with a recently developed portable photo printer, users can carry these devices, travel or gather In the occasion, photos will be printed immediately after the photo is taken, but the entire process from photographing to printing will become more complicated. Therefore, it will become more important to be able to provide users with the best convenience in such a complicated process and reduce the cumbersome operation process.
本發明之一技術態樣是在提供一種影像擷取裝置,藉由語音辨識功能,減少其操作過程而增加便利性。 One aspect of the present invention provides an image capture device that reduces the operation process by voice recognition function and increases convenience.
根據本發明一實施方式,一種影像擷取裝置,包含記憶模組、影像擷取模組、語音辨識模組以及控制器。記憶模組儲存複數個影像特別效果。影像擷取模組為用來擷取並產生影像,並儲存於記憶模組。語音辨識模組為用來接收語音訊號。控制器分別電連接於影像擷取模組、語音辨識模組以及記憶模組。控制器透過語音辨識模組對語音訊號進行解析,並連線到記憶模組,依據解析結果搜尋相對應之至少一影像特別效果,並於搜尋結果中選擇所欲之影像特別效果,之後控制器連線到記憶模組並選取影像,控制器合成選取之影像特別效果與影像。 According to an embodiment of the invention, an image capturing device includes a memory module, an image capturing module, a voice recognition module, and a controller. The memory module stores a plurality of image special effects. The image capture module is used to capture and generate images and store them in the memory module. The voice recognition module is used to receive voice signals. The controller is electrically connected to the image capturing module, the voice recognition module and the memory module. The controller analyzes the voice signal through the voice recognition module, and connects to the memory module, searches for at least one image special effect according to the analysis result, and selects the desired image special effect in the search result, and then the controller Connect to the memory module and select the image, the controller synthesizes the selected image special effects and images.
於本發明之一或多個實施方式中,控制器係用以透過圖層疊合的方式合成選取之影像特別效果與影像。 In one or more embodiments of the present invention, the controller is configured to synthesize the selected image special effects and images by means of layer stacking.
於本發明之一或多個實施方式中,控制器係用以將影像特別效果作為特殊標記標記在影像的方式合成選取之影像特別效果與影像,形成合成圖像。 In one or more embodiments of the present invention, the controller is configured to synthesize the selected image special effects and images in a manner of image special effects as a special mark to form a composite image.
根據本發明另一實施方式,一種影像處理系統包含 前述之影像擷取裝置以及影像輸出裝置。當影像擷取裝置指示影像輸出裝置輸出合成圖像,影像輸出裝置輸出一合成圖片,合成圖片包含根據影像印製的圖片基底以及根據特殊標記印製的圖片特殊效果。 According to another embodiment of the present invention, an image processing system includes The image capturing device and the image output device described above. When the image capturing device instructs the image output device to output a composite image, the image output device outputs a composite image comprising a picture substrate printed according to the image and a picture special effect printed according to the special mark.
根據本發明再一實施方式,一種利用語音辨識進行影像處理的方法包含語音辨識模組接收語音訊號,而控制器透過語音辨識模組對語音訊號進行解析,接著依據解析結果,控制器連線到記憶模組搜尋相對應之至少一影像特別效果,然後於搜尋結果中選擇所欲之影像特別效果,之後控制器連線到記憶模組並選取至少一影像,最後控制器合成選取之影像特別效果與影像。 According to still another embodiment of the present invention, a method for performing image processing by using voice recognition includes a voice recognition module receiving a voice signal, and the controller parses the voice signal through the voice recognition module, and then, according to the analysis result, the controller is connected to The memory module searches for at least one image special effect, and then selects a desired image special effect in the search result, and then the controller connects to the memory module and selects at least one image, and finally the controller synthesizes the selected image special effect. With images.
於本發明之一或多個實施方式中,控制器以圖層疊合的方式合成選取之影像特別效果與影像。 In one or more embodiments of the present invention, the controller synthesizes the selected image special effects and images in a layered manner.
於本發明之一或多個實施方式中,控制器以將影像特別效果作為特殊標記標記在影像的方式合成選取之影像特別效果與影像,形成合成圖像。 In one or more embodiments of the present invention, the controller synthesizes the selected image special effects and images by displaying the image special effects as special marks on the image to form a composite image.
於本發明之一或多個實施方式中,利用語音辨識進行影像處理的方法更包含提供影像輸出裝置,當影像擷取裝置指示影像輸出裝置輸出合成圖像,影像輸出裝置輸出合成圖片,合成圖片包含根據影像印製的圖片基底以及根據特殊標記印製的圖片特殊效果。 In one or more embodiments of the present invention, the method for performing image processing by using voice recognition further includes providing an image output device. When the image capturing device instructs the image output device to output a composite image, the image output device outputs a composite image, and the composite image is output. Contains image bases that are printed from images and special effects that are printed according to special marks.
於本發明之一或多個實施方式中,圖片特殊效果的顏色為金屬色。 In one or more embodiments of the present invention, the color of the picture special effect is metallic.
本發明上述實施方式藉由整合語音辨識以及影像 擷取裝置,讓使用者在出遊或者聚會等場合,照完照片後可以十分輕鬆方便地進行影像後製,即使在搭配隨身相片印表機時,從拍照到印製照片的流程變得更加複雜的情況下,仍能帶給使用者最佳的便利感受。 The above embodiment of the present invention integrates speech recognition and image The capture device allows the user to perform image post-production very easily and conveniently after taking photos, even when using a photo printer, the process from photographing to printing is more complicated. In the case, it still gives the user the best convenience.
10‧‧‧影像擷取裝置 10‧‧‧Image capture device
12‧‧‧影像擷取模組 12‧‧‧Image capture module
14‧‧‧語音辨識模組 14‧‧‧Voice recognition module
16‧‧‧記憶模組 16‧‧‧Memory Module
18‧‧‧控制器 18‧‧‧ Controller
20‧‧‧影像輸出裝置 20‧‧‧Image output device
200~212‧‧‧步驟 200~212‧‧‧Steps
第1圖繪示依照本發明一實施方式之影像擷取裝置的方塊圖。 FIG. 1 is a block diagram of an image capturing device according to an embodiment of the invention.
第2圖繪示本發明之利用語音辨識進行影像處理的方法一實施方式的流程示意圖。 FIG. 2 is a schematic flow chart of an embodiment of a method for performing image processing using voice recognition according to the present invention.
第3圖繪示依照本發明另一實施方式之影像擷取裝置的方塊圖。 FIG. 3 is a block diagram of an image capturing device according to another embodiment of the present invention.
第4圖繪示本發明之利用語音辨識進行影像處理的方法另一實施方式的流程示意圖。 FIG. 4 is a schematic flow chart showing another embodiment of a method for performing image processing using voice recognition according to the present invention.
以下將以圖式揭露本發明之複數個實施方式,為明確說明起見,許多實務上的細節將在以下敘述中一併說明。然而,應瞭解到,這些實務上的細節不應用以限制本發明。也就是說,在本發明部分實施方式中,這些實務上的細節是非必要的。此外,為簡化圖式起見,一些習知慣用的結構與元件在圖式中將以簡單示意的方式繪示之。 The embodiments of the present invention are disclosed in the following drawings, and the details of However, it should be understood that these practical details are not intended to limit the invention. That is, in some embodiments of the invention, these practical details are not necessary. In addition, some of the conventional structures and elements are shown in the drawings in a simplified schematic manner in order to simplify the drawings.
第1圖繪示依照本發明一實施方式之影像擷取裝 置的方塊圖。本實施方式之影像擷取裝置10可為照相手機或數位相機。也就是說,影像擷取裝置10為具有處理電子資訊能力的影像擷取裝置,且影像擷取裝置10主要適用於可攜式裝置。 FIG. 1 is a diagram of an image capture device according to an embodiment of the invention. Set the block diagram. The image capturing device 10 of the present embodiment may be a camera phone or a digital camera. That is to say, the image capturing device 10 is an image capturing device having the capability of processing electronic information, and the image capturing device 10 is mainly applicable to a portable device.
如第1圖所示,影像擷取裝置10包含影像擷取模組12、語音辨識模組14、記憶模組16以及控制器18。記憶模組16儲存複數個影像特別效果。影像擷取模組12為用來擷取並產生影像,並儲存於記憶模組16。語音辨識模組14為用來接收語音訊號。控制器18分別電連接於影像擷取模組12、語音辨識模組14以及記憶模組16。控制器18透過語音辨識模組14對語音訊號進行解析,並連線到記憶模組16,依據解析結果搜尋相對應之至少一影像特別效果,並於搜尋結果中選擇所欲之影像特別效果,之後控制器18連線到記憶模組16並選取影像,控制器18合成選取之影像特別效果與影像。 As shown in FIG. 1 , the image capturing device 10 includes an image capturing module 12 , a voice recognition module 14 , a memory module 16 , and a controller 18 . The memory module 16 stores a plurality of image special effects. The image capturing module 12 is configured to capture and generate images and store them in the memory module 16 . The voice recognition module 14 is configured to receive voice signals. The controller 18 is electrically connected to the image capturing module 12, the voice recognition module 14, and the memory module 16, respectively. The controller 18 analyzes the voice signal through the voice recognition module 14, and connects to the memory module 16, searches for the corresponding image special effect according to the analysis result, and selects the desired image special effect in the search result. The controller 18 then connects to the memory module 16 and selects an image, and the controller 18 synthesizes the selected image special effects and images.
影像擷取裝置10更可包含顯示模組、輸入模組以及資訊傳輸與接收模組。顯示模組用來顯示影像擷取裝置10提供給使用者的資訊,比如說圖案、照片或是文字訊息。顯示模組可為液晶顯示器。輸入模組用來提供使用者輸入控制指令,輸入模組可為鍵盤或是觸控螢幕。資訊傳輸與接收模組用來將影像或是合成之影像特別效果與影像傳輸給其他電子裝置,或是接收影像擷取裝置10的韌體更新檔等資訊。資訊傳輸與接收模組可為有線模組,比如說通用序列匯流排(Universal Serial Bus,USB)連線模組,或是無線 模組,比如說藍牙模組或是使用射頻無線電的數據機。 The image capturing device 10 further includes a display module, an input module, and an information transmission and receiving module. The display module is used to display information provided by the image capturing device 10 to the user, such as a pattern, a photo or a text message. The display module can be a liquid crystal display. The input module is used to provide a user input control command, and the input module can be a keyboard or a touch screen. The information transmission and receiving module is used to transmit image or synthesized image special effects and images to other electronic devices, or to receive information such as firmware update files of the image capturing device 10. The information transmission and receiving module can be a wired module, such as a universal serial bus (USB) connection module, or wireless Modules, such as Bluetooth modules or modems that use RF radios.
影像擷取模組12可包含鏡頭、快門、機身以及影像感應器,其詳細結構配置將不詳述。影像感應器可為電荷耦合元件(Charge-coupled Device,CCD)或是互補式金屬氧化物半導體(Complementary Metal-Oxide-Semiconductor,CMOS)。影像可為照片或是任何電子圖像。 The image capturing module 12 can include a lens, a shutter, a body, and an image sensor, and the detailed structural configuration thereof will not be described in detail. The image sensor can be a Charge-coupled Device (CCD) or a Complementary Metal-Oxide-Semiconductor (CMOS). The image can be a photo or any electronic image.
語音辨識模組14可具有接收器,接收器可用來接收來自外部環境的語音訊號,並可選擇性地對語音訊號進行濾波以及解析的功能,其中濾波為濾除雜訊的功能,解析為放大主訊號的功能。舉例來說,語音辨識模組14可進行濾波,藉由一判斷準則判斷出語音訊號的其中一部份為雜訊,刪去不要的雜訊之後,再進行放大訊號,之後可以再重複上述步驟數次。藉由以上功能將能有效辨識出語音訊號所包含的控制指令。語音辨識模組14可為硬體或韌體。接收器可為麥克風。 The voice recognition module 14 can have a receiver, the receiver can be used to receive voice signals from the external environment, and can selectively filter and analyze the voice signals, wherein the filtering is a function of filtering out noise, and is analyzed to be amplified. The function of the main signal. For example, the voice recognition module 14 can perform filtering, and determine a part of the voice signal as noise by a judgment criterion, delete the unnecessary noise, and then amplify the signal, and then repeat the above steps. Several times. With the above functions, the control commands included in the voice signal can be effectively recognized. The speech recognition module 14 can be a hardware or a firmware. The receiver can be a microphone.
記憶模組16可為隨機存取記憶體,應了解到,以上所舉之記憶模組16的具體實施方式僅為例示,而非用以限制本發明,本發明所屬技術領域中具有通常知識者,可依實際需要,彈性選擇記憶模組16的具體實施方式。 The memory module 16 can be a random access memory. It should be understood that the specific implementation of the above-mentioned memory module 16 is merely illustrative and not intended to limit the present invention, and those having ordinary knowledge in the technical field of the present invention. The specific implementation manner of the memory module 16 can be flexibly selected according to actual needs.
儲存於記憶模組16的影像特別效果可為用來對於照片進行後製的特別效果。具體而言,影像特別效果可為色彩轉換效果,例如灰階模式、懷舊效果、負片效果、曝光效果、色調分離效果等效果,影像特別效果亦可為加入邊框於影像、在影像上塗鴉文字或是在影像上疊合特殊圖 樣,影像特別效果並不限於前述列舉。影像特別效果可以為內建於記憶模組16、透過外接記憶體而儲存於記憶模組16或者藉由資訊傳輸與接收模組透過網際網路下載到記憶模組16。影像特別效果的來源並不限於前述方式。 The image special effects stored in the memory module 16 can be a special effect for post-production of the photos. Specifically, the image special effect may be a color conversion effect, such as a grayscale mode, a nostalgic effect, a negative film effect, an exposure effect, a color separation effect, etc., and the image special effect may also be a border image, a graffiti text on the image, or Is to overlay a special map on the image As such, the image special effects are not limited to the foregoing list. The image special effects can be built into the memory module 16 , stored in the memory module 16 through the external memory or downloaded to the memory module 16 through the Internet through the information transmission and receiving module. The source of the image special effects is not limited to the foregoing.
控制器18可為中央處理器(Central Processing Unit,CPU)或應用處理器(Application Processor),應了解到,以上所舉之控制器18的具體實施方式僅為例示,而非用以限制本發明,本發明所屬技術領域中具有通常知識者,可依實際需要,彈性選擇控制器18的具體實施方式。 The controller 18 can be a central processing unit (CPU) or an application processor. It should be understood that the specific implementation of the controller 18 is merely illustrative and not limiting. Those skilled in the art to which the present invention pertains can flexibly select a specific embodiment of the controller 18 according to actual needs.
第2圖繪示本發明之利用語音辨識進行影像處理的方法一實施例的流程示意圖。對於前述之影像擷取裝置10,操作步驟如下。步驟200為語音辨識模組14接收語音訊號,其中語音辨識模組14可藉由接收器來接收來自外部環境的語音訊號,接收功能可由一機制觸發,比如說藉由輸入模組輸入特定指令或是在影像擷取模組12擷取影像並儲存於記憶模組16完畢後。 FIG. 2 is a schematic flow chart of an embodiment of a method for performing image processing by using voice recognition according to the present invention. For the image capturing device 10 described above, the operation steps are as follows. In step 200, the voice recognition module 14 receives the voice signal. The voice recognition module 14 can receive the voice signal from the external environment by using the receiver. The receiving function can be triggered by a mechanism, for example, inputting a specific instruction through the input module or The image capture module 12 captures the image and stores it in the memory module 16.
步驟202為控制器18透過語音辨識模組14對語音訊號進行解析。在語音辨識模組14接收到語音訊號後,語音辨識模組14可選擇性地對語音訊號進行濾波以及解析的功能並得到可作為指令的主訊號,之後再將主訊號判定為特定控制指令,以此為解析結果。舉例來說,當使用者說出「愛心」時,語音辨識模組14先將使用者說話時的背景雜訊濾除,之後將「愛心」的語音訊號放大後,然後再將「愛心」的語音訊號判定為控制指令「愛心」。 Step 202 is to analyze the voice signal by the controller 18 through the voice recognition module 14. After the voice recognition module 14 receives the voice signal, the voice recognition module 14 can selectively filter and analyze the voice signal and obtain the main signal that can be used as the command, and then determine the main signal as a specific control command. This is the result of the analysis. For example, when the user says "love", the voice recognition module 14 first filters out the background noise of the user's speech, then enlarges the "love" voice signal, and then "love" The voice signal is determined as the control command "Love".
步驟204為依據解析結果,控制器18連線到記憶模組16搜尋相對應之至少一影像特別效果。舉例來說,當解析結果為控制指令「愛心」時,控制器18即依此連線到記憶模組16找到以文字顯示的「愛心」字樣或以圖案顯示的「愛心」圖樣等影像特別效果,之後控制器18可將這些影像特別效果的名稱以列表方式顯示於顯示模組上。 Step 204 is based on the analysis result, and the controller 18 connects to the memory module 16 to search for at least one image special effect. For example, when the analysis result is the control command "love", the controller 18 connects to the memory module 16 to find the "love" type displayed in the text or the "love" pattern displayed in the pattern. Then, the controller 18 can display the names of the special effects of the images on the display module in a list.
步驟206為於搜尋結果中選擇所欲之影像特別效果。舉例來說,當控制器18將以文字顯示的「愛心」字樣或以圖案顯示的「愛心」圖樣的名稱以列表方式顯示於顯示模組上時,使用者可以使用聲控或是藉由輸入模組選取所欲之影像特別效果。 Step 206 is to select a desired image special effect in the search result. For example, when the controller 18 displays the name of the "love" in the text or the name of the "love" pattern displayed in the form on the display module, the user can use the voice control or the input mode. The group selects the desired image for special effects.
在使用者選取所欲之影像特別效果之後,針對特定的影像特別效果,比如說在影像上塗鴉文字或是在影像上疊合「愛心」圖樣,使用者可藉由語音辨識模組14或是輸入模組輸入指令,比如說輸入文字訊息或是調整圖樣位置,以完成影像特別效果的設定。 After the user selects the desired image special effect, for the specific image special effects, such as graffiti text on the image or superimposed "love" pattern on the image, the user can use the voice recognition module 14 or Input module input commands, such as inputting text messages or adjusting the position of the pattern, to complete the setting of the special effects of the image.
步驟202~206可以重複執行,因此使用者可以選取複數個所欲之影像特別效果。舉例來說,使用者可以在影像上塗鴉文字的同時,在影像上疊合「愛心」圖樣 Steps 202-206 can be repeated, so the user can select a plurality of desired image special effects. For example, the user can overlay the "love" pattern on the image while graffitiing the text on the image.
步驟208為控制器18連線到記憶模組16並選取至少一影像。舉例來說,控制器18可以在影像擷取模組14剛擷取一照片,並儲存於記憶模組16之後,選取這張照片。或者,控制器18可以依照使用者的指令,選取先前儲存在記憶模組16中的影像。 Step 208 connects the controller 18 to the memory module 16 and selects at least one image. For example, the controller 18 can just capture a photo in the image capturing module 14 and store it in the memory module 16 to select the photo. Alternatively, the controller 18 may select an image previously stored in the memory module 16 in accordance with a user's instruction.
步驟210為控制器18合成選取之影像特別效果與影像。控制器18合成選取之影像特別效果與影像的方法不只一種。明確來說,若影像特別效果為色彩轉換效果,控制器18可依照一色彩轉換公式轉換影像的顏色資訊,舉例來說,若影像特別效果為灰階模式,則控制器18可將影像的色調資訊除去,僅留下亮度資訊。若影像特別效果為在影像上塗鴉文字或是在影像上疊合特殊圖樣,則控制器18可將選取之影像作為基底圖層,並將選取之影像特別效果作為疊加圖層,之後再以圖層疊合的方式,把疊加圖層疊在基底圖層上,以形成合成圖像;或者,控制器18可將選取之影像特別效果作為特殊標記,並將特殊標記標記在選取之影像上,形成合成圖像。此處需要注意的是,藉由圖層疊合方式合成的合成圖像,合成圖像的資訊可以完全顯示於顯示模組上,但是藉由將影像特別效果作為特殊標記標記在影像上的方式合成的合成圖像,合成圖像的資訊無法完全以圖案形式顯示於顯示模組上。 Step 210 is for the controller 18 to synthesize the selected image special effects and images. The controller 18 synthesizes the selected image special effects and images by more than one. Specifically, if the image special effect is a color conversion effect, the controller 18 can convert the color information of the image according to a color conversion formula. For example, if the image special effect is a grayscale mode, the controller 18 can change the color of the image. Information is removed, leaving only brightness information. If the image has a special effect of graffitiing the image on the image or overlaying the special pattern on the image, the controller 18 can use the selected image as the base layer and use the selected image special effect as the overlay layer, and then layer the image. In a manner, the overlay is layered on the base layer to form a composite image; or the controller 18 can mark the selected image special effect as a special mark and mark the special mark on the selected image to form a composite image. It should be noted here that the composite image synthesized by the layer stacking method can completely display the information of the composite image on the display module, but by synthesizing the image special effect as a special mark on the image. The composite image, the information of the composite image cannot be completely displayed on the display module in a pattern.
第3圖繪示依照本發明另一實施方式之影像擷取裝置的方塊圖。影像擷取裝置10更可與影像輸出裝置20搭配,影像擷取裝置10藉由資訊傳輸與接收模組傳遞指令給影像輸出裝置20。當影像擷取裝置10指示影像輸出裝置20輸出合成圖像,影像輸出裝置20輸出合成圖片。合成圖片可包含根據影像印製的圖片基底以及根據特殊標記印製的圖片特殊效果。影像擷取裝置10以及影像輸出裝置20可組成影像處理系統。 FIG. 3 is a block diagram of an image capturing device according to another embodiment of the present invention. The image capturing device 10 can be further matched with the image output device 20, and the image capturing device 10 transmits commands to the image output device 20 through the information transmission and receiving module. When the image capturing device 10 instructs the image output device 20 to output a composite image, the image output device 20 outputs a composite image. The composite image can include a picture base printed from the image and a picture special effect printed according to the special mark. The image capturing device 10 and the image output device 20 may constitute an image processing system.
影像輸出裝置20可為雷射印表機、噴墨印表機或是相片印表機。當影像輸出裝置20為相片印表機,且使用者使用影像擷取裝置10搭配影像輸出裝置20時,使用者可以依照上述實施方式,從拍照、選擇特效、合成照片到印製照片,所有步驟可以一次完成。由於影像擷取裝置10以及影像輸出裝置20皆可為移動裝置,所以使用者可以隨身攜帶這兩個裝置,並且在出遊或者聚會等場合拍照並直接得到照片成品,如此將能帶給使用者許多樂趣。然而在上述場合中,整個從拍照到印製照片的流程顯得十分複雜,因此藉由本發明上述實施方式以聲控方式減少操作過程,將能大幅地改善使用者的使用感受。 The image output device 20 can be a laser printer, an inkjet printer, or a photo printer. When the image output device 20 is a photo printer, and the user uses the image capturing device 10 in conjunction with the image output device 20, the user can take photos, select special effects, synthesize photos, and print photos according to the above embodiments. Can be done in one go. Since both the image capturing device 10 and the image output device 20 can be mobile devices, the user can carry the two devices with them, and take photos and obtain the finished photos directly in the event of traveling or gathering, so that the user can be brought to the user. pleasure. However, in the above case, the entire process from photographing to printing photographs is very complicated. Therefore, by the above-described embodiment of the present invention, the operation process is reduced in a voice-activated manner, and the user's feeling of use can be greatly improved.
圖片特殊效果可為金屬色,例如金色或銀色,圖片特殊效果可使用金箔或是銀箔為列印材料以達成效果。如此在前述情境下將能帶來更多樂趣。舉例來說,在出遊或者聚會的場合,得到一張具有金色或是銀色的「愛心」圖樣的照片,將使照片更具有歡樂感。此處需要注意的是,金色或銀色並不屬於一般畫素所能呈現顏色的範疇,所以顯示模組無法直接顯示,且一般電子圖檔將無法表現這些顏色,因此在輸出這種具有金色或銀色的圖片特殊效果的合成圖片時,影像輸出裝置20不是根據透過圖層疊合的方式所合成的合成圖像列印,而是根據影像印製圖片基底以及根據特殊標記印製圖片特殊效果,以輸出合成圖片。 The special effect of the picture can be metallic, such as gold or silver. The special effect of the picture can be achieved by using gold foil or silver foil as the printing material. This will bring more fun in the aforementioned situation. For example, in the case of a trip or party, getting a photo with a gold or silver "love" pattern will make the photo more enjoyable. It should be noted here that gold or silver is not a category that can be rendered by a general pixel, so the display module cannot be directly displayed, and the general electronic image file will not be able to express these colors, so the output is golden or In the case of a silver-colored picture with a special effect, the image output device 20 does not print the composite image synthesized by the lamination of the images, but prints the image base according to the image and prints the special effect according to the special mark. Output a composite image.
第4圖繪示本發明之利用語音辨識進行影像處理的方法另一實施例的流程示意圖。此處與前述實施方式大 致相同,唯一不同之處在於利用語音辨識進行影像處理的方法更包含步驟211:提供影像輸出裝置20,當影像擷取裝置10指示影像輸出裝置20輸出合成圖像,影像輸出裝置20輸出合成圖片。合成圖片可包含根據影像印製的圖片基底以及根據特殊標記印製的圖片特殊效果。 FIG. 4 is a schematic flow chart showing another embodiment of a method for performing image processing by using voice recognition according to the present invention. Larger here than the previous embodiment The method is the same. The only difference is that the method for performing image processing by using voice recognition further includes step 211: providing an image output device 20, and when the image capturing device 10 instructs the image output device 20 to output a composite image, the image output device 20 outputs a composite image. . The composite image can include a picture base printed from the image and a picture special effect printed according to the special mark.
本發明上述實施方式藉由整合語音辨識以及影像擷取裝置10,讓使用者在出遊或者聚會等場合,照完照片後可以十分輕鬆方便地進行影像後製,即使在搭配隨身相片印表機時,從拍照到印製照片的流程變得更加複雜的情況下,仍能帶給使用者最佳的便利感受。 The above embodiment of the present invention integrates the voice recognition and image capturing device 10, so that the user can perform image post-production after the photo is taken out in a place such as a trip or a party, even when using the photo printer. The process of taking photos to printing photos becomes more complicated, and still gives users the best convenience.
雖然本發明已以實施方式揭露如上,然其並非用以限定本發明,任何熟習此技藝者,在不脫離本發明之精神和範圍內,當可作各種之更動與潤飾,因此本發明之保護範圍當視後附之申請專利範圍所界定者為準。 Although the present invention has been disclosed in the above embodiments, it is not intended to limit the present invention, and the present invention can be modified and modified without departing from the spirit and scope of the present invention. The scope is subject to the definition of the scope of the patent application attached.
200~212‧‧‧步驟 200~212‧‧‧Steps
Claims (10)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US13/958,569 US20140036102A1 (en) | 2012-08-05 | 2013-08-04 | Image capture device and method for image processing by voice recognition |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US201261679777P | 2012-08-05 | 2012-08-05 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| TW201407538A true TW201407538A (en) | 2014-02-16 |
Family
ID=50067417
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| TW102127336A TW201407538A (en) | 2012-08-05 | 2013-07-30 | Image capturing device and method for image processing by voice recognition |
Country Status (3)
| Country | Link |
|---|---|
| CN (1) | CN104584527A (en) |
| TW (1) | TW201407538A (en) |
| WO (1) | WO2014023080A1 (en) |
Families Citing this family (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN105657254B (en) * | 2015-12-28 | 2019-10-29 | 努比亚技术有限公司 | A kind of image composition method and device |
| CN106791370A (en) * | 2016-11-29 | 2017-05-31 | 北京小米移动软件有限公司 | A kind of method and apparatus for shooting photo |
| CN107886947A (en) * | 2017-10-19 | 2018-04-06 | 珠海格力电器股份有限公司 | Image processing method and device |
| CN108600635A (en) * | 2018-05-22 | 2018-09-28 | Oppo(重庆)智能科技有限公司 | Image pickup method, device and the electronic equipment of image |
| CN110334673A (en) * | 2019-07-10 | 2019-10-15 | 青海中水数易信息科技有限责任公司 | The long information system processed in river with intelligent recognition image function and method |
Family Cites Families (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6289140B1 (en) * | 1998-02-19 | 2001-09-11 | Hewlett-Packard Company | Voice control input for portable capture devices |
| JP4325415B2 (en) * | 2004-01-27 | 2009-09-02 | 株式会社ニコン | An electronic camera having a finish setting function and a processing program for customizing the finish setting function of the electronic camera. |
| CN100343874C (en) * | 2005-07-11 | 2007-10-17 | 北京中星微电子有限公司 | Voice-based colored human face synthesizing method and system, coloring method and apparatus |
| JP5374080B2 (en) * | 2008-06-25 | 2013-12-25 | キヤノン株式会社 | Imaging apparatus, control method therefor, and computer program |
| US20100238323A1 (en) * | 2009-03-23 | 2010-09-23 | Sony Ericsson Mobile Communications Ab | Voice-controlled image editing |
-
2013
- 2013-07-30 TW TW102127336A patent/TW201407538A/en unknown
- 2013-08-05 CN CN201380032736.0A patent/CN104584527A/en active Pending
- 2013-08-05 WO PCT/CN2013/000916 patent/WO2014023080A1/en not_active Ceased
Also Published As
| Publication number | Publication date |
|---|---|
| WO2014023080A1 (en) | 2014-02-13 |
| CN104584527A (en) | 2015-04-29 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN108886576B (en) | The display methods of digital camera and digital camera | |
| US8599251B2 (en) | Camera | |
| CN103384304B (en) | Display control apparatus, display control method | |
| WO2004039068A1 (en) | Image combining portable terminal and image combining method used therefor | |
| CN102262787A (en) | Image composition apparatus | |
| TW201407538A (en) | Image capturing device and method for image processing by voice recognition | |
| JP7144571B2 (en) | Information equipment and camera image sharing system | |
| CN106023083A (en) | Method and device for obtaining combined image | |
| CN103327188A (en) | Self-photographing method with mobile terminal and mobile terminal | |
| JP2006261915A (en) | Imaging device | |
| JP2010081012A (en) | Imaging device, imaging control method, and program | |
| JP2010097449A (en) | Image composition device, image composition method and image composition program | |
| JP2020052947A (en) | Image processing apparatus, image processing method, and image processing program | |
| US20160057359A1 (en) | Image generating apparatus, image generating method and computer readable recording medium for recording program for generating new image from images related to reference image | |
| JP5023932B2 (en) | Imaging apparatus, image capturing method by scenario, and program | |
| JP2014158102A (en) | Imaging device and image processing device | |
| JP7128347B2 (en) | Image processing device, image processing method and program, imaging device | |
| CN104584529A (en) | Image processing device, image capture device, and program | |
| US20140036102A1 (en) | Image capture device and method for image processing by voice recognition | |
| JP2022049799A (en) | Imaging method, imaging system, imaging device, server, and program | |
| KR101610196B1 (en) | Method for composing video data in mobile terminal and mobile terminal using the same | |
| JP5359995B2 (en) | Painting style image display control device, painting style image display control program, and painting style image display control method | |
| JP5601401B2 (en) | Imaging apparatus, image generation method, and program | |
| JP4603249B2 (en) | Image processing apparatus for mobile phone with camera | |
| JP5304854B2 (en) | Imaging apparatus, image generation method, and program |