KR20170055466A

KR20170055466A - Display apparatus, Method for controlling display apparatus and Method for controlling display apparatus in Voice recognition system thereof

Info

Publication number: KR20170055466A
Application number: KR1020170059480A
Authority: KR
Inventors: 박은희; 한상진; 김재권
Original assignee: 삼성전자주식회사
Priority date: 2017-05-12
Filing date: 2017-05-12
Publication date: 2017-05-19
Anticipated expiration: 2033-01-07
Also published as: KR102045539B1

Abstract

The present invention relates to a display apparatus, a control method, and a control method of a voice recognition system. The present invention provides voice guidance information to a user in order to quickly respond to user voice and control function of the display apparatus. When users voice for controlling the display apparatus is input, the display apparatus determines whether the users voice is a command already stored in the display device, and then transmits the users voice to an interactive server. If the user voice is not a command already stored in the display apparatus, control information corresponding to the user voice and first guide information guiding an already stored command capable of performing the same function as the user voice are transmitted from the interactive server. The function of the display apparatus is performed according to control information transmitted from the interactive server, and first guide information is displayed.

Description

BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a display apparatus, a control method thereof, and a display apparatus control method for a voice recognition system.

본 발명은 디스플레이 장치, 및 이의 제어 방법, 그리고 음성 인식 시스템의 디스플레이 장치 제어 방법에 관한 것으로서, 더욱 상세하게는 입력되는 사용자 음성에 따라 디스플레이 장치의 기능을 제어할 수 있는 디스플레이 장치, 및 이의 제어 방법, 그리고 음성 인식 시스템의 디스플레이 장치 제어 방법에 관한 것이다.BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a display device, a control method thereof, and a display device control method of a speech recognition system, and more particularly, to a display device capable of controlling functions of a display device according to an input user voice, And a display device control method of the speech recognition system.

일반적으로, 음성 인식이 가능한 디스플레이 장치는 크게 임베디드(Embedded) 방식과 대화형 방식이 있다. 2. Description of the Related Art In general, a display device capable of voice recognition includes an embedded system and an interactive system.

임베디드 방식의 디스플레이 장치는 한정된 사용자 음성만을 인식할 수 있다. 즉, 임베디드 방식의 디스플레이 장치는 기저장된 명령어에 대응되는 사용자 음성이 입력된 경우에만 사용자 음성에 대응되는 기능을 수행할 수 있다. 이와 같은 임베디드 방식의 디스플레이 장치는 입력된 사용자 음성에 대응되는 기능을 빠르게 수행한다는 점에서 장점이 있으나, 기 저장된 명령어에 대응되는 사용자 음성만을 인식한다는 점에서 사용자 음성을 인식하는데 매우 한정적이다.An embedded display device can recognize only a limited user voice. That is, the display device of the embedded system can perform a function corresponding to the user's voice only when the user's voice corresponding to the pre-stored command is input. Such an embedded display device is advantageous in that it quickly performs a function corresponding to the inputted user voice, but it is very limited in recognizing the user voice in that it recognizes only the user voice corresponding to the pre-stored command word.

대화형 방식의 디스플레이 장치는 외부의 대화형 서버를 통해 다양한 사용자 음성을 인식하여 사용자 의도를 파악하고, 그 파악된 사용자 의도에 적합한 동작을 수행한다. 이 같은 대화형 방식의 디스플레이 장치는 사용자 음성이 입력되면, 대화형 서버로부터 사용자 음성에 대응되는 제어 정보 또는 응답 정보(예를 들어, 컨텐츠 검색 정보)를 수신하고, 제어 정보 또는 응답 정보에 기초하여 사용자 음성에 대응되는 기능을 수행한다. 이 같은 대화형 방식의 디스플레이 장치는 임베디드 방식의 디스플레이 장치에 비해 다양한 사용자 발화를 인식하고, 인식한 사용자 발화에 대응되는 동작을 수행한다는 점에서 장점이 있으나, 대화형 서버를 이용하여 사용자 음성에 대응되는 기능을 수행하기 때문에 임베디드 방식의 디스플레이 장치에 비해 응답 속도가 느린 문제가 있다.The interactive display device recognizes various user's voices through an external interactive server, grasps the user's intention, and performs an operation suited to the user's intention. Such an interactive display device receives control information or response information (for example, content search information) corresponding to a user's voice from the interactive server when the user's voice is input, And performs a function corresponding to the user voice. Such an interactive display device is advantageous in that it recognizes various user utterances and performs an operation corresponding to a recognized user utterance as compared with an embedded display device. However, A response speed is slower than that of an embedded display device.

특히, 근래에는 상술한 두 가지 방식을 모두 이용하는 디스플레이 장치가 개발되고 있다. 그러나, 두 가지 방식을 모두 이용하더라도 사용자가 디스플레이 장치에 임베디드된 명령어를 발화하지 않고, 임베디드된 명령어와 유사한 명령어를 발화하는 경우, 디스플레이 장치는 대화형 방식을 이용하여 사용자 음성을 인식하고 사용자 음성에 대응되는 기능을 수행하게 된다. 예를 들어, 디스플레이 장치의 볼륨 업 기능을 수행하기 위해 임베디드된 명령어가 "볼륨 키워"이나, 사용자가 "볼륨을 높여주세요"라고 발화한 경우, 디스플레이 장치는 응답 속도가 빠른 임베디드 방식이 아닌 응답 속도가 느린 대화형 방식을 이용해야 볼륨 레벨을 증가시키는 기능을 수행해야 하는 문제점이 발생하였다.Particularly, in recent years, a display device using both of the above two methods has been developed. However, even if both methods are used, when the user utters a command similar to the embedded command, without igniting the command embedded in the display device, the display device recognizes the user's voice using the interactive method, And performs a corresponding function. For example, when a command that is embedded to perform the volume up function of the display device is "volume up" or when the user says "please raise the volume ", the display device displays the response speed There has been a problem in that a slow interactive method must be used to perform the function of increasing the volume level.

즉, 사용자가 두 가지 음성 인식 방법 중 어느 방법을 이용해야 더욱 신속하게 음성 인식을 수행할 수 있을지 모르는 경우, 디스플레이 장치의 작업량이 불필요하게 증가하며, 사용자 음성에 대한 응답이 늦어지는 문제점이 발생하게 된다.That is, if the user does not know which one of the two speech recognition methods can perform the speech recognition more quickly, a problem arises in that the workload of the display device is unnecessarily increased and the response to the user voice is delayed do.

본 발명은 상술한 문제점을 해결하기 위해 안출된 것으로, 본 발명의 목적은 사용자 음성을 신속하게 응답하여 디스플레이 장치의 기능을 제어할 수 있도록 사용자에게 음성 가이드 정보를 제공하는 디스플레이 장치, 및 이의 제어 방법, 그리고 음성 인식 시스템의 제어 방법을 제공함에 있다.It is an object of the present invention to provide a display device that provides audio guide information to a user so that the user can quickly respond to user's voice and control the function of the display device, And a control method of the speech recognition system.

상기 목적을 달성하기 위한 본 발명의 일 실시예에 따른, 디스플레이 장치의 제어 방법은, 상기 디스플레이 장치를 제어하기 위한 사용자 음성을 입력받는 단계; 상기 사용자 음성이 상기 디스플레이 장치에 기 저장된 명령어인지 여부를 판단하면서 상기 사용자 음성을 대화형 서버로 전송하는 단계; 및 상기 사용자 음성이 상기 디스플레이 장치에 기 저장된 명령어가 아닌 경우, 상기 대화형 서버로부터 상기 사용자 음성에 대응되는 제어 정보 및 상기 사용자 음성과 동일한 기능을 수행할 수 있는 기 저장된 명령어를 안내하는 제1 가이드 정보가 전송되면, 상기 대화형 서버로부터 전송된 제어 정보에 따라 상기 디스플레이 장치의 기능을 수행하고, 상기 제1 가이드 정보를 디스플레이하는 단계;를 포함한다.According to another aspect of the present invention, there is provided a method of controlling a display device, the method comprising: receiving a user voice for controlling the display device; Transmitting the user voice to the interactive server while determining whether the user voice is an instruction pre-stored in the display device; And a first guide for guiding, from the interactive server, control information corresponding to the user's voice and a pre-stored command capable of performing the same function as the user's voice, when the user's voice is not an instruction previously stored in the display device Performing the function of the display device according to control information transmitted from the interactive server and displaying the first guide information when the information is transmitted.

그리고, 상기 대화형 서버는, 상기 사용자 음성에 대응되는 제어 정보를 검색하고, 상기 사용자 음성에 대응되는 제어 정보를 검색하는 동안 상기 제어 정보와 동일한 기능을 수행할 수 있는 상기 디스플레이 장치에 기저장된 명령어가 있는지 여부를 판단하며, 상기 제어 정보와 동일한 기능을 수행할 수 있는 상기 디스플레이 장치에 기저장된 명령어가 있는 경우, 상기 기 저장된 명령어를 안내하는 제1 가이드 정보를 생성하여 상기 제어 정보와 함께 상기 디스플레이 장치에 전송할 수 있다.The interactive server searches for control information corresponding to the user's voice and searches for control information corresponding to the user's voice by using the pre- And if there is an instruction previously stored in the display device capable of performing the same function as the control information, generates first guide information for guiding the pre-stored instruction, Device.

또한, 상기 사용자 음성이 기 저장된 명령어인 경우, 상기 기 저장된 명령어와 대응되는 제어 정보를 검색하는 단계; 및 상기 검색된 제어 정보에 따라 상기 디스플레이 장치의 기능을 우선적으로 수행하는 단계;를 더 포함할 수 있다.Searching the control information corresponding to the pre-stored instruction word if the user voice is a previously stored instruction word; And preferentially performing a function of the display device according to the searched control information.

*그리고, 상기 사용자 음성이 기 저장된 명령어이며, 상기 사용자 음성이 복수의 계층구조를 가지는 디스플레이 장치의 기능을 제어하기 위한 명령어인 경우, 상기 사용자 음성과 동일한 기능을 수행할 수 있는 대화형 명령어를 안내하는 제2 가이드 정보를 디스플레이하는 단계;를 포함할 수 있다.If the user voice is a command for controlling a function of a display device having a plurality of hierarchical structures, an interactive command word capable of performing the same function as the user voice is displayed And displaying the second guide information.

또한, 상기 대화형 서버는, 상기 사용자 음성을 텍스트 정보로 변환하는 제1 대화형 서버 및 상기 텍스트 정보에 따라 제어 정보 및 제1 가이드 정보를 생성하는 제2 대화형 서버를 포함할 수 있다.The interactive server may include a first interactive server for converting the user's voice into text information, and a second interactive server for generating control information and first guide information according to the text information.

한편, 상기 목적을 달성하기 위해 안출된 본 발명의 일 실시예에 따른, 디스플레이 장치는, 상기 디스플레이 장치를 제어하기 위한 사용자 음성을 입력받는 음성 입력부; 대화형 서버와 통신을 수행하는 통신부; 명령어와 제어 정보를 매칭하여 저장하는 저장부; 디스플레이부; 및 상기 음성 입력부를 통해 입력된 사용자 음성이 상기 저장부에 기 저장된 명령어인지 여부를 판단하면서 상기 사용자 음성을 상기 통신부를 통해 상기 대화형 서버로 전송하고, 상기 사용자 음성이 상기 저장부에 기 저장된 명령어가 아닌 경우, 상기 대화형 서버로부터 상기 사용자 음성에 대응되는 제어 정보 및 상기 사용자 음성과 동일한 기능을 수행할 수 있는 기 저장된 명령어를 안내하는 제1 가이드 정보가 전송되면, 상기 대화형 서버로부터 전송된 제어 정보에 따라 상기 디스플레이 장치의 기능을 수행하고, 상기 제1 가이드 정보를 디스플레이하도록 상기 디스플레이부를 제어하는 제어부;를 포함한다.According to another aspect of the present invention, there is provided a display device including: a voice input unit for receiving a user voice for controlling the display device; A communication unit for performing communication with the interactive server; A storage unit for storing and storing instructions and control information; A display unit; And transmitting the user's voice to the interactive server through the communication unit while determining whether the user's voice inputted through the voice input unit is a command previously stored in the storage unit, When the first guide information for guiding the control information corresponding to the user's voice and the pre-stored command capable of performing the same function as the user's voice is transmitted from the interactive server, And a controller for performing the function of the display device according to the control information and controlling the display unit to display the first guide information.

또한, 상기 제어부는, 상기 사용자 음성이 상기 저장부에 기 저장된 명령어인 경우, 상기 저장부에 저장된 상기 사용자 음성에 대응되는 제어 정보를 검색하고, 상기 검색된 제어 정보에 따라 상기 디스플레이 장치의 기능을 우선적으로 수행할 수 있다.The control unit searches the control information corresponding to the user's voice stored in the storage unit when the user's voice is an instruction word previously stored in the storage unit, and prioritizes the function of the display device according to the retrieved control information. . &Lt; / RTI >

그리고, 상기 제어부는, 상기 사용자 음성이 기 저장된 명령어이며, 상기 사용자 음성이 복수의 계층구조를 가지는 디스플레이 장치의 기능을 제어하기 위한 명령어인 경우, 상기 사용자 음성과 동일한 기능을 수행할 수 있는 대화형 명령어를 안내하는 제2 가이드 정보를 디스플레이하도록 상기 디스플레이부를 제어할 수 있다.If the user voice is a command for controlling a function of a display device having a plurality of hierarchical structures, the control unit may be an interactive type capable of performing the same function as the user voice, And to display the second guide information guiding the instruction.

또한, 상기 대화형 서버는, 상기 입력된 사용자 음성을 텍스트 정보로 변환하는 제1 대화형 서버 및 상기 텍스트 정보에 따라 제어 정보 및 제1 가이드 정보를 생성하는 제2 대화형 서버를 포함하며, 상기 제어부는, 상기 입력된 사용자 음성을 상기 제1 대화형 서버로 전송하며, 상기 제1 대화형 서버로부터 전송된 텍스트 정보를 상기 제2 대화형 서버로 전송하도록 상기 통신부를 제어할 수 있다.The interactive server may include a first interactive server for converting the input user voice into text information and a second interactive server for generating control information and first guide information according to the text information, The control unit may control the communication unit to transmit the inputted user voice to the first interactive server and transmit the text information transmitted from the first interactive server to the second interactive server.

한편, 상기 목적을 달성하기 위한 본 발명의 일 실시예에 따른, 대화형 서버 및 디스플레이 장치를 포함하는 음성 인식 시스템의 제어 방법은, 상기 디스플레이 장치가, 사용자 음성을 입력받는 단계; 상기 디스플레이 장치가, 상기 사용자 음성이 상기 디스플레이 장치에 기 저장된 명령어인지 여부를 판단하면서 상기 사용자 음성을 상기 대화형 서버로 전송하는 제1 전송 단계; 상기 대화형 서버가, 상기 사용자 음성에 대응되는 제어 정보 및 상기 제어 정보와 동일한 기능을 수행할 수 있는 상기 디스플레이 장치에 기 저장된 명령어를 안내하는 제1 가이드 정보 중 적어도 하나를 생성하여 상기 디스플레이 장치로 전송하는 제2 전송 단계; 및 상기 사용자 음성이 상기 디스플레이 장치에 기 저장된 명령어가 아닌 경우, 상기 디스플레이 장치가, 상기 대화형 서버로부터 전송된 제어 정보에 따라 디스플레이 장치의 기능을 수행하고 상기 제1 가이드 정보를 디스플레이하는 단계;를 포함한다.According to another aspect of the present invention, there is provided a method of controlling a voice recognition system including an interactive server and a display device, the method comprising: receiving a user voice; A first transmission step of transmitting the user's voice to the interactive server while determining whether the user's voice is a command previously stored in the display device; Wherein the interactive server generates at least one of control information corresponding to the user's voice and first guide information guiding instructions stored in the display device to perform the same function as the control information, A second transmission step of transmitting the first transmission data; And if the user's voice is not a command previously stored in the display device, the display device performs the function of the display device according to the control information transmitted from the interactive server and displays the first guide information .

그리고, 상기 사용자 음성이 상기 디스플레이 장치에 기 저장된 명령어인 경우, 상기 디스플레이 장치가, 상기 사용자 음성에 대응되는 제어 정보를 검색하고, 상기 검색된 제어 정보에 따라 상기 디스플레이 장치의 기능을 수행하는 단계;를 더 포함할 수 있다.If the user's voice is a command previously stored in the display device, the display device searches for control information corresponding to the user's voice and performs the function of the display device according to the retrieved control information. .

또한, 상기 사용자 음성이 기 저장된 명령어이며, 상기 사용자 음성이 복수의 계층구조를 가지는 디스플레이 장치의 기능을 제어하기 위한 명령어인 경우, 상기 디스플레이 장치가, 상기 사용자 음성과 동일한 기능을 수행할 수 있는 대화형 명령어를 안내하는 제2 가이드 정보를 디스플레이하는 단계;를 더 포함할 수 있다.If the user voice is a command for controlling a function of a display device having a plurality of hierarchical structures, the display device may display a dialogue that can perform the same function as the user voice And displaying second guide information for guiding the type command.

그리고, 상기 대화형 서버는, 상기 입력된 사용자 음성을 텍스트 정보로 변환하는 제1 대화형 서버 및 상기 텍스트 정보에 따라 제어 정보 및 제1 가이드 정보를 생성하는 제2 대화형 서버를 포함하며, 상기 제1 전송 단계는, 상기 디스플레이 장치가, 상기 사용자 음성을 디지털 신호로 변환하는 단계; 상기 디스플레이 장치가, 상기 디지털 신호를 제1 대화형 서버로 전송하는 단계; 상기 제1 대화형 서버가, 상기 디지털 신호에 대응되는 텍스트 정보를 생성하여 상기 디스플레이 장치로 전송하는 단계; 및 상기 디스플레이 장치가, 상기 텍스트 정보를 상기 제2 대화형 서버로 전송하는 단계;를 포함할 수 있다.The interactive server includes a first interactive server for converting the input user voice into text information and a second interactive server for generating control information and first guide information according to the text information, The first transmission step may include: the display device converting the user's voice into a digital signal; The display device transmitting the digital signal to a first interactive server; The first interactive server generating text information corresponding to the digital signal and transmitting the text information to the display device; And transmitting, by the display device, the text information to the second interactive server.

*또한, 상기 제2 전송 단계는, 상기 사용자 음성이 상기 대화형 서버에 저장된 대화 패턴이 아닌 경우, 상기 대화형 서버가, 상기 사용자 음성과 동일한 기능을 수행하면서 상기 대화형 서버에 저장된 대화 패턴에 따르는 사용자 음성을 안내하는 제3 가이드 정보를 생성하여 상기 디스플레이 장치로 전송하는 단계;를 더 포함하며, 상기 디스플레이 장치가 상기 제3 가이드 정보를 디스플레이하는 단계;를 더 포함할 수 있다.In addition, the second transmitting step may include a step of, when the user's voice is not a conversation pattern stored in the interactive server, causing the interactive server to transmit the conversation pattern stored in the interactive server And generating third guide information for guiding the following user's voice to the display device, wherein the display device displays the third guide information.

그리고, 상기 제2 전송 단계는, 상기 사용자 음성이 상기 대화형 서버가 응답할 수 없는 대화형 음성인 경우, 상기 대화형 서버가 상기 사용자 음성으로부터 키워드를 추출하여 상기 키워드와 관련된 정보를 안내하는 제4 가이드 정보를 생성하여 상기 디스플레이 장치로 전송하는 단계;를 더 포함하며, 상기 디스플레이 장치가 상기 제4 가이드 정보를 디스플레이하는 단계;를 더 포함할 수 있다.The second transmission step may include a step of extracting a keyword from the user's voice and guiding information related to the keyword if the user's voice is an interactive voice that the interactive server can not respond to, 4 guide information and transmitting the generated guide information to the display device, wherein the display device displays the fourth guide information.

상술한 바와 같은 본 발명의 다양한 실시예에 의해, 효율적인 음성 인식을 위한 가이드 정보를 제공함으로써, 사용자는 음성 인식을 이용하여 더욱 효율적이고 신속하게 디스플레이 장치의 기능을 수행할 수 있게 된다.According to various embodiments of the present invention as described above, by providing guide information for efficient voice recognition, a user can perform functions of a display device more efficiently and quickly by using voice recognition.

도 1은 본 발명의 일 실시예에 따른, 음성 인식 시스템을 도시한 도면,
도 2는 본 발명의 일 실시예에 따른, 디스플레이 장치의 구성을 나타내는 블럭도,
도 3은 본 발명의 일 실시예에 따른, 음성 입력부의 구성을 나타내는 블럭도,
도 4 내지 도 7은 본 발명의 다양한 실시예에 따른, 가이드 정보를 도시한 도면,
도 8은 본 발명의 일 실시예에 따른, 대화형 서버의 구성을 나타내는 블럭도,
도 9는 본 발명의 일 실시예에 따른, 디스플레이 장치의 제어 방법을 설명하기 위한 흐름도,
도 10은 본 발명의 일 실시예에 따른, 음성 인식 시스템의 디스플레이 장치 제어 방법을 설명하기 위한 시퀀스도, 그리고,
도 11은 본 발명의 다른 실시예에 따른, 음성 인식 시스템을 도시한 도면이다.1 illustrates a speech recognition system, in accordance with an embodiment of the present invention;
2 is a block diagram showing the configuration of a display device according to an embodiment of the present invention;
3 is a block diagram illustrating a configuration of a voice input unit according to an embodiment of the present invention;
Figures 4-7 illustrate guide information, in accordance with various embodiments of the present invention,
8 is a block diagram showing the configuration of an interactive server according to an embodiment of the present invention;
9 is a flowchart illustrating a method of controlling a display device according to an embodiment of the present invention.
10 is a sequence diagram illustrating a method of controlling a display device of a speech recognition system according to an embodiment of the present invention,
11 is a diagram showing a speech recognition system according to another embodiment of the present invention.

이하에서는 도면을 참조하여 본 발명에 대해 상세히 설명하도록 한다. 도 1은 본 발명의 일 실시예에 따른, 음성 인식 시스템을 도시한 도면이다. 도 1에 도시된 바와 같이, 음성 인식 시스템(10)은 디스플레이 장치(100) 및 대화형 서버(200)를 포함한다. 이때, 디스플레이 장치는 스마트 TV로 구현될 수 있으나, 이는 일 실시예에 불과할 뿐, 스마트폰, 데스크 탑 PC, 태블릿 PC, 노트북 PC, 내비게이션 등과 같은 다양한 전자 장치로 구현될 수 있다.Hereinafter, the present invention will be described in detail with reference to the drawings. 1 is a diagram illustrating a speech recognition system according to an embodiment of the present invention. As shown in FIG. 1, the speech recognition system 10 includes a display device 100 and an interactive server 200. At this time, the display device may be implemented as a smart TV, but it may be realized by various electronic devices such as a smart phone, a desktop PC, a tablet PC, a notebook PC, a navigation device, and the like.

디스플레이 장치(100)는 사용자 음성을 인식하여 인식된 사용자 음성을 바탕으로 디스플레이 장치(100)의 기능을 수행할 수 있다. 특히, 디스플레이 장치(100)는 임베디드 방식 및 대화형 방식을 이용하여 사용자 음성에 따라 디스플레이 장치(100)의 기능을 수행할 수 있다.The display device 100 recognizes the user's voice and can perform the function of the display device 100 based on the recognized user's voice. In particular, the display device 100 may perform the functions of the display device 100 according to the user's voice using an embedded method and an interactive method.

구체적으로, 디스플레이 장치(100)는 사용자 음성을 인식하여 디스플레이 장치(100)의 기능을 수행하기 위한 명령어를 제어 정보와 매칭하여 저장한다. 예를 들어, 디스플레이 장치(100)는 "볼륨 올려"라는 명령어와 "오디오 볼륨 레벨을 기 설정된 레벨 증가"라는 제어 정보를 매칭하여 저장할 수 있다.Specifically, the display device 100 recognizes a user's voice and stores a command for performing a function of the display device 100, matching the control information. For example, the display apparatus 100 may store the command "raise volume" and the control information "increase the audio volume level to a predetermined level"

디스플레이 장치(100)에 사용자 음성이 입력되면, 디스플레이 장치(100)는 사용자 음성을 외부의 대화형 서버(200)에 전송하는 동시에 사용자 음성이 디스플레이 장치(100)에 기 저장된 명령어인지 여부를 판단할 수 있다. When the user's voice is input to the display device 100, the display device 100 transmits the user's voice to the external interactive server 200 and determines whether the user's voice is a command stored in the display device 100 .

대화형 서버(200)는 데이터베이스를 이용하여 디스플레이 장치(100)로부터 수신된 사용자 음성에 대응되는 제어 정보를 검색할 수 있다. 예를 들어, 수신된 사용자 음성이 "볼륨을 높여줘"인 경우, 대화형 서버(200)는 키워드인 "볼륨" 및 "높여"를 이용하여 "디스플레이 장치(100)에서 출력되는 오디오의 볼륨 레벨을 기설정된 레벨(예를 들어, 3 레벨)만큼 증가"라는 제어 정보를 검색할 수 있다.The interactive server 200 can search the control information corresponding to the user's voice received from the display device 100 using the database. For example, when the received user voice is "raise the volume ", the interactive server 200 sets the volume level of the audio output from the display device 100 to "Quot; increase by a predetermined level (for example, three levels) ".

이때, 대화형 서버(200)는 사용자 음성에 대응되는 제어 정보를 검색하는 동안 제어 정보와 동일한 기능을 수행할 수 있는 디스플레이 장치(100)에 기저장된 명령어가 있는지 여부를 판단할 수 있다. 제어 정보와 동일한 기능을 수행할 수 있는 디스플레이 장치(100)에 기저장된 명령어가 있는 경우, 대화형 서버(200)는 디스플레이 장치(100)에 기 저장된 명령어를 안내하는 제1 가이드 정보를 생성하여 제어 정보와 함께 디스플레이 장치(100)에 전송할 수 있다. 예를 들어, 대화형 서버(200)는 수신된 사용자 음성인 "볼륨을 높여줘"와 동일한 기능을 수행할 수 있는 디스플레이 장치(100)에 기 저장된 명령어를 검색하고, 검색된 명령어인 "볼륨 올려"라는 명령어를 사용자에게 안내하는 제1 가이드 정보를 생성할 수 있다. 그리고, 대화형 서버(200)는 제1 가이드 정보를 기설정된 레벨만큼 디스플레이 장치(100)의 오디오 볼륨 레벨을 증가시키는 제어 정보와 함께 디스플레이 장치(100)로 전송할 수 있다. 제1 가이드 정보를 통해 디스플레이 장치(100)에 기 저장된 명령어를 사용자가 발화하도록 유도함으로써, 디스플레이 장치(100)는 더욱 신속하게 사용자 음성에 응답할 수 있게 된다.At this time, the interactive server 200 can determine whether there is a pre-stored command in the display device 100 that can perform the same function as the control information while searching for the control information corresponding to the user's voice. If there is a pre-stored instruction in the display device 100 capable of performing the same function as the control information, the interactive server 200 generates the first guide information for guiding the pre-stored instruction to the display device 100, To the display device 100 together with the information. For example, the interactive server 200 searches for a pre-stored command in the display device 100 that can perform the same function as "raise the volume ", which is the received user voice, It is possible to generate the first guide information for guiding the command to the user. The interactive server 200 may transmit the first guide information to the display device 100 together with control information for increasing the audio volume level of the display device 100 by a predetermined level. The display device 100 can promptly respond to the user's voice by inducing the user to utter the command previously stored in the display device 100 through the first guide information.

한편, 사용자 음성이 디스플레이 장치(100)에 기 저장된 명령어가 아닌 경우, 디스플레이 장치(100)는 대화형 서버(200)로부터 전송되는 제어 정보에 따라 디스플레이 장치(100)의 기능을 수행할 수 있다. 예를 들어, 기 저장된 명령어가 "볼륨 올려"이나, 사용자가 "볼륨을 높여줘"라고 발화한 경우, 디스플레이 장치(100)는 대화형 서버(200)로부터 전송된 제어 정보를 바탕으로 기 설정된 레벨만큼 디스플레이 장치(100)에서 출력되는 오디오의 볼륨 레벨을 증가시키는 기능을 수행할 수 있다. 그리고, 디스플레이 장치(100)는 대화형 서버(200)로부터 전송된 제1 가이드 정보를 디스플레이할 수 있다.Meanwhile, when the user's voice is not a command previously stored in the display device 100, the display device 100 may perform the function of the display device 100 according to the control information transmitted from the interactive server 200. For example, when the pre-stored command is "volume up ", or when the user utters" raise the volume ", the display device 100 displays, on the basis of the control information transmitted from the interactive server 200, And to increase the volume level of the audio output from the display device 100. FIG. The display device 100 may display the first guide information transmitted from the interactive server 200. [

사용자 음성이 디스플레이 장치(100)에 기 저장된 명령어인 경우, 디스플레이 장치(100)는 대화형 서버(200)로부터 전송되는 제어 정보와 무관하게 기 저장된 명령어와 대응되는 제어 정보를 검색할 수 있다. 그리고, 디스플레이 장치(100)는 제어 정보에 따라 디스플레이 장치(100)의 기능을 수행할 수 있다. 예를 들어, 디스플레이 장치(100)에 기저장된 명령어인 "볼륨 올려"라는 사용자 음성이 입력된 경우, 디스플레이 장치(100)는 기 저장된 명령어에 대응되는 제어 정보를 검색하고, 검색된 제어 정보에 따라 디스플레이 장치(100)의 오디오 레벨을 기설정된 레벨만큼 증가시키는 기능을 수행할 수 있다.When the user's voice is a command previously stored in the display device 100, the display device 100 can retrieve the control information corresponding to the pre-stored command irrespective of the control information transmitted from the interactive server 200. [ The display device 100 may perform the function of the display device 100 according to the control information. For example, when a user voice called "volume up ", which is a command previously stored in the display device 100, is input, the display device 100 searches for control information corresponding to a previously stored command, And to increase the audio level of the apparatus 100 by a predetermined level.

특히, 사용자 음성이 디스플레이 장치(100)에 기 저장된 명령어이나, 복수의 계층 구조를 가지는 디스플레이 장치의 기능을 수행하기 위한 명령어인 경우, 디스플레이 장치(100)는 사용자 음성과 동일한 기능을 수행할 수 있는 대화형 명령어를 안내하는 제2 가이드 정보를 디스플레이할 수 있다. 이는 기저장된 명령어를 이용하여 복수의 계층 구조를 가지는 디스플레이 장치의 기능을 수행하는 경우, 여러 번의 사용자 음성을 입력받아야 하는 불편함이 존재하므로, 한 번의 대화형 명령을 통해 더욱 간편하게 디스플레이 장치(100)의 기능을 제어할 수 있게 하기 위함이다.In particular, when the user's voice is a command word previously stored in the display apparatus 100 or a command for performing a function of a display apparatus having a plurality of hierarchical structures, the display apparatus 100 can perform the same function as the user's voice It is possible to display the second guide information guiding the interactive command. In the case of performing a function of a display device having a plurality of hierarchical structures using pre-stored commands, there is an inconvenience that a plurality of user's voices must be input, so that the display device 100 can be more easily accessed through a single interactive command. So that it can control the functions of the system.

상술한 바와 같이 더욱 효율적이고 신속한 음성 인식 방법을 안내하는 가이드 정보를 제공함으로써, 사용자는 더욱 효율적이고 신속하게 디스플레이 장치(100)를 제어할 수 있게 된다.The user can control the display device 100 more efficiently and promptly by providing the guide information that guides the voice recognition method more efficiently and promptly as described above.

한편, 상술한 실시예에서는 사용자 음성이 기 저장된 명령어인지 여부와 무관하게 사용자 음성이 대화형 서버(200)로 전송되는 것으로 설명하였으나, 이는 일 실시예에 불과할 뿐, 사용자 음성이 기 저장된 명령어가 아닌 경우에만 사용자 음성을 대화형 서버(200)로 전송할 수 있다.Meanwhile, in the above-described embodiment, the user's voice is transmitted to the interactive server 200 irrespective of whether or not the user's voice is a pre-stored command. However, this is only an embodiment, It is possible to transmit the user's voice to the interactive server 200 only.

이하에서는 도 2 내지 도 7을 참조하여 디스플레이 장치(100)에 대해 더욱 상세히 설명하기로 한다. 도 2는 본 발명의 일 실시예에 따른, 디스플레이 장치(100)의 구성을 나타내는 블럭도이다. 디스플레이 장치(100)는 음성 입력부(110), 통신부(120), 저장부(130), 디스플레이부(140) 및 제어부(150)를 포함한다.Hereinafter, the display device 100 will be described in more detail with reference to FIGS. 2 to 7. FIG. 2 is a block diagram showing a configuration of a display apparatus 100 according to an embodiment of the present invention. The display device 100 includes a voice input unit 110, a communication unit 120, a storage unit 130, a display unit 140, and a control unit 150.

한편, 도 2는 디스플레이 장치(100)가 음성 인식 기능, 통신 기능, 디스플레이 기능 등과 같이 다양한 기능을 구비한 장치인 경우를 예로 들어, 각종 구성 요소들을 종합적으로 도시한 것이다. 따라서, 실시 예에 따라서는, 도 2에 도시된 구성 요소 중 일부는 생략 또는 변경될 수도 있고, 다른 구성요소가 더 추가될 수도 있다.2 is a block diagram illustrating various components of the display device 100 as an example of a device having various functions such as a voice recognition function, a communication function, a display function, and the like. Therefore, depending on the embodiment, some of the components shown in Fig. 2 may be omitted or changed, and other components may be further added.

음성 입력부(110)는 사용자 음성이 포함된 오디오 신호를 입력받고, 오디오 신호를 처리하여 사용자 음성 신호를 생성한다. 이때, 음성 입력부(110)는 디스플레이 장치(100)의 본체에 구비될 수 있으나, 이는 일 실시예에 불과할 뿐, 본체의 외부(예를 들어, 리모컨 또는 별도의 마이크 등)에 구비될 수 있다. 음성 입력부(110)가 본체의 외부에 구비되는 경우, 음성 입력부(110)는 유/무선 인터페이스(예를 들어, Wi-Fi, 블루투스 등)을 통해 생성된 사용자 음성 신호를 디스플레이 장치(100)의 본체에 전송할 수 있다.The voice input unit 110 receives an audio signal including a user voice and processes the audio signal to generate a user voice signal. At this time, the voice input unit 110 may be provided in the main body of the display device 100, but it may be provided in the outside of the main body (for example, a remote control or a separate microphone) only in an embodiment. When the voice input unit 110 is provided outside the main body, the voice input unit 110 transmits a user voice signal generated through a wired / wireless interface (for example, Wi-Fi or Bluetooth) Can be transmitted to the main body.

음성 입력부(110)가 사용자 음성이 포함된 오디오 신호를 입력받아 사용자 음성 신호를 생성하는 방법에 대해서는 도 3을 참조하여 설명하기로 한다. 도 3은 본 발명의 일 실시예에 따른, 음성 입력부의 구성을 나타내는 블럭도이다. 도 3에 도시된 바와 같이, 음성 입력부(110)는 마이크(111), ADC(Analog-Digital Converter)(112), 에너지 판단부(113), 노이즈 제거부(114) 및 음성신호 생성부(115)를 포함한다. A method in which the voice input unit 110 receives an audio signal including a user voice to generate a user voice signal will be described with reference to FIG. 3 is a block diagram illustrating a configuration of a voice input unit according to an embodiment of the present invention. 3, the voice input unit 110 includes a microphone 111, an analog-digital converter (ADC) 112, an energy determination unit 113, a noise removal unit 114, and a voice signal generation unit 115 ).

마이크(111)는 사용자 음성이 포함된 아날로그 형태의 오디오 신호를 입력받는다. The microphone 111 receives an analog audio signal including a user voice.

그리고, ADC(112)는 마이크로부터 입력된 다채널 아날로그 신호를 디지털 신호로 변환한다.The ADC 112 converts the multi-channel analog signal input from the microcomputer into a digital signal.

그리고, 에너지 판단부(113)는 변환된 디지털 신호의 에너지를 계산하여, 디지털 신호의 에너지가 기설정된 값 이상인지 여부를 판단한다. 디지털 신호의 에너지가 기설정된 값 이상인 경우, 에너지 판단부(113)는 입력된 디지털 신호를 노이즈 제거부(114)로 전송하고, 디지털 신호의 에너지가 기설정된 값 미만인 경우, 에너지 판단부(113)는 입력된 디지털 신호를 외부로 출력하지 않고, 다른 입력을 기다린다. 이에 의해, 음성 신호가 아닌 소리에 의해 전체 오디오 처리 과정이 활성화되지 않아, 불필요한 전력 소모를 방지할 수 있다.The energy determining unit 113 calculates the energy of the converted digital signal to determine whether the energy of the digital signal is equal to or greater than a predetermined value. The energy determining unit 113 transmits the input digital signal to the noise removing unit 114. When the energy of the digital signal is less than a preset value, Does not output the input digital signal to the outside, but waits for another input. As a result, the entire audio processing process is not activated by sound other than a voice signal, thereby preventing unnecessary power consumption.

노이즈 제거부(114)에 입력된 디지털 신호가 입력된 경우, 노이즈 제거부(114)는 노이즈 성분과 사용자 음성 성분이 포함된 디지털 신호 중 노이즈 성분을 제거한다. 이때, 노이즈 성분은 가정 환경에서 발생할 수 있는 돌발성 잡음으로써, 에어컨 소리, 청소기 소리, 음악 소리 등이 포함될 수 있다. 그리고, 노이즈 제거부(114)는 노이즈 성분이 제거된 디지털 신호를 음성 신호 생성부(115)로 출력한다.When the digital signal input to the noise removing unit 114 is inputted, the noise removing unit 114 removes the noise component from the digital signal including the noise component and the user voice component. At this time, the noise component is sudden noise that may occur in a home environment, and may include air conditioner sound, cleaner sound, music sound, and the like. Then, the noise removing unit 114 outputs the digital signal from which the noise component has been removed to the audio signal generating unit 115.

음성 신호 생성부(115)는 Localization/Speaker Tracking 모듈을 이용하여 음성 입력부(110)를 기준으로 360˚ 범위 내에 존재하는 사용자의 발화 위치를 추적하여 사용자 음성에 대한 방향 정보를 구한다. 그리고, 음성 신호 생성부(115)는 Target Spoken Sound Extraction 모듈을 통해 노이즈가 제거된 디지털 신호와 사용자 음성에 대한 방향 정보를 이용하여 음성 입력부(110)를 기준으로 360˚ 범위 내에 존재하는 목표 음원을 추출하여 음성 신호를 생성할 수 있다.The voice signal generation unit 115 tracks a user's utterance position within a range of 360 degrees based on the voice input unit 110 using a Localization / Speaker Tracking module to obtain direction information on the user voice. The audio signal generation unit 115 generates a target sound source existing within a 360-degree range based on the audio input unit 110 using the digital signal from which the noise is removed and the direction information on the user audio through the Target Spoken Sound Extraction module So that a voice signal can be generated.

한편, 상술한 바와 같이, 불필요한 주변의 노이즈를 제거하여 음성 신호를 생성하는 것은 일 실시예에 불과할 뿐, 사용자 음성에 키워드가 존재하는지 여부를 판단하여 음성 신호를 생성하는 실시예 역시 본 발명의 기술적 사상이 적용될 수 있다.As described above, the method of generating a speech signal by determining whether or not a keyword exists in the user's voice is performed only by removing the unnecessary peripheral noise to generate a speech signal. Thoughts can be applied.

다시 도 2에 대해 설명하면, 통신부(120)는 대화형 서버(200)와 통신을 수행한다. 특히, 통신부(120)는 음성 입력부(110)에서 생성된 사용자 음성 신호를 대화형 서버(200)에 전송하며, 대화형 서버(200)로부터 제어 정보 및 가이드 정보 중 적어도 하나를 수신할 수 있다. 이때, 통신부(120)는 이더넷(Ethernet), 무선랜, Wi-Fi 등으로 구현될 수 있으나, 이에 한정되는 것은 아니다.Referring again to FIG. 2, the communication unit 120 performs communication with the interactive server 200. FIG. In particular, the communication unit 120 may transmit the user voice signal generated by the voice input unit 110 to the interactive server 200, and may receive at least one of the control information and the guide information from the interactive server 200. At this time, the communication unit 120 may be implemented by Ethernet, wireless LAN, Wi-Fi, etc., but is not limited thereto.

저장부(130)는 디스플레이 장치(100)를 구동하기 위한 다양한 프로그램 및 데이터를 저장하고 있다. 특히, 저장부(130)는 명령어와 제어 정보가 매칭되어 저장되는 음성 인식 데이터베이스를 포함할 수 있다.The storage unit 130 stores various programs and data for driving the display device 100. In particular, the storage unit 130 may include a speech recognition database in which commands and control information are matched and stored.

디스플레이부(130)는 제어부(150)의 제어에 의해 영상 데이터를 디스플레이한다. 특히, 디스플레이부(130)는 기 저장된 가이드 정보 및 대화형 서버(200)로부터 수신된 가이드 정보 중 하나를 디스플레이할 수 있다.The display unit 130 displays the image data under the control of the controller 150. In particular, the display unit 130 may display one of the pre-stored guide information and the guide information received from the interactive server 200.

제어부(150)는 사용자 명령에 따라 디스플레이 장치(100)의 전반적인 동작을 제어한다. 특히, 제어부(150)는 음성 입력부(110)를 통해 입력된 사용자 음성에 따라 디스플레이 장치(100)의 전반적인 동작을 제어할 수 있다.The control unit 150 controls the overall operation of the display device 100 according to a user command. In particular, the control unit 150 may control the overall operation of the display device 100 according to the user's voice input through the voice input unit 110. [

구체적으로, 제어부(150)는 음성 입력부(110)를 통해 입력된 사용자 음성이 저장부(130)에 기 저장된 명령어인지 여부를 판단한다. 그와 동시에, 제어부(150)는 사용자 음성을 통신부(120)를 통해 대화형 서버(200)로 전송할 수 있다. 예를 들어, 음성 입력부(110)를 통해 "볼륨을 높여줘"라는 사용자 음성이 입력되면, 제어부(150)는 입력된 "볼륨을 높여줘"가 기 저장된 명령어인지 여부를 판단한다. 그리고, 제어부(150)는 "볼륨을 높여줘"를 외부의 대화형 서버(200)로 전송하도록 통신부(120)를 제어할 수 있다.Specifically, the control unit 150 determines whether the user's voice input through the voice input unit 110 is a command stored in the storage unit 130 in advance. At the same time, the control unit 150 can transmit the user's voice to the interactive server 200 through the communication unit 120. [ For example, when a user voice called "raise the volume" is input through the voice input unit 110, the controller 150 determines whether the input "boost volume" Then, the control unit 150 can control the communication unit 120 to transmit the "Raise Volume" to the external interactive server 200. [

특히, 사용자 음성이 저장부에 기 저장된 명령어가 아닌 경우, 대화형 서버(200)로부터 사용자 음성에 대응되는 제어 정보 및 사용자 음성과 동일한 기능을 수행할 수 있는 기 저장된 명령어를 안내하는 제1 가이드 정보가 전송되면, 제어부(150)는 대화형 서버(200)로부터 전송된 제어 정보에 따라 디스플레이 장치(100)의 기능을 수행하고, 제1 가이드 정보를 디스플레이하도록 디스플레이부(120)를 제어할 수 있다. 예를 들어, 사용자 음성이 "볼륨을 높여줘"인 경우, 대화형 서버(200)로부터 "오디오 볼륨 레벨을 기설정된 레벨 증가"라는 제어 정보 및 "볼륨을 높여줘"와 동일한 기능을 수행하며 저장부(130)에 저장된 명령어인 "볼륨 올려"를 안내하는 제1 가이드 정보가 수신되면, 제어부(150)는 오디오 볼륨 레벨을 기설정된 레벨만큼 증가시키는 기능을 수행할 수 있으며, 도 4에 도시된 바와 같이, "다음부터는 "볼륨 올려"로 말해주세요."라는 텍스트 정보가 포함된 제1 가이드 정보(410)를 디스플레이하도록 디스플레이부(120)를 제어할 수 있다. In particular, when the user's voice is not a pre-stored command in the storage unit, the interactive server 200 transmits control information corresponding to the user's voice and first guide information The control unit 150 may control the display unit 120 to display the first guide information by performing the function of the display device 100 according to the control information transmitted from the interactive server 200 . For example, when the user voice is "raise the volume ", the interactive server 200 performs the same function as" raise the volume level to a predetermined level " The control unit 150 may perform a function of increasing the audio volume level by a predetermined level when the first guide information for guiding the command "volume up" stored in the storage unit 130 is received. As shown in FIG. 4, Quot ;, "Please say with the volume up ", " Tell the user "

반면, 사용자 음성이 저장부(130)에 기 저장된 명령어인 경우, 제어부(150)는 저장부(130)에 저장된 명령어에 대응되는 제어 정보를 검색하고, 검색된 제어 정보에 따라 디스플레이 장치의 기능을 수행할 수 있다. 예를 들어, 음성 입력부(110)를 통해 입력된 사용자 음성이 "볼륨 올려"인 경우, 제어부(150)는 입력된 사용자 음성인 "볼륨 올려"와 대응되는 제어 정보인 "오디오 볼륨 레벨을 기설정된 레벨 증가"를 검색하고, 검색된 제어 정보에 따라 오디오 볼륨 레벨을 기설정된 레벨만큼 증가시키는 기능을 수행할 수 있다. 이때, 제어부(150)는 외부 대화형 서버(200)로부터 제어 정보가 수신되더라도 우선적으로 임베디드된 명령어에 따라 디스플레이 장치(100)의 기능을 수행할 수 있다. On the other hand, when the user's voice is an instruction word previously stored in the storage unit 130, the control unit 150 searches for control information corresponding to the instruction stored in the storage unit 130, and performs a function of the display device according to the retrieved control information can do. For example, when the user's voice input through the voice input unit 110 is "volume up ", the control unit 150 sets the audio volume level, which is control information corresponding to the input user voice & Level increase "and increase the audio volume level by a predetermined level according to the retrieved control information. At this time, the control unit 150 can perform the function of the display device 100 according to an instruction word embedded preferentially even when the control information is received from the external interactive server 200. [

또한, 음성 입력부(110)를 통해 입력된 사용자 음성이 저장부(130)에 저장된 명령어이며, 사용자 음성이 복수의 계층구조를 가지는 디스플레이 장치의 기능을 제어하기 위한 명령어인 경우, 제어부(150)는 사용자 음성과 동일한 기능을 수행할 수 있는 대화형 명령어를 안내하는 제2 가이드 정보를 디스플레이하도록 디스플레이부(120)를 제어할 수 있다. 예를 들어, 기 저장된 명령어를 이용하여 디스플레이 장치(100)의 취침 기능을 설정하기 위해, 사용자로부터 "취침 설정"이라는 1단계 사용자 음성이 입력되면, 제어부(150)는 취침 설정을 위한 메뉴를 디스플레이하고, 사용자로부터 "30분"이라는 2단계 사용자 음성이 입력되면, 제어부(150)는 30분 뒤 디스플레이 장치(100)의 전원을 끄는 기능을 수행할 수 있다. 즉, 사용자는 기 저장된 명령어를 이용하여 복수의 계층 구조를 가지는 디스플레이 장치의 기능을 수행하는 경우 복수의 사용자 음성을 입력해야 하는 불편함이 존재한다. 그러나, "30분 후에 깨워줘"라는 대화형 방식의 사용자 음성이 입력된 경우, 제어부(150)는 대화형 서버(200)를 이용하여 복수의 사용자 음성을 입력하는 것과 동일한 기능을 수행할 수 있다. 즉, 복수의 계층 구조를 가지는 디스플레이 장치의 기능을 수행하는 경우, 제어부(150)는 한 번의 사용자 음성을 통해 디스플레이 장치(100)의 기능을 수행할 수 있도록 도 5에 도시된 바와 같은 대화형 명령어를 안내하는 제2 가이드 정보(510)를 디스플레이하도록 디스플레이부(120)를 제어할 수 있다.If the user's voice inputted through the voice input unit 110 is an instruction word stored in the storage unit 130 and the user's voice is a command for controlling the functions of the display devices having a plurality of hierarchical structures, It is possible to control the display unit 120 to display the second guide information that guides the interactive command that can perform the same function as the user voice. For example, when the user inputs a one-step user voice called "sleep setting " from the user to set the sleep function of the display device 100 using the pre-stored command, the controller 150 displays a menu for the sleep setting And if the user inputs a second level user voice of "30 minutes" from the user, the controller 150 can perform a function of turning off the power of the display device 100 after 30 minutes. That is, when a user performs a function of a display device having a plurality of hierarchical structures using pre-stored commands, there is an inconvenience to input a plurality of user's voices. However, if an interactive user's voice of "wake up after 30 minutes" is inputted, the controller 150 can perform the same function as inputting a plurality of user's voices by using the interactive server 200 . That is, when performing a function of a display device having a plurality of hierarchical structures, the controller 150 controls the display device 100 to perform the functions of the display device 100 through a single user's voice, The display unit 120 may be controlled to display the second guide information 510 guiding the user.

뿐만 아니라, 음성 입력부(110)를 통해 입력된 사용자 음성이 대화형 서버(200)에 저장된 대화 패턴이 아닌 경우, 대화형 서버(200)로부터 사용자 음성과 동일한 기능을 수행하면서 대화형 서버(200)에 저장된 대화 패턴에 따르는 사용자 음성을 안내하는 제3 가이드 정보가 전송되면, 제어부(150)는 제3 가이드 정보를 디스플레이하도록 디스플레이부(120)를 제어할 수 있다. 예를 들어, "바꿔 채널"이라는 사용자 음성이 입력된 경우, 대화형 서버(200)로부터 대화형 서버(200)에 저장된 대화 패턴의 명령어인 "채널을 ooo로 바꿔줘"라는 사용자 음성을 안내하는 제3 가이드 정보가 전송되면, 제어부(150)는 도 6에 도시된 바와 같은 제3 가이드 정보(610)를 디스플레이하도록 디스플레이부(120)를 제어할 수 있다.In addition, when the user's voice input through the voice input unit 110 is not the conversation pattern stored in the interactive server 200, the interactive server 200 performs the same function as the user voice, The controller 150 may control the display unit 120 to display the third guide information when the third guide information for guiding the user voice according to the conversation pattern stored in the first guide information is transmitted. For example, when a user voice called "exchange channel" is input, a message for instructing a user voice "exchange channel to ooo" as a command of a conversation pattern stored in the interactive server 200 from the interactive server 200 When the third guide information is transmitted, the control unit 150 may control the display unit 120 to display the third guide information 610 as shown in FIG.

또는, 음성 입력부(110)를 통해 입력된 사용자 음성이 대화형 서버(200)가 응답할 수 없는 대화형 음성인 경우, 대화형 서버(200)로부터 사용자 음성에 포함된 키워드와 관련된 정보를 안내하는 제4 가이드 정보가 전송되면, 제어부(150)는 제4 가이드 정보를 디스플레이부(120)에 디스플레이하도록 제어할 수 있다. 예를 들어, 음성 입력부(110)를 통해 "유재석 어때"라는 사용자 음성이 입력된 경우, 대화형 서버(200)는 입력된 사용자 음성을 통해 응답 정보를 생성할 수 없으므로, 입력된 사용자 음성으로부터 키워드인 "유재석"을 추출하고, 추출된 키워드와 관련된 정보(예를 들어, 직업, 출연작 등)를 안내하는 제4 가이드 정보를 생성하여 디스플레이 장치(100)로 전송할 수 있다. 대화형 서버(200)로부터 제4 가이드 정보가 디스플레이되면, 제어부(150)는 도 7에 도시된 바와 같은 제4 가이드 정보(710)를 디스플레이하도록 디스플레이부(120)를 제어할 수 있다. 또 다른 예로, 음성 입력부(110)를 통해 "맛집 갈까"라는 사용자 음성이 입력된 경우, 대화형 서버(200)는 입력된 사용자 음성을 통해 응답 정보를 생성할 수 없으므로, 입력된 사용자 음성으로부터 키워드인 "맛집"을 추출하고, 추출된 키워드와 관련된 정보(예를 들어, 오늘의 추천 맛집)를 안내하는 제 4 가이드 정보를 생성하여 디스플레이 장치(100)로 전송할 수 있다. Alternatively, when the user's voice inputted through the voice input unit 110 is an interactive voice that can not be answered by the interactive server 200, the information related to the keyword included in the user voice is guided from the interactive server 200 When the fourth guide information is transmitted, the control unit 150 may control the display unit 120 to display the fourth guide information. For example, when a user voice called "when there is a bad luck" is input through the voice input unit 110, the interactive server 200 can not generate response information through the inputted user voice, And generates fourth guide information for guiding information related to the extracted keyword (e.g., job, performance, etc.) to the display device 100. [0050] When the fourth guide information is displayed from the interactive server 200, the control unit 150 may control the display unit 120 to display the fourth guide information 710 as shown in FIG. As another example, when a user voice called "go to the restaurant" is inputted through the voice input unit 110, the interactive server 200 can not generate response information through the inputted user voice, , And transmits the generated fourth guide information to the display device 100. The fourth guide information may include information about the extracted keyword (e.g., today's recommended restaurant).

이때, 제4 가이드 정보(710)는 키워드와 관련된 정보뿐만 아니라, 새로운 사용자 음성의 입력을 요구하는 메시지를 포함할 수 있다.At this time, the fourth guide information 710 may include a message requesting input of a new user voice as well as information related to the keyword.

상술한 바와 같은 디스플레이 장치(100)에 의해, 사용자는 음성 인식을 이용하여 더욱 효율적이고 신속하게 디스플레이 장치(100)를 제어할 수 있게 된다.With the display device 100 as described above, the user can control the display device 100 more efficiently and quickly by using the voice recognition.

도 8은 본 발명의 일 실시예에 따른, 대화형 서버(200)의 구성을 나타내는 블럭도이다. 도 8에 도시된 바와 같이, 대화형 서버(200)는 통신부(210), 데이터베이스(220) 및 제어부(230)를 포함한다.8 is a block diagram showing the configuration of an interactive server 200 according to an embodiment of the present invention. As shown in FIG. 8, the interactive server 200 includes a communication unit 210, a database 220, and a control unit 230.

통신부(210)는 디스플레이 장치(100)와 통신을 수행한다. 특히, 통신부(210)디스플레이 장치(100)로부터 사용자 음성 신호를 수신하며, 제어 정보 및 가이드 정보 중 적어도 하나를 디스플레이 장치(100)로 전송할 수 있다. 이때, 통신부(120)는 이더넷(Ethernet), 무선랜, Wi-Fi 등과 같은 통신 방식을 이용하여 디스플레이 장치(100)와 통신을 수행할 수 있다.The communication unit 210 performs communication with the display device 100. In particular, the communication unit 210 may receive a user voice signal from the display device 100, and may transmit at least one of the control information and the guide information to the display device 100. At this time, the communication unit 120 can perform communication with the display device 100 using a communication method such as Ethernet, wireless LAN, Wi-Fi, or the like.

데이터베이스(220)는 대화형 음성을 이용하여 디스플레이 장치(100)의 기능을 제어하거나 컨텐츠를 검색하기 위하여, 다양한 데이터를 저장한다. 특히, 데이터베이스(220)는 사용자 음성 이력 정보 및 EPG 정보와 같은 정보를 저장할 수 있다. 또한, 데이터베이스(22)는 사용자 음성 및 제어 정보를 매칭하여 저장할 수 있다.The database 220 stores various data for controlling the functions of the display device 100 using the interactive voice or for searching contents. In particular, the database 220 may store information such as user voice history information and EPG information. In addition, the database 22 may store and store user voice and control information.

또한, 대화형 서버(200)가 제1 가이드 정보를 제공할 수 있도록 데이터베이스(220)는 디스플레이 장치(100)에 기 저장된 명령어와 유사한 명령어를 표 1과 같이, 저장할 수 있다. In addition, the database 220 may store commands similar to those stored in the display device 100 as shown in Table 1 so that the interactive server 200 can provide the first guide information.

디스플레이 장치에 기 저장된 명령어A command pre-stored in the display device 유사 명령어Similar commands 볼륨 올려Raise the volume 볼륨 높여, 볼륨 키워, 볼륨 증가, 소리 키워, 소리 높여, 크게 틀어줘 등Increase volume, increase volume, increase volume, raise sound, raise sound, turn loud, etc. 음소거Mute 소리 꺼, 볼륨 꺼 등등Sounds turned off, volume turned off, etc.

제어부(230)는 대화형 서버(200)의 전반적인 동작을 제어한다. The control unit 230 controls the overall operation of the interactive server 200.

특히, 디스플레이 장치(100)로부터 사용자 음성이 수신되면, 제어부(230)는 사용자 음성에 대응되는 제어 정보를 검색한다. 구체적으로, 제어부(230)는 사용자 음성을 텍스트 정보로 변환한 후, 사용자 음성의 발화 요소를 분류할 수 있다. 그리고, 제어부(230)는 발화 요소를 이용하여 사용자 음성이 디스플레이 장치(100)의 기능을 제어하기 위한 사용자 음성인지, 컨텐츠 검색을 위한 사용자 음성인지 여부를 판단할 수 있다. 그리고, 사용자 음성이 디스플레이 장치(100)의 기능을 제어하기 위한 사용자 음성인 경우, 제어부(230)는 데이터베이스(220)를 이용하여 사용자 음성에 대응되는 제어 정보를 검색할 수 있다.In particular, when the user's voice is received from the display device 100, the control unit 230 searches for control information corresponding to the user's voice. Specifically, the control unit 230 can classify a user's utterance as a user's voice after converting the user's voice into text information. The control unit 230 can determine whether the user's voice is a user's voice for controlling the function of the display device 100 or a user's voice for searching for contents using a speech element. If the user's voice is a user's voice for controlling the function of the display device 100, the control unit 230 can search the control information corresponding to the user's voice using the database 220. [

제어부(230)는 사용자 음성에 대응되는 제어 정보를 검색하는 동안 제어 정보와 동일한 기능을 수행할 수 있는 디스플레이 장치(100)에 기저장된 명령어가 있는지 여부를 판단한다. 그리고, 제어 정보와 동일한 기능을 수행할 수 있는 디스플레이 장치(100)에 기저장된 명령어가 존재하는 경우, 제어부(230)는 기 저장된 명령어를 안내하는 제1 가이드 정보를 생성하여 제어 정보와 함께 디스플레이 장치(100)에 전송하도록 통신부(210)를 제어할 수 있다. 예를 들어, 사용자 음성이 "볼륨 높여"인 경우, 제어부(230)는 "볼륨 높여"와 동일한 기능을 수행할 수 있는 디스플레이 장치(100)에 기 저장된 명령어가 있는지 여부를 검색하고, "볼륨 높여"와 동일한 기능을 수행하면서 디스플레이 장치(100)에 기 저장된 명령어인 "볼륨 올려"를 안내하는 제1 가이드 정보를 생성할 수 있다.The control unit 230 determines whether there is a pre-stored command in the display device 100 that can perform the same function as the control information while searching for the control information corresponding to the user's voice. When there is a command already stored in the display device 100 capable of performing the same function as the control information, the controller 230 generates the first guide information for guiding the previously stored command, The communication unit 210 may control the communication unit 210 to transmit the data to the communication unit 100. For example, when the user voice is "increased in volume ", the control unit 230 searches the display device 100 capable of performing the same function as" raise the volume " Quot; volume up ", which is a command previously stored in the display device 100, while performing the same function as "

또한, 디스플레이 장치(100)로부터 전송된 사용자 음성이 대화형 서버(200)에 저장된 대화 패턴이 아닌 경우, 제어부(230)는 사용자 음성과 동일한 기능을 수행하면서 대화형 서버(200)에 저장된 대화 패턴에 따르는 사용자 음성을 안내하는 제3 가이드 정보를 생성하여 디스플레이 장치로 전송하도록 통신부(210)를 제어할 수 있다. 예를 들어, "바꿔 채널"이라는 사용자 음성이 입력된 경우, 제어부(230)는 데이터베이스(220)에 저장된 대화 패턴이 아님을 판단한다. 그리고, 제어부(230)는 데이터베이스(220)에 저장된 대화 패턴으로 사용자가 발화하는 것을 유도하기 위해, "채널을 ooo로 바꿔줘"라는 사용자 음성을 안내하는 제3 가이드 정보를 생성할 수 있다.If the user's voice transmitted from the display device 100 is not a conversation pattern stored in the interactive server 200, the controller 230 performs the same function as the user's voice, The third guide information for guiding the user's voice according to the user's voice, and control the communication unit 210 to transmit the generated third guide information to the display device. For example, when a user voice called "replace channel" is input, the controller 230 determines that the pattern is not a conversation pattern stored in the database 220. The control unit 230 may generate third guide information for guiding a user voice "change the channel to ooo" in order to induce the user to utter the conversation pattern stored in the database 220.

또한, 디스플레이 장치(100)로부터 전송된 사용자 음성이 대화형 서버(200)가 응답할 수 없는 대화형 음성인 경우, 제어부(230)는 대화형 사용자 음성으로부터 키워드를 추출하고, 키워드와 관련된 정보를 안내하는 제4 가이드 정보를 생성하여 디스플레이 장치(100)로 전송하도록 통신부(210)를 제어할 수 있다.If the user's voice transmitted from the display device 100 is an interactive voice that the interactive server 200 can not respond to, the control unit 230 extracts keywords from the interactive user's voice, It is possible to control the communication unit 210 to transmit the generated fourth guide information to the display device 100. [

예를 들어, 디스플레이 장치(100)로부터 "유재석 어때"라는 사용자 음성이 전송된 경우, 제어부(230)는 입력된 사용자 음성을 통해 응답 정보를 생성할 수 없으므로, 입력된 사용자 음성으로부터 키워드인 "유재석"을 추출하고, 추출된 키워드와 관련된 정보(예를 들어, 직업, 출연작 등)를 안내하는 제4 가이드 정보를 생성하여 디스플레이 장치(100)로 전송할 수 있다. 또 다른 예로, 디스플레이 장치(100)로부터 "맛집 갈까"라는 사용자 음성이 입력된 경우, 제어부(230)는 입력된 사용자 음성을 통해 응답 정보를 생성할 수 없으므로, 입력된 사용자 음성으로부터 키워드인 "맛집"을 추출하고, 추출된 키워드와 관련된 정보(예를 들어, 오늘의 추천 맛집)를 안내하는 제 4 가이드 정보를 생성하여 디스플레이 장치(100)로 전송할 수 있다. For example, in the case where a user voice called "bad news stakeout" is transmitted from the display device 100, the control unit 230 can not generate response information through the inputted user voice, Quot ;, and fourth guide information for guiding information related to the extracted keyword (e.g., job, performance, etc.) to the display device 100. As another example, in the case where the user's voice "go hungry" is inputted from the display device 100, the control unit 230 can not generate the response information through the inputted user's voice. Therefore, And transmits the generated fourth guide information to the display device 100. The fourth guide information may be transmitted to the display device 100. The fourth guide information may include information related to the extracted keyword (e.g., today's recommended restaurant).

상술한 바와 같이 대화형 서버(200)가 다양한 가이드 정보를 제공함으로써, 사용자는 음성 인식을 이용하여 더욱 효율적이고 신속하게 디스플레이 장치(100)의 기능을 제어할 수 있게 된다.As described above, since the interactive server 200 provides various guide information, the user can control the functions of the display device 100 more efficiently and quickly by using the voice recognition.

도 9는 본 발명의 일 실시예에 따른, 디스플레이 장치(100)의 제어 방법을 설명하기 위한 흐름도이다.9 is a flowchart for explaining a control method of the display apparatus 100 according to an embodiment of the present invention.

디스플레이 장치(100)는 사용자 음성을 입력받는다(S910). 이때, 사용자 음성은 볼륨 제어, 채널 제어, 전원 제어와 같은 디스플레이 장치(100)의 기능을 제어하기 위한 명령어일 수 있다.The display device 100 receives the user's voice (S910). At this time, the user voice may be a command for controlling functions of the display device 100 such as volume control, channel control, and power control.

그리고, 디스플레이 장치(100)는 사용자 음성을 대화형 서버(200)로 전송한다(S920). 그리고, 디스플레이 장치(100)는 사용자 음성이 기 저장된 명령어인지 여부를 판단한다(S930). 이때, S920 단계 및 S930 단계는 동시에 수행될 수 있다.Then, the display device 100 transmits the user's voice to the interactive server 200 (S920). Then, the display device 100 determines whether the user's voice is a pre-stored command (S930). At this time, steps S920 and S930 may be performed simultaneously.

사용자 음성이 기 저장된 명령어인 경우(S930-Y), 디스플레이 장치(100)는 기 저장된 명령어에 따라 디스플레이 장치(100)의 기능을 수행한다(S940).If the user's voice is a pre-stored command (S930-Y), the display apparatus 100 performs the function of the display apparatus 100 according to the pre-stored command (S940).

사용자 음성이 기 저장된 명령어가 아닌 경우, 디스플레이 장치(100)는 대화형 서버(200)로부터 제어 정보 및 제1 가이드 정보를 수신한다(S950). 이때, 제1 가이드 정보는 사용자 음성과 동일한 기능을 수행하면서 디스플레이 장치(100)에 기 저장된 명령어를 안내하는 정보일 수 있다.If the user's voice is not a pre-stored command, the display apparatus 100 receives the control information and the first guide information from the interactive server 200 (S950). At this time, the first guide information may be information for guiding the pre-stored command to the display device 100 while performing the same function as the user's voice.

디스플레이 장치(100)는 수신된 제어 정보에 따라 디스플레이 장치의 기능을 수행하고, 제1 가이드 정보를 디스플레이한다(S960).The display device 100 performs the function of the display device according to the received control information, and displays the first guide information (S960).

한편, 기저장된 명령어에 따라 디스플레이 장치(100)의 기능을 수행하는 경우, 디스플레이 장치(100)는 디스플레이 장치(100)의 복수의 계층 구조를 가지는 디스플레이 장치(100)의 기능인지 여부를 판단한다(S970).Meanwhile, when performing the function of the display device 100 according to pre-stored commands, the display device 100 determines whether or not it is a function of the display device 100 having a plurality of hierarchical structures of the display device 100 S970).

복수의 계층 구조를 가지는 디스플레이 장치(100)의 기능인 경우(S970-Y), 디스플레이 장치(100)는 제2 가이드 정보를 디스플레이한다(S980). 이때, 제2 가이드 정보는 사용자 음성과 동일한 기능을 수행하면서 대화형 서버(200)를 이용할 수 있는 대화형 명령어를 안내하는 정보일 수 있다.In the case of the function of the display device 100 having a plurality of hierarchical structures (S970-Y), the display device 100 displays the second guide information (S980). At this time, the second guide information may be information for guiding an interactive command word that can use the interactive server 200 while performing the same function as the user voice.

도 10은 본 발명의 일 실시예에 따른, 음성 인식 시스템의 디스플레이 장치 제어 방법을 설명하기 위한 시퀀스도이다. 10 is a sequence diagram illustrating a method of controlling a display device of a speech recognition system according to an embodiment of the present invention.

우선, 디스플레이 장치(100)는 사용자 음성을 입력받는다(S1010).First, the display device 100 receives a user's voice (S1010).

그리고, 디스플레이 장치(100)는 입력된 사용자 음성을 대화형 서버(200)로 전송한다(S1020). 그와 동시에, 디스플레이 장치(100)는 사용자 음성이 기 저장된 명령어인지 여부를 판단한다(S1030).Then, the display apparatus 100 transmits the input user voice to the interactive server 200 (S1020). At the same time, the display apparatus 100 determines whether the user voice is a pre-stored command (S1030).

대화형 서버(200)는 사용자 음성에 대응되는 제어 정보 및 가이드 정보를 생성한다(S1040). 구체적으로, 대화형 서버(200)는 사용자 음성의 발화 요소를 분석하여 사용자 음성에 대응되는 제어 정보를 생성할 수 있으며, 사용자 음성의 유형에 따라 다양한 가이드 정보를 생성할 수 있다. 예를 들어, 사용자 음성이 디스플레이 장치(100)에 기 저장된 명령어가 아닌 경우, 대화형 서버(200)는 사용자 음성과 동일한 기능을 수행할 수 있는 디스플레이 장치에 기 저장된 명령어를 안내하는 제1 가이드 정보를 생성할 수 있다. 또는, 사용자 음성이 대화형 서버(200)에 저장된 대화 패턴이 아닌 경우, 대화형 서버(200)는 사용자 음성과 동일한 기능을 수행하면서 대화형 서버에 저장된 대화 패턴에 따르는 사용자 음성을 안내하는 제3 가이드 정보를 생성할 수 있다. 또는, 사용자 음성이 대화형 서버가 응답할 수 없는 대화형 음성인 경우, 대화형 서버(200)는 사용자 음성으로부터 키워드를 추출하여 키워드와 관련된 정보를 안내하는 제4 가이드 정보를 생성할 수 있다.The interactive server 200 generates control information and guide information corresponding to the user's voice (S1040). Specifically, the interactive server 200 can generate control information corresponding to the user's voice by analyzing the speech element of the user's voice, and can generate various guide information according to the type of the user's voice. For example, if the user's voice is not an instruction pre-stored in the display device 100, the interactive server 200 may display the first guide information < RTI ID = 0.0 >Lt; / RTI > Alternatively, when the user's voice is not a conversation pattern stored in the interactive server 200, the interactive server 200 may perform a third function of guiding the user's voice according to the conversation pattern stored in the interactive server, Guide information can be generated. Alternatively, if the user voice is an interactive voice that the interactive server can not respond to, the interactive server 200 may extract the keyword from the user voice and generate the fourth guide information for guiding the information related to the keyword.

그리고, 대화형 서버(200)는 제어 정보 및 가이드 정보를 디스플레이 장치(100)로 전송한다(S1050).Then, the interactive server 200 transmits the control information and the guide information to the display device 100 (S1050).

디스플레이 장치(100)는 사용자 음성에 따라 디스플레이 장치의 기능을 수행하고, 가이드 정보를 디스플레이한다(S1060). 구체적으로, 디스플레이 장치(100)는 사용자 음성이 기 저장된 명령어인지 여부에 따라 상이한 제어 정보를 이용하여 디스플레이 장치(100)의 기능을 수행할 수 있다. 사용자 음성이 기 저장된 명령어인 경우, 디스플레이 장치(100)는 기 저장된 명령어에 대응되는 제어 정보를 검색하여 검색된 제어 정보에 따라 디스플레이 장치(100)의 기능을 수행할 수 있다. 반면, 사용자 음성이 기 저장된 명령어가 아닌 경우, 디스플레이 장치(100)는 대화형 서버(200)로부터 전송된 제어 정보에 따라 디스플레이 장치(100)의 기능을 수행할 수 있다. 또한, 디스플레이 장치(100)는 사용자가 더욱 효율적이고 신속하게 음성 인식을 수행할 수 있도록 도 4 내지 도 7에서 설명한 바와 같은 가이드 정보(410,510,610,710)를 디스플레이할 수 있다.The display apparatus 100 performs the function of the display apparatus according to the user's voice and displays the guide information (S1060). Specifically, the display apparatus 100 may perform the function of the display apparatus 100 using different control information depending on whether the user's voice is a pre-stored command. When the user's voice is a pre-stored command, the display apparatus 100 searches the control information corresponding to the pre-stored command and can perform the function of the display apparatus 100 according to the retrieved control information. On the other hand, if the user's voice is not a pre-stored command, the display apparatus 100 may perform the function of the display apparatus 100 according to the control information transmitted from the interactive server 200. In addition, the display device 100 may display the guide information 410, 510, 610, and 710 as illustrated in FIGS. 4 to 7 so that the user can perform voice recognition more efficiently and quickly.

상술한 바와 같이, 효율적인 음성 인식을 위한 가이드 정보를 제공함으로써, 사용자는 음성 인식을 이용하여 더욱 효율적이고 신속하게 디스플레이 장치의 기능을 수행할 수 있게 된다.As described above, by providing the guide information for efficient voice recognition, the user can perform the function of the display device more efficiently and quickly by using the voice recognition.

한편, 상술한 실시예에서는 대화형 서버(200)가 하나의 서버로 구현되는 것으로 설명하였으나, 이는 일 실시예에 불과할 뿐, 복수의 서버로 구현될 수 있다. 예를 들어, 도 11에 도시된 바와 같이, 대화형 서버(200)는 입력된 사용자 음성을 텍스트 정보로 변환하는 제1 대화형 서버(200-1) 및 텍스트 정보에 따라 제어 정보 및 제1 가이드 정보를 생성하는 제2 대화형 서버(200-2)를 포함할 수 있다. 이 경우, 디스플레이 장치(100)는 입력된 사용자 음성을 제1 대화형 서버(200-1)로 전송하며, 제1 대화형 서버(200-1)로부터 전송된 텍스트 정보를 제2 대화형 서버(200-2)로 전송할 수 있다. 제2 대화형 서버(200-2)는 전송된 텍스트 정보를 이용하여 도 8에서 설명한 바와 같이, 제어 정보 및 가이드 정보 중 적어도 하나를 생성할 수 있다. In the above-described embodiment, the interactive server 200 is implemented as a single server. However, the interactive server 200 may be implemented by a plurality of servers. 11, the interactive server 200 includes a first interactive server 200-1 for converting input user voice into text information, and a second interactive server 200-1 for converting the control information and the first guide And a second interactive server 200-2 that generates information. In this case, the display apparatus 100 transmits the inputted user voice to the first interactive server 200-1, and transmits the text information transmitted from the first interactive server 200-1 to the second interactive server 200-1 200-2. The second interactive server 200-2 can generate at least one of the control information and the guide information as described with reference to FIG. 8 using the transmitted text information.

이상과 같은 다양한 실시 예에 따른 제어 방법을 수행하기 위한 프로그램 코드는 비일시적 판독 가능 매체(non-transitory computer readable medium)에 저장될 수 있다. 비일시적 판독 가능 매체란 레지스터, 캐쉬, 메모리 등과 같이 짧은 순간 동안 데이터를 저장하는 매체가 아니라 반영구적으로 데이터를 저장하며, 기기에 의해 판독(reading)이 가능한 매체를 의미한다. 구체적으로는, 상술한 다양한 어플리케이션 또는 프로그램들은 CD, DVD, 하드 디스크, 블루레이 디스크, USB, 메모리카드, ROM 등과 같은 비일시적 판독 가능 매체에 저장되어 제공될 수 있다.The program code for performing the control method according to various embodiments as described above may be stored in a non-transitory computer readable medium. A non-transitory readable medium is a medium that stores data for a short period of time, such as a register, cache, memory, etc., but semi-permanently stores data and is readable by the apparatus. In particular, the various applications or programs described above may be stored on non-volatile readable media such as CD, DVD, hard disk, Blu-ray disk, USB, memory card, ROM,

또한, 이상에서는 본 발명의 바람직한 실시예에 대하여 도시하고 설명하였지만, 본 발명은 상술한 특정의 실시예에 한정되지 아니하며, 청구범위에서 청구하는 본 발명의 요지를 벗어남이 없이 당해 발명이 속하는 기술분야에서 통상의 지식을 가진자에 의해 다양한 변형실시가 가능한 것은 물론이고, 이러한 변형실시들은 본 발명의 기술적 사상이나 전망으로부터 개별적으로 이해되어져서는 안될 것이다.While the present invention has been particularly shown and described with reference to exemplary embodiments thereof, it is to be understood that the invention is not limited to the disclosed exemplary embodiments, but, on the contrary, It will be understood by those skilled in the art that various changes in form and detail may be made therein without departing from the spirit and scope of the present invention.

110: 음성 입력부 120: 통신부
130: 저장부 140: 디스플레이부
150: 제어부110: voice input unit 120: communication unit
130: storage unit 140: display unit
150:

Claims

A method of controlling a display device,
Receiving a user voice for controlling the display device;
Transmitting the user voice to an interactive server regardless of whether the user voice is a previously stored command;
Determining whether the user voice is a command previously stored in the display device;
Searching for control information corresponding to the pre-stored command if the user voice is a previously stored command, and preferentially performing the function of the display device according to the retrieved control information;
A first guide information for guiding control information corresponding to the user's voice and a pre-stored command capable of performing the same function as the user's voice, from the interactive server, when the user's voice is not an instruction word previously stored in the display device ; And
Performing a function of the display device according to control information received from the interactive server, and displaying the first guide information.

The method according to claim 1,
The interactive server comprises:
Retrieving control information corresponding to the user voice,
Determining whether there is a pre-stored command in the display device capable of performing the same function as the control information while searching for control information corresponding to the user's voice,
When there is an instruction previously stored in the display device capable of performing the same function as the control information, generates first guide information for guiding the pre-stored instruction word and transmits the first guide information together with the control information to the display device Lt; / RTI >

The method according to claim 1,
The user voice is a pre-stored instruction, and when the user voice is a command for controlling a function of a display device having a plurality of hierarchical structures, And displaying the guide information.

The method according to claim 1,
The interactive server comprises:
A first interactive server for converting the user voice into text information, and a second interactive server for generating control information and first guide information according to the text information.

In the display device,
A voice input unit for receiving a user voice for controlling the display device;
A communication unit for performing communication with the interactive server;
A storage unit for storing and storing instructions and control information;
A display unit; And
Controls the communication unit to transmit the user's voice to the interactive server irrespective of whether or not the user's voice input through the voice input unit is a pre-stored command, and determines whether the inputted user's voice is a pre-stored command in the storage unit Lt; / RTI >
Searching the control information corresponding to the user's voice stored in the storage unit and preferentially performing the function of the display device according to the retrieved control information when the user's voice is an instruction word previously stored in the storage unit,
A first guide information for guiding control information corresponding to the user's voice and a pre-stored command capable of performing the same function as the user's voice, from the interactive server, when the user's voice is not an instruction previously stored in the storage unit And a control unit for controlling the display unit to display the first guide information by performing the function of the display apparatus according to the control information transmitted from the interactive server.

6. The method of claim 5,
The interactive server comprises:
Retrieving control information corresponding to the user voice,
Determining whether there is a pre-stored command in the display device capable of performing the same function as the control information while searching for control information corresponding to the user's voice,
When there is an instruction previously stored in the display device capable of performing the same function as the control information, generates first guide information for guiding the pre-stored instruction word and transmits the first guide information together with the control information to the display device / RTI >

The method according to claim 6,
Wherein,
The user voice is a pre-stored instruction, and when the user voice is a command for controlling a function of a display device having a plurality of hierarchical structures, And controls the display unit to display guide information.

6. The method of claim 5,
The interactive server comprises:
A first interactive server for converting the input user voice into text information, and a second interactive server for generating control information and first guide information according to the text information,
Wherein,
And transmits the input user voice to the first interactive server and controls the communication unit to transmit the text information transmitted from the first interactive server to the second interactive server.

A control method of a speech recognition system including an interactive server and a display device,
The display device receiving a user voice;
A first transmission step of transmitting, by the display device, the user's voice to the interactive server regardless of whether the user's voice is a pre-stored command;
Determining whether the user's voice is a command previously stored in the display device;
When the user's voice is a command previously stored in the display device, the display device searches for control information corresponding to the user's voice and preferentially performs the function of the display device according to the retrieved control information;
Wherein the interactive server generates at least one of control information corresponding to the user's voice and first guide information guiding instructions stored in the display device to perform the same function as the control information, A second transmission step of transmitting the first transmission data; And
And if the user's voice is not a command previously stored in the display device, the display device performs the function of the display device according to the control information transmitted from the interactive server and displays the first guide information Lt; / RTI >

10. The method of claim 9,
The user's voice is a previously stored instruction word and the user's voice is a command for controlling a function of a display device having a plurality of hierarchical structures, And displaying second guide information for guiding the second guide information.

10. The method of claim 9,
The interactive server comprises:
A first interactive server for converting the input user voice into text information, and a second interactive server for generating control information and first guide information according to the text information,
Wherein the first transmission step comprises:
Converting the user voice into a digital signal;
The display device transmitting the digital signal to a first interactive server;
The first interactive server generating text information corresponding to the digital signal and transmitting the text information to the display device; And
And the display device transmitting the text information to the second interactive server.

10. The method of claim 9,
Wherein the second transmission step comprises:
When the user's voice is not a conversation pattern stored in the interactive server, the interactive server transmits third guide information for guiding a user's voice according to a conversation pattern stored in the interactive server while performing the same function as the user's voice And transmitting the generated information to the display device,
And displaying the third guide information by the display device.

10. The method of claim 9,
Wherein the second transmission step comprises:
If the user's voice is an interactive voice that the interactive server can not respond to, the interactive server extracts keywords from the user's voice and generates fourth guide information for guiding information related to the keyword, The method comprising the steps of:
And displaying the fourth guide information by the display device.