CN118233442A

CN118233442A - Multimedia file generation method and device and multimedia file playing method and device

Info

Publication number: CN118233442A
Application number: CN202311354142.3A
Authority: CN
Inventors: 唐鹏
Original assignee: China Mobile Communications Group Co Ltd; MIGU Culture Technology Co Ltd
Current assignee: China Mobile Communications Group Co Ltd; MIGU Culture Technology Co Ltd
Priority date: 2023-10-18
Filing date: 2023-10-18
Publication date: 2024-06-21

Abstract

The present disclosure relates to a multimedia file generation method and apparatus, and a multimedia file playing method and apparatus, the multimedia file generation method comprising: acquiring a multimedia file, wherein the multimedia file comprises redundant information; and writing the hidden information into the redundant information to generate a multimedia file carrying the hidden information, wherein the hidden information is used for indicating that the multimedia file is played in a specific playing mode. Based on the above, different hidden information can be set for different multimedia files, and a plurality of hidden information is written into redundant information corresponding to the multimedia files, so that the content of the multimedia files is enriched.

Description

Multimedia file generation method and device and multimedia file playback method and device

技术领域Technical Field

本公开涉及计算机技术领域，尤其涉及一种多媒体文件生成方法和装置及多媒体文件播放方法和装置。The present disclosure relates to the field of computer technology, and in particular to a method and device for generating a multimedia file and a method and device for playing a multimedia file.

背景技术Background technique

目前，传统的音效主要是通过用户根据多媒体文件的文字内容、多媒体文件的频率等参数手动进行设置，然而，大部分用户都不具备专业的多媒体文件知识，无法为多媒体文件设置最适合的音效。At present, traditional sound effects are mainly set manually by users according to parameters such as text content of multimedia files and frequency of multimedia files. However, most users do not have professional knowledge of multimedia files and are unable to set the most suitable sound effects for multimedia files.

现有技术中，当用户耗费大量的时间和精力为多媒体文件设置好音效时，只能在当前终端进行播放，无法实现跨平台播放，并且针对不同的场景以及不同多媒体文件播放时段不能灵活的切换音效。In the prior art, when users spend a lot of time and energy to set sound effects for multimedia files, they can only be played on the current terminal, and cross-platform playback cannot be achieved. In addition, the sound effects cannot be flexibly switched for different scenes and different multimedia file playback periods.

发明内容Summary of the invention

有鉴于此，本公开实施例提供了一种多媒体文件生成方法和装置及多媒体文件播放方法和装置，能够在无需用户选择的情况下，为用户提供包含最佳音效的多媒体文件。In view of this, the embodiments of the present disclosure provide a multimedia file generation method and device and a multimedia file playback method and device, which can provide a user with a multimedia file containing the best sound effect without the need for user selection.

本公开实施例的第一方面，提供了一种多媒体文件生成方法，包括：获取多媒体文件，其中，多媒体文件包括冗余信息；将隐写信息写入冗余信息中，以生成携带有隐写信息的多媒体文件，其中，隐写信息用于指示以特定播放方式对多媒体文件进行播放。According to a first aspect of an embodiment of the present disclosure, a method for generating a multimedia file is provided, comprising: obtaining a multimedia file, wherein the multimedia file includes redundant information; writing steganographic information into the redundant information to generate a multimedia file carrying the steganographic information, wherein the steganographic information is used to indicate that the multimedia file is to be played in a specific playback mode.

本公开实施例的第二方面，提供了一种多媒体文件播放方法，包括：获取多媒体文件；检测多媒体文件中是否存在隐写信息，其中，隐写信息用于指示以特定的播放方式对多媒体文件进行播放；在多媒体文件中存在隐写信息的情况下，以特定的播放方式对多媒体文件进行播放。A second aspect of an embodiment of the present disclosure provides a method for playing a multimedia file, including: acquiring a multimedia file; detecting whether there is steganographic information in the multimedia file, wherein the steganographic information is used to indicate that the multimedia file is to be played in a specific playback mode; and when the steganographic information exists in the multimedia file, playing the multimedia file in the specific playback mode.

本公开实施例的第三方面，提供了一种多媒体文件生成装置，包括：获取模块，被配置为获取多媒体文件，其中，多媒体文件包括冗余信息；生成模块，被配置为将隐写信息写入冗余信息中，以生成携带有隐写信息的多媒体文件，其中，隐写信息用于指示以特定播放方式对多媒体文件进行播放。According to a third aspect of an embodiment of the present disclosure, a multimedia file generation device is provided, comprising: an acquisition module, configured to acquire a multimedia file, wherein the multimedia file includes redundant information; and a generation module, configured to write steganographic information into the redundant information to generate a multimedia file carrying the steganographic information, wherein the steganographic information is used to indicate that the multimedia file is to be played in a specific playback mode.

本公开实施例的第四方面，提供了一种多媒体文件播放装置，包括：获取模块，被配置为获取多媒体文件；检测模块，被配置为检测多媒体文件中是否存在隐写信息，其中，隐写信息用于指示以特定的播放方式对多媒体文件进行播放；播放模块，被配置为在多媒体文件中存在隐写信息的情况下，以特定的播放方式对多媒体文件进行播放。According to a fourth aspect of the embodiments of the present disclosure, a multimedia file playback device is provided, including: an acquisition module, configured to acquire a multimedia file; a detection module, configured to detect whether there is steganographic information in the multimedia file, wherein the steganographic information is used to indicate that the multimedia file is to be played in a specific playback mode; and a playback module, configured to play the multimedia file in a specific playback mode when the steganographic information is present in the multimedia file.

本公开实施例的第五方面，提供了一种电子设备，包括至少一个处理器；用于存储至少一个处理器可执行指令的存储器；其中，至少一个处理器用于执行指令，以实现上述方法的步骤。According to a fifth aspect of an embodiment of the present disclosure, an electronic device is provided, comprising at least one processor; and a memory for storing instructions executable by at least one processor; wherein the at least one processor is used to execute instructions to implement the steps of the above method.

本公开实施例的第六方面，提供了一种计算机可读存储介质，当计算机可读存储介质中的指令由电子设备的处理器执行时，使得电子设备能够执行上述方法的步骤。According to a sixth aspect of an embodiment of the present disclosure, a computer-readable storage medium is provided. When instructions in the computer-readable storage medium are executed by a processor of an electronic device, the electronic device is enabled to perform the steps of the above method.

本公开实施例采用的上述至少一个技术方案能够达到以下有益效果：通过获取多媒体文件，其中，多媒体文件包括冗余信息；将隐写信息写入冗余信息中，以生成携带有隐写信息的多媒体文件，其中，隐写信息用于指示以特定播放方式对多媒体文件进行播放，能够利用隐写法在多媒体文件的冗余信息中添加播放多媒体文件的隐写信息，从而使多媒体文件以特定的播放方式进行播放，因此，丰富了多媒体文件内容，增加了用户的体验感和趣味性。At least one of the above-mentioned technical solutions adopted in the embodiments of the present disclosure can achieve the following beneficial effects: by obtaining a multimedia file, wherein the multimedia file includes redundant information; writing steganographic information into the redundant information to generate a multimedia file carrying the steganographic information, wherein the steganographic information is used to indicate that the multimedia file is to be played in a specific playback mode, and the steganographic information for playing the multimedia file can be added to the redundant information of the multimedia file by using the steganography method, so that the multimedia file is played in a specific playback mode, thereby enriching the content of the multimedia file and increasing the user experience and fun.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

为了更清楚地说明本公开实施例中的技术方案，下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍，显而易见地，下面描述中的附图仅仅是本公开的一些实施例，对于本领域普通技术人员来讲，在不付出创造性劳动的前提下，还可以根据这些附图获得其它的附图。In order to more clearly illustrate the technical solutions in the embodiments of the present disclosure, the drawings required for use in the embodiments or the description of the prior art will be briefly introduced below. Obviously, the drawings described below are only some embodiments of the present disclosure. For ordinary technicians in this field, other drawings can be obtained based on these drawings without paying any creative work.

图1示出了根据本公开实施例的多媒体文件生成所适用的系统的结构示意图；FIG1 is a schematic diagram showing the structure of a system applicable to multimedia file generation according to an embodiment of the present disclosure;

图2示出了本公开实施例的多媒体文件生成方法的流程示意图；FIG2 is a schematic diagram showing a flow chart of a method for generating a multimedia file according to an embodiment of the present disclosure;

图3示出了本公开实施例的在多媒体文件中写入隐写信息的流程示意图；FIG3 is a schematic diagram showing a process of writing steganographic information into a multimedia file according to an embodiment of the present disclosure;

图4示出了本公开实施例的多媒体文件播放方法的流程示意图；FIG4 is a schematic diagram showing a flow chart of a multimedia file playing method according to an embodiment of the present disclosure;

图5示出了本公开实施例的多媒体文件播放过程的流程示意图；FIG5 is a schematic diagram showing a process flow of a multimedia file playing process according to an embodiment of the present disclosure;

图6示出了本公开实施例的多媒体文件播放过程的流程示意图；FIG6 is a schematic diagram showing a process flow of a multimedia file playing process according to an embodiment of the present disclosure;

图7示出了本公开实施例的多媒体文件生成装置的结构示意图；FIG7 shows a schematic diagram of the structure of a multimedia file generating device according to an embodiment of the present disclosure;

图8示出了本公开实施例的多媒体文件播放装置的结构示意图；FIG8 is a schematic structural diagram of a multimedia file playing device according to an embodiment of the present disclosure;

图9示出了本公开实施例提供的电子设备的结构示意图；FIG9 shows a schematic structural diagram of an electronic device provided by an embodiment of the present disclosure;

图10示出了本公开实施例提供的计算机系统的结构示意图。FIG. 10 shows a schematic diagram of the structure of a computer system provided in an embodiment of the present disclosure.

具体实施方式Detailed ways

下面将参照附图更详细地描述本公开的实施例。虽然附图中显示了本公开的某些实施例，然而应当理解的是，本公开可以通过各种形式来实现，而且不应该被解释为限于这里阐述的实施例，相反提供这些实施例是为了更加透彻和完整地理解本公开。应当理解的是，本公开的附图及实施例仅用于示例性作用，并非用于限制本公开的保护范围。Embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although certain embodiments of the present disclosure are shown in the accompanying drawings, it should be understood that the present disclosure can be implemented in various forms and should not be construed as being limited to the embodiments described herein, which are instead provided for a more thorough and complete understanding of the present disclosure. It should be understood that the drawings and embodiments of the present disclosure are only for exemplary purposes and are not intended to limit the scope of protection of the present disclosure.

应当理解，本公开的方法实施方式中记载的各个步骤可以按照不同的顺序执行，和/或并行执行。此外，方法实施方式可以包括附加的步骤和/或省略执行示出的步骤。本公开的范围在此方面不受限制。It should be understood that the various steps described in the method embodiments of the present disclosure may be performed in different orders and/or in parallel. In addition, the method embodiments may include additional steps and/or omit the steps shown. The scope of the present disclosure is not limited in this respect.

本文使用的术语“包括”及其变形是开放性包括，即“包括但不限于”。术语“基于”是“至少部分地基于”。术语“一个实施例”表示“至少一个实施例”；术语“另一实施例”表示“至少一个另外的实施例”；术语“一些实施例”表示“至少一些实施例”。其他术语的相关定义将在下文描述中给出。需要注意，本公开中提及的“第一”、“第二”等概念仅用于对不同的装置、模块或单元进行区分，并非用于限定这些装置、模块或单元所执行的功能的顺序或者相互依存关系。The term "including" and its variations used in this document are open inclusions, that is, "including but not limited to". The term "based on" means "based at least in part on". The term "one embodiment" means "at least one embodiment"; the term "another embodiment" means "at least one other embodiment"; the term "some embodiments" means "at least some embodiments". Relevant definitions of other terms will be given in the description below. It should be noted that the concepts of "first", "second", etc. mentioned in this disclosure are only used to distinguish different devices, modules or units, and are not used to limit the order or interdependence of the functions performed by these devices, modules or units.

需要注意，本公开中提及的“一个”、“多个”的修饰是示意性而非限制性的，本领域技术人员应当理解，除非在上下文另有明确指出，否则应该理解为“一个或多个”。It should be noted that the modifications of "one" and "plurality" mentioned in the present disclosure are illustrative rather than restrictive, and those skilled in the art should understand that unless otherwise clearly indicated in the context, it should be understood as "one or more".

本公开实施方式中的多个装置之间所交互的消息或者信息的名称仅用于说明性的目的，而并不是用于对这些消息或信息的范围进行限制。The names of the messages or information exchanged between multiple devices in the embodiments of the present disclosure are only used for illustrative purposes and are not used to limit the scope of these messages or information.

在介绍本公开实施例之前首先对本公开实施例中涉及到的相关名词作如下释义：Before introducing the embodiments of the present disclosure, the following definitions are given for the relevant terms involved in the embodiments of the present disclosure:

AES的全称是Advanced Encryption Standard，意思是高级加密标准。它的出现主要是为了取代DES加密算法的，因为我们都知道DES算法的密钥长度是56Bit，因此算法的理论安全强度是2的56次方。但二十世纪中后期正是计算机飞速发展的阶段，元器件制造工艺的进步使得计算机的处理能力越来越强，虽然出现了3DES的加密方法，但由于它的加密时间是DES算法的3倍多，64Bit的分组大小相对较小，所以还是不能满足人们对安全性的要求。AES stands for Advanced Encryption Standard. It was created to replace the DES encryption algorithm, because we all know that the key length of the DES algorithm is 56 bits, so the theoretical security strength of the algorithm is 2 to the 56th power. However, the mid-to-late 20th century was a period of rapid development of computers. The advancement of component manufacturing technology made the processing power of computers stronger and stronger. Although the 3DES encryption method appeared, its encryption time was more than 3 times that of the DES algorithm, and the 64-bit group size was relatively small, so it still could not meet people's requirements for security.

DES全称为Data Encryption Standard，即数据加密标准是一个分组加密算法，典型的DES以64位为分组对数据加密，加密和解密用的是同一个算法。The full name of DES is Data Encryption Standard, which is a block encryption algorithm. Typical DES encrypts data in 64-bit blocks, and the same algorithm is used for encryption and decryption.

RSA全称为The Republic of South Africa，RSA加密算法是一种非对称加密算法，所谓非对称，就是指该算法加密和解密使用不同的密钥，即，使用加密密钥进行加密、解密密钥进行解密。The full name of RSA is The Republic of South Africa. The RSA encryption algorithm is an asymmetric encryption algorithm. The so-called asymmetric means that the algorithm uses different keys for encryption and decryption, that is, the encryption key is used for encryption and the decryption key is used for decryption.

QR码是二维条码的一种，QR来自英文“Quick Response”的缩写，即快速反应的意思，源自发明者希望QR码可让其内容快速被解码。QR码比普通条码可储存更多资料，亦无需像普通条码般在扫描时需直线对准扫描器。QR code is a type of two-dimensional barcode. QR is the abbreviation of "Quick Response" in English, which means quick response. It comes from the inventor's hope that QR code can allow its content to be decoded quickly. QR code can store more information than ordinary barcodes, and it does not need to be aligned with the scanner when scanning like ordinary barcodes.

时域低比特位隐写主要从(Least Significant Bit，LSB)最低有效位隐写开始，其中包含LSB替换和LSB匹配。根据多媒体文件时域波形的16比特样点值进行隐写。Time domain low bit steganography mainly starts with the least significant bit (LSB) steganography, which includes LSB replacement and LSB matching. Steganography is performed based on the 16-bit sample value of the time domain waveform of the multimedia file.

相位编码法，人耳听觉系统对不同频谱分量的相对相位敏感度不同，可以将初始多媒体文件的相位替换为表示嵌入数据的参考相位，从而实现信息的嵌入。当修改位置之后的段，也要进行相应的调整，来确保各段之间的相对相位不会发生改变。就信号的感知信噪比而言，相位编码是最有效的编码方法之一。实际上，当各频率分量之间的相位关系发生显著变化时，则会发生明显的相位色散现象。只要相位修正足够小，就可以实现不可感知编码。而且，相位编码可以容忍信号失真现象。Phase encoding method: the human auditory system has different sensitivities to the relative phase of different spectral components. The phase of the initial multimedia file can be replaced by a reference phase representing the embedded data, thereby embedding information. When the segments after the position is modified, corresponding adjustments should also be made to ensure that the relative phase between the segments does not change. Phase encoding is one of the most effective encoding methods in terms of the perceived signal-to-noise ratio of the signal. In fact, when the phase relationship between the frequency components changes significantly, obvious phase dispersion will occur. As long as the phase correction is small enough, imperceptible encoding can be achieved. Moreover, phase encoding can tolerate signal distortion.

扩频法，扩频技术是在数据通信领域发展起来的概念，通过产生传递信息的冗余副本，使经过噪声信道干扰的原始数据仍然能够被正确恢复。扩频技术主要包含两种:直接序列扩频和调频。扩频法的优点是即使噪声破坏了一些隐藏信息，也可以通过副本来恢复它。Spread spectrum method, spread spectrum technology is a concept developed in the field of data communication. By generating redundant copies of transmitted information, the original data can still be correctly recovered after being interfered by a noisy channel. There are two main types of spread spectrum technology: direct sequence spread spectrum and frequency modulation. The advantage of spread spectrum method is that even if noise destroys some hidden information, it can be recovered through the copy.

回声(Echo Hiding)算法是种经典算法，在回声隐藏的算法中，编码器将载体数据延迟一定的时间并叠加到原始的载体数据上以产生回声。编码器可以用两个不同的延迟时间来嵌入“0”和“1”。在实际的操作中，用代表“0”或“1”的回声内核与载体信号进行卷积来达到添加回声的效果。The echo hiding algorithm is a classic algorithm. In the echo hiding algorithm, the encoder delays the carrier data for a certain time and superimposes it on the original carrier data to generate an echo. The encoder can embed "0" and "1" with two different delay times. In actual operation, the echo kernel representing "0" or "1" is convolved with the carrier signal to achieve the effect of adding an echo.

近年来，随着互联网的发展，多媒体产业下的多媒体文件内容有着非常广泛的用户，为了使得多媒体文件内容更加丰富，用户会为多媒体文件添加各种各样的音效，使得多媒体文件内容更加丰富。但是传统的音效主要是通过用户根据多媒体文件的文字内容、多媒体文件的频率等参数手动进行设置，由于大部分用户都不具备专业的多媒体文件知识，因此无法为多媒体文件设置最适合的音效，并且，在手动设置音效的过程中，需要用户为同一多媒体文件设置多个不同的音效，不断地进行尝试，从而耗费掉用户大量的时间以及精力，若用户选择不设置音效进行多媒体文件播放时，在不同用户的终端播放同一多媒体文件均为多媒体文件的原始多媒体文件，缺少趣味性。In recent years, with the development of the Internet, multimedia file contents under the multimedia industry have a very wide range of users. In order to enrich the content of multimedia files, users will add various sound effects to multimedia files to enrich the content of multimedia files. However, traditional sound effects are mainly set manually by users according to parameters such as the text content of multimedia files and the frequency of multimedia files. Since most users do not have professional knowledge of multimedia files, they are unable to set the most suitable sound effects for multimedia files. In addition, in the process of manually setting sound effects, users are required to set multiple different sound effects for the same multimedia file and keep trying, which consumes a lot of time and energy of users. If users choose not to set sound effects to play multimedia files, the same multimedia file played on different users' terminals is the original multimedia file, which lacks fun.

针对上述问题，本公开实施例提供了多媒体文件生成方法和多媒体文件播放方法，能够将针对多媒体文件的播放方式隐写到多媒体文件中，并将携带隐写信息的多媒体文件通过网络渠道进行传输，多媒体文件播放设备获取到多媒体文件后，首先需要对多媒体文件进行检测，确认该多媒体文件是否含有隐写信息，当多媒体文件含有隐写信息时，以特定的播放方式对多媒体文件进行播放，可以在任何多媒体文件播放设备上无感知播放，从而实现多媒体文件的跨平台播放，增加用户的体验感和趣味性。In response to the above problems, the embodiments of the present disclosure provide a multimedia file generation method and a multimedia file playback method, which can steganographically write the playback mode for the multimedia file into the multimedia file, and transmit the multimedia file carrying the steganographic information through a network channel. After the multimedia file playback device obtains the multimedia file, it is first necessary to detect the multimedia file to confirm whether the multimedia file contains the steganographic information. When the multimedia file contains the steganographic information, the multimedia file is played in a specific playback mode, and can be played imperceptibly on any multimedia file playback device, thereby realizing cross-platform playback of multimedia files and increasing the user experience and fun.

图1示出了根据本公开实施例的多媒体文件生成所适用的系统的结构示意图。如图1所示，本公开实施例的系统100可以包括：多媒体文件播放设备110、计算设备120以及数据存储系统130。Fig. 1 is a schematic diagram showing the structure of a system applicable to multimedia file generation according to an embodiment of the present disclosure. As shown in Fig. 1 , the system 100 of the embodiment of the present disclosure may include: a multimedia file playing device 110, a computing device 120 and a data storage system 130.

具体地，多媒体文件播放设备110可以通过通信网络与计算设备130通信。该通信网络可以为有线通信网络或无线通信网络。有线通信网络可以为基于电力线载波技术的通信网络，无线通信网络可以为局域无线网络或广域无线网络。局域无线网络可以为WIFI无线网络、Zigbee无线网络、移动通信网络或卫星通信网络等。同时多媒体文件播放设备可以为具有解码能力的多媒体文件播放设备。Specifically, the multimedia file playback device 110 can communicate with the computing device 130 through a communication network. The communication network can be a wired communication network or a wireless communication network. The wired communication network can be a communication network based on power line carrier technology, and the wireless communication network can be a local wireless network or a wide area wireless network. The local wireless network can be a WIFI wireless network, a Zigbee wireless network, a mobile communication network, or a satellite communication network. At the same time, the multimedia file playback device can be a multimedia file playback device with decoding capability.

多媒体文件播放设备110可以包括但不限于电脑、手机或者信息处理中心等智能终端。计算设备120可以为云服务器、网络服务器、应用服务器以及管理服务器等具有数据处理功能的服务器。The multimedia file playing device 110 may include but is not limited to a computer, a mobile phone or an intelligent terminal such as an information processing center. The computing device 120 may be a server with data processing function such as a cloud server, a network server, an application server and a management server.

数据存储系统130可以是一个统称，包括本地存储以及存储历史数据的数据库，数据库可以在计算设备120上，也可以在其它网络服务器上，还可以在数据存储系统130上。该数据存储系统130可以与计算设备120分立，也可以集成在计算设备120内。The data storage system 130 may be a general term, including local storage and a database storing historical data, and the database may be on the computing device 120, on other network servers, or on the data storage system 130. The data storage system 130 may be separate from the computing device 120, or may be integrated in the computing device 120.

在实际应用中，计算设备120可以将多媒体文件拆分成多个多媒体片段，其中，多媒体文件包括冗余信息，并确定多个多媒体片段对应的多个播放方式，进而将播放方式以隐写的方式写入冗余信息中，以生成携带有隐写信息的多媒体文件。In actual applications, the computing device 120 can split a multimedia file into multiple multimedia segments, where the multimedia file includes redundant information, and determine multiple playback modes corresponding to the multiple multimedia segments, and then write the playback modes into the redundant information in a steganographic manner to generate a multimedia file carrying steganographic information.

多媒体文件播放设备110获取来自计算设备120的多媒体文件，并对多媒体文件进行检测，若检测到多媒体文件中存在隐写信息的情况下，以特定的播放方式对多媒体文件进行播放，其中，特定方式可以为基于特定的解码方式，识别隐写信息，隐写信息用于指示以特定的播放方式对多媒体文件进行播放；若检测到多媒体文件中不存在隐写信息的情况下，则直接利用多媒体文件播放设备110播放多媒体文件。The multimedia file playing device 110 obtains the multimedia file from the computing device 120 and detects the multimedia file. If it is detected that there is steganographic information in the multimedia file, the multimedia file is played in a specific playing mode, wherein the specific mode can be based on a specific decoding mode to identify the steganographic information, and the steganographic information is used to indicate that the multimedia file is played in a specific playing mode; if it is detected that there is no steganographic information in the multimedia file, the multimedia file is directly played using the multimedia file playing device 110.

本公开实施例的多媒体文件生成方法可以由服务器执行，也可以由应用于服务器的芯片执行。The multimedia file generation method of the embodiment of the present disclosure may be executed by a server, or may be executed by a chip applied to the server.

图2示出了本公开实施例的多媒体文件生成方法的基本流程示意图。如图2所示，本公开实施例的多媒体文件生成方法可以包括S201和S202。Fig. 2 shows a basic flow chart of a multimedia file generation method according to an embodiment of the present disclosure. As shown in Fig. 2, the multimedia file generation method according to an embodiment of the present disclosure may include S201 and S202.

S201，获取多媒体文件，其中，多媒体文件包括冗余信息。S201: Acquire a multimedia file, where the multimedia file includes redundant information.

具体地，可以为多媒体文件添加冗余信息，冗余信息不仅可以用于加快多媒体文件的访问速度，还可以在不影响多媒体文件主干内容的前提下，编辑冗余信息，从而丰富多媒体文件的内容。Specifically, redundant information can be added to multimedia files. The redundant information can not only be used to speed up the access speed of multimedia files, but also can be edited without affecting the main content of the multimedia files, thereby enriching the content of the multimedia files.

S202，将隐写信息写入冗余信息中，以生成携带有隐写信息的多媒体文件，其中，隐写信息用于指示以特定播放方式对多媒体文件进行播放。S202, writing the stego information into the redundant information to generate a multimedia file carrying the stego information, wherein the stego information is used to instruct to play the multimedia file in a specific playing mode.

具体地，可以针不同的多媒体文件设置不同的隐写信息，并将多个隐写信息写入多媒体文件对应的冗余信息中，从而丰富多媒体文件内容。其中，多媒体文件可以包括视频文件，音频文件，动画文件等。下面以多媒体文件为音频文件的情况下进行详述。Specifically, different steganographic information can be set for different multimedia files, and multiple steganographic information can be written into the redundant information corresponding to the multimedia file, thereby enriching the content of the multimedia file. The multimedia file may include a video file, an audio file, an animation file, etc. The following is a detailed description of the case where the multimedia file is an audio file.

根据本公开实施例提供的技术方案，通过获取多媒体文件，其中，多媒体文件包括冗余信息；将隐写信息写入冗余信息中，以生成携带有隐写信息的多媒体文件，其中，隐写信息用于指示以特定播放方式对多媒体文件进行播放，能够利用隐写法在多媒体文件的冗余信息中添加播放多媒体文件的隐写信息，从而使多媒体文件以特定的播放方式进行播放，因此，丰富了多媒体文件内容，增加了用户的体验感和趣味性。According to the technical solution provided by the embodiment of the present disclosure, a multimedia file is obtained, wherein the multimedia file includes redundant information; steganographic information is written into the redundant information to generate a multimedia file carrying the steganographic information, wherein the steganographic information is used to indicate that the multimedia file is to be played in a specific playback mode, and the steganographic information for playing the multimedia file can be added to the redundant information of the multimedia file by using the steganography method, so that the multimedia file is played in a specific playback mode, thereby enriching the content of the multimedia file and increasing the user experience and fun.

在一些实施例中，该方法还包括：基于至少一个时间段对多媒体文件进行分段处理，得到与至少一个时间段中的每个时间段对应的多媒体片段；在预存的索引表中选择与每个时间段对应的多媒体片段匹配的播放方式，其中，索引表用于保存所有多媒体片段对应的多个播放控制信息，播放控制信息用于更改多媒体片段的播放方式。In some embodiments, the method also includes: segmenting the multimedia file based on at least one time period to obtain multimedia segments corresponding to each time period in at least one time period; selecting a playback mode matching the multimedia segment corresponding to each time period in a pre-stored index table, wherein the index table is used to store multiple playback control information corresponding to all multimedia segments, and the playback control information is used to change the playback mode of the multimedia segment.

具体地，可以通过AI模型训练将多媒体文件按照纯伴奏、纯人声、人声和伴奏进行拆分，假设拆分后的多媒体文件的纯伴奏部分在t∈∑₁(T₁,T₁′)区间内，其中T₁表示纯伴奏部分的开始时间，T₁′表示纯伴奏部分的结束时间；纯人声部分在t′∈∑₂(T₂,T₂′)区间内，其中T₂表示纯人声部分的开始时间，T₂′表示纯人声部分的结束时间；人声和伴奏部分在t″∈∑₃(T₃,T₃′)区间内，其中T₃表示人声和伴奏部分的开始时间，T₃′表示人声和伴奏部分的结束时间。其中t+t′+t″＝E，E表示多媒体文件的多媒体文件总时长。Specifically, the multimedia file can be split into pure accompaniment, pure vocals, and vocals and accompaniment through AI model training. It is assumed that the pure accompaniment part of the split multimedia file is in the interval t∈∑ ₁ (T ₁ ,T ₁ ′), where T ₁ represents the start time of the pure accompaniment part, and T ₁ ′ represents the end time of the pure accompaniment part; the pure vocal part is in the interval t′∈∑ ₂ (T ₂ ,T ₂ ′), where T ₂ represents the start time of the pure vocal part, and T ₂ ′ represents the end time of the pure vocal part; the vocals and accompaniment part are in the interval t″∈∑ ₃ (T ₃ ,T ₃ ′), where T ₃ represents the start time of the vocals and accompaniment part, and T ₃ ′ represents the end time of the vocals and accompaniment part. Wherein t+t′+t″＝E, and E represents the total duration of the multimedia file.

当多媒体文件为音频文件时，针对音频文件的多个播放控制信息可以为更改音乐风格和音效模式等；例如，当播放控制信息为音乐风格时，可以基于播放控制信息将原始音频文件的音乐风格改为电乐风格，进而以电乐风格播放多媒体文件，将原始音频文件的音乐风格改为Hi-Fi风格，进而以Hi-Fi风格播放多媒体文件等；当播放控制信息为添加音效模式时，可以基于播放控制信息为原始音频文件添加大叔音的音效，将原始音频文件中的人声改为大叔音，还可以为原始音频文件添加萝莉音的音效等，将原始音频文件中的人声改为萝莉音。When the multimedia file is an audio file, multiple playback control information for the audio file can be for changing the music style and sound effect mode, etc.; for example, when the playback control information is the music style, the music style of the original audio file can be changed to the electronic music style based on the playback control information, and then the multimedia file is played in the electronic music style, and the music style of the original audio file can be changed to the Hi-Fi style, and then the multimedia file is played in the Hi-Fi style, etc.; when the playback control information is for adding a sound effect mode, the sound effect of an uncle's voice can be added to the original audio file based on the playback control information, and the human voice in the original audio file can be changed to an uncle's voice, and the sound effect of a lolita voice can be added to the original audio file, and the human voice in the original audio file can be changed to a lolita voice, etc.

在实际应用中，在进行数据传输时，会直接相关数据进行打包，当数据量较大时，会导致数据打包过程以及数据传输过程消耗大量的时间，同时直接进行数据打包，会降低数据的安全性。因此为了尽可能的减少数据传输过程中的开销以及保证数据安全性，本公开实施例针对音频文件定义了一个双端查询的音效索引表，即可以在多媒体文件播放设备和计算设备上查询的索引表。In actual applications, when data is transmitted, related data will be directly packaged. When the amount of data is large, the data packaging process and the data transmission process will consume a lot of time. At the same time, directly packaging the data will reduce the security of the data. Therefore, in order to reduce the overhead in the data transmission process as much as possible and ensure data security, the embodiment of the present disclosure defines a dual-end query sound effect index table for audio files, that is, an index table that can be queried on a multimedia file playback device and a computing device.

在一些实施例中，可以采用利用数值表示不同的播放控制信息，同时还可以利用数值对多种不同的播放控制信息进行分类。音效索引表如表1所示：In some embodiments, different playback control information can be represented by numerical values, and multiple different playback control information can also be classified by numerical values. The sound effect index table is shown in Table 1:

表1音效索引表Table 1 Sound effect index table

播放控制信息Playback control information 索引号The index number 音乐风格music style 11 清澈人生风格Clear life style 1111 电音风格Electronic music style 1212 HI-FI风格HI-FI style 1313 音效模式Sound Mode 22 大叔音Uncle Voice 21twenty one 萝莉音Loli Voice 22twenty two 御姐音Royal sister sound 23twenty three 音频速度Audio Speed 33 0.5倍速0.5x speed 3131 1.0倍速1.0x speed 3232 1.5倍速1.5x speed 3333

在表1中，播放控制信息包括音乐风格1，音效模式2，音频速度3，其中，音乐风格1包括清澈人声风格11，电音风格12，HI-FI风格13三种音效；音效模式2包括大叔音21，萝莉音22，御姐音23；音频速度3包括0.5倍速31，1.0倍速32，1.5倍速33。应理解，播放控制信息还可以包括变调以及替换歌词等，具体可以根据实际情况设定。In Table 1, the playback control information includes music style 1, sound effect mode 2, and audio speed 3, wherein music style 1 includes three sound effects: clear vocal style 11, electronic music style 12, and HI-FI style 13; sound effect mode 2 includes uncle voice 21, loli voice 22, and royal sister voice 23; and audio speed 3 includes 0.5 times speed 31, 1.0 times speed 32, and 1.5 times speed 33. It should be understood that the playback control information may also include pitch change and replacement of lyrics, etc., which may be set according to actual conditions.

在实际应用中，当将多媒体文件拆分成三部分的多媒体片段后，可以利用隐写算法将多媒体片段的播放控制信息写入多媒体文件中，生成携带有隐写信息的多媒体文件。In practical applications, after a multimedia file is split into three multimedia segments, the playback control information of the multimedia segments can be written into the multimedia file using a steganographic algorithm to generate a multimedia file carrying steganographic information.

具体地，根据上述拆分结果，可以在多媒体文件的纯伴奏部分对应的时长t区间对应的多媒体片段中写入清澈人声风格对应的数值11，纯人声部分在t′写入电音风格对应的数值12，人声和伴奏部分在t″写入HI-FI风格对应的数值13，并将写入隐写信息的多媒体片段按照播放顺序进行合并，生成携带有隐写信息的多媒体文件。其中，播放顺序可以按照未写入隐写信息的多媒体文件的播放顺序，还可以为按照时长进行排序，具体可以根据实际情况进行设定。Specifically, according to the above splitting results, the value 11 corresponding to the clear vocal style can be written into the multimedia segment corresponding to the duration t interval corresponding to the pure accompaniment part of the multimedia file, the value 12 corresponding to the electronic music style can be written into the pure vocal part at t′, and the value 13 corresponding to the HI-FI style can be written into the vocal and accompaniment part at t″, and the multimedia segments with the stego information written can be merged according to the playback order to generate a multimedia file carrying the stego information. Among them, the playback order can be based on the playback order of the multimedia files without the stego information written, and can also be sorted by duration, which can be set according to actual conditions.

基于此，我们在数据传输过程中，可以仅传输多媒体文件对应播放方式的数值即可，这样及时在数据传输过程中，数据被截获在没有索引表的情况下也不会获取数据内容，从而保证数据的安全性，并且采用数值代替播放方式文字描述的内容会使得传输数据量较小，从而可以加快数据的传输过程。Based on this, we can only transmit the numerical value of the corresponding playback mode of the multimedia file during the data transmission process. In this way, even if the data is intercepted during the data transmission process, the data content will not be obtained without an index table, thereby ensuring the security of the data. In addition, using numerical values instead of the text description of the playback mode will reduce the amount of transmitted data, thereby speeding up the data transmission process.

图3示出了本公开实施例在多媒体文件写入隐写信息的流程示意图。如图3所示，将隐写信息写入冗余信息中，以生成携带有隐写信息的多媒体文件，包括：FIG3 shows a schematic diagram of a process of writing stego information into a multimedia file according to an embodiment of the present disclosure. As shown in FIG3 , writing the stego information into redundant information to generate a multimedia file carrying the stego information includes:

S301，对播放方式对应的数值进行编码，得到编码字符串。S301, encode the value corresponding to the playing mode to obtain an encoded character string.

具体地，本公开实施例采用ASCII或者UTF8编码的方式将播放方式对应的数值进行编码转化为转码字符的形式，获得转码字符串，基于此，可以获得一个仅包含数字的转码字符串。Specifically, the embodiment of the present disclosure uses ASCII or UTF8 encoding to encode the numerical value corresponding to the playback mode into a transcoded character form to obtain a transcoded character string. Based on this, a transcoded character string containing only numbers can be obtained.

S302，对编码字符串进行压缩，得到压缩字符串。S302, compress the encoded character string to obtain a compressed character string.

具体地，可以将转码字符串压缩，可以采用哈夫曼压缩编码对转码字符串压缩，获得压缩数据；Specifically, the transcoded character string may be compressed, and the transcoded character string may be compressed using Huffman compression coding to obtain compressed data;

S303，对压缩字符串进行加密，得到加密字符串。S303, encrypt the compressed string to obtain an encrypted string.

具体地，可以通过加密算法对压缩数据进行加密，获得加密转码字符串，其中加密算法可以采用AES、DES或者RSA等加密算法，加密转码字符串可以替换为QR码，但是由于QR码携带信息量有限，因此以本公开采用加密转码字符串的形式进行数据传输。Specifically, the compressed data can be encrypted by an encryption algorithm to obtain an encrypted transcoded string, wherein the encryption algorithm can adopt an encryption algorithm such as AES, DES or RSA, and the encrypted transcoded string can be replaced with a QR code. However, since the amount of information carried by the QR code is limited, the present disclosure adopts the form of an encrypted transcoded string for data transmission.

S304，将加密字符串隐写到冗余信息中，得到隐写有加密字符串的多媒体文件。S304, steganographically writing the encrypted string into the redundant information to obtain a multimedia file steganographically writing the encrypted string.

具体地，可以利用隐写算法将加密转码字符串写入多媒体文件的冗余信息中，生成携带有隐写信息的多媒体文件。Specifically, a steganographic algorithm may be used to write an encrypted transcoded character string into redundant information of a multimedia file, thereby generating a multimedia file carrying steganographic information.

具体地，多媒体文件隐写算法可以包括LSB法、相位编码法、扩频法和回声数据隐藏法等方法，都是利用多媒体文件的冗余信息来达到隐写信息的目的。Specifically, multimedia file steganography algorithms may include LSB method, phase coding method, spread spectrum method and echo data hiding method, all of which utilize the redundant information of multimedia files to achieve the purpose of steganographic information.

本公开实施例采用回声数据隐藏法将加密转码字符串写入多媒体文件对应的冗余信息中，例如，如果多媒体文件为双声道多媒体文件文件，则可以将左声道和右声道的振幅降低，并将左声道振幅降低的多媒体片段混入右声道，将右声道振幅降低的多媒体片段混入左声道，在混入时将两边振幅较低的多媒体片段以延迟500微秒的形式混入，进而获得交叉反馈音效(CrossFeed)，从而可以在用户无感知的情况下，在并在多媒体文件中产生冗余信息，进而通过对冗余信息的处理，完成数据隐写。基于此，通过左右声道混入的各自多媒体文件的振幅差值或时延偏移，产生0和1的信息位，进行加密转码字符串的隐写。The disclosed embodiment adopts the echo data hiding method to write the encrypted transcoded string into the redundant information corresponding to the multimedia file. For example, if the multimedia file is a dual-channel multimedia file, the amplitude of the left channel and the right channel can be reduced, and the multimedia segment with the reduced amplitude of the left channel can be mixed into the right channel, and the multimedia segment with the reduced amplitude of the right channel can be mixed into the left channel. When mixing, the multimedia segments with lower amplitudes on both sides are mixed in the form of a delay of 500 microseconds, thereby obtaining a cross-feedback sound effect (CrossFeed), so that redundant information can be generated in the multimedia file without the user's perception, and then the data steganography is completed by processing the redundant information. Based on this, the information bits of 0 and 1 are generated by the amplitude difference or delay offset of the respective multimedia files mixed into the left and right channels, and the encrypted transcoded string is steganographically written.

在实际应用中，当以左声道为例，并在针对多媒体文件中的纯人声部分进行加密的情况下，可以将右声道振幅降低的纯人声部分以延迟500微秒的方式混入左声道纯人声部分，即将右声道振幅降低的纯人声部分作为左声道的附加共振信号，其中，冗余信息对应的多媒体片段为延迟500微秒产生的音频片段，对应0信息位，而右声道振幅降低的纯人声部分，对应1信息位，在进行数据隐写时，可以将加密转码字符串写入0信息位。接着将0信息位与1信息位对应的多媒体片段进行拼接，并将拼接后的多媒体片段混入左声道对应的部分中，获得携带有隐写信息的多媒体文件。In practical applications, when taking the left channel as an example and encrypting the pure human voice part in the multimedia file, the pure human voice part with reduced amplitude of the right channel can be mixed into the pure human voice part of the left channel with a delay of 500 microseconds, that is, the pure human voice part with reduced amplitude of the right channel is used as an additional resonance signal of the left channel, wherein the multimedia segment corresponding to the redundant information is an audio segment generated by a delay of 500 microseconds, corresponding to 0 information bits, and the pure human voice part with reduced amplitude of the right channel corresponds to 1 information bit. When performing data steganography, the encrypted transcoded string can be written into 0 information bits. Then, the multimedia segments corresponding to the 0 information bits and the 1 information bits are spliced, and the spliced multimedia segments are mixed into the corresponding part of the left channel to obtain a multimedia file carrying steganographic information.

在一种可选的方式中，当多媒体文件为单人讲述故事类多媒体文件时，可以将多媒体文件中不同故事人物的对话时段对应不同的声音添加变声音效或者添加鬼畜音效，还可以将多媒体文件中某个时间段的多媒体片段重复播放，还可以为某个时间段的多媒体片段添加变速或者变调音效等。In an optional manner, when the multimedia file is a multimedia file telling a story by a single person, voice-changing sound effects or ghost sound effects may be added to the different sounds corresponding to the dialogue periods of different story characters in the multimedia file. The multimedia clips of a certain time period in the multimedia file may also be played repeatedly. Speed-changing or pitch-changing sound effects may also be added to the multimedia clips of a certain time period.

更进一步地，还可以对多媒体文件中的歌曲、歌词或者演唱者进行修改，可以将新的歌词及演唱者信息隐写到多媒体文件中，在基于此，当终端获取到修改歌词以及演唱者的隐写信息时，可以将多媒体文件中的人声部分进行乐器数字接口标准(MIDI)化，并基于AI技术替换多媒体文件中的人声，替换为隐写信息中演唱者的人声，接着按照新的歌词根据MIDI音阶进行纯人声合成，获得新的人声合成片段，最后将多媒体文件中除人声以外的其他乐器对应的声音与人声合成片段进行合成，获取多媒体文件。Furthermore, the songs, lyrics or singers in the multimedia file can be modified, and the new lyrics and singer information can be steganographically written into the multimedia file. Based on this, when the terminal obtains the modified lyrics and the stego-information of the singer, the vocal part in the multimedia file can be standardized according to the Musical Instrument Digital Interface (MIDI), and the vocals in the multimedia file can be replaced based on AI technology with the vocals of the singer in the stego-information. Then, pure vocal synthesis is performed according to the MIDI scale based on the new lyrics to obtain a new vocal synthesis segment. Finally, the sounds corresponding to other instruments in the multimedia file except the vocals are synthesized with the vocal synthesis segment to obtain the multimedia file.

本公开实施例的多媒体文件播放方法可以由终端执行，也可以由应用于终端的芯片执行。The multimedia file playing method of the embodiment of the present disclosure may be executed by a terminal, or may be executed by a chip applied to the terminal.

图4示出了本公开实施例的多媒体文件播放方法的基本流程示意图。如图4所示，本公开实施例的多媒体文件播放方法可以包括S401至S403。Fig. 4 shows a basic flow chart of a multimedia file playing method according to an embodiment of the present disclosure. As shown in Fig. 4, the multimedia file playing method according to an embodiment of the present disclosure may include S401 to S403.

S401，获取多媒体文件。S401, obtaining multimedia files.

具体地，本公开实施例可以通过网络下载多媒体文件，或者通过即时通讯软件获取多媒体文件，当用户获取到多媒体文件后，可以直接打开多媒体文件，也可以采用指定的多媒体文件播放设备打开多媒体文件。Specifically, the embodiments of the present disclosure can download multimedia files through the network, or obtain multimedia files through instant messaging software. After the user obtains the multimedia file, the user can directly open the multimedia file or use a designated multimedia file playback device to open the multimedia file.

S402，检测多媒体文件中是否存在隐写信息，其中，隐写信息用于指示以特定的播放方式对多媒体文件进行播放。S402, detecting whether there is steganographic information in the multimedia file, wherein the steganographic information is used to instruct to play the multimedia file in a specific playing mode.

具体地，本公开实施例在播放多媒体文件前，需要先检测多媒体文件是否含有隐写信息，从而避免在播放方式不匹配但多媒体文件成功播放的情况下，忽略隐写信息。Specifically, before playing a multimedia file, the embodiment of the present disclosure needs to first detect whether the multimedia file contains stego information, so as to avoid ignoring the stego information when the playback mode does not match but the multimedia file is successfully played.

S403，在多媒体文件中存在隐写信息的情况下，以特定的播放方式对多媒体文件进行播放。S403, when there is steganographic information in the multimedia file, the multimedia file is played in a specific playing mode.

具体地，以特定的方式播放多媒体文件时，既可以保证隐写信息的有效读取，还可以将隐写信息对应的内容融入到多媒体文件中，从而使得多媒体文件内容更加丰富。应理解，当多媒体文件为音频文件时，隐写信息可以用于改变音频文件的声音，还可以在音频文件中增加声音特效，从而丰富多媒体文件内容。Specifically, when playing a multimedia file in a specific manner, the effective reading of the stego information can be ensured, and the content corresponding to the stego information can be integrated into the multimedia file, thereby enriching the content of the multimedia file. It should be understood that when the multimedia file is an audio file, the stego information can be used to change the sound of the audio file, and sound effects can be added to the audio file, thereby enriching the content of the multimedia file.

图5示出了本公开实施例的多媒体文件播放过程的流程示意图。如图5所示，在多媒体文件中存在隐写信息的情况下，以特定的播放方式对多媒体文件进行播放，包括S501至S506。Fig. 5 is a schematic diagram of a process of playing a multimedia file according to an embodiment of the present disclosure. As shown in Fig. 5, when there is steganographic information in the multimedia file, the multimedia file is played in a specific playing mode, including S501 to S506.

S501，对隐写信息进行提取，得到加密字符串。S501, extract the stego information to obtain an encrypted string.

S502，对加密字符串进行解密，得到压缩字符串。S502, decrypt the encrypted string to obtain a compressed string.

具体地，如果能够对加密多媒体文件成功解密，则执行步骤S502，否则认为该多媒体文件解密失败，则直接播放未携带隐写信息的多媒体文件。其中，隐写信息为加密转码字符串，解密失败可能由于隐写信息发生错误或者网络问题导致解密失败。Specifically, if the encrypted multimedia file can be successfully decrypted, step S502 is executed, otherwise it is considered that the multimedia file decryption fails, and the multimedia file without the stego information is directly played. The stego information is an encrypted transcoded string, and the decryption failure may be caused by an error in the stego information or a network problem.

S503，对压缩字符串进行解压缩，得到解压缩字符串。应理解，可以利用霍夫曼编码对压缩数据进行解压，获得转码字符串。S503, decompress the compressed string to obtain a decompressed string. It should be understood that the compressed data can be decompressed using Huffman coding to obtain a transcoded string.

具体地，如果能够成功将压缩数据解压，则执行步骤S503，否则认为该多媒体文件无法解压，则直接播放未携带隐写信息的多媒体文件。Specifically, if the compressed data can be successfully decompressed, step S503 is executed; otherwise, it is considered that the multimedia file cannot be decompressed, and the multimedia file without the steganographic information is directly played.

S504，对解压缩字符串进行解码，得到数值。S504, decoding the decompressed character string to obtain a numerical value.

具体地，将转码字符串进行ASCII或者UTF8转码，获取数值。Specifically, the transcoded string is transcoded into ASCII or UTF8 to obtain a numerical value.

S505，在预存的索引表中选择与数值对应的播放方式。S505: Select a playback mode corresponding to the value in a pre-stored index table.

具体地，可以按照索引表将数值进行翻译，确定不同多媒体片段对应的播放控制信息，进而将多个播放控制信息与多个多媒体片段进行合成，获得多个合成多媒体片段；最后将多个合成多媒体片段按照多媒体片段的播放顺序进行拼接，获得解码多媒体文件。应理解，一个多媒体片段可以对应一种播放控制信息，还可能对应多种播放控制信息，具体可以根据实际情况进行设定。Specifically, the values can be translated according to the index table to determine the playback control information corresponding to different multimedia segments, and then multiple playback control information and multiple multimedia segments are synthesized to obtain multiple synthesized multimedia segments; finally, the multiple synthesized multimedia segments are spliced according to the playback order of the multimedia segments to obtain a decoded multimedia file. It should be understood that a multimedia segment can correspond to one playback control information, and may also correspond to multiple playback control information, which can be set according to actual conditions.

S506，控制多媒体文件以播放方式进行播放。S506, controlling the multimedia file to be played in a play mode.

具体地，在隐写信息提取失败时，即在多媒体文件中不存在隐写信息的情况下，可以直接对多媒体文件进行播放。Specifically, when the extraction of the stego information fails, that is, when the stego information does not exist in the multimedia file, the multimedia file can be played directly.

图6示出了本公开实施例的多媒体文件播放过程的流程示意图。如图6所示，该多媒体文件为音频文件包括HI-FI风格、清澈人声风格，电音风格等播放方式，针对不同的音频片段设置不同的播放方式进行播放，本公开在不同播放方式之间添加淡入淡出的音频效果进行切换，从而保证不同的播放方式之间可以平滑切换，避免直接播放造成突兀，影响用户的体验。Figure 6 shows a schematic flow chart of a multimedia file playback process according to an embodiment of the present disclosure. As shown in Figure 6, the multimedia file is an audio file including HI-FI style, clear vocal style, electronic music style and other playback modes. Different playback modes are set for different audio clips. The present disclosure adds fade-in and fade-out audio effects to switch between different playback modes, thereby ensuring smooth switching between different playback modes and avoiding abruptness caused by direct playback, which affects the user experience.

在实际应用中，当用户获取到多媒体文件时，还可以在用户终端界面上以弹窗的形式通知用户多媒体文件是一个包含隐写信息的多媒体文件，如果用户选择播放含有隐写信息的多媒体文件时，则播放含有隐写信息的多媒体文件，否则直接播放未含有隐写信息的多媒体文件，从而可以方便用户的选择。In actual applications, when a user obtains a multimedia file, a pop-up window can be displayed on the user terminal interface to notify the user that the multimedia file is a multimedia file containing stego information. If the user chooses to play a multimedia file containing stego information, the multimedia file containing stego information will be played; otherwise, the multimedia file without stego information will be played directly, thereby facilitating the user's selection.

可见，本公开实施例可以通过将目标音效对应的索引信息隐写到多媒体文件中，获取含有隐写信息的多媒体文件，可以保证多媒体文件的数据安全性以及数据传输效率，同时可以无感知进行播放，同时将播放控制信息隐写到多媒体文件中，生成携带有隐写信息的多媒体文件后，携带有隐写信息的多媒体文件可以通过网页或者聊天软件APP等网络传播，从而可以提升用户的体验感和趣味性。It can be seen that the disclosed embodiment can obtain a multimedia file containing stego information by stego-writing the index information corresponding to the target sound effect into the multimedia file, thereby ensuring the data security and data transmission efficiency of the multimedia file, and playing it without perception. At the same time, the playback control information is stego-written into the multimedia file. After the multimedia file carrying the stego-information is generated, the multimedia file carrying the stego-information can be transmitted through networks such as web pages or chat software APPs, thereby enhancing the user experience and fun.

本公开实施例采用的上述至少一个技术方案能够达到以下有益效果：可以获取多媒体文件，其中，多媒体文件包括冗余信息，冗余信息不仅可以用于加快多媒体文件的访问速度，还可以在不影响多媒体文件主干内容的前提下，编辑冗余信息，从而丰富多媒体文件的内容；同时可以将隐写信息写入冗余信息中，以生成携带有隐写信息的多媒体文件，其中，隐写信息用于指示以特定播放方式对多媒体文件进行播放，基于此，可以针不同的多媒体文件设置不同的隐写信息，并将多个隐写信息写入多媒体文件对应的冗余信息中，从而丰富多媒体文件内容。At least one of the above-mentioned technical solutions adopted in the embodiments of the present disclosure can achieve the following beneficial effects: a multimedia file can be obtained, wherein the multimedia file includes redundant information, and the redundant information can not only be used to speed up the access speed of the multimedia file, but also can be edited without affecting the main content of the multimedia file, thereby enriching the content of the multimedia file; at the same time, stego information can be written into the redundant information to generate a multimedia file carrying the stego information, wherein the stego information is used to indicate that the multimedia file is to be played in a specific playback mode, based on this, different stego information can be set for different multimedia files, and multiple stego information can be written into the redundant information corresponding to the multimedia file, thereby enriching the content of the multimedia file.

在此基础上，本公开实施例的方法应用在多媒体文件播放时，可以获取多媒体文件，并检测多媒体文件中是否存在隐写信息，其中，隐写信息用于指示以特定的播放方式对多媒体文件进行播放，检测多媒体文件中是否存在隐写信息可以避免在播放方式不匹配但多媒体文件成功播放的情况下，忽略掉隐写信息；基于此，在多媒体文件中存在隐写信息的情况下，以特定的播放方式对多媒体文件进行播放，既可以保证隐写信息的有效读取，还可以将隐写信息对应的内容融入到多媒体文件中，从而使得多媒体文件内容更加丰富。On this basis, when the method of the embodiment of the present disclosure is applied to multimedia file playback, the multimedia file can be obtained and whether there is stego information in the multimedia file can be detected, wherein the stego information is used to indicate that the multimedia file is to be played in a specific playback mode. Detecting whether there is stego information in the multimedia file can avoid ignoring the stego information when the playback mode does not match but the multimedia file is successfully played. Based on this, when there is stego information in the multimedia file, the multimedia file is played in a specific playback mode, which can not only ensure the effective reading of the stego information, but also integrate the content corresponding to the stego information into the multimedia file, thereby enriching the content of the multimedia file.

上述主要从服务器的角度对本公开实施例提供的方案进行了介绍。可以理解的是，服务器为了实现上述功能，其包含了执行各个功能相应的硬件结构和/或软件模块。本领域技术人员应该很容易意识到，结合本文中所公开的实施例描述的各示例的单元及算法步骤，本公开能够以硬件或硬件和计算机软件的结合形式来实现。某个功能究竟以硬件还是计算机软件驱动硬件的方式来执行，取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能，但是这种实现不应认为超出本公开的范围。The above mainly introduces the solution provided by the embodiment of the present disclosure from the perspective of the server. It is understandable that in order to realize the above functions, the server includes hardware structures and/or software modules corresponding to the execution of each function. Those skilled in the art should easily realize that, in combination with the units and algorithm steps of each example described in the embodiments disclosed herein, the present disclosure can be implemented in the form of hardware or a combination of hardware and computer software. Whether a function is executed in the form of hardware or computer software driving hardware depends on the specific application and design constraints of the technical solution. Professional and technical personnel can use different methods to implement the described functions for each specific application, but such implementation should not be considered to exceed the scope of the present disclosure.

本公开实施例可以根据上述方法示例对服务器进行功能单元的划分，例如，可以对应各个功能划分各个功能模块，也可以将两个或两个以上的功能集成在一个处理模块中。上述集成的模块既可以采用硬件的形式实现，也可以采用软件功能模块的形式实现。需要说明的是，本公开实施例中对模块的划分是示意性的，仅仅为一种逻辑功能划分，实际实现时可以有另外的划分方式。The embodiment of the present disclosure can divide the functional units of the server according to the above method example. For example, each functional module can be divided according to each function, or two or more functions can be integrated into one processing module. The above integrated module can be implemented in the form of hardware or in the form of software functional modules. It should be noted that the division of modules in the embodiment of the present disclosure is schematic and is only a logical functional division. There may be other division methods in actual implementation.

在采用对应各个功能划分各个功能模块的情况下，本公开实施例提供一种多媒体文件生成装置，该多媒体文件生成装置可以为服务器或应用于服务器的芯片。图7示出了本公开实施例的一种多媒体文件生成装置的结构示意图。如图7所示，该多媒体文件生成装置700包括：In the case of dividing each functional module according to each function, the embodiment of the present disclosure provides a multimedia file generation device, which can be a server or a chip applied to a server. FIG7 shows a schematic diagram of the structure of a multimedia file generation device according to an embodiment of the present disclosure. As shown in FIG7, the multimedia file generation device 700 includes:

获取模块701，被配置为获取多媒体文件，其中，所述多媒体文件包括冗余信息；The acquisition module 701 is configured to acquire a multimedia file, wherein the multimedia file includes redundant information;

生成模块702，被配置为将隐写信息写入所述冗余信息中，以生成携带有所述隐写信息的多媒体文件，其中，所述隐写信息用于指示以特定播放方式对所述多媒体文件进行播放。The generating module 702 is configured to write the stego information into the redundant information to generate a multimedia file carrying the stego information, wherein the stego information is used to indicate that the multimedia file is to be played in a specific playing mode.

在一种可能的实现方式中，基于至少一个时间段对所述多媒体文件进行分段处理，得到与所述至少一个时间段中的每个时间段对应的多媒体片段；在预存的索引表中选择与所述每个时间段对应的多媒体片段匹配的播放方式，其中，所述索引表用于保存所有多媒体片段对应的多个播放控制信息。In one possible implementation, the multimedia file is segmented based on at least one time period to obtain multimedia segments corresponding to each of the at least one time period; and a playback mode matching the multimedia segments corresponding to each time period is selected from a pre-stored index table, wherein the index table is used to store multiple playback control information corresponding to all multimedia segments.

在一种可能的实现方式中，所述生成模块702还用于对所述播放方式对应的数值进行编码，得到编码字符串；对所述编码字符串进行压缩，得到压缩字符串；对所述压缩字符串进行加密，得到加密字符串；将所述加密字符串隐写到所述冗余信息中，得到隐写有所述加密字符串的多媒体文件。In a possible implementation, the generation module 702 is also used to encode the numerical value corresponding to the playback mode to obtain a coded string; compress the coded string to obtain a compressed string; encrypt the compressed string to obtain an encrypted string; and steganographically write the encrypted string into the redundant information to obtain a multimedia file with the encrypted string steganographically written therein.

在采用对应各个功能划分各个功能模块的情况下，本公开实施例提供一种多媒体文件播放装置，该多媒体文件播放装置可以为终端或应用于终端的芯片。图8示出了本公开实施例的一种多媒体文件播放装置的结构示意图。如图8所示，该多媒体文件播放装置800包括：In the case of dividing each functional module according to each function, the embodiment of the present disclosure provides a multimedia file playback device, which can be a terminal or a chip applied to a terminal. FIG8 shows a schematic diagram of the structure of a multimedia file playback device according to the embodiment of the present disclosure. As shown in FIG8, the multimedia file playback device 800 includes:

获取模块801，被配置为获取多媒体文件；The acquisition module 801 is configured to acquire multimedia files;

检测模块802，被配置为检测所述多媒体文件中是否存在隐写信息，其中，所述隐写信息用于指示以特定的播放方式对所述多媒体文件进行播放；The detection module 802 is configured to detect whether there is stego information in the multimedia file, wherein the stego information is used to indicate that the multimedia file is to be played in a specific playback mode;

播放模块803，被配置为在所述多媒体文件中存在所述隐写信息的情况下，以特定的播放方式对所述多媒体文件进行播放。The playing module 803 is configured to play the multimedia file in a specific playing mode when the steganographic information exists in the multimedia file.

在一种可能的实现方式中，所述播放模块803还用于对所述隐写信息进行提取，得到加密字符串；对所述加密字符串进行解密，得到压缩字符串；对所述压缩字符串进行解压缩，得到解压缩字符串；对所述解压缩字符串进行解码，得到数值；在预存的索引表中选择与所述数值对应的播放方式；控制所述多媒体文件以所述播放方式进行播放。In a possible implementation, the playback module 803 is also used to extract the steganographic information to obtain an encrypted string; decrypt the encrypted string to obtain a compressed string; decompress the compressed string to obtain a decompressed string; decode the decompressed string to obtain a numerical value; select a playback mode corresponding to the numerical value in a pre-stored index table; and control the multimedia file to be played in the playback mode.

在一种可能的实现方式中，所述播放模块803还用于在所述多媒体文件中不存在所述隐写信息的情况下，直接对所述多媒体文件进行播放。In a possible implementation, the playing module 803 is further configured to directly play the multimedia file when the steganographic information does not exist in the multimedia file.

本公开实施例还提供一种电子设备，包括：至少一个处理器；用于存储至少一个处理器可执行指令的存储器；其中，至少一个处理器用于执行指令，以实现本公开实施例公开的上述方法的步骤。An embodiment of the present disclosure also provides an electronic device, comprising: at least one processor; a memory for storing instructions executable by at least one processor; wherein the at least one processor is used to execute instructions to implement the steps of the above method disclosed in the embodiment of the present disclosure.

图9示出了本公开实施例提供的电子设备的结构示意图。如图9所示，该电子设备900包括至少一个处理器901以及耦接至处理器901的存储器902，该处理器901可以执行本公开实施例公开的上述方法中的相应步骤。Fig. 9 shows a schematic diagram of the structure of an electronic device provided by an embodiment of the present disclosure. As shown in Fig. 9, the electronic device 900 includes at least one processor 901 and a memory 902 coupled to the processor 901, and the processor 901 can execute the corresponding steps in the above method disclosed in the embodiment of the present disclosure.

上述处理器901还可以称为中央处理单元(Central Processing Unit，CPU)，其可以是一种集成电路芯片，具有信号的处理能力。本公开实施例公开的上述方法中的各步骤可以通过处理器901中的硬件的集成逻辑电路或者软件形式的指令完成。上述的处理器901可以是通用处理器、数字信号处理器(Digital Signal Processing，DSP)、ASIC、现成可编程门阵列(Field-programmable Gate Array，FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。结合本公开实施例所公开的方法的步骤可以直接体现为硬件译码处理器执行完成，或者用译码处理器中的硬件及软件模块组合执行完成。软件模块可以位于存储器902中，例如随机存储器，闪存、只读存储器，可编程只读存储器或者电可擦写可编程存储器、寄存器等本领域成熟的存储介质。处理器901读取存储器902中的信息，结合其硬件完成上述方法的步骤。The processor 901 may also be referred to as a central processing unit (CPU), which may be an integrated circuit chip having signal processing capabilities. Each step in the method disclosed in the embodiment of the present disclosure may be completed by an integrated logic circuit of hardware in the processor 901 or by instructions in the form of software. The processor 901 may be a general-purpose processor, a digital signal processor (DSP), an ASIC, a field-programmable gate array (FPGA) or other programmable logic device, a discrete gate or transistor logic device, or a discrete hardware component. A general-purpose processor may be a microprocessor or the processor may also be any conventional processor, etc. The steps of the method disclosed in the embodiment of the present disclosure may be directly embodied as being executed by a hardware decoding processor, or may be executed by a combination of hardware and software modules in a decoding processor. The software module may be located in a memory 902, such as a random access memory, a flash memory, a read-only memory, a programmable read-only memory, or an electrically erasable programmable memory, a register, or other mature storage media in the art. The processor 901 reads the information in the memory 902 and completes the steps of the method in combination with its hardware.

另外，根据本公开的各种操作/处理在通过软件和/或固件实现的情况下，可从存储介质或网络向具有专用硬件结构的计算机系统，例如，图10所示的计算机系统1000安装构成该软件的程序，该计算机系统在安装有各种程序时，能够执行各种功能，包括诸如前文所述的功能等等。图10示出了本公开实施例提供的计算机系统的结构示意图。In addition, when various operations/processes according to the present disclosure are implemented by software and/or firmware, the programs constituting the software can be installed from a storage medium or a network to a computer system having a dedicated hardware structure, for example, the computer system 1000 shown in FIG10 , and the computer system can perform various functions, including the functions described above, etc. when various programs are installed. FIG10 shows a schematic diagram of the structure of a computer system provided by an embodiment of the present disclosure.

计算机系统1000旨在表示各种形式的数字电子的计算机设备，诸如，膝上型计算机、台式计算机、工作台、个人数字助理、服务器、刀片式服务器、大型计算机、和其它适合的计算机。电子设备还可以表示各种形式的移动装置，诸如，个人数字处理、蜂窝电话、智能电话、可穿戴设备和其它类似的计算装置。本文所示的部件、它们的连接和关系、以及它们的功能仅仅作为示例，并且不意在限制本文中描述的和/或者要求的本公开的实现。Computer system 1000 is intended to represent various forms of digital electronic computer devices, such as laptop computers, desktop computers, workstations, personal digital assistants, servers, blade servers, mainframe computers, and other suitable computers. Electronic devices may also represent various forms of mobile devices, such as personal digital processing, cellular phones, smart phones, wearable devices, and other similar computing devices. The components shown herein, their connections and relationships, and their functions are merely examples and are not intended to limit the implementation of the present disclosure described and/or claimed herein.

如图10所示，计算机系统1000包括计算单元1001，该计算单元1001可以根据存储在只读存储器(ROM)1002中的计算机程序或者从存储单元1008加载到随机存取存储器(RAM)1003中的计算机程序，来执行各种适当的动作和处理。在RAM 1003中，还可存储计算机系统1000操作所需的各种程序和数据。计算单元1001、ROM 1002以及RAM 1003通过总线1004彼此相连。输入/输出(I/O)接口1005也连接至总线1004。As shown in FIG10 , the computer system 1000 includes a computing unit 1001, which can perform various appropriate actions and processes according to a computer program stored in a read-only memory (ROM) 1002 or a computer program loaded from a storage unit 1008 into a random access memory (RAM) 1003. In the RAM 1003, various programs and data required for the operation of the computer system 1000 can also be stored. The computing unit 1001, the ROM 1002, and the RAM 1003 are connected to each other via a bus 1004. An input/output (I/O) interface 1005 is also connected to the bus 1004.

计算机系统1000中的多个部件连接至I/O接口1005，包括：输入单元1006、输出单元1007、存储单元1008以及通信单元1009。输入单元1006可以是能向计算机系统1000输入信息的任何类型的设备，输入单元1006可以接收输入的数字或字符信息，以及产生与电子设备的用户设置和/或功能控制有关的键信号输入。输出单元1007可以是能呈现信息的任何类型的设备，并且可以包括但不限于显示器、扬声器、视频/多媒体文件输出终端、振动器和/或打印机。存储单元1008可以包括但不限于磁盘、光盘。通信单元1009允许计算机系统1000通过网络诸如因特网的与其他设备交换信息/数据，并且可以包括但不限于调制解调器、网卡、红外通信设备、无线通信收发机和/或芯片组，例如，蓝牙TM设备、WiFi设备、WiMax设备、蜂窝通信设备和/或类似物。A plurality of components in the computer system 1000 are connected to the I/O interface 1005, including: an input unit 1006, an output unit 1007, a storage unit 1008, and a communication unit 1009. The input unit 1006 may be any type of device capable of inputting information to the computer system 1000, and the input unit 1006 may receive input digital or character information, and generate key signal inputs related to user settings and/or function control of the electronic device. The output unit 1007 may be any type of device capable of presenting information, and may include but is not limited to a display, a speaker, a video/multimedia file output terminal, a vibrator, and/or a printer. The storage unit 1008 may include but is not limited to a disk, an optical disk. The communication unit 1009 allows the computer system 1000 to exchange information/data with other devices via a network such as the Internet, and may include but is not limited to a modem, a network card, an infrared communication device, a wireless communication transceiver, and/or a chipset, for example, a Bluetooth™ device, a WiFi device, a WiMax device, a cellular communication device, and/or the like.

计算单元1001可以是各种具有处理和计算能力的通用和/或专用处理组件。计算单元1001的一些示例包括但不限于中央处理单元(CPU)、图形处理单元(GPU)、各种专用的人工智能(AI)计算芯片、各种运行机器学习模型算法的计算单元、数字信号处理器(DSP)、以及任何适当的处理器、控制器、微控制器等。计算单元1001执行上文所描述的各个方法和处理。例如，在一些实施例中，本公开实施例公开的上述方法可被实现为计算机软件程序，其被有形地包含于机器可读介质，例如，存储单元1008。在一些实施例中，计算机程序的部分或者全部可以经由ROM 1002和/或通信单元1009而被载入和/或安装到电子设备1000上。在一些实施例中，计算单元1001可以通过其他任何适当的方式(例如，借助于固件)而被配置为执行本公开实施例公开的上述方法。The computing unit 1001 may be a variety of general and/or special processing components with processing and computing capabilities. Some examples of the computing unit 1001 include, but are not limited to, a central processing unit (CPU), a graphics processing unit (GPU), various dedicated artificial intelligence (AI) computing chips, various computing units running machine learning model algorithms, digital signal processors (DSPs), and any appropriate processors, controllers, microcontrollers, etc. The computing unit 1001 performs the various methods and processes described above. For example, in some embodiments, the above methods disclosed in the embodiments of the present disclosure may be implemented as a computer software program, which is tangibly included in a machine-readable medium, such as a storage unit 1008. In some embodiments, part or all of the computer program may be loaded and/or installed on the electronic device 1000 via the ROM 1002 and/or the communication unit 1009. In some embodiments, the computing unit 1001 may be configured to perform the above methods disclosed in the embodiments of the present disclosure in any other appropriate manner (e.g., by means of firmware).

本公开实施例还提供一种计算机可读存储介质，其中，当计算机可读存储介质中的指令由电子设备的处理器执行时，使得该电子设备能够执行本公开实施例公开的上述方法。The embodiment of the present disclosure also provides a computer-readable storage medium, wherein when the instructions in the computer-readable storage medium are executed by a processor of an electronic device, the electronic device is enabled to execute the above method disclosed in the embodiment of the present disclosure.

本公开实施例中的计算机可读存储介质可以是有形的介质，其可以包含或存储以供指令执行系统、装置或设备使用或与指令执行系统、装置或设备结合地使用的程序。上述计算机可读存储介质可以包括但不限于电子的、磁性的、光学的、电磁的、红外的、或半导体系统、装置或设备，或者上述内容的任何合适组合。更具体地，上述计算机可读存储介质可以包括基于一个或多个线的电气连接、便携式计算机盘、硬盘、随机存取存储器(RAM)、只读存储器(ROM)、可擦除可编程只读存储器(EPROM或快闪存储器)、光纤、便捷式紧凑盘只读存储器(CD-ROM)、光学储存设备、磁储存设备、或上述内容的任何合适组合。The computer-readable storage medium in the disclosed embodiments may be a tangible medium that may contain or store a program for use by or in conjunction with an instruction execution system, device, or equipment. The computer-readable storage medium may include, but is not limited to, electronic, magnetic, optical, electromagnetic, infrared, or semiconductor systems, devices, or equipment, or any suitable combination of the foregoing. More specifically, the computer-readable storage medium may include an electrical connection based on one or more lines, a portable computer disk, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disk read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.

上述计算机可读介质可以是上述电子设备中所包含的；也可以是单独存在，而未装配入该电子设备中。The computer-readable medium may be included in the electronic device, or may exist independently without being incorporated into the electronic device.

本公开实施例还提供一种计算机程序产品，包括计算机程序，其中，该计算机程序被处理器执行时实现本公开实施例公开的上述方法。The embodiments of the present disclosure also provide a computer program product, including a computer program, wherein when the computer program is executed by a processor, the above method disclosed in the embodiments of the present disclosure is implemented.

在本公开的实施例中，可以以一种或多种程序设计语言或其组合来编写用于执行本公开的操作的计算机程序代码，上述程序设计语言包括但不限于面向对象的程序设计语言，诸如Java、Smalltalk、C++，还包括常规的过程式程序设计语言，诸如“C”语言或类似的程序设计语言。程序代码可以完全地在用户计算机上执行、部分地在用户计算机上执行、作为一个独立的软件包执行、部分在用户计算机上部分在远程计算机上执行、或者完全在远程计算机或服务器上执行。在涉及远程计算机的情形中，远程计算机可以通过任意种类的网络(包括局域网(LAN)或广域网(WAN))连接到用户计算机，或者，可以连接到外部计算机。In embodiments of the present disclosure, computer program codes for performing the operations of the present disclosure may be written in one or more programming languages or combinations thereof, including but not limited to object-oriented programming languages, such as Java, Smalltalk, C++, and conventional procedural programming languages, such as "C" language or similar programming languages. The program code may be executed entirely on a user's computer, partially on a user's computer, as an independent software package, partially on a user's computer, partially on a remote computer, or entirely on a remote computer or server. In the case of a remote computer, the remote computer may be connected to the user's computer via any type of network, including a local area network (LAN) or a wide area network (WAN), or may be connected to an external computer.

附图中的流程图和框图，图示了按照本公开各种实施例的系统、方法和计算机程序产品的可能实现的体系架构、功能和操作。在这点上，流程图或框图中的每个方框可以代表一个模块、程序段、或代码的一部分，该模块、程序段、或代码的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。也应当注意，在有些作为替换的实现中，方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如，两个接连地表示的方框实际上可以基本并行地执行，它们有时也可以按相反的顺序执行，这依所涉及的功能而定。也要注意的是，框图和/或流程图中的每个方框、以及框图和/或流程图中的方框的组合，可以用执行规定的功能或操作的专用的基于硬件的系统来实现，或者可以用专用硬件与计算机指令的组合来实现。The flow chart and block diagram in the accompanying drawings illustrate the possible architecture, function and operation of the system, method and computer program product according to various embodiments of the present disclosure. In this regard, each square box in the flow chart or block diagram can represent a module, a program segment or a part of a code, and the module, the program segment or a part of the code contains one or more executable instructions for realizing the specified logical function. It should also be noted that in some implementations as replacements, the functions marked in the square box can also occur in a sequence different from that marked in the accompanying drawings. For example, two square boxes represented in succession can actually be executed substantially in parallel, and they can sometimes be executed in the opposite order, depending on the functions involved. It should also be noted that each square box in the block diagram and/or flow chart, and the combination of the square boxes in the block diagram and/or flow chart can be implemented with a dedicated hardware-based system that performs a specified function or operation, or can be implemented with a combination of dedicated hardware and computer instructions.

描述于本公开实施例中所涉及到的模块、部件或单元可以通过软件的方式实现，也可以通过硬件的方式来实现。其中，模块、部件或单元的名称在某种情况下并不构成对该模块、部件或单元本身的限定。The modules, components or units involved in the embodiments described in the present disclosure may be implemented by software or hardware, wherein the names of the modules, components or units do not, in some cases, limit the modules, components or units themselves.

本文中以上描述的功能可以至少部分地由一个或多个硬件逻辑部件来执行。例如，非限制性地，可以使用的示例性的硬件逻辑部件包括：现场可编程门阵列(FPGA)、专用集成电路(ASIC)、专用标准产品(ASSP)、片上系统(SOC)、复杂可编程逻辑设备(CPLD)等等。The functions described above herein may be performed at least in part by one or more hardware logic components. For example, exemplary hardware logic components that may be used include, without limitation, field programmable gate arrays (FPGAs), application specific integrated circuits (ASICs), application specific standard products (ASSPs), systems on chip (SOCs), complex programmable logic devices (CPLDs), and the like.

以上描述仅为本公开的一些实施例以及对所运用技术原理的说明。本领域技术人员应当理解，本公开中所涉及的公开范围，并不限于上述技术特征的特定组合而成的技术方案，同时也应涵盖在不脱离上述公开构思的情况下，由上述技术特征或其等同特征进行任意组合而形成的其它技术方案。例如上述特征与本公开中公开的(但不限于)具有类似功能的技术特征进行互相替换而形成的技术方案。The above descriptions are only some embodiments of the present disclosure and an explanation of the technical principles used. Those skilled in the art should understand that the scope of disclosure involved in the present disclosure is not limited to the technical solutions formed by a specific combination of the above technical features, but should also cover other technical solutions formed by any combination of the above technical features or their equivalent features without departing from the above disclosed concept. For example, a technical solution formed by replacing the above features with the technical features with similar functions disclosed in the present disclosure (but not limited to).

虽然已经通过示例对本公开的一些特定实施例进行了详细说明，但是本领域的技术人员应该理解，以上示例仅是为了进行说明，而不是为了限制本公开的范围。本领域的技术人员应该理解，可在不脱离本公开的范围和精神的情况下，对以上实施例进行修改。本公开的范围由所附权利要求来限定。Although some specific embodiments of the present disclosure have been described in detail by way of example, it should be understood by those skilled in the art that the above examples are for illustration only and are not intended to limit the scope of the present disclosure. It should be understood by those skilled in the art that the above embodiments may be modified without departing from the scope and spirit of the present disclosure. The scope of the present disclosure is defined by the appended claims.

Claims

1. A method for generating a multimedia file, comprising:

Acquire a multimedia file, wherein the multimedia file includes redundant information;

The steganographic information is written into the redundant information to generate a multimedia file carrying the steganographic information, wherein the steganographic information is used to indicate that the multimedia file is to be played in a specific playing mode.

2. The method according to claim 1, characterized in that the method further comprises:

Segmenting the multimedia file based on at least one time period to obtain multimedia segments corresponding to each time period in the at least one time period;

A play mode matching the multimedia segment corresponding to each time period is selected in a pre-stored index table, wherein the index table is used to store a plurality of play control information corresponding to all multimedia segments.

3. The method according to claim 2, characterized in that the step of writing the stego information into the redundant information to generate a multimedia file carrying the stego information comprises:

Encode the numerical value corresponding to the playing mode to obtain a coded character string;

Compressing the encoded string to obtain a compressed string;

Encrypting the compressed string to obtain an encrypted string;

The encrypted string is steganographically written into the redundant information to obtain a multimedia file in which the encrypted string is steganographically written.

4. A multimedia file playing method, characterized by comprising:

Get multimedia files;

Detecting whether there is steganographic information in the multimedia file, wherein the steganographic information is used to instruct to play the multimedia file in a specific playing mode;

In the case where the steganographic information exists in the multimedia file, the multimedia file is played in a specific playing mode.

5. The method according to claim 4, characterized in that, when the steganographic information exists in the multimedia file, playing the multimedia file in a specific playing mode comprises:

Extracting the stego information to obtain an encrypted string;

Decrypting the encrypted string to obtain a compressed string;

Decompressing the compressed string to obtain a decompressed string;

Decoding the decompressed character string to obtain a numerical value;

Selecting a playback mode corresponding to the value in a pre-stored index table;

Control the multimedia file to be played in the playing mode.

6. The method according to claim 4 or 5, characterized in that the method further comprises:

When the steganographic information does not exist in the multimedia file, the multimedia file is directly played.

7. A multimedia file generating device, characterized by comprising:

An acquisition module, configured to acquire a multimedia file, wherein the multimedia file includes redundant information;

The generating module is configured to write the stego information into the redundant information to generate a multimedia file carrying the stego information, wherein the stego information is used to indicate that the multimedia file is to be played in a specific playing mode.

8. A multimedia file playing device, comprising:

An acquisition module, configured to acquire multimedia files;

a detection module configured to detect whether there is steganographic information in the multimedia file, wherein the steganographic information is used to indicate that the multimedia file is to be played in a specific playback mode;

The playing module is configured to play the multimedia file in a specific playing mode when the steganographic information exists in the multimedia file.

9. An electronic device, comprising:

at least one processor;

a memory for storing the at least one processor-executable instruction;

The at least one processor is configured to execute the instructions to implement the steps of the method as claimed in any one of claims 1 to 6.

10. A computer-readable storage medium, characterized in that when instructions in the computer-readable storage medium are executed by a processor of an electronic device, the electronic device is enabled to execute the method according to any one of claims 1 to 6.