[go: up one dir, main page]

WO2025139877A1 - Video editing method and apparatus, and electronic device and storage medium - Google Patents

Video editing method and apparatus, and electronic device and storage medium Download PDF

Info

Publication number
WO2025139877A1
WO2025139877A1 PCT/CN2024/139575 CN2024139575W WO2025139877A1 WO 2025139877 A1 WO2025139877 A1 WO 2025139877A1 CN 2024139575 W CN2024139575 W CN 2024139575W WO 2025139877 A1 WO2025139877 A1 WO 2025139877A1
Authority
WO
WIPO (PCT)
Prior art keywords
editing
video
segment
target
template
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
PCT/CN2024/139575
Other languages
French (fr)
Chinese (zh)
Inventor
周颖枝
郑涵奇
郭鹏举
万秭兰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Zitiao Network Technology Co Ltd
Original Assignee
Beijing Zitiao Network Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Zitiao Network Technology Co Ltd filed Critical Beijing Zitiao Network Technology Co Ltd
Publication of WO2025139877A1 publication Critical patent/WO2025139877A1/en
Pending legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/472End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/472End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
    • H04N21/47205End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content for manipulating displayed content, e.g. interacting with MPEG-4 objects, editing locally

Definitions

  • Embodiments of the present disclosure relate to a video editing method and device, an electronic device, and a storage medium.
  • Video content production can be seen everywhere in daily life. Users use video content production to record their lives, show their personality, and output value. There are usually two ways to produce videos. One is to record and edit the video by yourself; the other is to make videos using video templates. Since making videos using video templates is convenient, fast, and has rich effects, it has become the main way to share video content.
  • At least one embodiment of the present disclosure provides a video editing method, comprising: in response to a trigger operation on a video editing template, displaying a video editing interface, the video editing template comprising structural information and editing information, the structural information being used to indicate at least one video segment, the editing information being used to indicate at least one editing effect applied in the at least one video segment, the video editing interface comprising segment editing areas corresponding to the at least one video segment respectively; in response to a material import operation for a target video segment, displaying an identifier of at least one image material in the segment editing area of the target video segment, the at least one video segment comprising the target video segment, the at least one image material being a material imported into the target video segment based on the material import operation; and in response to a trigger operation for editing processing, generating a video editing result according to the video editing template and the at least one image material, the portion of the video editing result corresponding to the target video segment being an editing result obtained based on a target editing effect and at least one image material, the target editing effect matching an
  • At least one embodiment of the present disclosure further provides a video editing device, comprising: a first display unit, configured to display a video editing interface in response to a trigger operation on a video editing template, wherein the video editing template includes structural information and editing information, the structural information is used to indicate at least one video segment, the editing information is used to indicate at least one editing effect applied in the at least one video segment, and the video editing interface includes segment editing areas corresponding to the at least one video segment respectively; a second display unit, configured to display an identifier of at least one image material in the segment editing area of the target video segment in response to a material import operation for a target video segment, the at least one video segment includes the target video segment, and the at least one image material is a material imported into the target video segment based on the material import operation; and a result generating unit, configured to generate a video editing result according to the video editing template and the at least one image material in response to a trigger operation of an editing process, wherein the portion of the video editing result corresponding to the target video segment is an
  • At least one embodiment of the present disclosure also provides an electronic device, comprising: a processor; a memory, comprising one or more computer program modules; wherein the one or more computer program modules are stored in the memory and configured to be executed by the processor, and the one or more computer program modules include instructions for implementing the video editing method described in any embodiment of the present disclosure.
  • At least one embodiment of the present disclosure further provides a storage medium for storing non-transitory computer-readable instructions.
  • the non-transitory computer-readable instructions are executed by a computer, the video editing method described in any embodiment of the present disclosure can be implemented.
  • FIG1A is a schematic diagram showing a flow chart of a video editing method provided by at least one embodiment of the present disclosure
  • FIG1B is a schematic diagram showing a video editing method provided by at least one embodiment of the present disclosure.
  • FIG1C shows a schematic diagram of structural information provided by at least one embodiment of the present disclosure
  • FIG1D shows a schematic diagram of a video editing interface provided by at least one embodiment of the present disclosure
  • FIG2 shows a schematic diagram of adjusting the editing effect in a video editing template to obtain the editing effect in a video editing result provided by some embodiments of the present disclosure
  • FIG. 3 shows a method flow chart of step S30 in FIG. 1A provided by at least one embodiment of the present disclosure
  • FIG4 is a schematic diagram showing a preview page provided by at least some embodiments of the present disclosure.
  • 5A to 5F are schematic diagrams showing a method of modifying subtitles provided by at least some embodiments of the present disclosure.
  • FIG6 is a schematic block diagram of a video editing device provided by some embodiments of the present disclosure.
  • FIG7 is a schematic block diagram of an electronic device provided by some embodiments of the present disclosure.
  • FIG8 is a schematic block diagram of another electronic device provided by some embodiments of the present disclosure.
  • FIG. 9 is a schematic diagram of a storage medium provided in some embodiments of the present disclosure.
  • Video templates for short videos have fixed slots, and users need to import the same number of materials, the same number of video clips, and the same length of video clips as the video template. Therefore, video templates for short videos have higher requirements for the materials imported by users, more restrictions, and the use of video templates is not flexible.
  • At least one embodiment of the present disclosure provides a video editing method and apparatus, an electronic device and a storage medium, which can reduce the requirements and restrictions of video templates on materials imported by users and improve the flexibility of using video templates.
  • At least one embodiment of the present disclosure provides a method for generating videos using video templates, which method not only reduces the requirements for materials and makes the use of video templates more flexible, but also is applicable to medium videos in addition to short videos.
  • Short videos usually refer to videos that are originally created by users and are generally less than 1 minute in length, that is, user-generated content (UGC).
  • Long videos usually refer to videos that are professionally produced and are more than 30 minutes long, with higher content quality, that is, professionally generated content (PGC).
  • Medium videos are between short videos and long videos. Although they are original content generated by users, the professional level of users is higher than that of short videos, that is, professionally generated content (PUGC) and the length of medium videos is usually between short videos and long videos.
  • short videos do not require deliberate time to watch, and they deliver fragmented content. They can obtain the key content of the video in a relatively short period of time, but it may not be retained. Medium and long videos require finding a suitable time and place to watch, and require more effort, so that the video content can be retained in memory for a longer period of time.
  • video template and template video are the same, both refer to templates used as reference when making videos, and the template is displayed in the form of a video.
  • At least one embodiment of the present disclosure provides a video editing method, a video editing device, an electronic device, and a computer-readable storage medium.
  • the video editing method includes: in response to a trigger operation on a video editing template, displaying a video editing interface, the video editing template includes structural information and editing information, the structural information is used to indicate at least one video segment, the editing information is used to indicate at least one editing effect applied in the at least one video segment, and the video editing interface includes a segment editing area corresponding to the at least one video segment; in response to a material import operation for a target video segment, displaying an identifier of at least one image material in the segment editing area of the target video segment, the at least one video segment includes a target video segment, and the at least one image material is a material imported into the target video segment based on the material import operation; and in response to a trigger operation of an editing process, generating a video editing result according to the video editing template and at least one image material, the portion of the video editing result corresponding to the target video segment is
  • This video editing method does not limit the at least one imported image material to match the video editing template, but applies the editing effect of the video editing template to the at least one imported image material. There are no requirements on the quantity and time length of the at least one imported image material. This not only reduces the requirements on the image material, making the use of video editing templates more flexible, but also the method is applicable to medium videos in addition to short videos.
  • Figure 1A is a flowchart of a video editing method provided by at least one embodiment of the present disclosure
  • Figure 1B is a schematic diagram of a video editing method provided by at least one embodiment of the present disclosure
  • Figure 1C is a schematic diagram of structural information provided by at least one embodiment of the present disclosure
  • Figure 1D is a schematic diagram of a video editing interface provided by at least one embodiment of the present disclosure.
  • the method includes the following operations.
  • Step S10 In response to a triggering operation on a video editing template, a video editing interface is displayed, wherein the video editing template includes structural information and editing information, wherein the structural information is used to indicate at least one video segment, and the editing information is used to indicate at least one editing effect applied in at least one video segment, and the video editing interface includes segment editing areas corresponding to the at least one video segment.
  • step S10 the triggering operation of the video editing template is, for example, clicking on the use icon in the main page of the video editing template.
  • the main page of the video editing template is page 101, and the page 101 includes a use icon 102. If the client receives a click operation on the use icon 102, the video editing interface 103 shown in FIG1D is displayed.
  • the video editing method is applied to a video application, and multiple video editing template covers are displayed in the template recommendation page of the video application. If the user selects the cover of a certain video editing template, the demonstration video of the video editing template is played for the user's reference. When the demonstration video ends, the main page of the video editing template is entered. The user can operate in the main page (for example, page 101) to use the video editing template or not. If the user performs a trigger operation in the main page, it means that the user chooses to use the video editing template, and then enters the video editing interface 103 of the video editing template to perform video editing in the video editing interface 103.
  • the video editing template includes structure information 100.
  • the structure information 100 is set by, for example, the producer of the video editing template.
  • the publishing link and the vimo platform support adding the video editing template structure information.
  • the structure information 100 shown in Fig. 1 is added through the vimo platform.
  • the structural information 100 includes three parts: overall description, intelligent capability, and template structure decomposition.
  • the overall description part is controlled by the operator, for example, to fill in the description of the video editing template, such as content, style, etc.
  • the overall description part does not limit the maximum number of characters to be entered.
  • the structure of each part in the structural information is similar, for example, the first part 1 is used as an example for explanation.
  • the first part 1 includes a title, a description, a start and end time, and a start and end segment.
  • the title is, for example, a refined description of the template image material of the first part 1 in the video editing template.
  • the template image material is, for example, the material used in the video editing template.
  • the description is, for example, a general description of the template image material of the first part 1.
  • the structural information indicates each of at least one video segment by a time interval or a serial number of continuous video segments.
  • each video segment is determined by a start and end time, or each video segment is determined by a start and end segment.
  • the template image material is at least one continuous video segment
  • the start and end time is selected from the Kth second to the Lth second (time interval) from the at least one continuous video segment as the material of the first part 1
  • the start and end segments are, for example, selected from the jth segment to the ith segment from the at least one continuous video segment as the material of the first part 1.
  • j and i are examples of serial numbers of video segments.
  • the video editing template includes the template image material. In some other embodiments of the present disclosure, the video editing template may not include the template image material.
  • the video editing template of the medium video can directly reuse the production and publishing process of the short video template. After exporting the project file through the editing tool, select "Publish Template” to obtain the video editing template.
  • the medium video template After the medium video template is released, it supports storage and management in the vimo background, and reuses the short video template logic. For example, by manually marking it on vimo, it is determined whether the template is of the "medium video template” type. Vimo supports batch addition of identifiers of the "medium video template” type. For the medium video template type, it is necessary to add an attribute field of the "medium video template” type to the metadata, and distinguish the video type through the type attribute field identifier (for example, the identifier is video_type_id). For those marked as medium video template types, the medium video template logic is performed on the overall template link.
  • editing information for example, it can be similar to a short video template.
  • Producers can add text information, dubbing information, music information, sticker information, filter information, transition animation information and other editing information according to their own ideas and needs.
  • the editing information indicates the editing effects, and the editing effects include at least one of subtitles, dubbing, music, text, filters, and transition animations.
  • the video editing interface 103 includes at least one paragraph editing area corresponding to each video segment, and each paragraph editing area is used as an editing area for a theme, and displays video clips of different themes.
  • the paragraph editing area 11 corresponding to the video segment indicated by the first part 1 in FIG1C if the content entered in the title of the first part 1 is "finished dish", then the title of the paragraph editing area 11 corresponding to the video segment indicated by the first part 1 is "finished dish", for example, the structural information also includes a second part, if the content entered in the title of the second part is "food preparation process", then the video segment indicated by the second part can correspond to the paragraph editing area 12, and the title of the paragraph editing area 12 is "food preparation process”.
  • Each video segment is regarded as a subject, and each segment editing area is regarded as an editing area of a subject.
  • the video segment is referred to as a subject, and the segment editing area is referred to as an editing area of a subject.
  • the video editing interface 103 can be divided into multiple parts, each part is used to display different content.
  • the video editing interface 103 includes a navigation bar title 113, in which the title is "Select Material”, and the video editing interface 103 also includes M topics arranged in sequence.
  • the editing area, M is a positive integer.
  • the first theme is "Finished Dish”
  • the second theme is "Food Production Process”.
  • the user can import the corresponding creative video clips or pictures for each theme in the editing area of each theme; or choose not to import creative video clips or pictures in at least one of the M themes.
  • the relevant description 163 of the video segment corresponding to the theme can be included. If there is no relevant description of the theme in the video editing template, for example, if the corresponding characters are not entered in the description part of Figure 1C, the relevant description will not be displayed.
  • the short video template is a video editing draft in which some video clips are designated as materials that need to be replaced and filled; on the other hand, from the perspective of template use, the medium video template is to apply the video editing effect in each paragraph separately, and the application logic in the paragraph is based on the editing operation in the paragraph indicated by the editing information, which is applied to the material imported into the paragraph after changes (because the number of materials is not bound to the clips in the template, the editing operation cannot be directly reused on the material), and the application logic of the short video template is to import the video clips in the template into the material, and the editing operation directly acts on the material, without the need to divide the paragraphs or change the editing operation.
  • the editing area of each theme includes an add entry 123.
  • an album list is pulled up for the user to select an image material to be imported into the theme from the album list. That is, the material import operation is, for example, a click operation on the add entry 123 and a selection of an image material to be imported into the theme from the album list.
  • the adding entry 123 has no slot concept and does not limit the number of segments and the length of the segments.
  • a target video segment also referred to as a target theme
  • an identifier of at least one video material is displayed in a segment editing area of the target theme.
  • a target theme can be selected from the M themes for a material import operation. For example, if the user selects N themes in sequence for a material import operation, then the N themes are all used as target video segments. M and N are both positive integers.
  • Information prompt paragraph Information prompt paragraph In the example of FIG. 1D , for example, in response to the user clicking on the add entry 123 in “Dish Finished Product”, the album list 133 is pulled up for the user to select at least one image material related to the dish finished product from the album list 133. Afterwards, the identification of each of the at least one image material is displayed in the paragraph editing area 11 of the target theme “Dish Finished Product”.
  • the identification of each of the at least one image material is, for example, a frame of image extracted from each image material, for example, the first frame of image of each image material is extracted as the identification of the image material.
  • the user imports image material 1 and image material 8 from the album list 133, and the first frame of image material 1 and image material 8 are images showing the number 1 and the number 8, respectively, and the images showing the number 1 and the number 8 are displayed in the paragraph editing area 11.
  • the user can click the add entry 123 in the "food making process" to pull up the album list 133 again, so that the user can select at least one video material related to the food making process from the album list 133.
  • the user can perform a material selection operation for each theme, or only perform a material selection operation for some of the M themes.
  • the video editing interface of the video editing template includes M themes arranged in sequence, so that one or more image materials are grouped according to themes, so that the video structure is rationalized.
  • the amount and duration of imported materials can be subject to maximum restrictions. For example, the duration of a single material (i.e., a video clip) can only be 30 minutes at most.
  • the total duration of at least one image material is limited by the upper limit of the total duration of user materials in Figure 1C.
  • the album list may include a prompt text, for example, when no image material is selected, the default prompt text is "Select the image material to be imported”; when the image material has been selected, the prompt text is changed to "%d segments have been selected” according to the number of materials. As shown in FIG. 1D, image material 1 and image material 8 are selected, and the prompt text is "2 segments have been selected”. As shown in FIG. 1D, the album list may also include a confirmation add button 143.
  • the confirmation add button 143 includes two states: lit and grayed out.
  • the state of the confirmation add button 143 is lit, that is, the lighting logic is that when a segment of image material is imported, the button is lit, otherwise it is grayed out; after clicking the add button, the image material imported by the user is placed in the corresponding frame in its order.
  • the logos of image material 1 and image material 8 are placed in the theme of the paragraph editing area 11 with the theme of "finished dish". If the current theme frame is filled with image materials, enter the album list, and the added image materials are displayed by default.
  • the addition entry 123 moves backward following the position of the filled material, and the image material added later is automatically sorted behind the existing image material. If it exceeds one line, it will automatically wrap.
  • the added image material will display the length of its original image material, and will be intelligently captured and split when entering the next step of synthesis.
  • image material deletion operations are supported.
  • the image material is deleted, and the image material of the subject is automatically postponed to the front row.
  • the delete button can be selected in the album list page and the video editing interface 103 to perform the deletion operation.
  • the user can also sort and edit the imported image materials. For example, the positions of two image materials are swapped, or editing operations such as beautification and cropping are performed on a certain image material.
  • the image material may be a video clip or a picture.
  • the triggering operation of the editing process is, for example, a selection operation of an export control or a preview control after the editing is completed through the video editing interface.
  • an export synthesized video or an export generated file can be generated based on the video editing template and at least one image material.
  • a preview object of the preview page is generated based on the video editing template and at least one image material.
  • the preview object can be a video or a picture, etc.
  • the video editing result includes, for example, a plurality of video parts, each of which corresponds to a target video segment, that is, each video part is an editing result in which a target editing effect is applied to at least one video material in the target video segment.
  • the target editing effect is obtained based on an editing effect located in the target video segment among at least one editing effect.
  • a neural network for natural language processing may be used to analyze the prompt information to obtain an analysis result, thereby more accurately understanding the user's demands.
  • a sequence-to-sequence neural network model, a long short-term memory network, etc. may be used to understand the prompt information to obtain an analysis result.
  • feature recognition of image materials is performed using image processing technology to obtain an analysis result of at least one image material. Combining the analysis result of the prompt information with the analysis result of at least one image material, a second copy matching the prompt information and at least one image material is generated.
  • the second copy content can be "sour and sweet, refreshing”.
  • the influence of the analysis result on the content of the second copy is greater than the influence of at least one image material on the content of the second copy.
  • the weight of the analysis result is greater than the weight of at least one image material, which can better meet the user's demands and the generated copy content is more in line with the user's expectations.
  • the copy can be displayed according to the effective duration range of the copy in the video editing template.
  • the text 3 is displayed for the entire duration of Topic 3 (for example, a total of 2 minutes), but the duration of Topic 3 in the video editing result is a total of 3 minutes, then the copy is also only displayed within the first 2 minutes of Topic 3 in the video editing result.
  • the text 1 is displayed for the entire duration of Topic 1 (for example, a total of 1 minute), but the duration of Topic 1 in the video editing result is a total of 2 minutes, then the copy is also only displayed within the first 1 minute of Topic 3 in the video editing result.
  • image recognition is performed on the second material to identify the scene and tone of the second material, thereby generating text content corresponding to the scene and tone video based on the understanding of the second material.
  • the text format used in the video editing template is directly applied to the text content to obtain the second text.
  • transition animation (abbreviated as “transition”) 1 between Topic 1 and Topic 2, and there is a transition 2 between Topic 2 and Topic 3.
  • the transition animation is used to make the video clips of adjacent topics transition naturally, making the switching of the content in the video more natural and smooth.
  • a neural network for natural language processing may be used to analyze the prompt information to obtain analysis results, thereby more accurately understanding the user's demands.
  • a sequence-to-sequence neural network model, a long short-term memory network, etc. may be used to understand the prompt information.
  • FIG. 3 shows a method flow chart of step S30 in FIG. 1A provided by at least one embodiment of the present disclosure.
  • the method includes steps S301 to S304 .
  • Step S301 In response to a triggering operation of an editing process, the prompt information and at least one image material are analyzed to obtain an analysis result.
  • Step S302 in response to the editing effect including subtitles and/or dubbing, the content of the subtitles and/or dubbing in the target editing effect is obtained according to a matching strategy of matching the content of the subtitles and/or dubbing in the target editing effect with the analysis result.
  • Step S303 according to the matching strategy that the format of the subtitles in the target editing effect is the same as the format of the subtitles in the editing effect, the format of the subtitles in the target editing effect is obtained.
  • Step S304 according to the matching strategy that the sound parameters of the dubbing in the target editing effect are the same as the sound parameters of the dubbing in the editing effect, the sound parameters of the dubbing in the target editing effect are obtained.
  • step S301 the analysis results obtained by analyzing the prompt information and at least one image material are similar to those in the previous embodiment and will not be repeated here. For example, based on the image material imported by the user and the prompt information input, the content and appeal are understood, and appropriate subtitles and/or dubbing content are generated. If it is recognized that the image material imported by the user includes sound, then appropriate subtitles and/or dubbing content can be generated by combining the original sound in the image material and the image content of the image material and the prompt information. Alternatively, regardless of whether the image material itself includes sound, intelligent dubbing is used by default.
  • the video material is an image about making delicious food
  • the prompt information is "weekend”
  • the subtitles in the preview video can include "happy weekend”.
  • the matching strategy includes a matching strategy for the content of the subtitles and/or the content of the dubbing, a matching strategy for the subtitle format, and a matching strategy for the sound parameters.
  • the matching strategy may be that the content of subtitles and/or dubbing in the target editing effect matches the analysis result.
  • the matching of the content of subtitles and/or dubbing with the analysis result is similar to the matching of the content of the copywriting with the analysis result described above, and will not be repeated.
  • the sound parameters of dubbing may include, for example, the timbre, frequency, pitch, etc. of the dubbing.
  • the matching strategy is, for example, that the sound parameters of the dubbing in the target editing effect are the same as the sound parameters of the dubbing in the editing effect.
  • the timbre of the dubbing in the target editing effect is the same as the timbre, frequency, and pitch of the dubbing in the editing effect in the video editing template. That is, the subtitle format and the sound parameters of the sound can directly apply the format and sound parameters in the editing effect in the video editing template.
  • the default timbre is used, and the default timbre can be pre-set. That is, when the image material of the video editing template has an original sound and the video editing template retains the original sound, when the video editing template is used to synthesize the video editing result, the original sound in the video editing template will not be retained by default.
  • subtitles and/or dubbing may be effective for the entire video editing result.
  • subtitles are automatically added to the final video of all the video materials imported by the user, that is, the subtitles are displayed according to the duration of the final video.
  • a preset duration threshold such as 3 seconds
  • no subtitles and/or dubbing are added to the clip.
  • the duration of the subtitles is the same as the duration of the final video.
  • subtitles and dubbing can be one-to-one.
  • the underlying content of the dubbing is the subtitle content. If the subtitle content is modified, it is allowed to apply the modified content to the dubbing, and the reading content of the dubbing also changes automatically.
  • playing the dubbing and displaying the subtitles are decoupled.
  • Table 1 shows an example of decoupling dubbing and subtitles provided by at least some embodiments of the present disclosure.
  • dubbing and subtitles are decoupled. If the video editing template includes the "Smart Add Dubbing" tag and the "Smart Add Subtitles" tag, for example, the structural information of Figure 1C, the synthesized video editing result is also expected to have dubbing and subtitles. If the video editing template does not include the "Smart Add Dubbing" tag and the “Smart Add Subtitles” tag, the synthesized video editing result is also expected to not include dubbing and subtitles. If the video editing template includes the "Smart Add Dubbing" tag but does not include the "Smart Add Subtitles” tag, the synthesized video editing result is expected to include dubbing but not subtitles. If the video editing template does not include the "Smart Add Dubbing" tag but includes the "Smart Add Subtitles” tag, the synthesized video editing result is expected to not include dubbing but include subtitles.
  • the dubbing mark in the preview video corresponding to the video editing result is displayed as a closed state.
  • the subtitle mark in the preview video corresponding to the video editing result is displayed as a closed state.
  • the dubbing mark is in the open state and the subtitle mark is in the closed state; in scene 3, the dubbing mark is in the closed state and the subtitle mark is in the open state.
  • the dubbing icon includes two different display states, respectively representing the closed state and the open state
  • the subtitle icon also includes two different display states, respectively representing the closed state and the open state.
  • step S30 includes splicing the video parts corresponding to each target video segment to obtain a video editing result.
  • Table 2 shows an example of a material import method provided by some embodiments of the present disclosure.
  • the template video includes Topic 1, Topic 2, and Topic 3 arranged in sequence
  • the image material imported into Topic 1 includes Segment A, Segment B, and Segment C
  • the three segments are arranged as Segment A, Segment B, and Segment C
  • the image material of Topic 2 includes Segment D
  • the image material of Topic 3 includes Segment E, Segment F, and Segment G
  • the three segments are arranged as Segment E, Segment F, and Segment G.
  • the video part of Topic 1, the video part of Topic 2, and the video part of Topic 3 are spliced in the order of the themes to obtain the video editing result.
  • the video part of each theme is obtained based on the image material and the target editing effect. For example, the target editing effect is applied to the image material.
  • the video editing result is obtained by applying the first target editing effect to Segment A, Segment B, and Segment C, applying the second target editing effect to Segment D, and applying the third target editing effect to Segment E, Segment F, and Segment G, and then combining them in order.
  • the video material of subject 1 includes segment A, segment B and segment C, and the three segments are ordered as segment A, segment B and segment C, and no material is imported into subject 2, and no material is imported into subject 3.
  • Multiple segments are spliced together in the order of the themes to obtain a video that is combined in the order of segment A, segment B, and segment C, and the first target editing effect is applied to the spliced video of segment A, segment B, and segment C to obtain a video editing result.
  • the image material of Topic 2 includes Segment D
  • the image material of Topic 3 includes Segment E, Segment F, and Segment G
  • the three segments are arranged in the order of Segment E, Segment F, and Segment G.
  • Multiple segments are spliced in the order of themes to obtain a spliced video that is combined in the order of Segment D, Segment E, Segment F, and Segment G
  • the second target editing effect is applied to a portion of Segment D in the spliced video
  • the third target editing effect is applied to portions of Segments F and G in the spliced video to obtain a video editing result.
  • the music is global to the video and there is only one music.
  • a video editing template includes a music, and the music serves as the background music of the video editing template.
  • the background music in the video editing template can be directly applied to the generated video editing result, but the time length of the background music is appropriately adjusted.
  • the video editing template 201 includes theme 1, theme 2 and theme 3. At least one template image material of theme 1 includes segment 1 and segment 2, at least one template image material of theme 2 includes segment 3, segment 4, segment 5, segment 6 and segment 7, and at least one template image material of theme 3 includes segment 8, segment 9, segment 10 and segment 11.
  • the music in the video editing template 201 runs through the entire template video, for example, the entire video editing template uses the music as background music, and the original duration of the music (i.e., the first duration) is consistent with the video time length of the video editing template 201.
  • the user only imports video materials into Theme 2 and Theme 3 in the video editing interface, and does not import video materials into Theme 1, so that the playback time of the target video material in the video editing result is less than the time length of the combination of multiple template video materials of the video editing template 201 (hereinafter referred to as "video time length")
  • video time length the time length of the combination of multiple template video materials of the video editing template 201
  • the background music in the video editing result gradually weakens until it disappears, that is, the background music is faded out.
  • the playback time of the target video material in the adopted video material is shorter than the video time length of the video editing template 201, and the original duration of the music is also directly shortened to be consistent with the playback time length of the target video material.
  • the background music in the video editing result gradually weakens until it disappears, that is, the background music is faded out.
  • the user imported image materials into Theme 1, Theme 2 and Theme 3 respectively in the video editing interface.
  • the playback duration of the target image material in the image materials used is longer than the video time length of the video editing template 201.
  • the original duration of the music is extended to be consistent with the playback duration of the target image material.
  • the playback duration of the target image material is longer than the playback duration of the template image material in the video editing template
  • the end time of the music in the video editing template is extended backward, and the start time of the music in the video editing result is aligned with the video editing template.
  • the remaining music will be played continuously. For example, if the total duration of the music is 3 minutes, but the video editing template only uses the duration range of 0-1 minutes (i.e., the original duration is 1 minute), if the target video material exceeds 1 minute, the remaining content after 1 minute of the music can be played continuously.
  • the music can be extended by intelligently extending the music.
  • the intelligently extending music can automatically generate a music clip that connects to the current music by using a neural network, or find a melody or music that matches the current music from a music library.
  • the playing time is extended by automatically looping the current music.
  • the global editing effect includes a filter.
  • the filter effective time length in the video editing template is extended; or in response to the target image material playing time being shorter than the video time length, the filter effective time length in the video editing template is shortened.
  • subtitles can also be used as a global editing effect.
  • the processing logic of subtitles as a global editing effect is similar to that of the aforementioned filters and music, and will not be repeated here.
  • step S30 generates a video editing result based on a matching strategy.
  • step S30 may include: in response to a trigger operation of the editing process, using a neural network to process at least one image material and an editing effect to obtain a video editing result.
  • the image material and the editing effect are analyzed through a neural network, and the editing effect adapted to the image material is applied to the image material to obtain a video editing result.
  • the neural network may include, for example, a neural network for image processing, such as a convolutional neural network, a deep learning neural network, etc.
  • step S30 includes: in response to a triggering operation of editing processing, first pre-processing at least one image material to obtain a target image material, and then applying a corresponding editing effect to the target image material to obtain a video editing result.
  • Pre-processing includes, for example, highlight recognition, material segmentation, etc.
  • step S30 includes: extracting a target segment from at least one image material in response to a triggering operation of the editing process; and generating a video editing result according to the video editing template and the target segment.
  • Extracting the target segment from at least one image material may be, for example, performing highlight recognition on at least one image material to obtain a target segment at a highlight moment of at least one image material.
  • the target segment at a highlight moment refers to, for example, an optimal material obtained by intercepting an optimal portion of at least one of the at least one image material.
  • At least one image material can be automatically identified to obtain a preferred segment of at least one image material to generate a video editing result.
  • step S30 includes: in response to a trigger operation of editing processing, if there is a first image material in at least one image material, dividing the first image material into multiple sub-segments, wherein the first image material is a material with a duration greater than a preset duration; and generating a video editing result based on the video editing template and the multiple sub-segments.
  • the user can import at least one image material as shown in FIG1D .
  • at least one image material there may be a first image material whose time length is greater than a preset time length.
  • the first image material whose time length is greater than the preset time length can be segmented and processed, and the first image material is segmented into a plurality of sub-segments. These sub-segments are still used as materials of the theme corresponding to the first image material and are not used across themes to avoid unexpected user effects.
  • Dividing the first image material into a plurality of sub-segments is conducive to identifying and utilizing the sub-segments, at least partially avoiding the problem of long processing time caused by excessively long image materials, and is conducive to the flexible use of sub-segments.
  • the image material and prompt information imported by the user are analyzed and identified in advance, and when the trigger operation of the editing process is obtained, the image material is directly extracted (i.e., intercepted) and segmented according to the analysis result, thereby reducing the waiting time for generating the video editing result.
  • the material is analyzed, without waiting for the user to perform the trigger operation of the editing process before starting the analysis.
  • the method further includes playing a preview video on a preview page according to the video editing result. Playing the preview video allows the user to watch the video draft generated according to the image material, which is convenient for the user to modify in time when the user needs to modify.
  • the playback time of at least part of the material is consistent with the preset playback time.
  • the semantic analysis of the prompt information shows that the playback duration of the video the user hopes to produce is 20 minutes, and the duration of the target image material extracted from the image material and applied to the video editing result is also 20 minutes.
  • FIG. 4 shows a schematic diagram of a preview page provided by at least some embodiments of the present disclosure.
  • the preview page 401 includes a video playback window 411 , a material display window 421 , and an editing entry window 431 .
  • the material window unit B is selected, the material adding sub-window 4213 is clicked, and the album list is pulled up to display multiple image materials for selection. If the user adds and selects a target material from the multiple image materials for selection, the target material is added after the material window unit B. For example, a material window unit E is added between the material window unit B and the material window unit C to display the target material. Please refer to the above description for the album list.
  • the subtitle entry 4311 if the video editing template includes a tag for intelligently adding subtitles or a tag for intelligently adding dubbing, the condition for intelligently adding subtitles to the preview video is met, and the subtitle entry 4311 is displayed in the editing entry window 431. If the template video does not include a tag for intelligently adding subtitles or a tag for intelligently adding dubbing, the subtitle entry 4311 is not displayed in the editing entry window 431.
  • some video editing templates may have only dubbing or only subtitles.
  • the subtitle entry 4311 When there is only dubbing, the subtitle entry 4311 is normally displayed in the editing entry window 431, but the "subtitle" display in the subtitle editing page is closed by default, and the subtitle only serves the dubbing and reading content.
  • there may be only subtitles and the subtitle entry 4311 is normally displayed in the editing entry window 431, but the "dubbing" display in the subtitle editing page is closed by default, and only the subtitles are displayed without reading and dubbing.
  • a pop-up window 506 shown in Figure 5D may appear, and the pop-up window 506 is used to ask the user whether to apply the subtitle to dubbing. If the user chooses to apply to dubbing, the dubbing is modified to be consistent with the changed subtitles accordingly.
  • the subtitle secondary page 502 may further include a subtitle duration setting button 532 .
  • the subtitle duration setting button 532 For example, after the user selects a corresponding subtitle, he clicks the subtitle duration setting button 532, and the segment range 542 corresponding to the subtitle is displayed at the bottom of the subtitle secondary page, supporting the selection of a retractable duration range.
  • Other subtitles 552 can be displayed above the displayed segment range 542 corresponding to the subtitle, and by selecting other subtitles 552, the display can be switched to other subtitle duration ranges.
  • a "change a batch” button 562 may be permanently located on the subtitles primary page 501 and the subtitles secondary page 502. In response to a click operation on the "change a batch” button 562, new subtitle content is regenerated. When the subtitles are slid in the subtitles primary page 501 or the subtitles secondary page 502, the "change a batch” button 562 may disappear, and when the sliding stops, the "change a batch” button 562 continues to appear.
  • the first subtitle page 501 and the second subtitle page 502 may further include a first viewing icon for viewing the previous subtitle and a second viewing icon for viewing the next subtitle. Clicking the first viewing icon returns to the previous version, and clicking the second viewing icon continues to generate new subtitles.
  • the new dubbing modification is saved and the dubbing takes effect. If the effective icon is not clicked or the exit icon is clicked, the dubbing modification is not saved and the user returns to the previous page of the dubbing editing page.
  • music can be recommended by similar music. For example, music with a similarity greater than a predetermined value in the video editing template is recommended. Alternatively, music is recommended based on the understanding of the second material and the prompt information.
  • the user can select music on his own and set the volume of the music.
  • the music can also be processed at the point of the music.
  • the following devices may be connected to the I/O interface 350: an input device 360 including, for example, a touch screen, a touchpad, a keyboard, a mouse, a camera, a microphone, an accelerometer, a gyroscope, etc.; an output device 370 including, for example, a liquid crystal display (LCD), a speaker, a vibrator, etc.; a storage device 380 including, for example, a magnetic tape, a hard disk, etc.; and a communication device 390.
  • the communication device 390 may allow the electronic device 300 to communicate with other electronic devices wirelessly or by wire to exchange data.
  • FIG. 9 shows an electronic device 300 having various devices, it should be understood that it is not required to implement or have all of the devices shown, and the electronic device 300 may alternatively implement or have more or fewer devices.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Databases & Information Systems (AREA)
  • Human Computer Interaction (AREA)
  • Television Signal Processing For Recording (AREA)

Abstract

Provided in the embodiments of the present disclosure are a video editing method and apparatus, and an electronic device and a storage medium. The video editing method comprises: in response to a trigger operation on a video editing template, presenting a video editing interface; in response to a material import operation in respect of a target video segment, presenting identifiers of at least one image material in a segment editing area of the target video segment, wherein at least one video segment comprises the target video segment, and the at least one image material is material which is imported into the target video segment on the basis of the material import operation; and in response to a trigger operation for editing processing, generating a video editing result on the basis of the video editing template and the at least one image material.

Description

视频编辑方法及装置、电子设备及存储介质Video editing method and device, electronic device and storage medium

本申请要求于2023年12月28日递交的中国专利申请第202311843312.4号的优先权,在此全文引用上述中国专利申请公开的内容以作为本申请的一部分。This application claims priority to Chinese Patent Application No. 202311843312.4 filed on December 28, 2023, and the contents of the above-mentioned Chinese patent application disclosure are hereby cited in their entirety as a part of this application.

技术领域Technical Field

本公开的实施例涉及一种视频编辑方法及装置、电子设备及存储介质。Embodiments of the present disclosure relate to a video editing method and device, an electronic device, and a storage medium.

背景技术Background Art

视频内容制作在日常生活中随处可见,用户通过制作视频内容来记录生活、彰显个性以及输出价值。视频制作通常有两种方式,一是自行录制视频并剪辑完善;二是通过视频模板制作视频。由于通过视频模板制作视频操作方便、快捷、效果丰富,已经成为了视频内容分享的主要方式。Video content production can be seen everywhere in daily life. Users use video content production to record their lives, show their personality, and output value. There are usually two ways to produce videos. One is to record and edit the video by yourself; the other is to make videos using video templates. Since making videos using video templates is convenient, fast, and has rich effects, it has become the main way to share video content.

发明内容Summary of the invention

本公开至少一个实施例提供一种视频编辑方法,包括:响应于对视频编辑模板的触发操作,展示视频编辑界面,所述视频编辑模板包括结构信息和编辑信息,所述结构信息用于指示至少一个视频段落,所述编辑信息用于指示在所述至少一个视频段落中施加的至少一个编辑效果,所述视频编辑界面包括与所述至少一个视频段落分别对应的段落编辑区域;响应于针对目标视频段落的素材导入操作,在所述目标视频段落的段落编辑区域展示至少一个影像素材的标识,所述至少一个视频段落包括所述目标视频段落,所述至少一个影像素材为基于所述素材导入操作被导入在所述目标视频段落中的素材;以及响应于编辑处理的触发操作,根据所述视频编辑模板和所述至少一个影像素材,生成视频编辑结果,所述视频编辑结果对应于所述目标视频段落的部分为基于目标编辑效果和至少一个影像素材得到的编辑结果,所述目标编辑效果匹配所述至少一个编辑效果中位于所述目标视频段落内的编辑效果。At least one embodiment of the present disclosure provides a video editing method, comprising: in response to a trigger operation on a video editing template, displaying a video editing interface, the video editing template comprising structural information and editing information, the structural information being used to indicate at least one video segment, the editing information being used to indicate at least one editing effect applied in the at least one video segment, the video editing interface comprising segment editing areas corresponding to the at least one video segment respectively; in response to a material import operation for a target video segment, displaying an identifier of at least one image material in the segment editing area of the target video segment, the at least one video segment comprising the target video segment, the at least one image material being a material imported into the target video segment based on the material import operation; and in response to a trigger operation for editing processing, generating a video editing result according to the video editing template and the at least one image material, the portion of the video editing result corresponding to the target video segment being an editing result obtained based on a target editing effect and at least one image material, the target editing effect matching an editing effect of the at least one editing effect located within the target video segment.

本公开至少一个实施例还提供一种视频编辑装置,包括:第一展示单元,配置为响应于对视频编辑模板的触发操作,展示视频编辑界面,其中,所述视频编辑模板包括结构信息和编辑信息,所述结构信息用于指示至少一个视频段落,所述编辑信息用于指示在所述至少一个视频段落中施加的至少一个编辑效果,所述视频编辑界面包括与所述至少一个视频段落分别对应的段落编辑区域;第二展示单元,配置为响应于针对目标视频段落的素材导入操作,在所述目标视频段落的段落编辑区域展示至少一个影像素材的标识,所述至少一个视频段落包括所述目标视频段落,所述至少一个影像素材为基于所述素材导入操作被导入在所述目标视频段落中的素材;以及结果生成单元,配置为响应于编辑处理的触发操作,根据所述视频编辑模板和所述至少一个影像素材,生成视频编辑结果,其中,所述视频编辑结果对应于所述目标视频段落的部分为基于目标编辑效果和所述至少一个影像素材得到的编辑结果,所述目标编辑效果匹配所述至少一个编辑效果中位于所述目标视频段落内的编辑效果。At least one embodiment of the present disclosure further provides a video editing device, comprising: a first display unit, configured to display a video editing interface in response to a trigger operation on a video editing template, wherein the video editing template includes structural information and editing information, the structural information is used to indicate at least one video segment, the editing information is used to indicate at least one editing effect applied in the at least one video segment, and the video editing interface includes segment editing areas corresponding to the at least one video segment respectively; a second display unit, configured to display an identifier of at least one image material in the segment editing area of the target video segment in response to a material import operation for a target video segment, the at least one video segment includes the target video segment, and the at least one image material is a material imported into the target video segment based on the material import operation; and a result generating unit, configured to generate a video editing result according to the video editing template and the at least one image material in response to a trigger operation of an editing process, wherein the portion of the video editing result corresponding to the target video segment is an editing result obtained based on a target editing effect and the at least one image material, and the target editing effect matches an editing effect of the at least one editing effect located within the target video segment.

本公开至少一个实施例还提供一种电子设备,包括:处理器;存储器,包括一个或多个计算机程序模块;其中,所述一个或多个计算机程序模块被存储在所述存储器中并被配置为由所述处理器执行,所述一个或多个计算机程序模块包括用于实现本公开的任一实施例所述的视频编辑方法的指令。At least one embodiment of the present disclosure also provides an electronic device, comprising: a processor; a memory, comprising one or more computer program modules; wherein the one or more computer program modules are stored in the memory and configured to be executed by the processor, and the one or more computer program modules include instructions for implementing the video editing method described in any embodiment of the present disclosure.

本公开至少一个实施例还提供一种存储介质,用于存储非暂时性计算机可读指令,当所述非暂时性计算机可读指令由计算机执行时可以实现本公开的任一实施例所述的视频编辑方法。At least one embodiment of the present disclosure further provides a storage medium for storing non-transitory computer-readable instructions. When the non-transitory computer-readable instructions are executed by a computer, the video editing method described in any embodiment of the present disclosure can be implemented.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

结合附图并参考以下具体实施方式,本公开各实施例的上述和其他特征、优点及方面将变得更加明显。贯穿附图中,相同的附图标记表示相同的元素。应当理解附图是示意性的,原件和元素不一定按照比例绘制。The above and other features, advantages and aspects of the embodiments of the present disclosure will become more apparent with reference to the following detailed description in conjunction with the accompanying drawings. Throughout the accompanying drawings, the same reference numerals represent the same elements. It should be understood that the drawings are schematic and the originals and elements are not necessarily drawn to scale.

图1A示出了本公开至少一个实施例提供的一种视频编辑方法的流程示意图;FIG1A is a schematic diagram showing a flow chart of a video editing method provided by at least one embodiment of the present disclosure;

图1B示出了本公开至少一个实施例提供的一种视频编辑方法的示意图;FIG1B is a schematic diagram showing a video editing method provided by at least one embodiment of the present disclosure;

图1C示出了本公开至少一个实施例提供的一种结构信息的示意图;FIG1C shows a schematic diagram of structural information provided by at least one embodiment of the present disclosure;

图1D示出了本公开至少一个实施例提供的一种视频编辑界面的示意图;FIG1D shows a schematic diagram of a video editing interface provided by at least one embodiment of the present disclosure;

图2示出了本公开一些实施例提供的一种对视频编辑模板中的编辑效果调整得到视频编辑结果中的编辑效果的示意图;FIG2 shows a schematic diagram of adjusting the editing effect in a video editing template to obtain the editing effect in a video editing result provided by some embodiments of the present disclosure;

图3示出了本公开至少一个实施例提供的图1A中步骤S30的方法流程图;FIG. 3 shows a method flow chart of step S30 in FIG. 1A provided by at least one embodiment of the present disclosure;

图4示出了本公开至少一些实施例提供的一种预览页面的示意图;FIG4 is a schematic diagram showing a preview page provided by at least some embodiments of the present disclosure;

图5A~5F示出了本公开至少一些实施例提供的一种修改字幕的示意图;5A to 5F are schematic diagrams showing a method of modifying subtitles provided by at least some embodiments of the present disclosure;

图6为本公开一些实施例提供的一种视频编辑装置的示意框图;FIG6 is a schematic block diagram of a video editing device provided by some embodiments of the present disclosure;

图7为本公开一些实施例提供的一种电子设备的示意框图;FIG7 is a schematic block diagram of an electronic device provided by some embodiments of the present disclosure;

图8为本公开一些实施例提供的另一种电子设备的示意框图;以及FIG8 is a schematic block diagram of another electronic device provided by some embodiments of the present disclosure; and

图9为本公开一些实施例提供的一种存储介质的示意图。FIG. 9 is a schematic diagram of a storage medium provided in some embodiments of the present disclosure.

具体实施方式DETAILED DESCRIPTION

下面将参照附图更详细地描述本公开的实施例。虽然附图中显示了本公开的某些实施例,然而应当理解的是,本公开可以通过各种形式来实现,而且不应该被解释为限于这里阐述的实施例,相反提供这些实施例是为了更加透彻和完整地理解本公开。应当理解的是,本公开的附图及实施例仅用于示例性作用,并非用于限制本公开的保护范围。Embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although certain embodiments of the present disclosure are shown in the accompanying drawings, it should be understood that the present disclosure can be implemented in various forms and should not be construed as being limited to the embodiments described herein, which are instead provided for a more thorough and complete understanding of the present disclosure. It should be understood that the drawings and embodiments of the present disclosure are only for exemplary purposes and are not intended to limit the scope of protection of the present disclosure.

应当理解,本公开的方法实施方式中记载的各个步骤可以按照不同的顺序执行,和/或并行执行。此外,方法实施方式可以包括附加的步骤和/或省略执行示出的步骤。本公开的范围在此方面不受限制。It should be understood that the various steps described in the method embodiments of the present disclosure may be performed in different orders and/or in parallel. In addition, the method embodiments may include additional steps and/or omit the steps shown. The scope of the present disclosure is not limited in this respect.

本文使用的术语“包括”及其变形是开放性包括,即“包括但不限于”。术语“基于”是“至少部分地基于”。术语“一个实施例”表示“至少一个实施例”;术语“另一实施例”表示“至少一个另外的实施例”;术语“一些实施例”表示“至少一些实施例”。其他术语的相关定义将在下文描述中给出。The term "including" and its variations used herein are open inclusions, i.e., "including but not limited to". The term "based on" means "based at least in part on". The term "one embodiment" means "at least one embodiment"; the term "another embodiment" means "at least one additional embodiment"; the term "some embodiments" means "at least some embodiments". The relevant definitions of other terms will be given in the following description.

需要注意,本公开中提及的“第一”、“第二”等概念仅用于对不同的装置、模块或单元进行区分,并非用于限定这些装置、模块或单元所执行的功能的顺序或者相互依存关系。It should be noted that the concepts such as "first" and "second" mentioned in the present disclosure are only used to distinguish different devices, modules or units, and are not used to limit the order or interdependence of the functions performed by these devices, modules or units.

需要注意,本公开中提及的“一个”、“多个”的修饰是示意性而非限制性的,本领域技术人员应当理解,除非在上下文另有明确指出,否则应该理解为“一个或多个”。“多个”应理解为两个或以上。It should be noted that the modifications of "one" and "plurality" mentioned in the present disclosure are illustrative rather than restrictive, and those skilled in the art should understand that unless otherwise clearly indicated in the context, it should be understood as "one or more". "Plurality" should be understood as two or more.

本公开实施方式中的多个装置之间所交互的消息或者信息的名称仅用于说明性的目的,而并不是用于对这些消息或信息的范围进行限制。The names of the messages or information exchanged between multiple devices in the embodiments of the present disclosure are only used for illustrative purposes and are not used to limit the scope of these messages or information.

随着视频模板效果的丰富和类型的增加,基于视频模板完成视频制作分享的需求和场景会日益增多。但是,目前的视频模板大多是针对短视频的,针对短视频的视频模板有固定槽位,用户需要导入和视频模板一样的素材数量、视频片段的数量和视频片段的时长也需要与视频模板一致。因此,针对短视频的视频模板对用户导入的素材的要求较高,限制较多,视频模板的使用不灵活。As the effects and types of video templates increase, the demand and scenarios for video production and sharing based on video templates will increase. However, most of the current video templates are for short videos. Video templates for short videos have fixed slots, and users need to import the same number of materials, the same number of video clips, and the same length of video clips as the video template. Therefore, video templates for short videos have higher requirements for the materials imported by users, more restrictions, and the use of video templates is not flexible.

针对上述问题,本公开至少一个实施例提供一种视频编辑方法及装置、电子设备及存储介质,可以降低视频模板对用户导入的素材的要求和限制,提高视频模板使用的灵活性。In view of the above problems, at least one embodiment of the present disclosure provides a video editing method and apparatus, an electronic device and a storage medium, which can reduce the requirements and restrictions of video templates on materials imported by users and improve the flexibility of using video templates.

本公开至少一个实施例提供一种利用视频模板生成视频的方法,该方法不仅降低了对素材的要求,使得视频模板的使用更加灵活,而且该方法除了适用于短视频之外还适用于中视频。短视频通常是指用户原创的时长一般小于1分钟的视频,即用户生产内容(User-generated Content,UGC),长视频通常是指专业结构制作的时长在30分钟以上视频,内容质量更高,即专业生产内容(Professionally-generated Content,PGC),中视频是介于短视频和长视频之间,虽然是用户生产原创内容,但用户的专业水平相较于短视频来说会高一点,即专业用户生产内容(Professionally-User-generated,PUGC)并且中视频的时长通常也介于短视频片和长视频之间。从用户角度来说,短视频不需要刻意腾出时间,传递着碎片化的内容,能在较短的时间里,获取视频的关键内容,但不一定能保留下来,而中视频和长视频需要找一个合适的时间和地方进行观看,需要投入更多的精力,视频内容能够较长时间的保留在记忆里。At least one embodiment of the present disclosure provides a method for generating videos using video templates, which method not only reduces the requirements for materials and makes the use of video templates more flexible, but also is applicable to medium videos in addition to short videos. Short videos usually refer to videos that are originally created by users and are generally less than 1 minute in length, that is, user-generated content (UGC). Long videos usually refer to videos that are professionally produced and are more than 30 minutes long, with higher content quality, that is, professionally generated content (PGC). Medium videos are between short videos and long videos. Although they are original content generated by users, the professional level of users is higher than that of short videos, that is, professionally generated content (PUGC) and the length of medium videos is usually between short videos and long videos. From the user's perspective, short videos do not require deliberate time to watch, and they deliver fragmented content. They can obtain the key content of the video in a relatively short period of time, but it may not be retained. Medium and long videos require finding a suitable time and place to watch, and require more effort, so that the video content can be retained in memory for a longer period of time.

需要说明的是,在本公开的实施例中,视频模板与模板视频的含义是相同的,都是指制作视频时参考的模板,且该模板是以视频的形式展示的。It should be noted that, in the embodiments of the present disclosure, the meanings of video template and template video are the same, both refer to templates used as reference when making videos, and the template is displayed in the form of a video.

本公开至少一个实施例提供一种视频编辑方法、视频编辑装置、电子设备和计算机可读存储介质。该视频编辑方法包括:响应于对视频编辑模板的触发操作,展示视频编辑界面,视频编辑模板包括结构信息和编辑信息,结构信息用于指示至少一个视频段落,编辑信息用于指示在至少一个视频段落中施加的至少一个编辑效果,视频编辑界面包括与至少一个视频段落分别对应的段落编辑区域;响应于针对目标视频段落的素材导入操作,在目标视频段落的段落编辑区域展示至少一个影像素材的标识,至少一个视频段落包括目标视频段落,至少一个影像素材为基于素材导入操作被导入在目标视频段落中的素材;以及响应于编辑处理的触发操作,根据视频编辑模板和至少一个影像素材,生成视频编辑结果,视频编辑结果对应于目标视频段落的部分为目标编辑效果被施加在至少一个影像素材上的编辑结果,目标编辑效果匹配至少一个编辑效果中位于目标视频段落内的编辑效果。该视频编辑方法不是限制导入的至少一个影像素材与视频编辑模板匹配,而是将视频编辑模板的编辑效果应用于导入的至少一个影像素材,对导入的至少一个影像素材的数量、时间长度没有要求,不仅降低了对影像素材的要求,使得视频编辑模板的使用更加灵活,而且该方法除了适用于短视频之外还适用于中视频。At least one embodiment of the present disclosure provides a video editing method, a video editing device, an electronic device, and a computer-readable storage medium. The video editing method includes: in response to a trigger operation on a video editing template, displaying a video editing interface, the video editing template includes structural information and editing information, the structural information is used to indicate at least one video segment, the editing information is used to indicate at least one editing effect applied in the at least one video segment, and the video editing interface includes a segment editing area corresponding to the at least one video segment; in response to a material import operation for a target video segment, displaying an identifier of at least one image material in the segment editing area of the target video segment, the at least one video segment includes a target video segment, and the at least one image material is a material imported into the target video segment based on the material import operation; and in response to a trigger operation of an editing process, generating a video editing result according to the video editing template and at least one image material, the portion of the video editing result corresponding to the target video segment is an editing result in which the target editing effect is applied to the at least one image material, and the target editing effect matches an editing effect located in the target video segment among the at least one editing effect. This video editing method does not limit the at least one imported image material to match the video editing template, but applies the editing effect of the video editing template to the at least one imported image material. There are no requirements on the quantity and time length of the at least one imported image material. This not only reduces the requirements on the image material, making the use of video editing templates more flexible, but also the method is applicable to medium videos in addition to short videos.

下面,将参考附图详细地说明本公开的实施例。Hereinafter, embodiments of the present disclosure will be described in detail with reference to the accompanying drawings.

图1A示出了本公开至少一个实施例提供的一种视频编辑方法的流程示意图;图1B示出了本公开至少一个实施例提供的一种视频编辑方法的示意图;图1C示出了本公开至少一个实施例提供的一种结构信息的示意图;图1D示出了本公开至少一个实施例提供的一种视频编辑界面的示意图。Figure 1A is a flowchart of a video editing method provided by at least one embodiment of the present disclosure; Figure 1B is a schematic diagram of a video editing method provided by at least one embodiment of the present disclosure; Figure 1C is a schematic diagram of structural information provided by at least one embodiment of the present disclosure; Figure 1D is a schematic diagram of a video editing interface provided by at least one embodiment of the present disclosure.

如图1A所示,在至少一个实施例中,该方法包括如下操作。As shown in FIG. 1A , in at least one embodiment, the method includes the following operations.

步骤S10:响应于对视频编辑模板的触发操作,展示视频编辑界面,视频编辑模板包括结构信息和编辑信息,结构信息用于指示至少一个视频段落,编辑信息用于指示在至少一个视频段落中施加的至少一个编辑效果,视频编辑界面包括与至少一个视频段落分别对应的段落编辑区域。Step S10: In response to a triggering operation on a video editing template, a video editing interface is displayed, wherein the video editing template includes structural information and editing information, wherein the structural information is used to indicate at least one video segment, and the editing information is used to indicate at least one editing effect applied in at least one video segment, and the video editing interface includes segment editing areas corresponding to the at least one video segment.

步骤S20:响应于针对目标视频段落的素材导入操作,在目标视频段落的段落编辑区域展示至少一个影像素材的标识,至少一个视频段落包括目标视频段落,至少一个影像素材为基于素材导入操作被导入在目标视频段落中的素材。Step S20: In response to a material import operation for a target video segment, an identifier of at least one image material is displayed in a segment editing area of the target video segment, at least one video segment includes the target video segment, and at least one image material is a material imported into the target video segment based on the material import operation.

步骤S30:响应于编辑处理的触发操作,根据视频编辑模板和至少一个影像素材,生成视频编辑结果,其中,视频编辑结果对应于目标视频段落的部分为目标编辑效果被施加在至少一个影像素材上的编辑结果,目标编辑效果匹配至少一个编辑效果中位于目标视频段落内的编辑效果。Step S30: In response to a triggering operation of the editing process, a video editing result is generated according to a video editing template and at least one image material, wherein the portion of the video editing result corresponding to the target video segment is an editing result in which a target editing effect is applied to at least one image material, and the target editing effect matches at least one editing effect located within the target video segment.

例如,在步骤S10中,对视频编辑模板的触发操作例如是在视频编辑模板的主页面中对使用图标进行点击操作。如图1B所示,视频编辑模板的主页面为页面101,该页面101包括使用图标102,若客户端接收到对使用图标102的点击操作,展示图1D所示的视频编辑界面103。For example, in step S10, the triggering operation of the video editing template is, for example, clicking on the use icon in the main page of the video editing template. As shown in FIG1B , the main page of the video editing template is page 101, and the page 101 includes a use icon 102. If the client receives a click operation on the use icon 102, the video editing interface 103 shown in FIG1D is displayed.

例如,该视频编辑方法应用于视频应用,在该视频应用的模板推荐页面中展示多个视频编辑模板的封面,若用户对某个视频编辑模板的封面进行了选择操作,则播放该视频编辑模板的展示视频供用户参考,在该展示视频播放结束时进入该视频编辑模板的主页面,用户可以在主页面(例如,页面101)中进行操作以使用该视频编辑模板或者不使用该视频编辑模板。若用户在主页面中执行了触发操作,则表示该用户选择使用该视频编辑模板,继而进入该视频编辑模板的视频编辑界面103,以在视频编辑界面103进行视频编辑。For example, the video editing method is applied to a video application, and multiple video editing template covers are displayed in the template recommendation page of the video application. If the user selects the cover of a certain video editing template, the demonstration video of the video editing template is played for the user's reference. When the demonstration video ends, the main page of the video editing template is entered. The user can operate in the main page (for example, page 101) to use the video editing template or not. If the user performs a trigger operation in the main page, it means that the user chooses to use the video editing template, and then enters the video editing interface 103 of the video editing template to perform video editing in the video editing interface 103.

在本公开的一些实施例中,视频编辑模板包括结构信息和编辑信息。结构信息用于指示至少一个视频段落,编辑信息用于指示在指示一个视频段落中施加的至少一个编辑效果。编辑信息例如可以包括字幕、配音、配乐、文案、滤镜、转场动画中至少一种。In some embodiments of the present disclosure, a video editing template includes structural information and editing information. The structural information is used to indicate at least one video segment, and the editing information is used to indicate at least one editing effect applied in indicating a video segment. The editing information may include at least one of subtitles, dubbing, soundtrack, text, filters, and transition animations.

如图1C所示,视频编辑模板包括结构信息100。该结构信息100例如是由视频编辑模板的生产者设置的。例如,发布链路和vimo平台支持添加视频编辑模板结构信息。例如,通过vimo平台添加如图1所示的结构信息100。As shown in Fig. 1C, the video editing template includes structure information 100. The structure information 100 is set by, for example, the producer of the video editing template. For example, the publishing link and the vimo platform support adding the video editing template structure information. For example, the structure information 100 shown in Fig. 1 is added through the vimo platform.

如图1C所示,结构信息100包括整体说明、智能能力和模板结构拆解三大部分。As shown in FIG. 1C , the structural information 100 includes three parts: overall description, intelligent capability, and template structure decomposition.

模板结构拆解包括第一部分1,并且允许添加更多部分。第一部分1表示一个视频段落,该视频段落的标题、描述等信息可以由运营人员填写。例如,通过点击添加更多部分控件2来添加第二部分、第三部分等,每个部分表示一个视频段落。The template structure includes the first part 1, and allows adding more parts. The first part 1 represents a video segment, and the title, description and other information of the video segment can be filled in by the operator. For example, by clicking the add more parts control 2 to add the second part, the third part, etc., each part represents a video segment.

整体说明部分例如由运营自行把控,填写视频编辑模板的例如内容、风格等的说明。整体说明部分例如不限制输入的最大字符数。The overall description part is controlled by the operator, for example, to fill in the description of the video editing template, such as content, style, etc. The overall description part does not limit the maximum number of characters to be entered.

智能能力例如支持运营勾选,例如包括智能添加字幕、智能添加配音等。运营可以自己选择是否勾选智能能力,以及勾选哪一种或者多种智能能力。若运行勾选了某一种智能能力,则视频编辑模板具有该智能能力。For example, intelligent capabilities can be selected by operators, such as intelligent subtitle addition and intelligent dubbing. Operators can choose whether to select intelligent capabilities and which one or more intelligent capabilities to select. If an intelligent capability is selected, the video editing template has the intelligent capability.

对于结构信息中的每个部分的结构类似,例如以第一部分1为例进行说明。例如,第一部分1包括标题、描述、起止时间和起止片段。标题例如为对视频编辑模板中第一部分1的模板影像素材的提炼性地描述。模板影像素材例如为视频编辑模板中使用的素材。描述例如为对第一部分1的模板影像素材的概括性地描述。The structure of each part in the structural information is similar, for example, the first part 1 is used as an example for explanation. For example, the first part 1 includes a title, a description, a start and end time, and a start and end segment. The title is, for example, a refined description of the template image material of the first part 1 in the video editing template. The template image material is, for example, the material used in the video editing template. The description is, for example, a general description of the template image material of the first part 1.

在本公开的一些实施例中,结构信息通过时间区间或者连续的视频片段的序号指示至少一个视频段落的每个。例如,在图1C的结构信息中,通过起止时间确定每个视频段落,或者通过起止片段确定每个视频段落。In some embodiments of the present disclosure, the structural information indicates each of at least one video segment by a time interval or a serial number of continuous video segments. For example, in the structural information of FIG. 1C , each video segment is determined by a start and end time, or each video segment is determined by a start and end segment.

例如,模板影像素材为至少一个连续视频片段,起止时间为从至少一个连续视频片段中选取从第K秒到第L秒(时间区间)作为该第一部分1的素材,起止片段例如为从至少一个连续视频片段中选取第j片段到第i片段作为该第一部分1的素材。j和i为视频片段的序号的示例。For example, the template image material is at least one continuous video segment, the start and end time is selected from the Kth second to the Lth second (time interval) from the at least one continuous video segment as the material of the first part 1, and the start and end segments are, for example, selected from the jth segment to the ith segment from the at least one continuous video segment as the material of the first part 1. j and i are examples of serial numbers of video segments.

需要说明的是,结构信息对起止时间和起止片段的划分是为了对视频编辑模板的理解,便于给用户成片,而用户导入的素材不一定按照结构信息的划分。例如,用户可以导入任意形式的素材,然后根据结构信息对用户导入的素材进行分割、高光切片等处理。It should be noted that the division of the start and end time and the start and end segments by the structural information is for the purpose of understanding the video editing template and facilitating the production of videos for users, and the materials imported by users are not necessarily divided according to the structural information. For example, users can import materials of any form, and then perform segmentation, highlight slicing, etc. on the materials imported by users according to the structural information.

在上述实施例中,视频编辑模板包括模板影像素材。在本公开的另外一些实施例中,视频编辑模板也可以不包括模板影像素材。In the above embodiment, the video editing template includes the template image material. In some other embodiments of the present disclosure, the video editing template may not include the template image material.

在本公开的一些实施例中,中视频的视频编辑模板可以直接复用短视频模板的生产和发布流程,通过剪辑工具工程文件导出后,选择“发布模板”而得到视频编辑模板。In some embodiments of the present disclosure, the video editing template of the medium video can directly reuse the production and publishing process of the short video template. After exporting the project file through the editing tool, select "Publish Template" to obtain the video editing template.

中视频模板发布后,支持在vimo后台存储与管理,复用短视频模板逻辑。例如,通过在vimo上运营手动标识完成,来判断该模板是否为“中视频模板”类型,vimo支持批量添加是否为“中视频模板”类型的标识。对于中视频模板类型,需要在元数据上新增“中视频模板”类型的属性字段,通过类型属性字段标识(例如,标识为video_type_id)进行视频类型的区分。被标注为中视频模板类型的,在模板整体链路上,都进行中视频模板逻辑。After the medium video template is released, it supports storage and management in the vimo background, and reuses the short video template logic. For example, by manually marking it on vimo, it is determined whether the template is of the "medium video template" type. Vimo supports batch addition of identifiers of the "medium video template" type. For the medium video template type, it is necessary to add an attribute field of the "medium video template" type to the metadata, and distinguish the video type through the type attribute field identifier (for example, the identifier is video_type_id). For those marked as medium video template types, the medium video template logic is performed on the overall template link.

对于编辑信息,例如可以类似于短视频模板,生产者可以按照自身想法需要,添加文案信息、配音信息、音乐信息、贴纸信息、滤镜信息、转场动画信息等编辑信息,这些编辑信息指示编辑效果,编辑效果例如包括字幕、配音、配乐、文案、滤镜、转场动画中至少一种。For editing information, for example, it can be similar to a short video template. Producers can add text information, dubbing information, music information, sticker information, filter information, transition animation information and other editing information according to their own ideas and needs. The editing information indicates the editing effects, and the editing effects include at least one of subtitles, dubbing, music, text, filters, and transition animations.

如图1D所示,视频编辑界面103包括至少一个视频段落分别对应的段落编辑区域,每个段落编辑区域作为一个主题的编辑区域,分别展示不同主题的视频片段。例如,图1C中的第一部分1指示的视频段落对应的段落编辑区域11,若第一部分1的标题中输入的内容为“菜肴成品”,则第一部分1指示的视频段落对应的段落编辑区域11的标题为“菜肴成品”,例如结构信息还包括第二部分,若第二部分的标题中输入的内容为“美食制作过程”,则第二部分指示的视频段落可以对应段落编辑区域12,该段落编辑区域12的标题为“美食制作过程”。As shown in FIG1D , the video editing interface 103 includes at least one paragraph editing area corresponding to each video segment, and each paragraph editing area is used as an editing area for a theme, and displays video clips of different themes. For example, the paragraph editing area 11 corresponding to the video segment indicated by the first part 1 in FIG1C , if the content entered in the title of the first part 1 is "finished dish", then the title of the paragraph editing area 11 corresponding to the video segment indicated by the first part 1 is "finished dish", for example, the structural information also includes a second part, if the content entered in the title of the second part is "food preparation process", then the video segment indicated by the second part can correspond to the paragraph editing area 12, and the title of the paragraph editing area 12 is "food preparation process".

每个视频段落作为一个主题,每个段落编辑区域作为一个主题的编辑区域,在下文的描述中将视频段落称为主题,将段落编辑区域称为主题的编辑区域。Each video segment is regarded as a subject, and each segment editing area is regarded as an editing area of a subject. In the following description, the video segment is referred to as a subject, and the segment editing area is referred to as an editing area of a subject.

视频编辑界面103可以划分为多个部分,每个部分用于展示不同的内容。例如,在图1D的示例中,视频编辑界面103包括导航栏大标题113,在该示例中大标题为“选择素材”,视频编辑界面103还包括依序排列的M个主题各自的编辑区域,M为正整数。例如,第一个主题为“菜肴成品”、第二个主题为“美食制作过程”,用户可以在每个主题的编辑区域针对该主题导入相应的创作视频片段或者图片;也可以选择在M个主题中的至少一个主题中不导入创作视频片段或者图片。在每个主题的编辑区域的下方可以包括该主题对应的视频段落的相关描述163。若视频编辑模板中没有该主题的相关描述,例如在图1C的描述部分没有输入相应的字符,则不显示相关描述。The video editing interface 103 can be divided into multiple parts, each part is used to display different content. For example, in the example of Figure 1D, the video editing interface 103 includes a navigation bar title 113, in which the title is "Select Material", and the video editing interface 103 also includes M topics arranged in sequence. The editing area, M is a positive integer. For example, the first theme is "Finished Dish", and the second theme is "Food Production Process". The user can import the corresponding creative video clips or pictures for each theme in the editing area of each theme; or choose not to import creative video clips or pictures in at least one of the M themes. Below the editing area of each theme, the relevant description 163 of the video segment corresponding to the theme can be included. If there is no relevant description of the theme in the video editing template, for example, if the corresponding characters are not entered in the description part of Figure 1C, the relevant description will not be displayed.

需要说明的是,在本公开的实施例中,视频编辑模板可以是基于视频编辑草稿生成的,视频编辑草稿里包含了素材(比如视频、音频、图片)和编辑信息,视频编辑模板需要指示出来视频编辑草稿中哪些素材是需要被替换的。中视频模板相对于短视频模板的主要存在两方面区别。一方面,中视频模板是有段落结构的,视频段落对应一个时间线区间,视频段落包括一个或多个视频片段,而短视频模板没有结构段落,短视频模板就是一个视频编辑草稿中有些视频片段被指定为需要替换填充的素材;另一方面,从模板使用来说,中视频模板是每个段落分别去套用该段落内的视频编辑效果,段落内的套用逻辑是基于编辑信息指示的该段落内的编辑操作经过变化施加在段落内导入的素材上(因为素材的数量不是跟模板中片段绑定,无法直接将编辑操作复用在素材上),而短视频模板的套用逻辑就是把模板中的视频片段导入素材,编辑操作就直接作用在素材上了,不需要分段落也不需要变化编辑操作。It should be noted that in the embodiments of the present disclosure, the video editing template can be generated based on the video editing draft, and the video editing draft contains materials (such as video, audio, pictures) and editing information. The video editing template needs to indicate which materials in the video editing draft need to be replaced. There are two main differences between the medium video template and the short video template. On the one hand, the medium video template has a paragraph structure, the video paragraph corresponds to a timeline interval, and the video paragraph includes one or more video clips, while the short video template has no structural paragraph. The short video template is a video editing draft in which some video clips are designated as materials that need to be replaced and filled; on the other hand, from the perspective of template use, the medium video template is to apply the video editing effect in each paragraph separately, and the application logic in the paragraph is based on the editing operation in the paragraph indicated by the editing information, which is applied to the material imported into the paragraph after changes (because the number of materials is not bound to the clips in the template, the editing operation cannot be directly reused on the material), and the application logic of the short video template is to import the video clips in the template into the material, and the editing operation directly acts on the material, without the need to divide the paragraphs or change the editing operation.

对于步骤S20,如图1D所示,在每个主题的编辑区域包括添加入口123。例如,响应于对添加入口123的点击操作,拉起相册列表,供用户从相册列表中选择导入该主题的影像素材。即,素材导入操作例如为对添加入口123的点击操作以及从相册列表中选择导入该主题的影像素材。For step S20, as shown in FIG1D , the editing area of each theme includes an add entry 123. For example, in response to a click operation on the add entry 123, an album list is pulled up for the user to select an image material to be imported into the theme from the album list. That is, the material import operation is, for example, a click operation on the add entry 123 and a selection of an image material to be imported into the theme from the album list.

在本公开的一些实施例中,添加入口123没有槽位概念,不对片段数量和片段时长进行限制。In some embodiments of the present disclosure, the adding entry 123 has no slot concept and does not limit the number of segments and the length of the segments.

例如,在针对目标视频段落(又称为目标主题)进行素材导入操作之后,在该目标主题的段落编辑区域展示至少一个影像素材的标识。For example, after a material import operation is performed on a target video segment (also referred to as a target theme), an identifier of at least one video material is displayed in a segment editing area of the target theme.

在本公开的一些实施例中,例如视频编辑模板共包括M个主题,则可以从M个主题中选择目标主题进行素材导入操作。例如,用户依次选择了N个主题进行了素材导入操作,则该N个主题均作为目标视频段落。M和N均为正整数。信息提示段落信息提示段落在图1D的示例中,例如响应用户点击“菜肴成品”中的添加入口123,拉起相册列表133,供用户从相册列表133中选择与菜肴成品相关的至少一个影像素材。之后,在该目标主题“菜肴成品”的段落编辑区域11展示该至少一个影像素材每个的标识。该至少一个影像素材每个的标识,例如是从每个影像素材中提取的一帧图像,例如提取每个影像素材的第一帧图像作为该影像素材的标识。例如,在图1D的示例中,用户从相册列表133中影像素材1和影像素材8导入,影像素材1和影像素材8的第一帧图像分别为显示数字1和显示数字8的图像,则在段落编辑区域11展示显示数字1和显示数字8的图像。In some embodiments of the present disclosure, for example, if the video editing template includes a total of M themes, then a target theme can be selected from the M themes for a material import operation. For example, if the user selects N themes in sequence for a material import operation, then the N themes are all used as target video segments. M and N are both positive integers. Information prompt paragraph Information prompt paragraph In the example of FIG. 1D , for example, in response to the user clicking on the add entry 123 in “Dish Finished Product”, the album list 133 is pulled up for the user to select at least one image material related to the dish finished product from the album list 133. Afterwards, the identification of each of the at least one image material is displayed in the paragraph editing area 11 of the target theme “Dish Finished Product”. The identification of each of the at least one image material is, for example, a frame of image extracted from each image material, for example, the first frame of image of each image material is extracted as the identification of the image material. For example, in the example of FIG. 1D , the user imports image material 1 and image material 8 from the album list 133, and the first frame of image material 1 and image material 8 are images showing the number 1 and the number 8, respectively, and the images showing the number 1 and the number 8 are displayed in the paragraph editing area 11.

用户可以点击“美食制作过程”中的添加入口123,再次拉起相册列表133,供用户再次从相册列表133中选择与美食制作过程相关的至少一个影像素材。在本公开的实施例中,用户可以针对每个主题都进行素材选择操作,也可以只针对M个主题中的部分进行素材选择操作。The user can click the add entry 123 in the "food making process" to pull up the album list 133 again, so that the user can select at least one video material related to the food making process from the album list 133. In the embodiment of the present disclosure, the user can perform a material selection operation for each theme, or only perform a material selection operation for some of the M themes.

中视频相对于短视频有明显的剪辑结构套路,根据其结构信息对用户导入的一个或者多个影像素材分组和排序,在该实施例中视频编辑模板的视频编辑界面包括依序排列的M个主题,从而将一个或者多个影像素材按照主题进行分组,使得视频结构合理化。在本公开的实施例中,用户导入素材量和时长基本不做限制,支持批量选中和删除当前选中素材。但为了避免极端情况出现,在本公开的一些实施例中,可以对导入素材量和时长进行最大限制,例如单个素材(即,一个视频片段)时长最多只能是30分钟,超出即提示“最多只能导入30分钟以内视频”,每个主题导入素材的数量上限为100个片段,超过则提示“最多只能导入100个片段”。例如,通过图1C中的用户素材总时长上限对至少一个影像素材的总时长进行限制。Compared with short videos, medium videos have obvious editing structure routines. According to their structural information, one or more image materials imported by users are grouped and sorted. In this embodiment, the video editing interface of the video editing template includes M themes arranged in sequence, so that one or more image materials are grouped according to themes, so that the video structure is rationalized. In the embodiment of the present disclosure, there is basically no restriction on the amount and duration of user-imported materials, and batch selection and deletion of currently selected materials are supported. However, in order to avoid extreme situations, in some embodiments of the present disclosure, the amount and duration of imported materials can be subject to maximum restrictions. For example, the duration of a single material (i.e., a video clip) can only be 30 minutes at most. If it exceeds, it will prompt "only videos within 30 minutes can be imported at most", and the upper limit of the number of imported materials for each theme is 100 clips. If it exceeds, it will prompt "only 100 clips can be imported at most". For example, the total duration of at least one image material is limited by the upper limit of the total duration of user materials in Figure 1C.

在本公开的一些实施例中,相册列表可以包括提示文案,例如当未选择影像素材时,默认提示文案为“选择需要导入的影像素材”;当已经选择了影像素材时,根据素材数量,提示文案改为“已选择%d个片段”。如图1D中,影像素材1和影像素材8被选择,则提示文案为“已选择2个片段”。如图1D中,相册列表还可以包括确认添加按钮143。确认添加按钮143包括点亮和置灰两种状态。当导入片段后,确认添加按钮143的状态为点亮状态,即点亮逻辑为当导入了一段影像素材则按钮点亮,否则置灰;点击添加按钮后,将用户导入的影像素材按照其顺序放到对应框架内。如图1D所示,当点击添加按钮143之后,影像素材1和影像素材8的标识被放到主题为“菜肴成品”的段落编辑区域11主题中。若当前主题框架已填充影像素材时,进入相册列表,默认展示已添加影像素材。In some embodiments of the present disclosure, the album list may include a prompt text, for example, when no image material is selected, the default prompt text is "Select the image material to be imported"; when the image material has been selected, the prompt text is changed to "%d segments have been selected" according to the number of materials. As shown in FIG. 1D, image material 1 and image material 8 are selected, and the prompt text is "2 segments have been selected". As shown in FIG. 1D, the album list may also include a confirmation add button 143. The confirmation add button 143 includes two states: lit and grayed out. After the segment is imported, the state of the confirmation add button 143 is lit, that is, the lighting logic is that when a segment of image material is imported, the button is lit, otherwise it is grayed out; after clicking the add button, the image material imported by the user is placed in the corresponding frame in its order. As shown in FIG. 1D, after clicking the add button 143, the logos of image material 1 and image material 8 are placed in the theme of the paragraph editing area 11 with the theme of "finished dish". If the current theme frame is filled with image materials, enter the album list, and the added image materials are displayed by default.

在本公开的一些实施例中,添加入口123跟随已填充素材位置后移,后添加的影像素材自动排序在已有影像素材的后面,超过一行则自动换行,已添加的影像素材会展示其原始影像素材时长,等进入下一步合成时会智能截取和分割。In some embodiments of the present disclosure, the addition entry 123 moves backward following the position of the filled material, and the image material added later is automatically sorted behind the existing image material. If it exceeds one line, it will automatically wrap. The added image material will display the length of its original image material, and will be intelligently captured and split when entering the next step of synthesis.

在本公开的一些实施例中,支持影像素材删除操作。响应于对影像素材的删除操作,删除影像素材,并且该主题的影像素材自动顺延往前排。例如,可以在相册列表页和视频编辑界面103中选择删除按钮来进行删除操作。在本公开的一些实施例中,用户还可以对已经导入的影像素材进行排序和编辑。例如,调换两个影像素材的位置,或者对某个影像素材进行美颜、裁剪等编辑操作。In some embodiments of the present disclosure, image material deletion operations are supported. In response to the deletion operation of the image material, the image material is deleted, and the image material of the subject is automatically postponed to the front row. For example, the delete button can be selected in the album list page and the video editing interface 103 to perform the deletion operation. In some embodiments of the present disclosure, the user can also sort and edit the imported image materials. For example, the positions of two image materials are swapped, or editing operations such as beautification and cropping are performed on a certain image material.

在本公开的实施例中,影像素材可以是视频片段也可以是图片。In the embodiments of the present disclosure, the image material may be a video clip or a picture.

对于步骤S30,编辑处理的触发操作例如为在通过视频编辑界面完成编辑后,对导出控件或者预览控件的选择操作。例如,对导出控件进行选择操作,则可以根据视频编辑模板和至少一个影像素材,生成导出合成的视频或者导出生成的文件。例如,对预览控件的选择操作,则根据视频编辑模板和至少一个影像素材,生成预览页的预览对象。预览对象可以是视频也可以是图片等。For step S30, the triggering operation of the editing process is, for example, a selection operation of an export control or a preview control after the editing is completed through the video editing interface. For example, by selecting the export control, an export synthesized video or an export generated file can be generated based on the video editing template and at least one image material. For example, by selecting the preview control, a preview object of the preview page is generated based on the video editing template and at least one image material. The preview object can be a video or a picture, etc.

视频编辑结果例如包括多个视频部分,每个视频部分对应一个目标视频段落,即每个视频部分为目标编辑效果被施加在该目标视频段落中至少一个影像素材上的编辑结果。目标编辑效果是根据至少一个编辑效果中位于目标视频段落内的编辑效果得到的。The video editing result includes, for example, a plurality of video parts, each of which corresponds to a target video segment, that is, each video part is an editing result in which a target editing effect is applied to at least one video material in the target video segment. The target editing effect is obtained based on an editing effect located in the target video segment among at least one editing effect.

例如,向主题“菜肴成品”导入第一影像素材,对于主题“菜肴成品”这个目标视频段落,向该第一影像素材施加目标编辑效果,该目标编辑效果为与视频编辑模板中位于该目标视频段落内的编辑效果相匹配的编辑效果。For example, the first image material is imported into the theme "Finished Dish", and for the target video segment of the theme "Finished Dish", the target editing effect is applied to the first image material, and the target editing effect is an editing effect that matches the editing effect located in the target video segment in the video editing template.

例如,目标编辑效果与视频编辑模板中位于该目标视频段落内的编辑效果相同,或者目标编辑效果为对视频编辑模板中位于该目标视频段落内的编辑效果进行变化得到的。For example, the target editing effect is the same as the editing effect in the video editing template located in the target video segment, or the target editing effect is obtained by changing the editing effect in the video editing template located in the target video segment.

在本公开的一些实施例中,步骤S30包括:响应于编辑处理的触发操作,按照匹配策略,对编辑效果进行调整得到目标编辑效果;根据视频编辑模板,将目标编辑效果施加到至少一个影像素材的至少部分上得到目标视频段落的视频部分;以及基于至少一个视频段落中每个目标视频段落对应的视频部分,生成视频编辑结果。In some embodiments of the present disclosure, step S30 includes: in response to a triggering operation of editing processing, adjusting the editing effect according to a matching strategy to obtain a target editing effect; applying the target editing effect to at least a portion of at least one image material according to a video editing template to obtain a video portion of a target video segment; and generating a video editing result based on the video portion corresponding to each target video segment in at least one video segment.

在本公开的一些实施例中,匹配策略包括:在编辑效果包括第一文案的情况下,目标编辑效果包括第二文案,第二文案的文案格式与第一文案的文案格式相同,第二文案的文案内容与至少一个影像素材匹配。In some embodiments of the present disclosure, the matching strategy includes: when the editing effect includes a first text, the target editing effect includes a second text, the text format of the second text is the same as the text format of the first text, and the text content of the second text matches at least one image material.

在本公开的一些实施例中,文案可以是指除了字幕之外的所有文本,如标题类装饰性文本。例如,每一个主题添加一个文案内容,文案内容基于主题生效。In some embodiments of the present disclosure, the copy may refer to all texts except subtitles, such as decorative texts such as titles. For example, each theme adds a copy content, and the copy content takes effect based on the theme.

图2示出了本公开一些实施例提供的一种对视频编辑模板中的编辑效果调整得到视频编辑结果中的编辑效果的示意图。FIG2 shows a schematic diagram of adjusting the editing effects in a video editing template to obtain the editing effects in a video editing result provided by some embodiments of the present disclosure.

如图2(a)所示,在视频编辑模板中主题1的文案为文字1,主题2的文案为文字2以及主题3的文案为文字3,即文字1对应于主题1只对主题1生效,文件2对应于主题2只对主题2生效,文件3对应于主题3只对主题3生效。主题1例如为图1D中的“菜肴成品”主题、主题2例如为图1D中的“美食制作过程”主题。主题3例如为添加的其他主题。文字1、文字2和文字3为第一文案的示例。As shown in FIG2(a), in the video editing template, the text of Topic 1 is Text 1, the text of Topic 2 is Text 2, and the text of Topic 3 is Text 3, that is, Text 1 corresponds to Topic 1 and is only effective for Topic 1, File 2 corresponds to Topic 2 and is only effective for Topic 2, and File 3 corresponds to Topic 3 and is only effective for Topic 3. Topic 1 is, for example, the "finished dish" theme in FIG1D, and Topic 2 is, for example, the "food production process" theme in FIG1D. Topic 3 is, for example, another added theme. Text 1, Text 2, and Text 3 are examples of the first text.

文案格式例如包括文字的字体、文字的颜色、文字的样式等。The text format includes, for example, the font, color, and style of the text.

例如,文字1为楷体,则视频编辑结果中主题1中的文案的字体也是楷体、文字2为渐变色,则视频编辑结果中主题2中的文案也是同样的渐变色,文字3为倾斜字体,则视频编辑结果中主题3中的文案也是倾斜字体。For example, if text 1 is in regular font, the font of the text in Topic 1 in the video editing result will also be in regular font; if text 2 is in gradient color, the font of the text in Topic 2 in the video editing result will also be in the same gradient color; if text 3 is in italic font, the font of the text in Topic 3 in the video editing result will also be in italic font.

例如,步骤S30还包括获取第二文案的内容。例如在图1C的示例中,视频编辑界面103还可以包括信息提示段落104,用户可以在信息提示段落104输入提示信息以对该主题进行说明和描述。在该实施例中,获取第二文案的内容包括:对提示信息和至少一个影像素材进行分析得到分析结果;以及根据分析结果,生成第二文案的内容。For example, step S30 also includes obtaining the content of the second copy. For example, in the example of FIG. 1C , the video editing interface 103 may also include an information prompt section 104, and the user may enter prompt information in the information prompt section 104 to explain and describe the subject. In this embodiment, obtaining the content of the second copy includes: analyzing the prompt information and at least one video material to obtain an analysis result; and generating the content of the second copy according to the analysis result.

例如,可以采用用于对自然语言处理的神经网络对提示信息进行分析来获得分析结果,从而更准确地理解用户诉求。例如,采用序列到序列神经网络模型、长短期记忆网络等对提示信息进行理解获得分析结果。例如,通过图像处理技术对影像素材进行特征识别,得到对至少一个影像素材的分析结果。结合对提示信息的分析结果和对至少一个影像素材的分析结果,生成与该提示信息和至少一个影像素材匹配的第二文案的内容。For example, a neural network for natural language processing may be used to analyze the prompt information to obtain an analysis result, thereby more accurately understanding the user's demands. For example, a sequence-to-sequence neural network model, a long short-term memory network, etc. may be used to understand the prompt information to obtain an analysis result. For example, feature recognition of image materials is performed using image processing technology to obtain an analysis result of at least one image material. Combining the analysis result of the prompt information with the analysis result of at least one image material, a second copy matching the prompt information and at least one image material is generated.

例如,提示信息为“酸甜”,至少一个影像素材为饮料的图片,则第二文案内容可以是“酸酸甜甜,沁人心脾”。For example, if the prompt message is "sweet and sour" and at least one image material is a picture of a beverage, the second copy content can be "sour and sweet, refreshing".

在本公开的一些实施例中,例如分析结果对第二文案的内容的影响程度大于至少一个影像素材对第二文案的内容的影响程度。例如,在例如神经网络生成第二文案的内容时,分析结果的权重大于至少一个影像素材的权重,这样能够更加满足用户的诉求,生成的文案内容更加符合用户预期。In some embodiments of the present disclosure, for example, the influence of the analysis result on the content of the second copy is greater than the influence of at least one image material on the content of the second copy. For example, when the neural network generates the content of the second copy, the weight of the analysis result is greater than the weight of at least one image material, which can better meet the user's demands and the generated copy content is more in line with the user's expectations.

在上述实施例中,在视频编辑结果的生成过程中智能生成与影像素材适配的文案。例如,文案内容基于视频编辑模板的文本结构、影像素材和提示信息智能编写。例如,智能生成的文案仅改动视频编辑模板中文案内容,不改动视频编辑模板中的文案格式,与视频编辑模板中的文案格式一致。在本公开的一些实施例中,智能生成的文案的文本字数和视频编辑模板中的字数接近,例如二者之间的字数相差5个字以内。例如,视频编辑模板的文案为“居家记录”,影响素材是美食,提示信息为美食制作,则对应文案的内容可以是“制作美食记录”。在本公开的另一些实施例中,视频编辑结果的生成过程中也可以直接采用视频编辑模板中的文案呈现,或者用户可以自行在预览页面进行个性化编写。关于预览页面请参考下文的描述。In the above embodiments, in the process of generating the video editing results, the copy adapted to the image material is intelligently generated. For example, the copy content is intelligently written based on the text structure of the video editing template, the image material and the prompt information. For example, the intelligently generated copy only changes the copy content in the video editing template, does not change the copy format in the video editing template, and is consistent with the copy format in the video editing template. In some embodiments of the present disclosure, the number of words in the text of the intelligently generated copy is close to the number of words in the video editing template, for example, the number of words between the two is within 5 words. For example, the copy of the video editing template is "Home Record", the influencing material is food, and the prompt information is food production, then the content of the corresponding copy can be "Making Food Record". In some other embodiments of the present disclosure, the copy in the video editing template can also be directly presented in the process of generating the video editing results, or the user can personalize it on the preview page. Please refer to the description below for the preview page.

将第一文案的格式应用于第二文案的内容,从而得到目标编辑效果所包括的第二文案。之后,将该第二文案应用于目标视频段落对应的至少一个影像素材(例如,主题1中导入的影像素材1和影像素材8)。The format of the first copy is applied to the content of the second copy, thereby obtaining the second copy included in the target editing effect. Afterwards, the second copy is applied to at least one image material corresponding to the target video segment (for example, image material 1 and image material 8 imported in theme 1).

在本公开的一些实施例中,匹配策略还包括:响应于第一文案应用于目标视频段落的整个段落,第二文案应用于视频部分的整个部分;响应于第一文案应用于目标视频段落中部分时间段,第二文案应用于视频部分中部分时间段。In some embodiments of the present disclosure, the matching strategy also includes: in response to the first copy being applied to the entire paragraph of the target video segment, the second copy is applied to the entire part of the video portion; in response to the first copy being applied to a partial time period in the target video segment, the second copy is applied to a partial time period in the video portion.

在本公开的一些实施例中,判断某主题的文案是针对完整主题一直展示还是只展示某段时长。当视频编辑模板中的第一文案覆盖了某个主题的整个时,则第二文案也对于整个目标主题生效。当视频编辑模板中的第一文案只覆盖了某个主题的某段时长时,则第二文件在目标主题的时长生效范围与该段时长一致。In some embodiments of the present disclosure, it is determined whether the text of a certain theme is displayed for the entire theme or only for a certain duration. When the first text in the video editing template covers the entire theme, the second text is also effective for the entire target theme. When the first text in the video editing template only covers a certain duration of a certain theme, the second file is effective for the duration of the target theme consistent with the duration.

如图2(a)和(b)所示,在视频编辑模板中文字2(第一文案的示例)在主题2的第0秒到第60秒展示,那么在视频编辑结果中文字2(第二文案的示例)也只在视频编辑结果的主题2的第0秒到第60秒展示。在该示例中,第一文案和第二文案的内容相同,格式也相同。在视频编辑模板中文字3在主题3的整段时长展示,那么在视频编辑结果中文字3例如也在主题3的整段时长展示。图2(c)和(d)也类似,不再赘述。As shown in Figures 2(a) and (b), in the video editing template, text 2 (an example of the first text) is displayed from the 0th second to the 60th second of Topic 2, then in the video editing result, text 2 (an example of the second text) is also only displayed from the 0th second to the 60th second of Topic 2 of the video editing result. In this example, the first text and the second text have the same content and format. In the video editing template, text 3 is displayed throughout the duration of Topic 3, then in the video editing result, text 3 is also displayed throughout the duration of Topic 3. Figures 2(c) and (d) are also similar and will not be repeated here.

当用户修改了文案展示时长,后续又修改了影像素材,则按照用户修改后的时长范围展示;若影像素材时长变短,则时长对应缩短。When the user modifies the display duration of the text and subsequently modifies the image material, it will be displayed according to the duration range modified by the user; if the duration of the image material becomes shorter, the duration will be shortened accordingly.

在本公开的另一些实施例中,文案可以按照视频编辑模板的文案生效时长范围展示。如图2(e)所示,例如,在视频编辑模板中文字3在主题3的整段时长(例如,一共2分钟)展示,但是视频编辑结果中主题3的时长一共3分钟,则文案也仅在视频编辑结果中主题3的前2分钟内展示。在视频编辑模板中文字1在主题1的整段时长(例如,一共1分钟)展示,但是视频编辑结果中主题1的时长一共2分钟,则文案也仅在视频编辑结果中主题3的前1分钟内展示。In some other embodiments of the present disclosure, the copy can be displayed according to the effective duration range of the copy in the video editing template. As shown in Figure 2(e), for example, in the video editing template, the text 3 is displayed for the entire duration of Topic 3 (for example, a total of 2 minutes), but the duration of Topic 3 in the video editing result is a total of 3 minutes, then the copy is also only displayed within the first 2 minutes of Topic 3 in the video editing result. In the video editing template, the text 1 is displayed for the entire duration of Topic 1 (for example, a total of 1 minute), but the duration of Topic 1 in the video editing result is a total of 2 minutes, then the copy is also only displayed within the first 1 minute of Topic 3 in the video editing result.

在本公开的一些实施例中,目标视频段落中的文案可以直接采用视频编辑模板中的文案,这样能够提高合成视频编辑结果的速度。在本公开的一些实施例中,可以智能改写文案内容,使得文案与导入的素材更加匹配。In some embodiments of the present disclosure, the text in the target video segment can directly adopt the text in the video editing template, which can increase the speed of synthesizing the video editing result. In some embodiments of the present disclosure, the content of the text can be intelligently rewritten so that the text is more compatible with the imported material.

在本公开的一些实施例中,例如对第二素材进行图像识别,识别出第二素材的场景、色调,从而根据对第二素材的理解来生成与场景、色调视频的文案内容。将视频编辑模板中采用的文案格式直接应用于该文案内容,得到具有第二文案。In some embodiments of the present disclosure, for example, image recognition is performed on the second material to identify the scene and tone of the second material, thereby generating text content corresponding to the scene and tone video based on the understanding of the second material. The text format used in the video editing template is directly applied to the text content to obtain the second text.

在本公开的一些实施例中,匹配策略包括:在编辑效果包括转场动画的情况下,目标编辑效果包括转场动画。In some embodiments of the present disclosure, the matching strategy includes: when the editing effect includes a transition animation, the target editing effect includes the transition animation.

在视频编辑模板包括M个主题,向N个主题进行了素材导入操作,在N<M的情况下,对于没有导入影像素材的第一主题,删除视频编辑模板中位于第一主题和第一主题之后的下一主题之间的转场动画。即,若视频编辑模板中的某个主题没有导入影像素材,则紧邻该主题之后的转场动画也删除。在该实施例中,只对导入了影像素材的目标视频段落应用视频编辑模板中与该目标视频段落对应的转场动画。When the video editing template includes M themes, and material import operations are performed on N themes, in the case where N<M, for the first theme without imported image material, the transition animation between the first theme and the next theme after the first theme in the video editing template is deleted. That is, if a theme in the video editing template does not import image material, the transition animation immediately after the theme is also deleted. In this embodiment, only the transition animation corresponding to the target video segment in the video editing template is applied to the target video segment with imported image material.

如图2(a)所示,主题1和主题2之间包括转场动画(简称“转场”)1,主题2和主题3之间包括转场2。转场动画用于使相邻主题的视频片段自然地过渡,使得视频中内容的切换更加自然顺畅。As shown in FIG2(a), there is a transition animation (abbreviated as “transition”) 1 between Topic 1 and Topic 2, and there is a transition 2 between Topic 2 and Topic 3. The transition animation is used to make the video clips of adjacent topics transition naturally, making the switching of the content in the video more natural and smooth.

如图2(b)所示,主题1没有导入初始素材,则主题1之后的转场1也删除,视频编辑模板主题2和主题3之间包括转场2同样应用于视频编辑结果的主题2和主题3之间。As shown in FIG. 2( b ), if the initial material is not imported into Topic 1, the Transition 1 after Topic 1 is also deleted, and the video editing template including Transition 2 between Topic 2 and Topic 3 is also applied to Topic 2 and Topic 3 in the video editing result.

在本公开的一些实施例中,如图1D所示,结构信息还包括信息提示段落104,所述信息提示段落104用于获取提示信息。步骤S30包括:接收在信息提示段落输入的提示信息;以及对提示信息进行语义分析得到分析结果。In some embodiments of the present disclosure, as shown in Figure 1D, the structural information further includes an information prompt section 104, and the information prompt section 104 is used to obtain prompt information. Step S30 includes: receiving prompt information input in the information prompt section; and performing semantic analysis on the prompt information to obtain an analysis result.

例如,可以采用用于对自然语言处理的神经网络对提示信息进行分析来获得分析结果,从而更准确地理解用户诉求。例如,采用序列到序列神经网络模型、长短期记忆网络等对提示信息进行理解。For example, a neural network for natural language processing may be used to analyze the prompt information to obtain analysis results, thereby more accurately understanding the user's demands. For example, a sequence-to-sequence neural network model, a long short-term memory network, etc. may be used to understand the prompt information.

图3示出了本公开至少一个实施例提供的图1A中步骤S30的方法流程图。FIG. 3 shows a method flow chart of step S30 in FIG. 1A provided by at least one embodiment of the present disclosure.

如图3所示,该方法包括步骤S301~S304。As shown in FIG. 3 , the method includes steps S301 to S304 .

步骤S301:响应于编辑处理的触发操作,对提示信息和至少一个影像素材进行分析得到分析结果。Step S301: In response to a triggering operation of an editing process, the prompt information and at least one image material are analyzed to obtain an analysis result.

步骤S302:响应于编辑效果包括字幕和/或配音,按照目标编辑效果中的字幕和/或配音的内容与分析结果匹配的匹配策略,得到目标编辑效果中字幕和/或配音的内容。Step S302: in response to the editing effect including subtitles and/or dubbing, the content of the subtitles and/or dubbing in the target editing effect is obtained according to a matching strategy of matching the content of the subtitles and/or dubbing in the target editing effect with the analysis result.

步骤S303:按照目标编辑效果中字幕的格式与编辑效果中的字幕格式相同的匹配策略,得到目标编辑效果中字幕的格式。Step S303: according to the matching strategy that the format of the subtitles in the target editing effect is the same as the format of the subtitles in the editing effect, the format of the subtitles in the target editing effect is obtained.

步骤S304:按照目标编辑效果中配音的声音参数与编辑效果中的配音的声音参数相同的匹配策略,得到目标编辑效果中配音的声音参数。Step S304: according to the matching strategy that the sound parameters of the dubbing in the target editing effect are the same as the sound parameters of the dubbing in the editing effect, the sound parameters of the dubbing in the target editing effect are obtained.

在本公开的一些实施例中,如果视频编辑模板包括智能添加字幕和/或智能添加配音的标签,则自动为视频编辑结果添加字幕和/或配音。In some embodiments of the present disclosure, if the video editing template includes tags for intelligently adding subtitles and/or intelligently adding dubbing, subtitles and/or dubbing are automatically added to the video editing result.

该方法能够使具有字幕和/或配音的内容与提示信息适配,从而提高用户体验。The method can adapt the content with subtitles and/or dubbing to the prompt information, thereby improving the user experience.

对于步骤S301,关于对提示信息和至少一个影像素材进行分析得到分析结果与前述实施例类似,不再赘述。例如,根据用户导入的影像素材和输入的提示信息,进行内容和诉求理解,生成合适字幕和/或配音内容。如果识别到用户导入的影像素材包括声音,那么可以结合影像素材中的原有的声音和影像素材的图像内容以及提示信息,生成合适字幕和/或配音内容。或者,不管影像素材本身是否包括声音,均默认为其智能配音。For step S301, the analysis results obtained by analyzing the prompt information and at least one image material are similar to those in the previous embodiment and will not be repeated here. For example, based on the image material imported by the user and the prompt information input, the content and appeal are understood, and appropriate subtitles and/or dubbing content are generated. If it is recognized that the image material imported by the user includes sound, then appropriate subtitles and/or dubbing content can be generated by combining the original sound in the image material and the image content of the image material and the prompt information. Alternatively, regardless of whether the image material itself includes sound, intelligent dubbing is used by default.

例如,通过图像识别技术确定影像素材是关于制作美食的图像,提示信息为“周末”,那么确定预览视频中的字幕可以包括“幸福周末”。For example, it is determined through image recognition technology that the video material is an image about making delicious food, and the prompt information is "weekend", then it is determined that the subtitles in the preview video can include "happy weekend".

对于步骤S302~S304,匹配策略包括字幕的内容和/或配音的内容的匹配策略、字幕格式的匹配策略和声音参数的匹配策略。For steps S302 to S304, the matching strategy includes a matching strategy for the content of the subtitles and/or the content of the dubbing, a matching strategy for the subtitle format, and a matching strategy for the sound parameters.

对于字幕的内容和/或配音的内容,匹配策略可以是目标编辑效果中的字幕和/或配音的内容与分析结果匹配。字幕和/配音的内容与分析结果匹配与上文描述的文案内容与分析结果匹配类似,不再赘述。For the content of subtitles and/or dubbing, the matching strategy may be that the content of subtitles and/or dubbing in the target editing effect matches the analysis result. The matching of the content of subtitles and/or dubbing with the analysis result is similar to the matching of the content of the copywriting with the analysis result described above, and will not be repeated.

对于字幕格式,匹配策略可以是目标编辑效果中字幕的格式与编辑效果中的字幕格式相同。例如,目标编辑效果中字幕的格式可以与视频编辑模板中编辑效果中的字幕格式相同。当模板视频中没有字幕格式信息时,可以采用默认的例如“白底黑边”的字体样式。字幕格式例如包括字幕的文本字体、文本颜色、文本样式等。For subtitle format, the matching strategy can be that the format of subtitles in the target editing effect is the same as that in the editing effect. For example, the format of subtitles in the target editing effect can be the same as that in the editing effect in the video editing template. When there is no subtitle format information in the template video, a default font style such as "white background with black border" can be used. The subtitle format includes, for example, the text font, text color, and text style of the subtitle.

配音的声音参数例如可以包括配音的音色、频率、音调等。对于声音参数,匹配策略例如为目标编辑效果中配音的声音参数与编辑效果中的配音的声音参数相同。例如,目标编辑效果中配音的音色与视频编辑模板中编辑效果中配音的音色、频率和音调等相同。即,字幕格式和声音的声音参数可以直接套用视频编辑模板中编辑效果中的格式和声音参数。当视频编辑模板中编辑效果中没有音色的情况下(即,视频编辑模板为视频原声),则采用默认音色,默认音色可以是预先设定的。即,当视频编辑模板的影像素材有原声且视频编辑模板保留了该原始声音时,使用该视频编辑模板来合成视频编辑结果时,默认不会保留视频编辑模板中的原声。The sound parameters of dubbing may include, for example, the timbre, frequency, pitch, etc. of the dubbing. For the sound parameters, the matching strategy is, for example, that the sound parameters of the dubbing in the target editing effect are the same as the sound parameters of the dubbing in the editing effect. For example, the timbre of the dubbing in the target editing effect is the same as the timbre, frequency, and pitch of the dubbing in the editing effect in the video editing template. That is, the subtitle format and the sound parameters of the sound can directly apply the format and sound parameters in the editing effect in the video editing template. When there is no timbre in the editing effect in the video editing template (that is, the video editing template is the original sound of the video), the default timbre is used, and the default timbre can be pre-set. That is, when the image material of the video editing template has an original sound and the video editing template retains the original sound, when the video editing template is used to synthesize the video editing result, the original sound in the video editing template will not be retained by default.

例如,当视频编辑模板中已经存在文本朗读、录音等音频内容的情况下,做自动静音处理。For example, when the video editing template already contains audio content such as text reading or recording, automatic muting is performed.

在本公开的一些实施例中,例如字幕和/或配音可以针对整个视频编辑结果生效。In some embodiments of the present disclosure, for example, subtitles and/or dubbing may be effective for the entire video editing result.

例如,用户导入的所有影像素材的最终视频成品,都自动添加字幕,即按照最终视频成品的时长展示字幕。当用户的单个片段时长小于预设时长阈值例如3s的情况下,不为该片段配字幕和/或配音。如图2(a)~2(e)所示,字幕时长与最终视频时长相同。For example, subtitles are automatically added to the final video of all the video materials imported by the user, that is, the subtitles are displayed according to the duration of the final video. When the duration of a single clip of the user is less than a preset duration threshold, such as 3 seconds, no subtitles and/or dubbing are added to the clip. As shown in Figures 2(a) to 2(e), the duration of the subtitles is the same as the duration of the final video.

在本公开的一些实施例中,字幕和配音可以是一一对应的。配音的内容底层为字幕内容。如果字幕内容发生修改,同时将修改内容应用到配音被允许,配音的朗读内容也自动变化。在本公开的一些实施例中,播放配音和展示字幕是解耦的。例如,表一展示了本公开至少一些实施例提供的配音和字幕解耦的示例。In some embodiments of the present disclosure, subtitles and dubbing can be one-to-one. The underlying content of the dubbing is the subtitle content. If the subtitle content is modified, it is allowed to apply the modified content to the dubbing, and the reading content of the dubbing also changes automatically. In some embodiments of the present disclosure, playing the dubbing and displaying the subtitles are decoupled. For example, Table 1 shows an example of decoupling dubbing and subtitles provided by at least some embodiments of the present disclosure.

表一
Table 1

如表一中的场景1、场景2、场景3和场景4所示,配音和字幕是解耦的,如果视频编辑模板包括“智能添加配音”标签和“智能添加字幕”标签,例如,图1C的结构信息,则合成的视频编辑结果预期也是有配音和字幕的。如果视频编辑模板中不包括“智能添加配音”标签和“智能添加字幕”标签,则合成的视频编辑结果预期也是不包括配音和字幕的。如果视频编辑模板中包括“智能添加配音”标签,但不包括“智能添加字幕”标签,则合成的视频编辑结果预期是包括配音,但不包括字幕的。如果视频编辑模板中不包括“智能添加配音”标签,但包括“智能添加字幕”标签,则合成的视频编辑结果预期是不包括配音,但包括字幕的。As shown in Scene 1, Scene 2, Scene 3 and Scene 4 in Table 1, dubbing and subtitles are decoupled. If the video editing template includes the "Smart Add Dubbing" tag and the "Smart Add Subtitles" tag, for example, the structural information of Figure 1C, the synthesized video editing result is also expected to have dubbing and subtitles. If the video editing template does not include the "Smart Add Dubbing" tag and the "Smart Add Subtitles" tag, the synthesized video editing result is also expected to not include dubbing and subtitles. If the video editing template includes the "Smart Add Dubbing" tag but does not include the "Smart Add Subtitles" tag, the synthesized video editing result is expected to include dubbing but not subtitles. If the video editing template does not include the "Smart Add Dubbing" tag but includes the "Smart Add Subtitles" tag, the synthesized video editing result is expected to not include dubbing but include subtitles.

在本公开的一些实施例中,在目标编辑效果只包括字幕的情况下,视频编辑结果对应的预览视频中的配音标识展示为关闭状态,在目标编辑效果只包括配音的情况下,视频编辑结果对应的预览视频中的字幕标识展示为关闭状态。例如,在场景2中,配音标识为开启状态,字幕标识为关闭状态;在场景3中,配音标识为关闭状态,字幕标识为开启状态。本领域技术人员可以设置任意的图标分别表示配音和字幕,配音的图标又包括两种不同的显示状态分别表示关闭状态和开启状态,字幕图标也包括两种不同的显示状态分别表示关闭状态和开启状态。In some embodiments of the present disclosure, when the target editing effect includes only subtitles, the dubbing mark in the preview video corresponding to the video editing result is displayed as a closed state. When the target editing effect includes only dubbing, the subtitle mark in the preview video corresponding to the video editing result is displayed as a closed state. For example, in scene 2, the dubbing mark is in the open state and the subtitle mark is in the closed state; in scene 3, the dubbing mark is in the closed state and the subtitle mark is in the open state. Those skilled in the art can set arbitrary icons to represent dubbing and subtitles respectively. The dubbing icon includes two different display states, respectively representing the closed state and the open state, and the subtitle icon also includes two different display states, respectively representing the closed state and the open state.

在本公开的一些实施例中,步骤S30例如包括将每个目标视频段落对应的视频部分进行拼接,得到视频编辑结果。In some embodiments of the present disclosure, step S30, for example, includes splicing the video parts corresponding to each target video segment to obtain a video editing result.

表二示出了本公开一些实施例提供的一种素材导入方式的示例。
Table 2 shows an example of a material import method provided by some embodiments of the present disclosure.

如表二所示,模板视频包括依序排列的主题1、主题2和主题3,主题1中导入的影像素材包括片段A、片段B和片段C,并且三个片段的排序为片段A、片段B和片段C,主题2的影像素材包括了片段D,主题3的影像素材包括了片段E、片段F、片段G,并且三个片段的排序为片段E、片段F、片段G。按照主题顺序对主题1的视频部分、主题2的视频部分和主题3的视频部分拼接得到视频编辑结果。如上所描述,每个主题的视频部分为基于影像素材和目标编辑效果得到的。例如,将目标编辑效果施加到影像素材。该视频编辑结果例如为将第一目标编辑效果施加到片段A、片段B、片段C,将第二目标编辑效果施加到片段D,将第三目标编辑效果施加到片段E、片段F和片段G后,按照顺序组合在一起得到的。As shown in Table 2, the template video includes Topic 1, Topic 2, and Topic 3 arranged in sequence, the image material imported into Topic 1 includes Segment A, Segment B, and Segment C, and the three segments are arranged as Segment A, Segment B, and Segment C, the image material of Topic 2 includes Segment D, and the image material of Topic 3 includes Segment E, Segment F, and Segment G, and the three segments are arranged as Segment E, Segment F, and Segment G. The video part of Topic 1, the video part of Topic 2, and the video part of Topic 3 are spliced in the order of the themes to obtain the video editing result. As described above, the video part of each theme is obtained based on the image material and the target editing effect. For example, the target editing effect is applied to the image material. The video editing result is obtained by applying the first target editing effect to Segment A, Segment B, and Segment C, applying the second target editing effect to Segment D, and applying the third target editing effect to Segment E, Segment F, and Segment G, and then combining them in order.

若主题1的影像素材包括片段A、片段B和片段C,并且三个片段的排序为片段A、片段B和片段C,主题2未导入素材,主题3也未导入素材。按照主题顺序对多个片段拼接得到按照片段A、片段B、片段C的顺序组合在一起,并且向片段A、片段B、片段C拼接视频施加第一目标编辑效果得到视频编辑结果。If the video material of subject 1 includes segment A, segment B and segment C, and the three segments are ordered as segment A, segment B and segment C, and no material is imported into subject 2, and no material is imported into subject 3. Multiple segments are spliced together in the order of the themes to obtain a video that is combined in the order of segment A, segment B, and segment C, and the first target editing effect is applied to the spliced video of segment A, segment B, and segment C to obtain a video editing result.

若主题1的影像素材包括片段A、片段B和片段C,并且三个片段的排序为片段A、片段B和片段C,主题2未导入,主题3的影像素材包括片段E、片段F、片段G,并且三个片段的排序为片段E、片段F、片段G。按照主题顺序对多个片段拼接得到按照片段A、片段B、片段C、片段E、片段F和片段G的顺序组合在一起的拼接视频,并且向拼接视频中的片段A、片段B、片段C的部分施加第一目标编辑效果,向拼接视频中的片段F和片段G的部分施加第三目标编辑效果,得到视频编辑结果。If the image material of subject 1 includes segment A, segment B and segment C, and the three segments are arranged in the order of segment A, segment B and segment C, subject 2 is not imported, and the image material of subject 3 includes segment E, segment F and segment G, and the three segments are arranged in the order of segment E, segment F and segment G. Multiple segments are spliced in the order of themes to obtain a spliced video combined in the order of segment A, segment B, segment C, segment E, segment F and segment G, and the first target editing effect is applied to the parts of segment A, segment B and segment C in the spliced video, and the third target editing effect is applied to the parts of segment F and segment G in the spliced video to obtain a video editing result.

若主题1中未导入素材,主题2的影像素材包括片段D,主题3的影像素材包括片段E、片段F、片段G,并且三个片段的排序为片段E、片段F、片段G。按照主题顺序对多个片段拼接得到按照片段D、片段E、片段F和片段G的顺序组合在一起的拼接视频,并且向拼接视频中的片段D的部分时间第二目标编辑效果,向拼接视频中的片段F和片段G的部分施加第三目标编辑效果,得到视频编辑结果。If no material is imported into Topic 1, the image material of Topic 2 includes Segment D, and the image material of Topic 3 includes Segment E, Segment F, and Segment G, and the three segments are arranged in the order of Segment E, Segment F, and Segment G. Multiple segments are spliced in the order of themes to obtain a spliced video that is combined in the order of Segment D, Segment E, Segment F, and Segment G, and the second target editing effect is applied to a portion of Segment D in the spliced video, and the third target editing effect is applied to portions of Segments F and G in the spliced video to obtain a video editing result.

在本公开的一些实施例中,在N<M的情况下,对于没有导入优化素材的第一主题,从导入影像素材的第二主题中选择视频片段作为第一主题的目标片段;或者在按照M个主题顺序拼接多个视频部分的过程中跳过第一主题,拼接位于第一主题之后的主题的视频部分。In some embodiments of the present disclosure, when N<M, for the first theme without imported optimized material, a video clip is selected from the second theme with imported image material as the target clip of the first theme; or the first theme is skipped in the process of splicing multiple video parts in sequence of M themes, and the video part of the theme after the first theme is spliced.

例如,当第二个主题(例如,图1D中的“美食制作过程”主题)没有素材时,则自动顺延到第三个主题与第一主题进行拼接,第三主题的素材直接排在第一主题素材的后面。For example, when there is no material for the second theme (e.g., the theme of "food making process" in FIG. 1D ), it is automatically postponed to the third theme for splicing with the first theme, and the material of the third theme is directly placed behind the material of the first theme.

例如,若视频编辑模板被设置为“默认填充”,则即使在该主题没有导入任何素材,都会从已导入素材里选择一段(例如,随机选择)进行填充,保证整体预览效果完整性。For example, if the video editing template is set to "default fill", even if no material is imported into the theme, a segment will be selected from the imported material (for example, randomly selected) for filling to ensure the integrity of the overall preview effect.

在本公开的一些实施例中,至少一个编辑效果除了包括上述文案、字幕、配音等针对每个主题的段落编辑效果,还包括全局编辑效果。全局编辑效果例如包括音乐、滤镜和字幕中的至少一种。在该实施例中,步骤S30包括:响应于编辑处理的触发操作,调整全局编辑效果的时间长度与视频编辑结果中采用的目标影像素材的时间长度匹配,得到调整后的全局编辑效果。在此基础上,例如基于调整后的全局编辑效果和每个目标视频段落对应的视频部分,生成视频编辑结果。In some embodiments of the present disclosure, at least one editing effect includes not only the above-mentioned paragraph editing effects for each theme, such as text, subtitles, and dubbing, but also a global editing effect. The global editing effect includes, for example, at least one of music, filters, and subtitles. In this embodiment, step S30 includes: in response to the triggering operation of the editing process, adjusting the time length of the global editing effect to match the time length of the target image material used in the video editing result, and obtaining the adjusted global editing effect. On this basis, for example, based on the adjusted global editing effect and the video portion corresponding to each target video paragraph, a video editing result is generated.

目标影像素材例如是视频编辑结果中包括的影像素材。在本公开的一些实施例中,用户导入的至少一个影像素材不需要全部都用于生成视频编辑结果。例如,可以使用至少一个影像素材进行高光处理得到高光片段来生成视频编辑结果。在该实施例中,目标视频段落的采用的影像素材的时间长度小于目标视频段落导入的至少一个影像素材的时间长度。The target image material is, for example, an image material included in the video editing result. In some embodiments of the present disclosure, the at least one image material imported by the user does not need to be used to generate the video editing result. For example, at least one image material can be used to perform highlight processing to obtain a highlight segment to generate the video editing result. In this embodiment, the time length of the image material used in the target video segment is less than the time length of the at least one image material imported into the target video segment.

在本公开的一些实施例中,例如视频编辑结果包括的目标影像素材是利用一个或者多个目标视频段落的高光片段拼接得到的。若全局编辑效果包括音乐,调整音乐播放的时间长度与目标影像素材的时间长度匹配。例如,视频编辑模板中音乐播放的时间长度为第一时长,目标影像素材的时间长度为第二时长,则将音乐播放的时间长度由第一时长调整为第二时长。In some embodiments of the present disclosure, for example, the target image material included in the video editing result is obtained by splicing highlight clips of one or more target video segments. If the global editing effect includes music, the duration of the music playback is adjusted to match the duration of the target image material. For example, if the duration of the music playback in the video editing template is a first duration and the duration of the target image material is a second duration, the duration of the music playback is adjusted from the first duration to the second duration.

例如,响应于第二时长大于第一时长,将音乐的播放时间长度延长为第二时长,并且在视频编辑结果中音乐开始播放的时刻与在视频编辑模板中音乐开始播放的时刻相同;或者响应于第二时长小于第一时长,缩短音乐的播放时间长度为第二时长,并且在视频编辑结果中音乐逐渐减弱至消失。For example, in response to the second duration being greater than the first duration, the playback time of the music is extended to the second duration, and the time when the music starts playing in the video editing result is the same as the time when the music starts playing in the video editing template; or in response to the second duration being less than the first duration, the playback time of the music is shortened to the second duration, and the music gradually weakens until it disappears in the video editing result.

在本公开的实施例中,音乐针对视频全局且只有一个,例如视频编辑模板中包括一个音乐,该音乐作为视频编辑模板的背景音乐,在生成的视频编辑结果中可以直接套用该视频编辑模板中的背景音乐,但是适当地调整背景音乐的时间长度。In the embodiments of the present disclosure, the music is global to the video and there is only one music. For example, a video editing template includes a music, and the music serves as the background music of the video editing template. The background music in the video editing template can be directly applied to the generated video editing result, but the time length of the background music is appropriately adjusted.

如图2(a)所示,视频编辑模板201包括主题1、主题2和主题3。主题1的至少一个模板影像素材包括片段1和片段2,主题2的至少一个模板影像素材包括片段3、片段4、片段5、片段6和片段7,主题3的至少一个模板影像素材包括片段8、片段9、片段10和片段11。视频编辑模板201中的音乐贯穿整个模板视频,例如整个视频编辑模板都以该音乐作为背景音乐,音乐的原时长(即,第一时长)与视频编辑模板201的视频时间长度一致。As shown in FIG2(a), the video editing template 201 includes theme 1, theme 2 and theme 3. At least one template image material of theme 1 includes segment 1 and segment 2, at least one template image material of theme 2 includes segment 3, segment 4, segment 5, segment 6 and segment 7, and at least one template image material of theme 3 includes segment 8, segment 9, segment 10 and segment 11. The music in the video editing template 201 runs through the entire template video, for example, the entire video editing template uses the music as background music, and the original duration of the music (i.e., the first duration) is consistent with the video time length of the video editing template 201.

如图2(b)所示,若用户在视频编辑界面中只向主题2和主题3导入了影像素材,并未向主题1导入影像素材,使得视频编辑结果中目标影像素材的播放时长小于视频编辑模板201的多个模板影像素材组合的时间长度(以下简称为“视频时间长度”),则直接缩短音乐的原时长为主题2和主题3所采用的目标影像素材的播放时长。并且,在视频编辑结果中的背景音乐逐渐减弱至消失,即对背景音乐进行淡出处理。As shown in FIG2(b), if the user only imports video materials into Theme 2 and Theme 3 in the video editing interface, and does not import video materials into Theme 1, so that the playback time of the target video material in the video editing result is less than the time length of the combination of multiple template video materials of the video editing template 201 (hereinafter referred to as "video time length"), the original time length of the music is directly shortened to the playback time length of the target video material used by Theme 2 and Theme 3. In addition, the background music in the video editing result gradually weakens until it disappears, that is, the background music is faded out.

如图2(c)所示,虽然用户在视频编辑界面中向主题1、主题2和主题3分别导入了影像素材,但是采用的影像素材中的目标影像素材的播放时长小于视频编辑模板201的视频时间长度,则同样直接缩短音乐的原时长与目标影像素材的播放时长一致。并且,在视频编辑结果中的背景音乐逐渐减弱至消失,即对背景音乐进行淡出处理。As shown in FIG2(c), although the user has imported video materials into Topic 1, Topic 2, and Topic 3 in the video editing interface, the playback time of the target video material in the adopted video material is shorter than the video time length of the video editing template 201, and the original duration of the music is also directly shortened to be consistent with the playback time length of the target video material. In addition, the background music in the video editing result gradually weakens until it disappears, that is, the background music is faded out.

如图2(d)所示,用户在视频编辑界面中向主题1、主题2和主题3分别导入了影像素材,但是采用的影像素材中的目标影像素材的播放时长大于视频编辑模板201的视频时间长度,则延长音乐的原时长与目标影像素材播放时长一致。As shown in Figure 2(d), the user imported image materials into Theme 1, Theme 2 and Theme 3 respectively in the video editing interface. However, the playback duration of the target image material in the image materials used is longer than the video time length of the video editing template 201. In this case, the original duration of the music is extended to be consistent with the playback duration of the target image material.

在本公开的一些实施例中,当目标影像素材的播放时长比视频编辑模板中的模板影像素材的播放时长长时,将视频编辑模板中音乐结束时刻向后延长,视频编辑结果中音乐开始播放的时刻对齐视频编辑模板。In some embodiments of the present disclosure, when the playback duration of the target image material is longer than the playback duration of the template image material in the video editing template, the end time of the music in the video editing template is extended backward, and the start time of the music in the video editing result is aligned with the video editing template.

在本公开的一些实施例中,如果音乐的总时长较长,则续播剩余音乐。例如,音乐总时长为3分钟,但视频编辑模板只截取了0-1分钟的时长范围使用(即,原时长为1分钟),此时若目标影像素材超过了1分钟,则可以继续播放音乐1分钟之后的剩余内容。In some embodiments of the present disclosure, if the total duration of the music is long, the remaining music will be played continuously. For example, if the total duration of the music is 3 minutes, but the video editing template only uses the duration range of 0-1 minutes (i.e., the original duration is 1 minute), if the target video material exceeds 1 minute, the remaining content after 1 minute of the music can be played continuously.

若音乐的总时长比目标影像素材的播放时长短,则可以采用智能延长音乐的方式来延长音乐。智能延长音乐的方式例如应用神经网络的方式自动生成衔接在当前音乐后面的音乐片段,或者从音乐库中寻找与当前音乐适配的旋律或者音乐。If the total duration of the music is shorter than the playback duration of the target video material, the music can be extended by intelligently extending the music. For example, the intelligently extending music can automatically generate a music clip that connects to the current music by using a neural network, or find a melody or music that matches the current music from a music library.

在本公开的一些实施例中,若存在智能延长难以实现的情况,则通过自动循环播放当前音乐的方式来延长至播放时长。In some embodiments of the present disclosure, if there is a situation where intelligent extension is difficult to achieve, the playing time is extended by automatically looping the current music.

在本公开的一些实施例中,全局编辑效果包括滤镜。例如,响应于目标影像素材的播放时长大于视频时间长度,将视频编辑模板中滤镜生效时间长度延长;或者响应于目标影像素材的播放时长小于视频时间长度,将视频编辑模板中滤镜生效时间长度缩短。In some embodiments of the present disclosure, the global editing effect includes a filter. For example, in response to the target image material playing time being longer than the video time, the filter effective time length in the video editing template is extended; or in response to the target image material playing time being shorter than the video time length, the filter effective time length in the video editing template is shortened.

如图2中的(b)和(c)所示,第一滤镜生效时间长度例如为原时长,当目标影像素材的播放时长比原时长小时,缩短滤镜生效时间长度为目标影像素材的播放时间长度。如图2中的(d)所示,当目标影像素材的播放时长比原时长大时,延长滤镜生效时间长度为目标影像素材的播放时间长度。滤镜的配置的调整方式与音乐的配置的调整方式类似。As shown in (b) and (c) of FIG. 2 , the effective time length of the first filter is, for example, the original time length. When the playback time length of the target image material is shorter than the original time length, the effective time length of the filter is shortened to the playback time length of the target image material. As shown in (d) of FIG. 2 , when the playback time length of the target image material is longer than the original time length, the effective time length of the filter is extended to the playback time length of the target image material. The adjustment method of the filter configuration is similar to the adjustment method of the music configuration.

在本公开的一些实施例中,字幕也可以作为一种全局编辑效果,关于字幕作为全局编辑效果与前述滤镜和音乐的处理逻辑类似,不再赘述。In some embodiments of the present disclosure, subtitles can also be used as a global editing effect. The processing logic of subtitles as a global editing effect is similar to that of the aforementioned filters and music, and will not be repeated here.

在上述一些实施例中,步骤S30是根据匹配策略生成视频编辑结果,在本公开的另外一些实施例中,步骤S30可以包括:响应于编辑处理的触发操作,利用神经网络对至少一个影像素材和编辑效果进行处理,得到视频编辑结果。In some of the above embodiments, step S30 generates a video editing result based on a matching strategy. In some other embodiments of the present disclosure, step S30 may include: in response to a trigger operation of the editing process, using a neural network to process at least one image material and an editing effect to obtain a video editing result.

例如,响应于编辑处理的触发操作,通过神经网络对影像素材和编辑效果进行分析,得到与影像素材适配的编辑效果应用于影像素材,得到视频编辑结果。For example, in response to a trigger operation of editing processing, the image material and the editing effect are analyzed through a neural network, and the editing effect adapted to the image material is applied to the image material to obtain a video editing result.

神经网络例如可以包括用于图像处理的神经网络,例如卷积神经网络、深度学习神经网络等。The neural network may include, for example, a neural network for image processing, such as a convolutional neural network, a deep learning neural network, etc.

在本公开的一些实施例中,步骤S30包括:响应于编辑处理的触发操作,先对至少一个影像素材进行预处理得到目标影像素材,然后对目标影像素材应用对应的编辑效果从而得到视频编辑结果。预处理例如包括高光识别、素材分割等。In some embodiments of the present disclosure, step S30 includes: in response to a triggering operation of editing processing, first pre-processing at least one image material to obtain a target image material, and then applying a corresponding editing effect to the target image material to obtain a video editing result. Pre-processing includes, for example, highlight recognition, material segmentation, etc.

例如,步骤S30包括:响应于编辑处理的触发操作,提取至少一个影像素材中的目标片段;以及根据视频编辑模板和目标片段,生成视频编辑结果。For example, step S30 includes: extracting a target segment from at least one image material in response to a triggering operation of the editing process; and generating a video editing result according to the video editing template and the target segment.

提取至少一个影像素材中的目标片段例如可以是对至少一个影像素材进行高光识别,得到至少一个影像素材的高光时刻的目标片段。高光时刻的目标片段例如是指对至少一个影像素材中的至少一个中的最优部分进行截取得到的最优素材。Extracting the target segment from at least one image material may be, for example, performing highlight recognition on at least one image material to obtain a target segment at a highlight moment of at least one image material. The target segment at a highlight moment refers to, for example, an optimal material obtained by intercepting an optimal portion of at least one of the at least one image material.

在该实施例中,能够自动对至少一个影像素材进行识别得到至少一个影像素材的优选片段生成视频编辑结果。In this embodiment, at least one image material can be automatically identified to obtain a preferred segment of at least one image material to generate a video editing result.

在本公开的一些实施例中,步骤S30包括:响应于编辑处理的触发操作,若至少一个影像素材中存在第一影像素材,将第一影像素材分割为多个子片段,其中,第一影像素材为时长大于预设时长的素材;以及基于视频编辑模板和多个子片段,生成视频编辑结果。In some embodiments of the present disclosure, step S30 includes: in response to a trigger operation of editing processing, if there is a first image material in at least one image material, dividing the first image material into multiple sub-segments, wherein the first image material is a material with a duration greater than a preset duration; and generating a video editing result based on the video editing template and the multiple sub-segments.

每个主题中用户可以向图1D所示的导入至少一个影像素材。至少一个影像素材中可能存在时间长度大于预设时长的第一影像素材,在本公开的实施例中,对于时间长度大于预设时长的第一影像素材可以进行分割处理,将第一影像素材分割为多个子片段,这些子片段仍然作为该第一影像素材对应的主题的素材,不跨主题使用,以避免用户没有预期。将第一影像素材分割为多个子片段有利于对子片段进行识别和利用,至少部分地避免影像素材过长导致的处理时间过长的问题,并且有利于子片段的灵活使用。In each theme, the user can import at least one image material as shown in FIG1D . In at least one image material, there may be a first image material whose time length is greater than a preset time length. In the embodiment of the present disclosure, the first image material whose time length is greater than the preset time length can be segmented and processed, and the first image material is segmented into a plurality of sub-segments. These sub-segments are still used as materials of the theme corresponding to the first image material and are not used across themes to avoid unexpected user effects. Dividing the first image material into a plurality of sub-segments is conducive to identifying and utilizing the sub-segments, at least partially avoiding the problem of long processing time caused by excessively long image materials, and is conducive to the flexible use of sub-segments.

在本公开的一些实施例中,例如在视频编辑界面页面中,提前对用户导入的影像素材和提示信息做分析识别,当获取到编辑处理的触发操作后直接根据分析结果对影像素材进行提取(即,截取)和分割,从而减少生成视频编辑结果的等待时间。例如,只要用户导入了一个或多个素材便对素材进行分析,不用等待用户执行了编辑处理的触发操作才开始分析。In some embodiments of the present disclosure, for example, in a video editing interface page, the image material and prompt information imported by the user are analyzed and identified in advance, and when the trigger operation of the editing process is obtained, the image material is directly extracted (i.e., intercepted) and segmented according to the analysis result, thereby reducing the waiting time for generating the video editing result. For example, as long as the user imports one or more materials, the material is analyzed, without waiting for the user to perform the trigger operation of the editing process before starting the analysis.

在本公开的一些实施例中,当用户开始选择另一个主题的素材,就认为之前的主题已经处理完了,需要考虑兼容用户修改的情况。In some embodiments of the present disclosure, when a user starts to select materials for another theme, it is considered that the previous theme has been processed, and compatibility with user modifications needs to be considered.

在本公开的一些实施例中,该方法还包括根据视频编辑结果,在预览页面播放预览视频。通过播放预览视频可供用户观看根据影像素材生成的视频草稿,在用户需要修改时便于用户及时修改。在本公开的一些实施例中,在分析结果指示提示信息包括预设播放时长的情况下,至少部分素材的播放时长与预设播放时长一致。In some embodiments of the present disclosure, the method further includes playing a preview video on a preview page according to the video editing result. Playing the preview video allows the user to watch the video draft generated according to the image material, which is convenient for the user to modify in time when the user needs to modify. In some embodiments of the present disclosure, when the analysis result indicates that the prompt information includes a preset playback time, the playback time of at least part of the material is consistent with the preset playback time.

例如,用户输入的提示信息为“20分钟学会3个美食”,那么对提示信息语义分析得到用户希望生产出的视频的播放时长为20分钟,那么从影像素材中提取的应用于视频编辑结果的目标影像素材的时长也为20分钟。For example, if the prompt information input by the user is "Learn 3 delicacies in 20 minutes", then the semantic analysis of the prompt information shows that the playback duration of the video the user hopes to produce is 20 minutes, and the duration of the target image material extracted from the image material and applied to the video editing result is also 20 minutes.

图4示出了本公开至少一些实施例提供的一种预览页面的示意图。FIG. 4 shows a schematic diagram of a preview page provided by at least some embodiments of the present disclosure.

如图4所示,该预览页面401包括视频播放窗口411、素材展示窗口421和编辑入口窗口431。As shown in FIG. 4 , the preview page 401 includes a video playback window 411 , a material display window 421 , and an editing entry window 431 .

视频播放窗口411用于展示或者播放预览视频。素材展示窗口421包括用于展示预览视频中的多个影像素材每个的至少部分的多个素材子窗口。多个影像素材每个的至少部分例如是多个影像素材每个中的目标片段。例如,多个素材子窗口包括用于展示主题1的素材子窗口4211和用于展示主题2的素材子窗口4212。即,在本公开的一些实施例中,每个主题对应一个素材子窗口,用于展示该主题采用的素材,每个素材子窗口包括多个素材窗口单元,每个素材窗口单元用于显示该主题中的一个素材。如图4所示,主题1中显示有4个素材窗口单元,分别为素材窗口单元A、素材窗口单元B、素材窗口单元C和素材窗口单元D,4个素材窗口单元分别显示该主题1中的不同素材。The video playback window 411 is used to display or play the preview video. The material display window 421 includes multiple material sub-windows for displaying at least part of each of the multiple image materials in the preview video. At least part of each of the multiple image materials is, for example, a target segment in each of the multiple image materials. For example, the multiple material sub-windows include a material sub-window 4211 for displaying theme 1 and a material sub-window 4212 for displaying theme 2. That is, in some embodiments of the present disclosure, each theme corresponds to a material sub-window for displaying the material used in the theme, and each material sub-window includes multiple material window units, and each material window unit is used to display a material in the theme. As shown in Figure 4, four material window units are displayed in theme 1, namely material window unit A, material window unit B, material window unit C and material window unit D, and the four material window units respectively display different materials in the theme 1.

在本公开的一些实施例中,素材展示窗口421还包括用于添加素材的素材添加子窗口4213。在该实施例中,视频编辑方法还包括:响应于在多个素材子窗口4213中的素材窗口单元被选中的情况下对素材添加子窗口的触发操作,展示多个供选择素材;根据对多个供选择素材的添加选择操作,将目标素材添加到素材窗口单元之后。In some embodiments of the present disclosure, the material display window 421 further includes a material adding sub-window 4213 for adding materials. In this embodiment, the video editing method further includes: in response to a triggering operation on the material adding sub-window when a material window unit in the multiple material sub-windows 4213 is selected, displaying multiple materials for selection; and according to the adding selection operation on the multiple materials for selection, adding the target material after the material window unit.

如图4所示,若在素材窗口单元B被选中的情况下,对素材添加子窗口4213进行点击操作,拉起相册列表以展示多个供选择影像素材,若用户对多个供选择影像素材中的目标素材进行添加选择操作则将目标素材添加到素材窗口单元B之后,例如,在素材窗口单元B和素材窗口单元C之间增加了素材窗口单元E,以展示目标素材。关于相册列表请参考上文的描述。As shown in FIG4 , if the material window unit B is selected, the material adding sub-window 4213 is clicked, and the album list is pulled up to display multiple image materials for selection. If the user adds and selects a target material from the multiple image materials for selection, the target material is added after the material window unit B. For example, a material window unit E is added between the material window unit B and the material window unit C to display the target material. Please refer to the above description for the album list.

编辑入口窗口431用于展示对素材进行编辑的入口。例如,编辑入口窗口431包括对用于对字幕进行编辑的字幕入口4311、用于对音乐进行编辑的音乐入口4312、对滤镜进行编辑的滤镜入口4313以及用于对文案进行编辑的文案入口4314中的至少一个。编辑入口窗口431还可以包括其他功能的入口,本领域技术人员可以根据编辑需求设置在编辑入口窗口。在本公开的一些实施例中,在点击字幕入口4311进入字幕编辑页面,在该字幕编辑页面中可以包括用于对配音进行编辑的配音编辑入口。配音编辑入口可以常驻于字幕编辑页面,无论是否进行配音都可以展示配音编辑入口。若视频编辑模版中没有配音,点击进入配音编辑入口后,配音默认为禁用状态。The editing entry window 431 is used to display the entry for editing the material. For example, the editing entry window 431 includes at least one of a subtitle entry 4311 for editing subtitles, a music entry 4312 for editing music, a filter entry 4313 for editing filters, and a text entry 4314 for editing text. The editing entry window 431 may also include an entry for other functions, which can be set in the editing entry window by those skilled in the art according to editing requirements. In some embodiments of the present disclosure, when clicking on the subtitle entry 4311 to enter the subtitle editing page, a dubbing editing entry for editing dubbing may be included in the subtitle editing page. The dubbing editing entry may be resident on the subtitle editing page, and the dubbing editing entry may be displayed regardless of whether dubbing is performed. If there is no dubbing in the video editing template, after clicking to enter the dubbing editing entry, dubbing is disabled by default.

在本公开的一些实施例中,对于字幕入口4311,若视频编辑模板中包括智能添加字幕的标签或者智能添加配音的标签,则满足向预览视频中智能添加字幕的条件,则在编辑入口窗口431展示该字幕入口4311。若模版视频不包括智能添加字幕的标签和智能添加配音的标签,则不在编辑入口窗口431展示字幕入口4311。In some embodiments of the present disclosure, for the subtitle entry 4311, if the video editing template includes a tag for intelligently adding subtitles or a tag for intelligently adding dubbing, the condition for intelligently adding subtitles to the preview video is met, and the subtitle entry 4311 is displayed in the editing entry window 431. If the template video does not include a tag for intelligently adding subtitles or a tag for intelligently adding dubbing, the subtitle entry 4311 is not displayed in the editing entry window 431.

在本公开的一些实施例中,一些视频编辑模板可以只有配音或者只有字幕。在只有配音时,字幕入口4311正常展示于编辑入口窗口431,只是在字幕编辑页面中“字幕”展示默认为关闭状态,字幕只是服务配音朗读内容。在一些模版视频中可以只有字幕,字幕入口4311正常展示于编辑入口窗口431,只是在字幕编辑页面中“配音”展示默认为关闭状态,仅展示字幕,不进行朗读配音。In some embodiments of the present disclosure, some video editing templates may have only dubbing or only subtitles. When there is only dubbing, the subtitle entry 4311 is normally displayed in the editing entry window 431, but the "subtitle" display in the subtitle editing page is closed by default, and the subtitle only serves the dubbing and reading content. In some template videos, there may be only subtitles, and the subtitle entry 4311 is normally displayed in the editing entry window 431, but the "dubbing" display in the subtitle editing page is closed by default, and only the subtitles are displayed without reading and dubbing.

在字幕编辑页面中,用户可以对字幕进行修改内容、设置字幕时长、文本朗读以及删除等操作。On the subtitle editing page, users can modify the subtitle content, set the subtitle duration, read the text aloud, and delete it.

图5A~5F示出了本公开至少一些实施例提供的一种修改字幕的示意图。5A to 5F are schematic diagrams showing a method of modifying subtitles provided by at least some embodiments of the present disclosure.

例如,如图5A所示,在字幕一级页面(即,字幕编辑页面)501中,展示有字幕“abcd”、字幕“efg”以及字幕“higk”。若在字幕编辑页面501中选中字幕“efg”,并且此时画面中也展示该字幕“efg”,则进入图5B所示的字幕二级页面502,或者直接点击字幕内容进入字幕编辑页面502。在字幕编辑页面502中点击“编辑”入口512,进入字幕内容修改。在本公开的一些实施例中,字幕的选中态跟随滑动操作选中不同字幕,同时被选中字幕在画面中自动展示。For example, as shown in FIG5A , in the first-level subtitle page (i.e., the subtitle editing page) 501, the subtitles "abcd", "efg" and "higk" are displayed. If the subtitle "efg" is selected in the subtitle editing page 501, and the subtitle "efg" is also displayed on the screen at this time, the second-level subtitle page 502 shown in FIG5B is entered, or the subtitle content is directly clicked to enter the subtitle editing page 502. In the subtitle editing page 502, click the "Edit" entry 512 to enter the subtitle content modification. In some embodiments of the present disclosure, the selected state of the subtitle follows the sliding operation to select different subtitles, and the selected subtitles are automatically displayed on the screen.

如图5C~5D所示,在对字幕内容修改时,自动拉起输入框504及键盘503,以利用输入框504和键盘503修改字幕内容。支持上屏的“删除”按钮;或字幕二级页面的删除方式删除字幕,例如如图5B所示,删除按钮522位于字幕二级页面的最后一个。字幕位置可以被拖动,拖动后默认应用到全局,所有字幕都改为新位置。例如,在对字幕内容修改完成后,通过点击生效图标505应用在预览视频中。在本公开的一些实施例中,在点击了生效图标505之后,可以出现图5D所示的弹窗506,该弹窗506用于询问用户是否将该字幕应用于配音。若用户选择应用于配音,则相应地,将配音修改为与改字幕一致。As shown in Figures 5C to 5D, when the subtitle content is modified, the input box 504 and the keyboard 503 are automatically pulled up to modify the subtitle content using the input box 504 and the keyboard 503. Support the "Delete" button on the upper screen; or delete the subtitles in the deletion method of the subtitle secondary page. For example, as shown in Figure 5B, the delete button 522 is located at the last one of the subtitle secondary page. The subtitle position can be dragged, and after dragging, it is applied globally by default, and all subtitles are changed to the new position. For example, after the modification of the subtitle content is completed, it is applied in the preview video by clicking the effective icon 505. In some embodiments of the present disclosure, after clicking the effective icon 505, a pop-up window 506 shown in Figure 5D may appear, and the pop-up window 506 is used to ask the user whether to apply the subtitle to dubbing. If the user chooses to apply to dubbing, the dubbing is modified to be consistent with the changed subtitles accordingly.

如图5B和5D所示,在字幕二级页面502中还可以包括字幕时长设置按钮532。As shown in FIGS. 5B and 5D , the subtitle secondary page 502 may further include a subtitle duration setting button 532 .

例如,当用户选择了对应字幕后,点击字幕时长设置按钮532,在字幕二级页面中下方展示该字幕对应的片段范围542,支持伸缩选取时长范围。在展示字幕对应的片段范围542的上方可以展示其他字幕552,选中其他字幕552可以切换到其他字幕时长范围显示。For example, after the user selects a corresponding subtitle, he clicks the subtitle duration setting button 532, and the segment range 542 corresponding to the subtitle is displayed at the bottom of the subtitle secondary page, supporting the selection of a retractable duration range. Other subtitles 552 can be displayed above the displayed segment range 542 corresponding to the subtitle, and by selecting other subtitles 552, the display can be switched to other subtitle duration ranges.

在本公开的一些实施例中,在字幕一级页面501和字幕二级页面502可以常驻“换一批”按钮562。响应于对“换一批”按钮562的点击操作,重新生成新的字幕内容。当在字幕一级页面501或者字幕二级页面502中滑动字幕时,该“换一批”按钮562可以消失,当滑动停止是,该“换一批”按钮562继续出现。In some embodiments of the present disclosure, a "change a batch" button 562 may be permanently located on the subtitles primary page 501 and the subtitles secondary page 502. In response to a click operation on the "change a batch" button 562, new subtitle content is regenerated. When the subtitles are slid in the subtitles primary page 501 or the subtitles secondary page 502, the "change a batch" button 562 may disappear, and when the sliding stops, the "change a batch" button 562 continues to appear.

在本公开的一些实施例中,在字幕一级页面501和字幕二级页面502还可以包括用于查看上一个字幕的第一查看图标和用于查看下一个字幕的第二查看图标。点击第一查看图标则回到原来上一个版本,点击第二查看图标则继续生成新字幕。In some embodiments of the present disclosure, the first subtitle page 501 and the second subtitle page 502 may further include a first viewing icon for viewing the previous subtitle and a second viewing icon for viewing the next subtitle. Clicking the first viewing icon returns to the previous version, and clicking the second viewing icon continues to generate new subtitles.

在本公开的一些实施例中,在字幕一级页面501和字幕二级页面502还可以包括字幕开关入口511。当有应用的字幕时,字幕开关入口511的图标呈现选中状态,此时字幕展示在预览视频中,点击关闭后,字幕开关入口511变为非选中态,此时字幕隐藏,不展示在预览视频上,仅供朗读使用。当从展示与隐藏状态互相切换时,点击展开时提示“字幕已展示”,点击隐藏时提示“字幕已隐藏”。In some embodiments of the present disclosure, the subtitles level 1 page 501 and the subtitles level 2 page 502 may further include a subtitles switch entry 511. When there are applied subtitles, the icon of the subtitles switch entry 511 is in a selected state, and the subtitles are displayed in the preview video. After clicking to close, the subtitles switch entry 511 becomes unselected, and the subtitles are hidden and not displayed on the preview video, and are only used for reading. When switching between the display and hidden states, when clicking to expand, it prompts "Subtitles are displayed", and when clicking to hide, it prompts "Subtitles are hidden".

如上文所描述的,如图5F所示,在字幕编辑页面502中可以包括用于对配音进行编辑的配音编辑入口562,通过选中配音编辑入口562进入配音编辑页面507,在配音编辑页面507中可以修改音色。在配音编辑页面507中,默认选中当前应用的音色,例如图5F中音色1为当前应用的音色,呈选中状态。点击切换其他音色则马上预览其效果朗读当前字幕内容。若在配音编辑页面中点击了“禁用”按钮,则不朗读配音。As described above, as shown in FIG5F, the subtitle editing page 502 may include a dubbing editing entry 562 for editing the dubbing. By selecting the dubbing editing entry 562, the dubbing editing page 507 is entered, and the timbre can be modified in the dubbing editing page 507. In the dubbing editing page 507, the currently applied timbre is selected by default. For example, in FIG5F, timbre 1 is the currently applied timbre and is in a selected state. Click to switch to other timbres to immediately preview their effects and read the current subtitle content. If the "Disable" button is clicked in the dubbing editing page, the dubbing will not be read.

在配音编辑页面中支持设置配音音量和视频原声音量,当有配音时,视频原声默认为0。The dubbing editing page supports setting the dubbing volume and the original video sound volume. When there is dubbing, the original video sound defaults to 0.

在本公开的一些实施例中,可以在配音编辑页面中选择是否将设置的配音应用于全局,若配音应用于全局,则所有字幕均生效新的配音效果。In some embodiments of the present disclosure, it can be selected in the dubbing editing page whether to apply the set dubbing globally. If the dubbing is applied globally, the new dubbing effect will take effect on all subtitles.

在本公开的一些实施例中,例如若用户点击了生效图标,则保存新的配音修改,并且配音生效,若未点击生效图标或者点击了退出图标,则不保存配音修改,并且返回到配音编辑页面的上一级页面。In some embodiments of the present disclosure, for example, if the user clicks on the effective icon, the new dubbing modification is saved and the dubbing takes effect. If the effective icon is not clicked or the exit icon is clicked, the dubbing modification is not saved and the user returns to the previous page of the dubbing editing page.

在本公开的一些实施例中,点击音乐入口4312进入音乐编辑页面。在音乐编辑页面可以通过相似音乐推荐音乐。例如,推荐于视频编辑模板中相似度大于预定值的音乐。或者,根据对第二素材和提示信息的理解推荐音乐。In some embodiments of the present disclosure, click on the music entry 4312 to enter the music editing page. In the music editing page, music can be recommended by similar music. For example, music with a similarity greater than a predetermined value in the video editing template is recommended. Alternatively, music is recommended based on the understanding of the second material and the prompt information.

在音乐编辑页面中,用户可以自主选择音乐,并且可以设置音乐的音量。在本公开的一些实施例中,还可以对音乐进行卡点处理。In the music editing page, the user can select music on his own and set the volume of the music. In some embodiments of the present disclosure, the music can also be processed at the point of the music.

在本公开的一些实施例中个,编辑入口窗口还包括:用于对预览视频中的多个优选素材进行选择删除的删除图标。例如,在预览页面的编辑入口窗口431包括“删除”入口,位置在编辑入口窗口431的最后位置处。选择单个素材,在预览页面中点击“删除”图标删除该选择素材。该选择素材被删除后,该选择素材所在的主题中的其他素材自动往前顺延,其他主题的素材不会跳主题顺延。In some embodiments of the present disclosure, the editing entry window also includes: a delete icon for selecting and deleting multiple preferred materials in the preview video. For example, the editing entry window 431 of the preview page includes a "delete" entry, which is located at the last position of the editing entry window 431. Select a single material and click the "delete" icon in the preview page to delete the selected material. After the selected material is deleted, other materials in the theme where the selected material is located will automatically move forward, and materials in other themes will not jump to the theme.

在本公开的一些实施例中,编辑入口窗口431和素材展示窗口421支持左右滑动以展示更多的编辑入口和素材。例如,向左滑动编辑入口窗口431展示出删除图标。In some embodiments of the present disclosure, the editing entry window 431 and the material display window 421 support left and right sliding to display more editing entries and materials. For example, sliding the editing entry window 431 to the left displays a delete icon.

在本公开的一些实施例中,编辑入口窗口还包括重新组织入口,方法还包括:响应于重新组织入口被选择,展示排序面板,排序面板包括第二素材和提示信息;接收对排序面板中的第二素材进行的组织操作,重新组织第二素材。In some embodiments of the present disclosure, the editing entry window also includes a reorganization entry, and the method also includes: in response to the reorganization entry being selected, displaying a sorting panel, the sorting panel including a second material and prompt information; receiving an organization operation on the second material in the sorting panel, and reorganizing the second material.

例如,在预览页面的编辑入口窗口中还包括重新组织入口。在点击重新组织入口之后,展示排序面板,例如排序面板与原结构框架的逻辑相同,除了排序面板包括之前导入的第二素材和提示信息之外,其他与上述结构框架相同,以便修改提示信息和导入第二素材。响应于通过重新组织入口进入排序面板,在该结构框架中展示之前填写的提示信息以及按照各个主题展示各个第二素材,并且文案也按照原文案展示。组织操作例如包括导入素材的修改、添加、删除和拖动排序(能够跨主题排序)以及对提示信息的修改等。接收对结构框架中提示信息和导入素材的修改、添加、删除和拖动排序(能够跨主题排序),并且在接收到生效指令后,保存上述修改,当涉及到提示信息和素材填充内容变化时,算法素材的合成和算法的所有智能效果都要重新生成一遍(包括:素材拼接选取、整体包装合成、智能改写文案、智能生成字幕/配音、智能推荐音乐),并且在生效之后,返回到上一级页面。若用户对结构框架进行修改后,接收到退出指令,则所有修改不生效。For example, the editing entry window of the preview page also includes a reorganization entry. After clicking the reorganization entry, the sorting panel is displayed. For example, the logic of the sorting panel is the same as that of the original structural framework. Except that the sorting panel includes the second material and prompt information imported previously, the other is the same as the above structural framework, so as to modify the prompt information and import the second material. In response to entering the sorting panel through the reorganization entry, the previously filled prompt information is displayed in the structural framework, and each second material is displayed according to each theme, and the copy is also displayed according to the original copy. The organization operation includes, for example, modification, addition, deletion and drag sorting of the imported material (able to sort across themes) and modification of the prompt information. Receive the modification, addition, deletion and drag sorting of the prompt information and the imported material in the structural framework (able to sort across themes), and save the above modifications after receiving the effective instruction. When it comes to changes in the prompt information and material filling content, the synthesis of the algorithm material and all the intelligent effects of the algorithm must be regenerated (including: material splicing selection, overall packaging synthesis, intelligent rewriting of copy, intelligent generation of subtitles/dubbing, intelligent recommendation of music), and after taking effect, return to the previous page. If the user modifies the structure framework and receives an exit command, all modifications will not take effect.

在本公开的一些实施例中,对素材的排序除了通过重新组织入口之外,还可以通过特殊操作(例如,长按某个片段素材)进行重新排序。例如,特殊操作能够支持跨组排序。In some embodiments of the present disclosure, in addition to reorganizing the entry, the order of the materials can also be reordered through special operations (for example, long pressing a certain clip material). For example, the special operation can support cross-group ordering.

在本公开的一些实施例中,还可以对素材进行截取。例如,素材的截取范围默认定位为实际选中范围,支持调整截取范围,并非固定时长。In some embodiments of the present disclosure, the material can also be intercepted. For example, the interception range of the material is positioned as the actual selected range by default, and the interception range can be adjusted instead of a fixed duration.

在本公开的一些实施例中,可以在预览页面中进行新增、删除素材等各种操作,或者通过预览页面中的重新组织入口重新展示视频编辑界面,在视频编辑界面中对影像素材进行增加、删除和排序等操作,并且可以对单个影像素材调整截取时长范围的操作。在通过预览页面中的重新组织入口重新展示的视频编辑界面中,还可以对提示信息进行修改。In some embodiments of the present disclosure, various operations such as adding and deleting materials can be performed on the preview page, or the video editing interface can be re-displayed through the reorganization entrance in the preview page, and operations such as adding, deleting and sorting image materials can be performed in the video editing interface, and the capture time range of a single image material can be adjusted. In the video editing interface re-displayed through the reorganization entrance in the preview page, the prompt information can also be modified.

当影像素材发生变化时(例如,增加、删除和排序素材),对于新导入的影像素材进行分析识别出目标片段和对素材进行分割;对于原有素材(之前导入的素材),继续保留原有处理效果,无需重新处理。若在导入新影像素材的基础上,还对原有素材进行了重新排序,则重新拼接得到新的视频部分。基于新的影像素材,重新包装,包括相应的滤镜、音乐、文本、转场效果,请参考上文的描述。When the video material changes (for example, adding, deleting and sorting the material), the newly imported video material is analyzed to identify the target segment and segment the material; for the original material (previously imported material), the original processing effect is retained without reprocessing. If the original material is reordered on the basis of importing the new video material, the new video part is re-joined. Based on the new video material, re-packaging, including corresponding filters, music, text, transition effects, please refer to the description above.

在本公开的一些实施例中,对于新导入的影像素材,字幕和配音需要重新生成,对于原有素材,继续保留原有处理效果,无需字幕和配音重新生成,对于删减素材,则对应字幕/配音也对应删除,对于新的影像素材排序,则字幕和配音按照新排序重新生成。In some embodiments of the present disclosure, for newly imported image materials, subtitles and dubbing need to be regenerated. For original materials, the original processing effects continue to be retained and there is no need to regenerate subtitles and dubbing. For deleted materials, the corresponding subtitles/dubbing are also deleted. For new image material sorting, the subtitles and dubbing are regenerated according to the new sorting.

在本公开的一些实施例中,对于新导入的影像素材无需重新生成文案。对于原有素材的文案,继续保留原有处理效果,无需重新处理,对于删减素材,导致某主题完全没有素材,则该主题对应文案也删除。对于新的影像素材排序,则文案展示位置也按照新排序重新处理。In some embodiments of the present disclosure, there is no need to regenerate text for newly imported video materials. For the text of the original materials, the original processing effect is retained without reprocessing. If the deleted materials result in a certain theme having no materials at all, the text corresponding to the theme is also deleted. For the new image material sorting, the text display position is also reprocessed according to the new sorting.

在本公开的一些实施例中,当素材仅发生截取时长变化时,按照用户截取范围选取,不会再做新的高光识别,并且按照新的截取素材进行重新拼接。基于最新素材时长,按照上文描述的音乐、滤镜和文案的处理方式对素材重新包装。例如,重新生成的视频编辑结果中的音乐、滤镜和文案仅改变播放时长和生效时长,不改变内容本身。当影像素材仅发生截取时长变化时,按照新的影像素材时长生成新的字幕和配音,当用户单个片段时长小于3s时,则不为该片段配字幕和配音,此时现有的字幕和配音可以不保留,因为太短了很难展示和播放。当影像素材仅发生截取时长变化时,无需重新生成文案,对于全局主题生效的文本,则素材有多少长,文本展示时长就多长;对于根据主题部分生效的文案,如果影像素材变长时,则文案还是按照原时长展示;如果素材变短时,则文本按照素材变短后的时长展示。In some embodiments of the present disclosure, when the material only changes in the interception time, it is selected according to the user's interception range, no new highlight recognition will be performed, and the new intercepted material will be re-stitched. Based on the latest material duration, the material is repackaged according to the processing method of the music, filter and copy described above. For example, the music, filter and copy in the regenerated video editing result only change the playback time and the effective time, and do not change the content itself. When the image material only changes in the interception time, new subtitles and dubbing are generated according to the new image material duration. When the user's single clip duration is less than 3s, the clip is not subtitled and dubbed. At this time, the existing subtitles and dubbing can be not retained because it is too short to display and play. When the image material only changes in the interception time, there is no need to regenerate the copy. For the text that takes effect on the global theme, the length of the material is as long as the text display time; for the copy that takes effect according to the theme part, if the image material becomes longer, the copy is still displayed according to the original duration; if the material becomes shorter, the text is displayed according to the shortened duration of the material.

在本公开的一些实施例中,若提示信息的主题和风格发生变化,所有字幕和/或配音都会刷新。In some embodiments of the present disclosure, if the theme and style of the prompt information changes, all subtitles and/or dubbing will be refreshed.

在本公开的一些实施例中,当字幕和配音发生变化时,若字幕比修改之前的字幕长,则继续按照修改之前的字幕的时长范围展示,字幕自动换行,换行逻辑对齐当前的字幕换行。若字幕比修改之前的字幕短,则继续按照字幕原来的时长范围展示,字幕有多少展示多少。如果字幕时长范围横跨了多个片段,导致多个字幕重叠,则可以按照重新效果展示。In some embodiments of the present disclosure, when the subtitles and dubbing are changed, if the subtitles are longer than the subtitles before the modification, they will continue to be displayed according to the duration range of the subtitles before the modification, and the subtitles will automatically wrap, and the line wrap logic will be aligned with the current subtitle wrap. If the subtitles are shorter than the subtitles before the modification, they will continue to be displayed according to the original duration range of the subtitles, and as many subtitles as there are will be displayed. If the duration range of the subtitles spans multiple segments, resulting in multiple subtitles overlapping, they can be displayed according to the re-effect.

当字幕过长,导致对应的配音也过长时,按照配音朗读按变长后的长度正常朗读,支持导入影像素材的全部播放。当一段片段有多段配音音频重叠时,保持多段音频一起播放。当字幕变短导致对应的配音也变短时,按照配音朗读变短后的长度正常朗读。When the subtitle is too long, causing the corresponding dubbing to be too long, it will be read normally according to the length of the dubbing reading, and support the full playback of the imported video material. When a clip has multiple dubbing audios overlapping, keep playing the multiple audios together. When the subtitle becomes shorter, causing the corresponding dubbing to be shorter, it will be read normally according to the length of the shortened dubbing reading.

在本公开的一些实施例中,存在算法重新合成视频编辑结果过程与用户编辑出现冲突,例如在合成视频编辑结果的过程中,用户对素材、配音等进行编辑操作。当用户再一次修改的素材和当前算法处理的素材重合时,则优先展示用户主动修改后的素材效果,如用户已经对某段素材手动截取时长范围,则按照其时长选取,而不是按照上述描述的选择目标片段。若在算法重新合成视频编辑结果过程中,响应于对字幕编辑入口或者配音编辑入口的选择操作,在预览页面显示通知消息以告知用户正在生成字幕和配音。等待视频编辑结果重新合成完毕,预览页面正常显示并支持操作。若在算法重新合成视频编辑结果过程中,上屏字幕被选中,则进行正常编辑修改,智能字幕进度正常加载。当用户确认保存修改,则此时优先展示该片段用户修改字幕内容,该段字幕智能生成进度停止。当用户没有保存修改,则智能字幕正常加载并展示。当改写文案还未生成时,此时文本内容展示为原模板文案;若二次编辑后触发的重新合成,当新的改写文案还未生成时,展示当前的文案。用户可以正常编辑修改内容,改写文案进行正常加载,有效展示该片段用户修改的文本,该段智能改写文案进度停止。In some embodiments of the present disclosure, there is a conflict between the algorithm re-synthesizing the video editing results and the user editing. For example, in the process of synthesizing the video editing results, the user performs editing operations on the material, dubbing, etc. When the material modified by the user again coincides with the material processed by the current algorithm, the material effect after the user's active modification is displayed first. For example, if the user has manually intercepted the duration range of a certain segment of material, it is selected according to its duration, rather than selecting the target segment as described above. If in the process of the algorithm re-synthesizing the video editing results, in response to the selection operation of the subtitle editing entrance or the dubbing editing entrance, a notification message is displayed on the preview page to inform the user that subtitles and dubbing are being generated. Wait for the video editing results to be re-synthesized, and the preview page is displayed normally and supports operations. If the upper screen subtitle is selected in the process of the algorithm re-synthesizing the video editing results, normal editing and modification are performed, and the intelligent subtitle progress is loaded normally. When the user confirms to save the modification, the subtitle content modified by the user in the segment is displayed first, and the intelligent generation progress of the subtitle segment stops. When the user does not save the modification, the intelligent subtitle is loaded and displayed normally. When the rewritten copy has not been generated, the text content is displayed as the original template copy; if the re-synthesis is triggered after the second editing, when the new rewritten copy has not been generated, the current copy is displayed. The user can edit and modify the content normally, and the rewritten copy is loaded normally, effectively displaying the text modified by the user in this segment, and the progress of this segment of intelligent rewriting copy stops.

在本公开的一些实施例中,视频编辑模板可以由工程文件转化得到。例如,即通过剪辑工具工程文件导出后,选择“发布模板”。通过在发布链路和vimo平台添加视频模板的结构套路信息,通过模板剪辑特点,识别在什么节点拆分各组镜头以及各组镜头对应主题。中视频模板发布后,可以在vimo后台存储与管理。例如,通过在vimo上添加中视频模板标识,通过运营手动标识判断该模板是否为中视频模板类型。例如,素材类别添加“中视频模板”类型,在元数据中新增“中视频模板”类型的属性字段。In some embodiments of the present disclosure, a video editing template can be converted from a project file. For example, after exporting the project file through the editing tool, select "Publish Template". By adding the structural routine information of the video template in the publishing link and the vimo platform, and through the template editing characteristics, it is possible to identify at which nodes to split each group of shots and the corresponding themes of each group of shots. After the medium video template is published, it can be stored and managed in the vimo background. For example, by adding a medium video template identifier on vimo, it is determined whether the template is a medium video template type through manual identification by the operator. For example, the "medium video template" type is added to the material category, and a new attribute field of the "medium video template" type is added to the metadata.

对于模板结构信息的输入,例如在vimo平台上依次通过模板管理入口、模板素材管理入口和操作详情入口来增加“修改结构信息”入口。在修改结构信息的页面中,主要划分为三大部分,分别为整体说明、智能能力、模板结构拆解。整体说明部分用于对视频编辑模板进行整体性说明,不限制输入最大字符数,当在应用程序页面展示时,如果超过设计范围,则显示渐隐的蒙层。For input of template structure information, for example, on the vimo platform, the "Modify Structure Information" entry is added through the template management entry, template material management entry, and operation details entry. The page for modifying structure information is mainly divided into three parts, namely overall description, intelligent capabilities, and template structure disassembly. The overall description part is used to provide an overall description of the video editing template. There is no limit on the maximum number of characters to be entered. When displayed on the application page, if it exceeds the design range, a fading mask will be displayed.

智能能力部分支持运营勾选智能能力,包括智能添加字幕、智能添加配音。模板结构拆解部分按照模板结构情况,自行拆解为多个部分,每个部分填写内容包括:标题、描述、起止时间、起止片段、用户素材总时长上限、默认填充和默认占位。标题不限制输入最大字符数,当在应用程序页面展示超过设计范围时,显示渐隐的蒙层。描述也不限制输入最大字符数,当在应用程序页面展示超过设计范围时,显示渐隐的蒙层。起止时间用于描述视频片段从第几秒开始到第几秒结束。起止片段用于描述从第几个片段开始到第几个片段结束。用户素材总时长上限用于说明导入素材相对长时,算法可以参考上限,让最终截取合成的素材不要过长,从而达到最优的效果。默认填充是指即在该组部分没有导入任何素材的情况下,从已导入素材里选择一段进行填充,保证整体预览效果完整性。默认占位是指在该组部分没有导入任何素材,进入预览页面时,该组的素材展示由空白占位。若在该模板结构拆解部分进行确认之后,弹出二次确认弹窗,点击二次弹窗后,正式生效上线该视频编辑模板。The intelligent capability section supports operators to check intelligent capabilities, including intelligent subtitle addition and intelligent dubbing. The template structure disassembly section is automatically disassembled into multiple parts according to the template structure. The content of each part includes: title, description, start and end time, start and end segments, upper limit of the total duration of user materials, default padding and default placeholder. There is no limit on the maximum number of characters for the title. When the display on the application page exceeds the design range, a fade-out mask is displayed. There is no limit on the maximum number of characters for the description. When the display on the application page exceeds the design range, a fade-out mask is displayed. The start and end time are used to describe the video clip from the second to the second. The start and end segments are used to describe the segment from the beginning to the end. The upper limit of the total duration of user materials is used to explain that when the imported material is relatively long, the algorithm can refer to the upper limit so that the final intercepted and synthesized material is not too long, so as to achieve the best effect. Default padding means that when no material is imported into this group, a segment is selected from the imported material for filling to ensure the integrity of the overall preview effect. The default placeholder means that no material is imported into this group. When entering the preview page, the material display of this group is blank. If you confirm the template structure disassembly part, a secondary confirmation pop-up window will pop up. After clicking the secondary pop-up window, the video editing template will be officially launched.

在本公开的一些实施例中,只要有一个素材填入,则“预览”按钮点亮,可以进入预览页面,当素材为空时,“预览”按钮置灰,无法进入预览页面。“预览”按钮可以固定在页面下方不跟随滑动,当其他内容超出一屏时,需要下滑查看更多。In some embodiments of the present disclosure, as long as there is a material filled in, the "Preview" button will light up and you can enter the preview page. When the material is empty, the "Preview" button will be grayed out and you cannot enter the preview page. The "Preview" button can be fixed at the bottom of the page and will not follow the slide. When other content exceeds one screen, you need to slide down to view more.

图6为本公开一些实施例提供的一种视频编辑装置的示意框图。如图6所示,该视频编辑装置600包括第一展示单元110、第二展示单元120、结果生成单元130。例如,该视频编辑装置600可以应用于用户终端中,也可以应用于任意的需要实现设计素材的预览的设备或系统中,本公开的实施例对此不作限制。FIG6 is a schematic block diagram of a video editing device provided by some embodiments of the present disclosure. As shown in FIG6, the video editing device 600 includes a first display unit 110, a second display unit 120, and a result generation unit 130. For example, the video editing device 600 can be applied to a user terminal, or to any device or system that needs to implement a preview of a design material, and the embodiments of the present disclosure are not limited to this.

第一展示单元110配置为响应于对视频编辑模板的触发操作,展示视频编辑界面,其中,所述视频编辑模板包括结构信息和编辑信息,所述结构信息用于指示至少一个视频段落,所述编辑信息用于指示在所述至少一个视频段落中施加的至少一个编辑效果,所述视频编辑界面包括与所述至少一个视频段落分别对应的段落编辑区域。例如,第一展示单元110可以执行如图1A所示的视频编辑方法的步骤S10。The first display unit 110 is configured to display a video editing interface in response to a trigger operation on a video editing template, wherein the video editing template includes structural information and editing information, the structural information is used to indicate at least one video segment, the editing information is used to indicate at least one editing effect applied in the at least one video segment, and the video editing interface includes segment editing areas corresponding to the at least one video segment. For example, the first display unit 110 can perform step S10 of the video editing method shown in FIG1A.

第二展示单元120配置为响应于针对目标视频段落的素材导入操作,在所述目标视频段落的段落编辑区域展示至少一个影像素材的标识,所述至少一个视频段落包括所述目标视频段落,所述至少一个影像素材为基于所述素材导入操作被导入在所述目标视频段落中的素材例如,第二展示单元120可以执行如图1A所示的视频编辑方法的步骤S20。The second display unit 120 is configured to display an identifier of at least one image material in a segment editing area of a target video segment in response to a material import operation for the target video segment, wherein the at least one video segment includes the target video segment, and the at least one image material is a material imported into the target video segment based on the material import operation. For example, the second display unit 120 can execute step S20 of the video editing method as shown in Figure 1A.

结果生成单元130配置为响应于编辑处理的触发操作,根据所述视频编辑模板和所述至少一个影像素材,生成视频编辑结果,其中,所述视频编辑结果对应于所述目标视频段落的部分为目标编辑效果被施加在所述至少一个影像素材上的编辑结果,所述目标编辑效果匹配所述至少一个编辑效果中位于所述目标视频段落内的编辑效果。例如,页面展示单元130可以执行如图1A所示的视频编辑方法的步骤S30。The result generation unit 130 is configured to generate a video editing result according to the video editing template and the at least one video material in response to a triggering operation of the editing process, wherein the portion of the video editing result corresponding to the target video segment is an editing result in which a target editing effect is applied to the at least one video material, and the target editing effect matches an editing effect of the at least one editing effect located within the target video segment. For example, the page display unit 130 may execute step S30 of the video editing method as shown in FIG1A.

该视频编辑装置不仅降低了对素材的要求,使得视频模板的使用更加灵活,而且该方法除了适用于短视频之外还适用于中视频。The video editing device not only reduces the requirements for materials, making the use of video templates more flexible, but also the method is applicable to medium videos in addition to short videos.

例如,第一展示单元110、第二展示单元120、结果生成单元130可以为硬件、软件、固件以及它们的任意可行的组合。例如,第一展示单元110、第二展示单元120、结果生成单元130可以为专用或通用的电路、芯片或装置等,也可以为处理器和存储器的结合。关于第一展示单元110、第二展示单元120、结果生成单元130的具体实现形式,本公开的实施例对此不作限制。For example, the first display unit 110, the second display unit 120, and the result generation unit 130 may be hardware, software, firmware, or any feasible combination thereof. For example, the first display unit 110, the second display unit 120, and the result generation unit 130 may be a dedicated or general circuit, chip, or device, or may be a combination of a processor and a memory. The embodiments of the present disclosure do not limit the specific implementation of the first display unit 110, the second display unit 120, and the result generation unit 130.

需要说明的是,本公开的实施例中,视频编辑装置600的各个单元与前述的视频编辑方法的各个步骤对应,关于该视频编辑装置600的具体功能可以参考上文中视频编辑方法的相关描述,此处不再赘述。图6所示的视频编辑装置600的组件和结构只是示例性的,而非限制性的,根据需要,该视频编辑装置600还可以包括其他组件和结构。It should be noted that in the embodiment of the present disclosure, each unit of the video editing device 600 corresponds to each step of the aforementioned video editing method. For the specific functions of the video editing device 600, reference can be made to the relevant description of the video editing method above, which will not be repeated here. The components and structure of the video editing device 600 shown in FIG6 are only exemplary and not restrictive. The video editing device 600 may also include other components and structures as needed.

图7为本公开一些实施例提供的一种电子设备的示意框图。如图7所示,电子设备200包括处理器210和存储器220。存储器220用于存储非暂时性计算机可读指令(例如一个或多个计算机程序模块)。处理器210用于运行非暂时性计算机可读指令,非暂时性计算机可读指令被处理器210运行时可以执行上文所述的视频编辑方法中的一个或多个步骤。存储器220和处理器210可以通过总线系统和/或其它形式的连接机构(未示出)互连。FIG7 is a schematic block diagram of an electronic device provided by some embodiments of the present disclosure. As shown in FIG7, the electronic device 200 includes a processor 210 and a memory 220. The memory 220 is used to store non-transitory computer-readable instructions (e.g., one or more computer program modules). The processor 210 is used to run non-transitory computer-readable instructions, and when the non-transitory computer-readable instructions are run by the processor 210, one or more steps in the video editing method described above can be executed. The memory 220 and the processor 210 can be interconnected via a bus system and/or other forms of connection mechanisms (not shown).

例如,处理器210可以是中央处理单元(CPU)、数字信号处理器(DSP)或者具有数据处理能力和/或程序执行能力的其它形式的处理单元,例如现场可编程门阵列(FPGA)等;例如,中央处理单元(CPU)可以为X86或ARM架构等。处理器210可以为通用处理器或专用处理器,可以控制电子设备200中的其它组件以执行期望的功能。For example, the processor 210 may be a central processing unit (CPU), a digital signal processor (DSP), or other forms of processing units with data processing capabilities and/or program execution capabilities, such as a field programmable gate array (FPGA), etc.; for example, the central processing unit (CPU) may be an X86 or ARM architecture, etc. The processor 210 may be a general-purpose processor or a dedicated processor, and may control other components in the electronic device 200 to perform desired functions.

例如,存储器220可以包括一个或多个计算机程序产品的任意组合,计算机程序产品可以包括各种形式的计算机可读存储介质,例如易失性存储器和/或非易失性存储器。易失性存储器例如可以包括随机存取存储器(RAM)和/或高速缓冲存储器(cache)等。非易失性存储器例如可以包括只读存储器(ROM)、硬盘、可擦除可编程只读存储器(EPROM)、便携式紧致盘只读存储器(CD-ROM)、USB存储器、闪存等。在计算机可读存储介质上可以存储一个或多个计算机程序模块,处理器210可以运行一个或多个计算机程序模块,以实现电子设备200的各种功能。在计算机可读存储介质中还可以存储各种应用程序和各种数据以及应用程序使用和/或产生的各种数据等。For example, the memory 220 may include any combination of one or more computer program products, and the computer program product may include various forms of computer-readable storage media, such as volatile memory and/or non-volatile memory. Volatile memory may include, for example, random access memory (RAM) and/or cache memory (cache), etc. Non-volatile memory may include, for example, read-only memory (ROM), hard disk, erasable programmable read-only memory (EPROM), portable compact disk read-only memory (CD-ROM), USB memory, flash memory, etc. One or more computer program modules may be stored on the computer-readable storage medium, and the processor 210 may run one or more computer program modules to implement various functions of the electronic device 200. Various applications and various data, as well as various data used and/or generated by the application, etc. may also be stored in the computer-readable storage medium.

需要说明的是,本公开的实施例中,电子设备200的具体功能和技术效果可以参考上文中关于视频编辑方法的描述,此处不再赘述。It should be noted that, in the embodiment of the present disclosure, the specific functions and technical effects of the electronic device 200 can refer to the above description of the video editing method, which will not be repeated here.

图8为本公开一些实施例提供的另一种电子设备的示意框图。该电子设备300例如适于用来实施本公开实施例提供的视频编辑方法。电子设备300可以是用户终端等。需要注意的是,图8示出的电子设备300仅仅是一个示例,其不会对本公开实施例的功能和使用范围带来任何限制。FIG8 is a schematic block diagram of another electronic device provided by some embodiments of the present disclosure. The electronic device 300 is suitable for implementing the video editing method provided by the embodiments of the present disclosure, for example. The electronic device 300 may be a user terminal, etc. It should be noted that the electronic device 300 shown in FIG8 is only an example, and does not impose any limitation on the functions and scope of use of the embodiments of the present disclosure.

如图8所示,电子设备300可以包括处理装置(例如中央处理器、图形处理器等)310,其可以根据存储在只读存储器(ROM)320中的程序或者从存储装置380加载到随机访问存储器(RAM)330中的程序而执行各种适当的动作和处理。在RAM 330中,还存储有电子设备300操作所需的各种程序和数据。处理装置310、ROM 320以及RAM 330通过总线340彼此相连。输入/输出(I/O)接口350也连接至总线340。As shown in FIG8 , the electronic device 300 may include a processing device (e.g., a central processing unit, a graphics processing unit, etc.) 310, which can perform various appropriate actions and processes according to a program stored in a read-only memory (ROM) 320 or a program loaded from a storage device 380 to a random access memory (RAM) 330. In the RAM 330, various programs and data required for the operation of the electronic device 300 are also stored. The processing device 310, the ROM 320, and the RAM 330 are connected to each other via a bus 340. An input/output (I/O) interface 350 is also connected to the bus 340.

通常,以下装置可以连接至I/O接口350:包括例如触摸屏、触摸板、键盘、鼠标、摄像头、麦克风、加速度计、陀螺仪等的输入装置360;包括例如液晶显示器(LCD)、扬声器、振动器等的输出装置370;包括例如磁带、硬盘等的存储装置380;以及通信装置390。通信装置390可以允许电子设备300与其他电子设备进行无线或有线通信以交换数据。虽然图9示出了具有各种装置的电子设备300,但应理解的是,并不要求实施或具备所有示出的装置,电子设备300可以替代地实施或具备更多或更少的装置。Typically, the following devices may be connected to the I/O interface 350: an input device 360 including, for example, a touch screen, a touchpad, a keyboard, a mouse, a camera, a microphone, an accelerometer, a gyroscope, etc.; an output device 370 including, for example, a liquid crystal display (LCD), a speaker, a vibrator, etc.; a storage device 380 including, for example, a magnetic tape, a hard disk, etc.; and a communication device 390. The communication device 390 may allow the electronic device 300 to communicate with other electronic devices wirelessly or by wire to exchange data. Although FIG. 9 shows an electronic device 300 having various devices, it should be understood that it is not required to implement or have all of the devices shown, and the electronic device 300 may alternatively implement or have more or fewer devices.

例如,根据本公开的实施例,图1A所示的视频编辑方法可以被实现为计算机软件程序。例如,本公开的实施例包括一种计算机程序产品,其包括承载在非暂态计算机可读介质上的计算机程序,该计算机程序包括用于执行上述视频编辑方法的程序代码。在这样的实施例中,该计算机程序可以通过通信装置390从网络上被下载和安装,或者从存储装置380安装,或者从ROM 320安装。在该计算机程序被处理装置310执行时,可以执行本公开实施例提供的视频编辑方法中限定的功能。For example, according to an embodiment of the present disclosure, the video editing method shown in FIG. 1A can be implemented as a computer software program. For example, an embodiment of the present disclosure includes a computer program product, which includes a computer program carried on a non-transitory computer-readable medium, and the computer program includes a program code for executing the above-mentioned video editing method. In such an embodiment, the computer program can be downloaded and installed from a network through a communication device 390, or installed from a storage device 380, or installed from a ROM 320. When the computer program is executed by the processing device 310, the functions defined in the video editing method provided in the embodiment of the present disclosure can be executed.

本公开至少一个实施例还提供一种存储介质,用于存储非暂时性计算机可读指令,当该非暂时性计算机可读指令由计算机执行时可以实现本公开任一实施例所述的视频编辑方法。利用该存储介质,不仅降低了对素材的要求,使得视频模板的使用更加灵活,而且该方法除了适用于短视频之外还适用于中视频。At least one embodiment of the present disclosure further provides a storage medium for storing non-transitory computer-readable instructions, which can implement the video editing method described in any embodiment of the present disclosure when the non-transitory computer-readable instructions are executed by a computer. Using the storage medium not only reduces the requirements for materials, making the use of video templates more flexible, but also the method is applicable to medium videos in addition to short videos.

图9为本公开一些实施例提供的一种存储介质的示意图。如图9所示,存储介质400用于存储非暂时性计算机可读指令410。例如,当非暂时性计算机可读指令410由计算机执行时可以执行根据上文所述的视频编辑方法中的一个或多个步骤。Fig. 9 is a schematic diagram of a storage medium provided by some embodiments of the present disclosure. As shown in Fig. 9, the storage medium 400 is used to store non-transitory computer-readable instructions 410. For example, when the non-transitory computer-readable instructions 410 are executed by a computer, one or more steps in the video editing method described above can be executed.

例如,该存储介质400可以应用于上述电子设备200中。例如,存储介质400可以为图7所示的电子设备200中的存储器220。例如,关于存储介质400的相关说明可以参考图7所示的电子设备200中的存储器220的相应描述,此处不再赘述。For example, the storage medium 400 may be applied to the electronic device 200. For example, the storage medium 400 may be the memory 220 in the electronic device 200 shown in FIG7. For example, the relevant description of the storage medium 400 may refer to the corresponding description of the memory 220 in the electronic device 200 shown in FIG7, and will not be repeated here.

在上文中,结合图1A至图9描述了本公开实施例提供的视频编辑方法、视频编辑装置、电子设备及存储介质。本公开实施例提供的视频编辑方法不仅降低了对素材的要求,使得视频模板的使用更加灵活,而且该方法除了适用于短视频之外还适用于中视频。In the above, the video editing method, video editing device, electronic device and storage medium provided by the embodiments of the present disclosure are described in combination with Figures 1A to 9. The video editing method provided by the embodiments of the present disclosure not only reduces the requirements for materials, making the use of video templates more flexible, but also is applicable to medium videos in addition to short videos.

需要说明的是,本公开上述的存储介质(计算机可读介质)可以是计算机可读信号介质或者非暂时性计算机可读存储介质或者是上述两者的任意组合。非暂时性计算机可读存储介质例如可以是,但不限于,电、磁、光、电磁、红外线、或半导体的系统、装置或器件,或者任意以上的组合。非暂时性计算机可读存储介质的更具体的例子可以包括但不限于:具有一个或多个导线的电连接、便携式计算机磁盘、硬盘、随机访问存储器(RAM)、只读存储器(ROM)、可擦式可编程只读存储器(EPROM或闪存)、光纤、便携式紧凑磁盘只读存储器(CD-ROM)、光存储器件、磁存储器件、或者上述的任意合适的组合。在本公开中,非暂时性计算机可读存储介质可以是任何包含或存储程序的有形介质,该程序可以被指令执行系统、装置或者器件使用或者与其结合使用。而在本公开中,计算机可读信号介质可以包括在基带中或者作为载波一部分传播的数据信号,其中承载了计算机可读的程序代码。这种传播的数据信号可以采用多种形式,包括但不限于电磁信号、光信号或上述的任意合适的组合。计算机可读信号介质还可以是非暂时性计算机可读存储介质以外的任何计算机可读介质,该计算机可读信号介质可以发送、传播或者传输用于由指令执行系统、装置或者器件使用或者与其结合使用的程序。计算机可读介质上包含的程序代码可以用任何适当的介质传输,包括但不限于:电线、光缆、RF(射频)等,或者上述的任意合适的组合。It should be noted that the storage medium (computer-readable medium) disclosed above may be a computer-readable signal medium or a non-transitory computer-readable storage medium or any combination of the above two. Non-transitory computer-readable storage media may be, for example, but not limited to, electrical, magnetic, optical, electromagnetic, infrared, or semiconductor systems, devices or devices, or any combination of the above. More specific examples of non-transitory computer-readable storage media may include, but are not limited to: an electrical connection with one or more wires, a portable computer disk, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disk read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the above. In the present disclosure, a non-transitory computer-readable storage medium may be any tangible medium containing or storing a program, which may be used by or in combination with an instruction execution system, device or device. In the present disclosure, a computer-readable signal medium may include a data signal propagated in a baseband or as part of a carrier wave, which carries a computer-readable program code. Such propagated data signals may take a variety of forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the above. Computer readable signal media may also be any computer readable medium other than a non-transitory computer readable storage medium, which may send, propagate, or transmit a program for use by or in conjunction with an instruction execution system, apparatus, or device. The program code contained on the computer readable medium may be transmitted using any suitable medium, including but not limited to: wires, optical cables, RF (radio frequency), etc., or any suitable combination of the above.

在一些实施方式中,客户端、服务器可以利用诸如超文本传输协议(Hyper Text Transfer Protocol,HTTP)之类的任何当前已知或未来研发的网络协议进行通信,并且可以与任意形式或介质的数字数据通信(例如,通信网络)互连。通信网络的示例包括局域网(LAN),广域网(WAN),网际网(例如,互联网)以及端对端网络(例如,ad hoc端对端网络),以及任何当前已知或未来研发的网络。In some embodiments, the client and server can communicate using any currently known or future developed network protocol, such as Hyper Text Transfer Protocol (HTTP), and can be interconnected with any form or medium of digital data communication (e.g., a communication network). Examples of communication networks include local area networks (LANs), wide area networks (WANs), internets (e.g., the Internet), and peer-to-peer networks (e.g., ad hoc peer-to-peer networks), as well as any currently known or future developed networks.

上述计算机可读介质可以是上述电子设备中所包含的;也可以是单独存在,而未装配入该电子设备中。The computer-readable medium may be included in the electronic device, or may exist independently without being incorporated into the electronic device.

上述计算机可读介质承载有一个或者多个程序,当上述一个或者多个程序被该电子设备执行时,使得该电子设备执行根据上文所述的视频编辑方法中的一个或多个步骤。可以以一种或多种程序设计语言或其组合来编写用于执行本公开的操作的计算机程序代码,上述程序设计语言包括但不限于面向对象的程序设计语言,诸如Java、Smalltalk、C++,还包括常规的过程式程序设计语言,诸如“C”语言或类似的程序设计语言。程序代码可以完全地在用户计算机上执行、部分地在用户计算机上执行、作为一个独立的软件包执行、部分在用户计算机上部分在远程计算机上执行、或者完全在远程计算机或服务器上执行。在涉及远程计算机的情形中,远程计算机可以通过任意种类的网络,例如局域网(LAN)或广域网(WAN),连接到用户计算机,或者,可以连接到外部计算机(例如利用因特网服务提供商来通过因特网连接)。The computer-readable medium carries one or more programs, and when the one or more programs are executed by the electronic device, the electronic device performs one or more steps in the video editing method described above. The computer program code for performing the operation of the present disclosure can be written in one or more programming languages or a combination thereof, and the programming language includes but is not limited to object-oriented programming languages, such as Java, Smalltalk, C++, and also includes conventional procedural programming languages, such as "C" language or similar programming languages. The program code can be executed completely on the user's computer, partially on the user's computer, as an independent software package, partially on the user's computer and partially on the remote computer, or completely on the remote computer or server. In the case of a remote computer, the remote computer can be connected to the user's computer through any type of network, such as a local area network (LAN) or a wide area network (WAN), or can be connected to an external computer (for example, using an Internet service provider to connect through the Internet).

附图中的流程图和框图,图示了按照本公开各种实施例的系统、方法和计算机程序产品的可能实现的体系架构、功能和操作。在这点上,流程图或框图中的每个方框可以代表一个模块、程序段、或代码的一部分,该模块、程序段、或代码的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。也应当注意,在有些作为替换的实现中,方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如,两个接连地表示的方框实际上可以基本并行地执行,它们有时也可以按相反的顺序执行,这根据所涉及的功能而定。也要注意的是,框图和/或流程图中的每个方框、以及框图和/或流程图中的方框的组合,可以用执行规定的功能或操作的专用的基于硬件的系统来实现,或者可以用专用硬件与计算机指令的组合来实现。The flow chart and block diagram in the accompanying drawings illustrate the possible architecture, function and operation of the system, method and computer program product according to various embodiments of the present disclosure. In this regard, each square box in the flow chart or block diagram can represent a module, a program segment or a part of a code, and the module, the program segment or a part of the code contains one or more executable instructions for realizing the specified logical function. It should also be noted that in some implementations as replacements, the functions marked in the square box can also occur in a sequence different from that marked in the accompanying drawings. For example, two square boxes represented in succession can actually be executed substantially in parallel, and they can sometimes be executed in the opposite order, depending on the functions involved. It should also be noted that each square box in the block diagram and/or flow chart, and the combination of the square boxes in the block diagram and/or flow chart can be implemented with a dedicated hardware-based system that performs a specified function or operation, or can be implemented with a combination of dedicated hardware and computer instructions.

描述于本公开实施例中所涉及到的单元可以通过软件的方式实现,也可以通过硬件的方式来实现。其中,单元的名称在某种情况下并不构成对该单元本身的限定。The units involved in the embodiments described in the present disclosure may be implemented by software or hardware, wherein the name of a unit does not, in some cases, limit the unit itself.

本文中以上描述的功能可以至少部分地由一个或多个硬件逻辑部件来执行。例如,非限制性地,可以使用的示范类型的硬件逻辑部件包括:现场可编程门阵列(FPGA)、专用集成电路(ASIC)、专用标准产品(ASSP)、片上系统(SOC)、复杂可编程逻辑设备(CPLD)等。The functions described above herein may be performed at least in part by one or more hardware logic components. For example, without limitation, exemplary types of hardware logic components that may be used include: field programmable gate arrays (FPGAs), application specific integrated circuits (ASICs), application specific standard products (ASSPs), systems on chips (SOCs), complex programmable logic devices (CPLDs), etc.

在本公开中,机器可读介质可以是有形的介质,其可以包含或存储以供指令执行系统、装置或设备使用或与指令执行系统、装置或设备结合地使用的程序。机器可读介质可以是机器可读信号介质或机器可读储存介质。机器可读介质可以包括但不限于电子的、磁性的、光学的、电磁的、红外的、或半导体系统、装置或设备,或者上述内容的任何合适组合。机器可读存储介质的更具体示例会包括基于一个或多个线的电气连接、便携式计算机盘、硬盘、随机存取存储器(RAM)、只读存储器(ROM)、可擦除可编程只读存储器(EPROM或快闪存储器)、光纤、便捷式紧凑盘只读存储器(CD-ROM)、光学储存设备、磁储存设备、或上述内容的任何合适组合。In the present disclosure, a machine-readable medium may be a tangible medium that may contain or store a program for use by or in conjunction with an instruction execution system, device, or equipment. A machine-readable medium may be a machine-readable signal medium or a machine-readable storage medium. A machine-readable medium may include, but is not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, device, or equipment, or any suitable combination of the foregoing. A more specific example of a machine-readable storage medium may include an electrical connection based on one or more lines, a portable computer disk, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disk read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.

以上描述仅为本公开的部分实施例以及对所运用技术原理的说明。本领域技术人员应当理解,本公开中所涉及的公开范围,并不限于上述技术特征的特定组合而成的技术方案,同时也应涵盖在不脱离上述公开构思的情况下,由上述技术特征或其等同特征进行任意组合而形成的其它技术方案。例如上述特征与本公开中公开的(但不限于)具有类似功能的技术特征进行互相替换而形成的技术方案。The above description is only a partial embodiment of the present disclosure and an explanation of the technical principles used. Those skilled in the art should understand that the scope of disclosure involved in the present disclosure is not limited to the technical solutions formed by a specific combination of the above technical features, but should also cover other technical solutions formed by any combination of the above technical features or their equivalent features without departing from the above disclosed concept. For example, the above features are replaced with the technical features with similar functions disclosed in the present disclosure (but not limited to) by each other.

此外,虽然采用特定次序描绘了各操作,但是这不应当理解为要求这些操作以所示出的特定次序或以顺序次序来执行。在一定环境下,多任务和并行处理可能是有利的。同样地,虽然在上面论述中包含了若干具体实现细节,但是这些不应当被解释为对本公开的范围的限制。在单独的实施例的上下文中描述的某些特征还可以组合地实现在单个实施例中。相反地,在单个实施例的上下文中描述的各种特征也可以单独地或以任何合适的子组合的方式实现在多个实施例中。In addition, although each operation is described in a specific order, this should not be understood as requiring these operations to be performed in the specific order shown or in a sequential order. Under certain circumstances, multitasking and parallel processing may be advantageous. Similarly, although some specific implementation details are included in the above discussion, these should not be interpreted as limiting the scope of the present disclosure. Some features described in the context of a separate embodiment can also be implemented in a single embodiment in combination. On the contrary, the various features described in the context of a single embodiment can also be implemented in multiple embodiments individually or in any suitable sub-combination mode.

尽管已经采用特定于结构特征和/或方法逻辑动作的语言描述了本主题,但是应当理解所附权利要求书中所限定的主题未必局限于上面描述的特定特征或动作。相反,上面所描述的特定特征和动作仅仅是实现权利要求书的示例形式。Although the subject matter has been described in language specific to structural features and/or methodological logical actions, it should be understood that the subject matter defined in the appended claims is not necessarily limited to the specific features or actions described above. On the contrary, the specific features and actions described above are merely example forms of implementing the claims.

Claims (24)

一种视频编辑方法,包括:A video editing method, comprising: 响应于对视频编辑模板的触发操作,展示视频编辑界面,其中,所述视频编辑模板包括结构信息和编辑信息,所述结构信息用于指示至少一个视频段落,所述编辑信息用于指示在所述至少一个视频段落中施加的至少一个编辑效果,所述视频编辑界面包括与所述至少一个视频段落分别对应的段落编辑区域;In response to a triggering operation on a video editing template, displaying a video editing interface, wherein the video editing template includes structure information and editing information, the structure information is used to indicate at least one video segment, the editing information is used to indicate at least one editing effect applied to the at least one video segment, and the video editing interface includes segment editing areas corresponding to the at least one video segment respectively; 响应于针对目标视频段落的素材导入操作,在所述目标视频段落的段落编辑区域展示至少一个影像素材的标识,所述至少一个视频段落包括所述目标视频段落,所述至少一个影像素材为基于所述素材导入操作被导入在所述目标视频段落中的素材;以及In response to a material import operation for a target video segment, displaying an identifier of at least one image material in a segment editing area of the target video segment, the at least one video segment including the target video segment, and the at least one image material being a material imported into the target video segment based on the material import operation; and 响应于编辑处理的触发操作,根据所述视频编辑模板和所述至少一个影像素材,生成视频编辑结果,其中,所述视频编辑结果对应于所述目标视频段落的部分为基于目标编辑效果和所述至少一个影像素材得到的编辑结果,所述目标编辑效果匹配所述至少一个编辑效果中位于所述目标视频段落内的编辑效果。In response to a triggering operation of the editing process, a video editing result is generated according to the video editing template and the at least one image material, wherein the part of the video editing result corresponding to the target video segment is an editing result obtained based on a target editing effect and the at least one image material, and the target editing effect matches the editing effect of the at least one editing effect located within the target video segment. 根据权利要求1所述的方法,其中,所述响应于编辑处理的触发操作,根据所述视频编辑模板和所述至少一个影像素材,生成视频编辑结果,包括:The method according to claim 1, wherein the step of generating a video editing result according to the video editing template and the at least one video material in response to the triggering operation of the editing process comprises: 响应于所述编辑处理的触发操作,按照匹配策略,对所述编辑效果进行调整得到所述目标编辑效果;In response to the triggering operation of the editing process, adjusting the editing effect according to the matching strategy to obtain the target editing effect; 根据所述视频编辑模板,将所述目标编辑效果施加到所述至少一个影像素材的至少部分上得到所述目标视频段落的视频部分;以及According to the video editing template, applying the target editing effect to at least a portion of the at least one video material to obtain a video portion of the target video segment; and 基于所述至少一个视频段落中每个所述目标视频段落对应的所述视频部分,生成所述视频编辑结果。The video editing result is generated based on the video portion corresponding to each of the target video segments in the at least one video segment. 根据权利要求2所述的方法,其中,所述匹配策略包括:The method according to claim 2, wherein the matching strategy comprises: 在所述编辑效果包括第一文案的情况下,所述目标编辑效果包括第二文案,其中,所述第二文案的文案格式与所述第一文案的文案格式相同,所述第二文案的文案内容与所述至少一个影像素材匹配。In the case where the editing effect includes a first text, the target editing effect includes a second text, wherein a text format of the second text is the same as a text format of the first text, and text content of the second text matches the at least one image material. 根据权利要求3所述的方法,其中,所述匹配策略还包括:The method according to claim 3, wherein the matching strategy further comprises: 响应于所述第一文案应用于所述目标视频段落的整个段落,所述第二文案应用于所述视频部分的整个部分;In response to the first copy being applied to the entire segment of the target video segment, the second copy is applied to the entire portion of the video portion; 响应于所述第一文案应用于所述目标视频段落中部分时间段,所述第二文案应用于所述视频部分中部分时间段。In response to the first text being applied to a partial time period in the target video segment, the second text is applied to a partial time period in the video portion. 根据权利要求3所述的方法,其中,所述响应于编辑处理的触发操作,根据所述视频编辑模板和所述至少一个影像素材,生成视频编辑结果,还包括:The method according to claim 3, wherein the generating of the video editing result according to the video editing template and the at least one video material in response to the triggering operation of the editing process further comprises: 获取所述第二文案的内容,Get the content of the second copy, 其中,所述结构信息还包括信息提示段落,所述信息提示段落用于获取提示信息,获取所述第二文案的内容包括:The structural information further includes an information prompt paragraph, and the information prompt paragraph is used to obtain prompt information. Obtaining the content of the second copy includes: 对所述提示信息和所述至少一个影像素材进行分析得到分析结果;以及Analyzing the prompt information and the at least one image material to obtain an analysis result; and 根据所述分析结果,生成所述第二文案的内容。The content of the second copy is generated according to the analysis result. 根据权利要求5所述的方法,其中,所述分析结果对所述第二文案的内容的影响程度大于所述至少一个影像素材对所述第二文案的内容的影响程度。The method according to claim 5, wherein the influence of the analysis result on the content of the second copy is greater than the influence of the at least one image material on the content of the second copy. 根据权利要求2所述的方法,其中,所述匹配策略包括:The method according to claim 2, wherein the matching strategy comprises: 在所述编辑效果包括转场动画的情况下,所述目标编辑效果包括所述转场动画。In the case where the editing effect includes a transition animation, the target editing effect includes the transition animation. 根据权利要求2所述的方法,其中,所述结构信息还包括信息提示段落,所述信息提示段落用于获取提示信息,The method according to claim 2, wherein the structural information further includes an information prompt paragraph, and the information prompt paragraph is used to obtain prompt information. 所述响应于编辑处理的触发操作,按照匹配策略,对所述编辑效果进行调整得到所述目标编辑效果,包括:In response to the triggering operation of the editing process, adjusting the editing effect according to the matching strategy to obtain the target editing effect includes: 响应于所述编辑处理的触发操作,对所述提示信息和所述至少一个影像素材进行分析得到分析结果;In response to the triggering operation of the editing process, analyzing the prompt information and the at least one image material to obtain an analysis result; 响应于所述编辑效果包括字幕和/或配音,按照所述目标编辑效果中的字幕和/或配音的内容与所述分析结果匹配的匹配策略,得到所述目标编辑效果中字幕和/或配音的内容;In response to the editing effect including subtitles and/or dubbing, obtaining the content of the subtitles and/or dubbing in the target editing effect according to a matching strategy of matching the content of the subtitles and/or dubbing in the target editing effect with the analysis result; 按照所述目标编辑效果中字幕的格式与所述编辑效果中的字幕格式相同的匹配策略,得到所述目标编辑效果中字幕的格式;以及According to the matching strategy that the format of the subtitles in the target editing effect is the same as the format of the subtitles in the editing effect, obtaining the format of the subtitles in the target editing effect; and 按照所述目标编辑效果中配音的声音参数与所述编辑效果中的配音的声音参数相同的匹配策略,得到所述目标编辑效果中配音的声音参数。According to the matching strategy that the sound parameters of the dubbing in the target editing effect are the same as the sound parameters of the dubbing in the editing effect, the sound parameters of the dubbing in the target editing effect are obtained. 根据权利要求2所述的方法,其中,所述基于所述至少一个视频段落中每个所述目标视频段落对应的所述视频部分,生成所述视频编辑结果,包括:The method according to claim 2, wherein generating the video editing result based on the video portion corresponding to each of the target video segments in the at least one video segment comprises: 将每个所述目标视频段落对应的所述视频部分进行拼接,得到所述视频编辑结果。The video parts corresponding to each of the target video segments are spliced to obtain the video editing result. 根据权利要求1所述的方法,其中,所述响应于编辑处理的触发操作,根据所述视频编辑模板和所述至少一个影像素材,生成视频编辑结果,包括:The method according to claim 1, wherein the step of generating a video editing result according to the video editing template and the at least one video material in response to the triggering operation of the editing process comprises: 响应于所述编辑处理的触发操作,利用神经网络对所述至少一个影像素材和所述编辑效果进行处理,得到所述视频编辑结果。In response to a triggering operation of the editing process, the at least one image material and the editing effect are processed using a neural network to obtain the video editing result. 根据权利要求2所述的方法,其中,所述至少一个编辑效果还包括应用于所述至少一个视频段落的全局编辑效果,The method according to claim 2, wherein the at least one editing effect further comprises a global editing effect applied to the at least one video segment, 所述响应于编辑处理的触发操作,根据所述视频编辑模板和所述至少一个影像素材,生成视频编辑结果,还包括:The step of generating a video editing result according to the video editing template and the at least one video material in response to the triggering operation of the editing process further includes: 响应于所述编辑处理的所述触发操作,调整所述全局编辑效果的时间长度与所述视频编辑结果中采用的目标影像素材的时间长度匹配,得到调整后的全局编辑效果;In response to the triggering operation of the editing process, adjusting the time length of the global editing effect to match the time length of the target image material used in the video editing result, to obtain an adjusted global editing effect; 所述基于所述至少一个视频段落中每个所述目标视频段落对应的所述视频部分,生成所述视频编辑结果,包括:The generating the video editing result based on the video portion corresponding to each of the target video segments in the at least one video segment includes: 基于所述调整后的全局编辑效果和每个所述目标视频段落对应的所述视频部分,生成所述视频编辑结果。The video editing result is generated based on the adjusted global editing effect and the video portion corresponding to each target video segment. 根据权利要求10或11所述的方法,其中,所述全局编辑效果包括音乐、滤镜和字幕中的至少一种。The method according to claim 10 or 11, wherein the global editing effect includes at least one of music, filters and subtitles. 根据权利要求1所述的方法,其中,所述结构信息通过时间区间或者连续的视频片段的序号指示所述至少一个视频段落的每个。The method according to claim 1, wherein the structural information indicates each of the at least one video segment by a time interval or a serial number of consecutive video segments. 根据权利要求1所述的方法,其中,所述响应于编辑处理的触发操作,根据所述视频编辑模板和所述至少一个影像素材,生成视频编辑结果,包括:The method according to claim 1, wherein the step of generating a video editing result according to the video editing template and the at least one video material in response to the triggering operation of the editing process comprises: 响应于所述编辑处理的所述触发操作,提取所述至少一个影像素材中的目标片段;以及In response to the triggering operation of the editing process, extracting a target segment from the at least one image material; and 根据所述视频编辑模板和所述目标片段,生成所述视频编辑结果。The video editing result is generated according to the video editing template and the target segment. 根据权利要求1所述的方法,其中,所述响应于编辑处理的触发操作,根据所述视频编辑模板和所述至少一个影像素材,生成视频编辑结果,包括:The method according to claim 1, wherein the step of generating a video editing result according to the video editing template and the at least one video material in response to the triggering operation of the editing process comprises: 响应于所述编辑处理的所述触发操作,若所述至少一个影像素材中存在第一影像素材,将所述第一影像素材分割为多个子片段,其中,所述第一影像素材为时长大于预设时长的素材;以及In response to the triggering operation of the editing process, if there is a first image material in the at least one image material, dividing the first image material into a plurality of sub-segments, wherein the first image material is a material having a duration greater than a preset duration; and 基于所述视频编辑模板和所述多个子片段,生成所述视频编辑结果。The video editing result is generated based on the video editing template and the multiple sub-segments. 根据权利要求1-15中任一项所述的方法,还包括:The method according to any one of claims 1 to 15, further comprising: 根据所述视频编辑结果,在预览页面播放预览视频。According to the video editing result, the preview video is played on the preview page. 根据权利要求16所述的方法,其中,The method according to claim 16, wherein 在所述编辑效果只包括字幕的情况下,所述预览页面中的配音标识展示为关闭状态,在所述编辑效果只包括配音的情况下,所述预览页面中的字幕标识展示为所述关闭状态。In the case where the editing effect only includes subtitles, the dubbing mark in the preview page is displayed as a closed state. In the case where the editing effect only includes dubbing, the subtitle mark in the preview page is displayed as the closed state. 根据权利要求16或17所述的方法,其中,所述预览页面包括视频播放窗口、素材展示窗口和编辑入口窗口,The method according to claim 16 or 17, wherein the preview page includes a video playback window, a material display window and an editing entry window, 所述视频播放窗口用于播放所述预览视频,所述素材展示窗口包括用于展示所述预览视频中的多个影像素材每个的至少部分的多个素材子窗口,所述编辑入口窗口用于展示对所述影像素材进行编辑的入口。The video playback window is used to play the preview video, the material display window includes multiple material sub-windows for displaying at least part of each of the multiple image materials in the preview video, and the editing entry window is used to display an entry for editing the image material. 根据权利要求18所述的方法,其中,所述素材展示窗口还包括素材添加子窗口,The method according to claim 18, wherein the material display window further includes a material adding sub-window, 所述方法还包括:The method further comprises: 响应于在所述多个素材子窗口中的素材窗口单元被选中的情况下,对所述素材添加子窗口的触发操作,展示多个供选择素材;In response to a material window unit in the plurality of material sub-windows being selected, a triggering operation of adding a sub-window to the material is performed to display a plurality of materials for selection; 根据对所述多个供选择素材的添加选择操作,将目标素材添加到所述素材窗口单元之后。According to the adding and selecting operation on the plurality of selectable materials, the target material is added after the material window unit. 根据权利要求18所述的方法,其中,所述编辑入口窗口包括用于对所述字幕进行编辑的字幕入口、用于对滤镜进行编辑的滤镜入口、用于对文案进行编辑的文案入口、用于对音乐进行编辑的音乐入口中的至少一个,The method according to claim 18, wherein the editing entry window includes at least one of a subtitle entry for editing the subtitles, a filter entry for editing filters, a text entry for editing text, and a music entry for editing music, 所述编辑入口窗口还包括:用于对所述预览视频中的多个优选素材进行选择删除的删除图标。The editing entry window also includes: a deletion icon for selecting and deleting multiple preferred materials in the preview video. 根据权利要求20所述的方法,其中,所述编辑入口窗口还包括重新组织入口,所述方法还包括:The method according to claim 20, wherein the editing entry window further comprises reorganizing the entry, the method further comprising: 响应于所述重新组织入口被选择,展示排序面板,其中,所述排序面板包括所述至少一个影像素材;以及In response to the reorganization portal being selected, displaying a sorting panel, wherein the sorting panel includes the at least one image material; and 接收对所述排序面板中的所述至少一个影像素材进行的组织操作,重新组织所述至少一个影像素材。An organization operation performed on the at least one image material in the sorting panel is received, and the at least one image material is reorganized. 一种视频编辑装置,包括:A video editing device, comprising: 第一展示单元,配置为响应于对视频编辑模板的触发操作,展示视频编辑界面,其中,所述视频编辑模板包括结构信息和编辑信息,所述结构信息用于指示至少一个视频段落,所述编辑信息用于指示在所述至少一个视频段落中施加的至少一个编辑效果,所述视频编辑界面包括与所述至少一个视频段落分别对应的段落编辑区域;A first display unit is configured to display a video editing interface in response to a trigger operation on a video editing template, wherein the video editing template includes structure information and editing information, the structure information is used to indicate at least one video segment, the editing information is used to indicate at least one editing effect applied to the at least one video segment, and the video editing interface includes segment editing areas corresponding to the at least one video segment respectively; 第二展示单元,配置为响应于针对目标视频段落的素材导入操作,在所述目标视频段落的段落编辑区域展示至少一个影像素材的标识,所述至少一个视频段落包括所述目标视频段落,所述至少一个影像素材为基于所述素材导入操作被导入在所述目标视频段落中的素材;以及a second display unit, configured to display, in response to a material import operation for a target video segment, an identification of at least one image material in a segment editing area of the target video segment, wherein the at least one video segment includes the target video segment, and the at least one image material is a material imported into the target video segment based on the material import operation; and 结果生成单元,配置为响应于编辑处理的触发操作,根据所述视频编辑模板和所述至少一个影像素材,生成视频编辑结果,其中,所述视频编辑结果对应于所述目标视频段落的部分为基于目标编辑效果和所述至少一个影像素材得到的编辑结果,所述目标编辑效果匹配所述至少一个编辑效果中位于所述目标视频段落内的编辑效果。A result generating unit is configured to generate a video editing result in response to a triggering operation of an editing process, based on the video editing template and the at least one image material, wherein a portion of the video editing result corresponding to the target video segment is an editing result obtained based on a target editing effect and the at least one image material, and the target editing effect matches an editing effect of the at least one editing effect located within the target video segment. 一种电子设备,包括:An electronic device, comprising: 处理器;processor; 存储器,包括一个或多个计算机程序指令;a memory including one or more computer program instructions; 其中,所述一个或多个计算机程序指令被存储在所述存储器中,并由所述处理器执行时实现权利要求1-21中任一项所述的视频编辑方法。The one or more computer program instructions are stored in the memory and, when executed by the processor, implement the video editing method according to any one of claims 1 to 21. 一种计算机可读存储介质,非暂时性存储有计算机可读指令,其中,当所述计算机可读指令由处理器执行时实现权利要求1-21中任一项所述的视频编辑方法。A computer-readable storage medium non-temporarily stores computer-readable instructions, wherein when the computer-readable instructions are executed by a processor, the video editing method according to any one of claims 1 to 21 is implemented.
PCT/CN2024/139575 2023-12-28 2024-12-16 Video editing method and apparatus, and electronic device and storage medium Pending WO2025139877A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202311843312.4A CN120238696A (en) 2023-12-28 2023-12-28 Video editing method and device, electronic device and storage medium
CN202311843312.4 2023-12-28

Publications (1)

Publication Number Publication Date
WO2025139877A1 true WO2025139877A1 (en) 2025-07-03

Family

ID=96157866

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2024/139575 Pending WO2025139877A1 (en) 2023-12-28 2024-12-16 Video editing method and apparatus, and electronic device and storage medium

Country Status (2)

Country Link
CN (1) CN120238696A (en)
WO (1) WO2025139877A1 (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070083851A1 (en) * 2005-10-06 2007-04-12 Moda Co., Ltd. Template-based multimedia editor and editing method thereof
JP2007318450A (en) * 2006-05-25 2007-12-06 Nippon Telegr & Teleph Corp <Ntt> Video editing method and device
CN111866587A (en) * 2020-07-30 2020-10-30 口碑(上海)信息技术有限公司 Short video generation method and device
CN113891113A (en) * 2021-09-29 2022-01-04 阿里巴巴(中国)有限公司 Video clip synthesis method and electronic equipment
CN114697700A (en) * 2020-12-28 2022-07-01 北京小米移动软件有限公司 Video editing method, video editing device and storage medium
CN115250335A (en) * 2021-04-28 2022-10-28 深圳市腾讯计算机系统有限公司 Video processing method, apparatus, device and storage medium

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070083851A1 (en) * 2005-10-06 2007-04-12 Moda Co., Ltd. Template-based multimedia editor and editing method thereof
JP2007318450A (en) * 2006-05-25 2007-12-06 Nippon Telegr & Teleph Corp <Ntt> Video editing method and device
CN111866587A (en) * 2020-07-30 2020-10-30 口碑(上海)信息技术有限公司 Short video generation method and device
CN114697700A (en) * 2020-12-28 2022-07-01 北京小米移动软件有限公司 Video editing method, video editing device and storage medium
CN115250335A (en) * 2021-04-28 2022-10-28 深圳市腾讯计算机系统有限公司 Video processing method, apparatus, device and storage medium
CN113891113A (en) * 2021-09-29 2022-01-04 阿里巴巴(中国)有限公司 Video clip synthesis method and electronic equipment

Also Published As

Publication number Publication date
CN120238696A (en) 2025-07-01

Similar Documents

Publication Publication Date Title
KR20230042523A (en) Multimedia data processing method, generation method and related device
US20060204214A1 (en) Picture line audio augmentation
US10997364B2 (en) Operations on sound files associated with cells in spreadsheets
CN114938473B (en) Comment video generation method and device
JP7718760B2 (en) Effect profile generation method, device, equipment, and medium
WO2022253349A1 (en) Video editing method and apparatus, and device and storage medium
CN110324718A (en) Audio-video generation method, device, electronic equipment and readable medium
WO2017062961A1 (en) Methods and systems for interactive multimedia creation
WO2025036351A1 (en) Media content generation method and device, storage medium, and program product
CN111432290B (en) Video generation method based on audio adjustment
CN117998163A (en) Video editing method, device, electronic equipment and storage medium
US12159650B2 (en) Method and apparatus, electronic device, and storage medium for video editing
WO2025139877A1 (en) Video editing method and apparatus, and electronic device and storage medium
CN113571061B (en) Speech transcription text editing system, method, device and equipment
US20250037335A1 (en) Video processing method, apparatus, electronic device and storage medium
US12423895B2 (en) Adaptive editing experience for mixed media content
CN118828141A (en) Video processing method, device, electronic device and storage medium
WO2024217226A1 (en) Video producing method and apparatus, device, and storage medium
US20240386915A1 (en) Video generation method and apparatus, electronic device, and readable storage medium
CN113556576B (en) Video generation method and device
US20240397138A1 (en) Video production method and device, apparatus, and storage medium
CN115426517B (en) Method and device for generating gradual change animation frame, electronic equipment and storage medium
US12170829B1 (en) Video processing method and device, storage medium and program product
CN120186432A (en) Video template variable method, device, storage medium and program product
CN119342276A (en) Video processing method, device, storage medium and program product

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 24910808

Country of ref document: EP

Kind code of ref document: A1