-
2019-07-23
15/704,242
2017-09-14
US 10,360,651 B1
2019-07-23
-
-
Nancy Bitar
Polsinelli LLP
2037-09-14
Smart Summary: An array of small grids made up of pixels is used to create a watermark in videos. The movement of these pixel blocks is controlled by a random sequence generated from a starting value. Data is embedded into the video by slightly changing the color values of small groups of pixels, with each group representing a single binary bit. The changes are designed to be subtle, making them hard for people to notice. Only a small central area of each pixel patch is modified, and the adjustments are based on the colors of surrounding pixels to ensure minimal visibility. 🚀 TL;DR
An array of a multiplicity of ten-by-ten grids of pixels per video frame is employed to form a watermark mask. Motion of a pixel block is determined by a pseudo-random sequence from a seed value. The resulting invention embeds data as binary bits by shifting one or more color space values of a small patch of pixels.
Get notified when new applications in this technology area are published.
G06T1/0028 » CPC main
General purpose image data processing; Image watermarking Adaptive watermarking, e.g. Human Visual System [HVS]-based watermarking
G06T1/00 IPC
General purpose image data processing
The present application is related to and/or claims the benefits of the earliest effective priority date and/or the earliest effective filing date of the below-referenced applications, each of which is hereby incorporated by reference in its entirety, to the extent such subject matter is not inconsistent herewith, as if fully set forth herein:
(1) this application constitutes a non-provisional of U.S. Provisional Patent Application No. 62/394,390, entitled COMPATIBLE ADAPTIVE VIDEO WATERMARKING, naming W. Leo Hoarty as the inventor, filed Sep. 14, 2016, which is currently or is an application of which a currently application is entitled to the benefit of the filing date.
This invention relates generally to video, and, more specifically, to compatible adaptive video watermarking.
Recent advances in video technology have exposed new opportunities for solutions directed to compatible adaptive video watermarking.
System of the invention embeds data as binary bits by shifting one or more color space values of a small patch of pixels. Each patch of pixels represents a single binary bit. An array of pixel patch areas is distributed across a video frame and modified per patch to encode a binary word of watermark data. Watermarked pixel patches are cloaked by shifting a color space value of a respective pixel patch area that is the least noticeable to human visual perception.
Only the center cluster of pixels (typically 5×5 area within a patch area of 11×11 is modified). Color space value to shift per pixel patch is determined by surrounding pixels to produce the least noticeable change to human visual perception.
In another embodiment, pixel patches that are modified can be shifted in x, y position within the pixel patch area in the next video frame in a pseudo-random orbit to reduce visibility. Data words fill a video frame (from 8 to 32 bits) and can be encoded using redundancy such as repeating data word in subsequent frames as well as Gray coding (reflected binary coding) to detect errors during decoding. Encoded watermark will survive multiple re-encodings.
Informative Definitions:
The foregoing is a summary and thus contains, by necessity, simplifications, generalizations and omissions of detail; consequently, those skilled in the art will appreciate that the summary is illustrative only and is not intended to be in any way limiting. Other aspects, embodiments, features and advantages of the device and/or processes and/or other subject matter described herein will become apparent in the teachings set forth herein.
Certain embodiments of the present invention are described in detail below with reference to the following drawings:
FIG. 1 depicts an HDTV display area showing 32 pixel patch locations.
FIG. 2 depicts a pixel patch area of 11×11 pixels.
FIG. 3 depicts color space graphs of HSL and RGB color space.
FIG. 4 depicts data block pseudo-random motion around a pixel patch.
FIG. 5 depicts data repetition by frame grouping.
FIG. 6 depicts an encoding flow diagram.
FIG. 7 depicts a de-encoding flow diagram.
FIG. 8 depicts an HDTV video frame showing exemplary pixel path locations.
FIG. 9 depicts an HDTV video frame showing exemplary watermarked pixel patches.
FIG. 10 depicts an HDTV video frame showing a closeup of a watermark altered pixel path.
FIG. 11 depicts standard definition television pixel dimensions.
FIG. 12 depicts standard definition television pixel dimensions overlaid onto HDTV.
FIG. 13a depicts standard definition television pixel dimensions scaled vertically to 1080 lines and overlaid on HDTV.
FIG. 13b depicts standard definition television pixel dimensions scaled vertically to 1080 lines and overlaid on HDTV.
FIG. 14a depicts standard definition television pixel dimensions scaled horizontally to 1920 lines and overlaid on HDTV.
FIG. 14b depicts standard definition television pixel dimensions scaled horizontally to 1920 lines and overlaid on HDTV.
FIG. 15a depicts standard definition television pixel dimensions distorted to fit an HDTV frame.
FIG. 15b depicts standard definition television pixel dimensions distorted to fit an HDTV frame.
FIG. 16 depicts HDTV scaled down to standard definition then overlaid on HDTV frame.
FIG. 17 depicts HDTV scaled down to standard definition then scaled vertically and overlaid on HDTV frame.
FIG. 18 depicts HDTV scaled down to standard definition then scaled horizontally and overlaid on HDTV frame.
Specific details of certain embodiments of the invention are set forth in the following description and in the figures to provide a thorough understanding of such embodiments. The present invention may have additional embodiments, may be practiced without one or more of the details described for any particular described embodiment, or may have any detail described for one particular embodiment practiced with any other detail described for another embodiment.
Importantly, a grouping of inventive aspects in any particular “embodiment” within this detailed description, and/or a grouping of limitations in the claims presented herein, is not intended to be a limiting disclosure of those particular aspects and/or limitations to that particular embodiment and/or claim. The inventive entity presenting this disclosure fully intends that any disclosed aspect of any embodiment in the detailed description and/or any claim limitation ever presented relative to the instant disclosure and/or any continuing application claiming priority from the instant application (e.g. continuation, continuation-in-part, and/or divisional applications) may be practiced with any other disclosed aspect of any embodiment in the detailed description and/or any claim limitation. Claimed combinations which draw from different embodiments and/or originally-presented claims are fully within the possession of the inventive entity at the time the instant disclosure is being filed. Any future claim comprising any combination of limitations, each such limitation being herein disclosed and therefore having support in the original claims or in the specification as originally filed (or that of any continuing application claiming priority from the instant application), is possessed by the inventive entity at present irrespective of whether such combination is described in the instant specification because all such combinations are viewed by the inventive entity as currently operable without undue experimentation given the disclosure herein and therefore that any such future claim would not represent new matter.
FIG. 1 depicts an HDTV display area showing 32 pixel patch locations, and
FIG. 2 depicts a pixel patch area of 11×11 pixels, the foregoing depictions showing a possible environment in which an Adaptive Video Watermark (AVW) Embedding Process such as that summarized in FIG. 2 may be employed.
FIG. 3 depicts color space graphs of HSL and RGB color space in which the following Normative Definitions will apply:
within pixel sample area
FIG. 4 depicts data block pseudo-random motion around a pixel patch in which encoding is concealed from human visual perception due to the following observed traits of Human eye sensitivity:
FIG. 5 depicts data repetition by frame grouping, including an example of repeat data for multiple frames, the analysis of which may be a portion of an encoding flow diagrammed in FIG. 6 specifying in part the following Encoding Rules of the Invention (a de-encoding of which being depicted in FIG. 7):
When a binary one is encoded and pixels are shifted has shown, for example by the example watermarked video frame of FIG. 9, and the highlighted example shown in FIG. 10), the direction of shift will be in the opposite respective HSL direction of the previous shift (if any) of the same pixel block position of the previous frame.
An Application of Adaptive Video Watermarking
Using Video Watermarking to Detect Aspect Ratio Distortion
FIGS. 11, 12, 13a, 13b, 14a, 14b, 15a, 15b, 16, 17, and 18 depict standard television pixel dimensions, frame distortion associated with SD in various HDTV scaling schemes, and corresponding frame distortion detections using adaptive video watermarks.
Using Adaptive Video Watermark (AVW) to Detect Video Frame Distortion—Basic Principles:
FIGS. 13b, 14b, and 15b illustrate exemplary pixels shifts as a result of the respective video frame distortion.
While particular aspects of the present subject matter described herein have been shown and described, it will be apparent to those skilled in the art that, based upon the teachings herein, changes and modifications may be made without departing from the subject matter described herein and its broader aspects and, therefore, the appended claims are to encompass within their scope all such changes and modifications as are within the true spirit and scope of this subject matter described herein. Furthermore, it is to be understood that the invention is defined by the appended claims. It will be understood by those within the art that, in general, terms used herein, and especially in the appended claims (e.g., bodies of the appended claims) are generally intended as “open” terms (e.g., the term “including” should be interpreted as “including but not limited to,” the term “having” should be interpreted as “having at least,” the term “includes” should be interpreted as “includes but is not limited to,” etc.). It will be further understood by those within the art that if a specific number of an introduced claim recitation is intended, such an intent will be explicitly recited in the claim, and in the absence of such recitation no such intent is present. For example, as an aid to understanding, the following appended claims may contain usage of the introductory phrases “at least one” and “one or more” to introduce claim recitations. However, the use of such phrases should not be construed to imply that the introduction of a claim recitation by the indefinite articles “a” or “an” limits any particular claim containing such introduced claim recitation to inventions containing only one such recitation, even when the same claim includes the introductory phrases “one or more” or “at least one” and indefinite articles such as “a” or “an” (e.g., “a” and/or “an” should typically be interpreted to mean “at least one” or “one or more”); the same holds true for the use of definite articles used to introduce claim recitations. In addition, even if a specific number of an introduced claim recitation is explicitly recited, those skilled in the art will recognize that such recitation should typically be interpreted to mean at least the recited number (e.g., the bare recitation of “two recitations,” without other modifiers, typically means at least two recitations, or two or more recitations). Furthermore, in those instances where a convention analogous to “at least one of A, B, and C, etc.” is used, in general such a construction is intended in the sense one having skill in the art would understand the convention (e.g., “a system having at least one of A, B, and C” would include but not be limited to systems that have A alone, B alone, C alone, A and B together, A and C together, B and C together, and/or A, B, and C together, etc.).
While preferred and alternative embodiments of the invention have been illustrated and described, as noted above, many changes can be made without departing from the spirit and scope of the invention. Accordingly, the scope of the invention is not limited by the disclosure of these preferred and alternate embodiments. Instead, the invention should be determined entirely by reference to the claims that follow.
1. A system, comprising:
at least one processing device; and
one or more instructions which, when executed by the at least one processing device, configured the at least one processing device to perform one or more operations including at least:
forming a watermark mask, the watermark mask formed at least partially based on at least one ten-by-ten grid of pixels selected from a video frame, including at least selecting a four-by-four array of pixels within the at least one ten-by-ten grid of pixels selected from the video frame;
modifying the four-by-four array of pixels to encode a binary “one” represented by a modified four-by-four array of pixels, including at least:
deciding to shift at least one of an h value, an s value, or an 1 value of at least one pixel of the at least one ten-by-ten grid of pixels at least partially based on one or more values of one or more surrounding pixels around the at least one pixel; and
painting one or more pixels within the four-by-four array of pixels to represent the binary “one” at least partially based on at least one shifted value to form the watermark mask; and
determining motion occurring between the selected video frame and at least one adjacent frame at least partially based on the watermark mask including at least the binary “one” associated with the selected video frame and at least partially based on a corresponding at least one ten-by-ten grid of pixels selected from the at least one adjacent frame.