US20230286163A1
2023-09-14
18/083,507
2022-12-17
US 11,766,784 B1
2023-09-26
-
-
Spencer D Patton
SZDC LAW P.C.
2042-12-17
The invention provides a motion capture method of a robotic arm, including: fastening a visual sensor on a robotic arm to acquire data as a source domain, fastening an inertial sensor on a corresponding human arm to acquire data as a target domain, and establishing a state space expression of a system; setting an optimal unknown state observed joint distribution by using a total probability theory and an observed prediction distribution of the source domain as a condition, decomposing a conditional joint observed distribution model, and solving an optimal distribution using KL divergence; and transferring knowledge of the source domain measured by the visual sensor into the target domain measured by the inertial sensor based on a Kalman filter and the total probability theory, performing data fusion based on Kalman filtering, and predicting a state of the system at a next moment to implement motion capture of the robotic arm.
Get notified when new applications in this technology area are published.
B25J9/1669 » CPC further
Programme-controlled manipulators; Programme controls characterised by programming, planning systems for manipulators characterised by special application, e.g. multi-arm co-operation, assembly, grasping
B25J9/16 IPC
Programme-controlled manipulators Programme controls
B25J9/1697 » CPC main
Programme-controlled manipulators; Programme controls characterised by use of sensors other than normal servo-feedback from position, speed or acceleration sensors, perception control, multi-sensor controlled systems, sensor fusion Vision controlled systems
The present invention relates to the technical fields of robotic arm motion capture and robotic arm reality synchronization operation, and in particular to motion capture method and system of a robotic arm, a medium, and an electronic device.
One of the most significant challenges facing the application of robotic arms in industrial production, medicine, and other industries is how to operate the robotic arms. Compared with operations through computer instructions and joysticks, an operation mode of implementing reality synchronization through human motion capture is simpler, more direct, and efficient. Currently used non-contact motion capture is mostly vision-based technology to mark human body parts, track human motion, and therefore control a remote robotic arm through pose analysis and the like. However, there is the problem of marker occlusion, the accuracy of visual acquisition data in dark environment is greatly reduced, and poses cannot be obtained accurately under the impact of light brightness. A mode of vision-based motion capture is greatly limited.
For dynamic robotic arm motion capture, it is more straightforward to track the human body through an exoskeleton sensing system. However, a contact operation mode hinders human motion and is not conducive to actual operations. In a markerless non-contact tracking method, an inertial sensor may be used to obtain pose information of human motion. In inertial navigation, navigation parameters at a current moment are calculated by using known information at a previous moment and measurement information at a current moment. This process is continuously repeated. With the presence of external interference and sensing errors, errors occur in all steps of calculation, and the errors in inertial navigation accumulate over time. Therefore, in the use of a combined navigation mode, a state at a next moment can be predicted to the maximum extent by fusing acquired data.
A data fusion method with knowledge transfer based on a Kalman filter (KF) is used by combining motion capture modes of vision and inertial sensors. The basic principle of transfer learning is to use knowledge information in a source domain to improve learning in a target domain, and transfer learning has been researched in many fields such as autonomous driving, computer vision, and sensor networks. A human motion state can be accurately predicted by using this data fusion method with knowledge transfer to implement motion capture of markerless reality synchronization of robotic arms. However, little research has been conducted in this area.
A technical problem to be resolved by the present invention is to provide a robotic arm motion capture method that takes deficiencies in visual sensing and inertial sensing into comprehensive consideration and uses visual sensing as a source domain by using the concept of transfer learning to improve a target domain of inertia sensing, thereby improving prediction accuracy.
To resolve the foregoing problem, the present invention provides a motion capture method of a robotic arm, including the following steps:
As a further improvement to the present invention, in step 1, the state space expression of the system established for motion capture of a human body is as follows:
Xi=AXiβ1+Buiβ1+wiβ1ββ(1),
Zi=CXi+viββ(2), and
Zi,s=CsXi+vi,sββ(3),
As a further improvement to the present invention, step S2 includes:
m(Xi,Zi|fs)=m(Zi|Xi,fs)m(Xi|fs)ββ(4), and
m I ( X n , Z n ) = f β‘ ( X n , Z n ) = f β‘ ( Z n | X n ) β’ f β‘ ( X n ) β‘ β i = 1 n f β‘ ( Z i | X i ) β’ f β‘ ( X i | X i - 1 ) ; ( 5 )
m β‘ ( Z n | X n , f s ) = β i = 1 n m β‘ ( z i | X i , Z i - 1 , f s ) , ( 6 ) m β‘ ( X n | f s ) = β i = 1 n m β‘ ( x i | X i - 1 , f s ) = β i = 1 n m β‘ ( x i | x i - 1 , f s ) , ( 7 )
m(Xn,Zn|fs)=fs(Zn)m(Xn|fs)ββ(8),
m o ( X n | f s ) = β i = 1 n m o ( x i | x i - 1 , f s ) , ( 9 )
and
m o ( x i | x i - 1 , f s ) β‘ f β‘ ( x i | x i - 1 ) β’ exp β’ { - β« f s ( z i | Z i - 1 , , s ) β’ ln β’ f s ( z i | Z i - 1 , s ) f β‘ ( z i | x i ) β’ dz i } β’ Ξ³ β‘ ( x i ) Ξ³ β‘ ( x i - 1 ) , ( 10 )
As a further improvement to the present invention, step S3 includes:
f(xi|xiβ1)β‘Nxi(Axiβ1,Q)ββ(11),
f(zi|xi)β‘Nzi(Cxi,R)ββ(12),
f(zi,s|xi)β‘Nzi,s(Csxi,R)ββ(13), and
f(xi|xiβ1)β‘Nxi(ΞΌi|i,Ξ£i|i)ββ(14),
f(xi|ziβ1)β‘Nxi(ΞΌi|iβ1,Ξ£i|iβ1)ββ(15),
f(zi|ziβ1)β‘Nzi(zi|iβ1,Ri|iβ1)ββ(16), and
f(zi,s|ziβ1,s)β‘Nzi,s(zi|iβ1,s,Ri|iβ1,s)ββ(17);
Ξ³ β‘ ( x i - 1 ) β‘ β« f β‘ ( x i | x i - 1 ) Γ exp β’ { - β« f s ( z i | Z i - 1 , , s ) β’ ln β’ f s ( z i | Z i - 1 , s ) f β‘ ( z i | x i ) β’ dz i } β’ Ξ³ β‘ ( x i ) β’ dx i ,
Ξ³(xiβ1)βexp[βΒ½(xiβ1TSiβ1|ixiβ1β2xiβ1Triβ1|i)]ββ(18), and
L β‘ S i | i β’ Q 1 2 ( Q T 2 β’ S i | i β’ Q 1 2 + I n x ) - 1 β’ Q T 2
is:
riβ1|i=AT(InxβL)ri|i,ββ(19), and
Siβ1|i=AT(InxβL)Si|iAββ(20), and
mo=(mi|xiβ1,fs)=Nxi(ΞΌio,Ξ£io)ββ(21),
ΞΌ i o = ( I n x - Ξ£ i o β’ S i | i ) β’ A β’ x i - 1 + Ξ£ i o β’ r i | i , Ξ£ i o = Q 1 2 ( Q T 2 β’ S i | i β’ Q 1 2 + I n x ) - 1 β’ Q T 2 ,
and an optimal state a priori predicted distribution after transfer of knowledge from the source domain to the target domain for fusion may be obtained according to Formula (14) of the state prior distribution of the KF:
ΞΌi|iβ1=(InxβΞ£ioSi|i)AΞΌiβ1|iβ1+Ξ£iori|iββ(22), and
Ξ£i|iβ1=(InxβΞ£ioSi|i)AΞ£iβ1|iβ1AT(InxβΞ£ioSi|i)T+Ξ£ioββ(23).
The present invention further provides a computer-readable storage medium, the storage medium including a stored program, where the program performs the foregoing robotic arm motion capture method.
The present invention further provides an electronic device, including: one or more processors, a memory, and one or more programs, where the one or more programs are stored in the memory and are configured to be performed by the one or more processors, and the one or more programs are used for performing the foregoing robotic arm motion capture method.
The present invention further provides a motion capture system of a robotic arm, including the following modules:
As a further improvement to the present invention, the state space expression of the system established for motion capture of a human body is as follows:
Xi=AXiβ1+Buiβ1+wiβ1ββ(1),
Zi=CXi+viββ(2), and
Zi,s=CsXi+vi,sββ(3),
As a further improvement to the present invention, the optimal distribution solving module is configured to perform the following steps:
m(Xi,Zi|fs)=m(Zi|Xi,fs)m(Xi|fs)ββ(4), and
m I ( X n , Z n ) = f β‘ ( X n , Z n ) = f β‘ ( Z n | X n ) β’ f β‘ ( X n ) β‘ β i = 1 n f β‘ ( Z i | X i ) β’ f β‘ ( X i | X i - 1 ) ; ( 5 )
m β‘ ( Z n | X n , f s ) = β i = 1 n m β‘ ( z i | X i , Z i - 1 , f s ) , ( 6 ) m β‘ ( X n | f s ) = β i = 1 n m β‘ ( x i | X i - 1 , f s ) = β i = 1 n m β‘ ( x i | x i - 1 , f s ) , ( 7 )
m(Xn,Zn|fs)=fs(Zn)m(Xn|fs)ββ(8),
m o ( X n | f s ) = β i = 1 n m o ( x i | x i - 1 , f s ) , and ( 9 ) m o ( x i | x i - 1 , f s ) β‘ f β‘ ( x i | x i - 1 ) β’ exp β’ { - β« f s ( z i | Z i - 1 , , s ) β’ ln β’ f s ( z i | Z i - 1 , s ) f β‘ ( z i | x i ) β’ dz i } β’ Ξ³ β‘ ( x i ) Ξ³ β‘ ( x i - 1 ) , ( 10 )
As a further improvement to the present invention, the motion capture module is configured to perform the following steps:
f(xi|xiβ1)β‘Nxi(Axiβ1,Q)ββ(11),
f(zi|xi)β‘Nzi(Cxi,R)ββ(12),
f(zi,s|xi)β‘Nzi,s(Csxi,R)ββ(13), and
f(xi|xiβ1)β‘Nxi(ΞΌi|i,Ξ£i|i)ββ(14),
f(xi|ziβ1)β‘Nxi(ΞΌi|iβ1,Ξ£i|iβ1)ββ(15),
f(zi|ziβ1)β‘Nzi(zi|iβ1,Ri|iβ1)ββ(16), and
f(zi,s|ziβ1,s)β‘Nzi,s(zi|iβ1,s,Ri|iβ1,s)ββ(17);
Ξ³ β‘ ( x i - 1 ) β‘ β« f β‘ ( x i | x i - 1 ) Γ exp β’ { - β« f s ( z i | Z i - 1 , , s ) β’ ln β’ f s ( z i | Z i - 1 , s ) f β‘ ( z i | x i ) β’ dz i } β’ Ξ³ β‘ ( x i ) β’ dx i ,
Ξ³(xiβ1)βexp[βΒ½(xiβ1TSiβ1|ixiβ1β2xiβ1Triβ1|i)]ββ(18), and
L β‘ S i | i β’ Q 1 2 ( Q T 2 β’ S i | i β’ Q 1 2 + I n x ) - 1 β’ Q T 2
is:
riβ1|i=AT(InxβL)ri|i,ββ(19), and
Siβ1|i=AT(InxβL)Si|iAββ(20), and
mo=(mi|xiβ1,fs)=Nxi(ΞΌio,Ξ£io)ββ(21),
ΞΌ i o = ( I n x - Ξ£ i o β’ S i | i ) β’ A β’ x i - 1 + Ξ£ i o β’ r i | i , Ξ£ i o = Q 1 2 ( Q T 2 β’ S i | i β’ Q 1 2 + I n x ) - 1 β’ Q T 2 ,
and an optimal state a priori predicted distribution after transfer of knowledge from the source domain to the target domain for fusion may be obtained according to Formula (14) of the state prior distribution of the KF:
ΞΌi|iβ1=(InxβΞ£ioSi|i)AΞΌiβ1|iβ1+Ξ£iori|iββ(22), and
Ξ£i|iβ1=(InxβΞ£ioSi|i)AΞ£iβ1|iβ1AT(InxβΞ£ioSi|i)T+Ξ£ioββ(23).
The beneficial effects of the invention are as follows:
The robotic arm motion capture method provides in the present invention takes deficiencies in visual sensing and inertial sensing into comprehensive consideration and uses visual sensing as a source domain by using the concept of transfer learning to improve a target domain of inertia sensing, thereby improving prediction accuracy.
The above description is only an overview of the technical solutions of the present invention. For a clearer understanding of the technical measure of the present invention and implementation according to the content of the specification, and to make the above and other objectives, features, and advantages of the present invention clearer and more comprehensible, detailed description is provided as follows with reference to preferred embodiments and the accompanying drawings.
FIG. 1 is a flowchart of a motion capture method of a robotic arm according to a preferred embodiment of the present invention;
FIG. 2 is a schematic diagram of motion capture model of a robotic arm according to a preferred embodiment of the present invention;
FIG. 3 is a diagram showing simulated comparison between a case when the method of the present invention is not used and after the method of the present invention is used for processing; and
FIG. 4 is a diagram of accuracy comparison of state prediction using the algorithm of the present invention and previous algorithms.
The present invention is further described below with reference to the accompanying drawings and specific embodiments, to enable a person skilled in the art to better understand and implement the present invention. However, the embodiments are not used to limit the present invention.
As shown in FIG. 1, a motion capture method of a robotic arm according to a preferred embodiment of the present invention includes the following steps:
For a human hand of motion capture required for a robotic arm, the three-dimensional position and direction of the center of mass of the hand are used for controlling the position and direction of a robot end effector, as shown in FIG. 2. X, Y, and Z are used as a coordinate system of the human hand, Ο, ΞΈ, and Ο are defined as a direction yaw angle, a pitch angle, and a roll angle of the hand, wx, wy, and wz are angular velocity components of the hand on X, Y, Z axes, and a motion direction of the hand is estimated by using a quaternion method. The conversion between a quaternion and a Eulerian angle is as follows:
[ q 0 q 1 q 2 q 3 ] = [ cos β’ ( Ο 2 ) β’ cos β’ ( ΞΈ 2 ) β’ cos β’ ( Ο 2 ) + sin β‘ ( Ο 2 ) β’ sin β’ ( ΞΈ 2 ) β’ sin β’ ( Ο 2 ) sin β’ ( Ο 2 ) β’ cos β’ ( ΞΈ 2 ) β’ cos β’ ( Ο 2 ) - cos β’ ( Ο 2 ) β’ sin β’ ( ΞΈ 2 ) β’ sin β’ ( Ο 2 ) cos β’ ( Ο 2 ) β’ sin β’ ( ΞΈ 2 ) β’ cos β’ ( Ο 2 ) + sin β’ ( Ο 2 ) β’ cos β’ ( ΞΈ 2 ) β’ sin β’ ( Ο 2 ) cos β’ ( Ο 2 ) β’ cos β’ ( ΞΈ 2 ) β’ sin β’ ( Ο 2 ) - sin β’ ( Ο 2 ) β’ sin β’ ( ΞΈ 2 ) β’ cos β’ ( Ο 2 ) ] .
The quaternion satisfies q02+q12+q22+q32=1. A state space expression of a robotic arm motion capture model is established as follows according to the quaternion method:
Xi=AXiβ1+Buiβ1+wiβ1ββ(1),
Zi=CXi+viββ(2), and
Zi,s=CsXi+vi,sββ(3),
Measurement Matrices of the Target Domain and the Source Domain:
C = C s = [ 0 0 0 0 1 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 1 ] .
State Transfer Matrix:
A i = [ 1 0 0 0 - q 1 , i 2 - q 2 , i 2 - q 3 , i 2 0 1 0 0 q 0 , i 2 q 3 , i 2 q 2 , i 2 0 0 1 0 q 3 , i 2 q 0 , i 2 - q 1 , i 2 0 0 0 1 - q 2 , i 2 q 1 , i 2 q 0 , i 2 0 0 0 0 1 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 1 ] .
m(Xi,Zi|fs)=m(Zi|Xi,fs)m(Xi|fs)ββ(4), and
m I ( X n , Z n ) = f β‘ ( X n , Z n ) = f β‘ ( Z n | X n ) β’ f β‘ ( X n ) β‘ β i = 1 n f β‘ ( Z i | X i ) β’ f β‘ ( X i | X i - 1 ) ; ( 5 )
m β‘ ( Z n | X n , f s ) = β i = 1 n m β‘ ( z i | X i , Z i - 1 , f s ) , ( 6 ) m β‘ ( X n | f s ) = β i = 1 n m β‘ ( x i | X i - 1 , f s ) = β i = 1 n m β‘ ( x i | x i - 1 , f s ) , ( 7 )
m(Xn,Zn|fs)=fs(Zn)m(Xn|fs)ββ(8),
m o ( X n | f s ) = β i = 1 n m o ( x i | x i - 1 , f s ) , and ( 9 ) m o ( x i | x i - 1 , f s ) β‘ f β‘ ( x i | x i - 1 ) β’ exp β’ { - β« f s ( z i | Z i - 1 , , s ) β’ ln β’ f s ( z i | Z i - 1 , s ) f β‘ ( z i | x i ) β’ dz i } β’ Ξ³ β‘ ( x i ) Ξ³ β‘ ( x i - 1 ) , ( 10 )
f(xi|xiβ1)β‘Nxi(Axiβ1,Q)ββ(11),
f(zi|xi)β‘Nzi(Cxi,R)ββ(12),
f(zi,s|xi)β‘Nzi,s(Csxi,R)ββ(13), and
f(xi|xiβ1)β‘Nxi(ΞΌi|i,Ξ£i|i)ββ(14),
f(xi|ziβ1)β‘Nxi(ΞΌi|iβ1,Ξ£i|iβ1)ββ(15),
f(zi|ziβ1)β‘Nzi(zi|iβ1,Ri|iβ1)ββ(16), and
f(zi,s|ziβ1,s)β‘Nzi,s(zi|iβ1,s,Ri|iβ1,s)ββ(17);
Ξ³ β‘ ( x i - 1 ) β‘ β« f β‘ ( x i | x i - 1 ) Γ exp β’ { - β« f s ( z i | Z i - 1 , , s ) β’ ln β’ f s ( z i | Z i - 1 , s ) f β‘ ( z i | x i ) β’ dz i } β’ Ξ³ β‘ ( x i ) β’ dx i ,
Ξ³(xiβ1)βexp[βΒ½(xiβ1TSiβ1|ixiβ1β2xiβ1Triβ1|i)]ββ(18), and
L β‘ S i β i β’ Q 1 2 ( Q T 2 β’ S i β i β’ Q 1 2 + I n x ) - 1 β’ Q T 2
is
riβ1|i=AT(InxβL)ri|i,ββ(19), and
Siβ1|i=AT(InxβL)Si|iAββ(20), and
mo=(mi|xiβ1,fs)=Nxi(ΞΌio,Ξ£io)ββ(21),
ΞΌ i o = ( I n x - Ξ£ i o β’ S i β i ) β’ A β’ x i - 1 + Ξ£ i o β’ r i | i , Ξ£ i o = Q 1 2 ( Q T 2 β’ S i | i β’ Q 1 2 + I n x ) - 1 β’ Q T 2 ,
and an optimal state a priori predicted distribution after transfer of knowledge from the source domain to the target domain for fusion may be obtained according to Formula (14) of the state prior distribution of the KF:
ΞΌi|iβ1=(InxβΞ£ioSi|i)AΞΌiβ1|iβ1+Ξ£iori|iββ(22), and
Ξ£i|iβ1=(InxβΞ£ioSi|i)AΞ£iβ1|iβ1AT(InxβΞ£ioSi|i)T+Ξ£ioββ(23).
Through Formulas (18), (21), (22), and (23), a total probability design algorithm of knowledge transfer based on a KF may be obtained, specifically:
A. Source Domain Knowledge Transfer:
B. Target Domain State Prediction
Referring to steps S2 and S3, through the use of the total probability design algorithm of knowledge transfer based on a KF, under a condition of transferring measured prediction knowledge of a source domain, state prediction of a target domain is improved, and measurement data of the target domain is fused to obtain a state estimation value with a better tracking effect to perform effective object tracking, thereby implementing better motion capture. Simulation is performed by using an iterative algorithm, and comparison is made with a conventional method. Root-mean-square deviation indicators (mean norm squared-errors, MNSEs) according to different source domain measurement noise Rs are used for comparison. For the obtained results and simulated diagram, reference is made to FIG. 3, FIG. 4, and the following Table 1:
| TABLE 1 | ||||||
| Rs = 10β7 I3 | Rs = 10β6 I3 | Rs = 10β5 I3 | Rs = 10β4 I3 | Rs = 10β3 I3 | Rs = 10β2 I3 | |
| KF | β2.1025 | β2.1025 | β2.1025 | β2.1025 | β2.1025 | β2.1025 |
| DT | β2.7172 | β2.7387 | β2.7182 | β2.7782 | β2.2728 | β1.9064 |
| DTi | β6.4199 | β5.4236 | β4.9413 | β4.4720 | β2.2728 | β2.1125 |
An experimental method of reference comparison is used in a whole simulation process to compare the tracking effects of a case in which the present invention is not used and a case in which dynamic knowledge transfer of the present invention is used. A conventional KF method is compared with the method in the present invention in the case of different measurement noise covariance matrices in a source domain. As can be seen FIG. 3 and FIG. 4, the state estimation performance of dynamic knowledge transfer when a measurement noise covariance matrix of the source domain is greater than that of a target domain is inferior to a conventional method. The dynamic knowledge transfer is insensitive to the measurement prediction of the source domain in a transfer process. Therefore, A (source domain knowledge transfer) of the algorithm based on the total probability design of knowledge transfer based on a KF is replaced with a form of a dynamic knowledge transfer variant (DTi), and the tracking effect is better regardless of a change in the measurement noise covariance matrix of the source domain. Therefore, the knowledge of the source domain is particularly important for the knowledge processing of the target domain in the axiomatic framework of transfer learning, and the total probability design of knowledge transfer based on a KF solves the limitation of a single sensor for robotic arm motion capture and improves the tracking performance of motion capture.
The motion capture method of robotic arm provides in the present invention takes deficiencies in visual sensing and inertial sensing into comprehensive consideration and uses visual sensing as a source domain by using the concept of transfer learning to improve a target domain of inertia sensing, thereby improving prediction accuracy.
Further disclosed in a preferred embodiment of the present invention is a computer-readable storage medium, the storage medium including a stored program, where the program performs the motion capture method of robotic arm in Embodiment 1.
Further disclosed in a preferred embodiment of the present invention is an electronic device. The electronic device includes: one or more processors, a memory, and one or more programs, where the one or more programs are stored in the memory and are configured to be performed by the one or more processors, and the one or more programs are used for performing the motion capture method of robotic arm in Embodiment 1.
Further disclosed in a preferred embodiment of the present invention is a motion capture system of a robotic arm, including the following modules:
The state space expression of the system established for motion capture of a human body is as follows:
Xi=AXiβ1+Buiβ1+wiβ1ββ(1),
Zi=CXi+viββ(2), and
Zi,s=CsXi+vi,sββ(3),
The optimal distribution solving module is configured to perform the following steps:
m(Xi,Zi|fs)=m(Zi|Xi,fs)m(Xi|fs)ββ(4), and
m I ( X n , Z n ) = f β‘ ( X n , Z n ) = f β‘ ( Z n | X n ) β’ f β‘ ( X n ) β‘ β i = 1 n f β‘ ( Z i | X i ) β’ f β‘ ( X i | X i - 1 ) . ( 5 )
m β‘ ( Z n | X n , f s ) = β i = 1 n m β‘ ( z i | X i , Z i - 1 , f s ) , ( 6 ) m β‘ ( X n | f s ) = β i = 1 n m β‘ ( x i | X i - 1 , f s ) = β i = 1 n m β‘ ( x i | x i - 1 , f s ) , ( 7 )
m(Xn,Zn|fs)=fs(Zn)m(Xn|fs)ββ(8),
m o ( X n | f s ) = β - 1 n m o ( x , β | x - 1 , f s ) , and ( 9 ) m o ( x i | x i - 1 , f s ) β‘ f β‘ ( x i β x i - 1 ) β’ exp β’ { - β« f s ( z i | Z i - 1 , , s ) β’ ln β’ f s ( z i | Z i - 1 , s ) f β‘ ( z i | x i ) β’ dz i } β’ Ξ³ β‘ ( x i ) Ξ³ β‘ ( x i - 1 ) , ( 10 )
The motion capture module is configured to perform the following steps:
f(xi|xiβ1)β‘Nxi(Axiβ1,Q)ββ(11),
f(zi|xi)β‘Nzi(Cxi,R)ββ(12),
f(zi,s|xi)β‘Nzi,s(Csxi,R)ββ(13), and
f(xi|xiβ1)β‘Nxi(ΞΌi|i,Ξ£i|i)ββ(14),
f(xi|ziβ1)β‘Nxi(ΞΌi|iβ1,Ξ£i|iβ1)ββ(15),
f(zi|ziβ1)β‘Nzi(zi|iβ1,Ri|iβ1)ββ(16), and
f(zi,s|ziβ1,s)β‘Nzi,s(zi|iβ1,s,Ri|iβ1,s)ββ(17);
Ξ³ β‘ ( x i - 1 ) β‘ β« f β‘ ( x i | x i - 1 ) Γ exp β’ { - β« f s ( z i | Z i - 1 , , s ) β’ ln β’ f s ( z i | Z i - 1 , s ) f β‘ ( z i | x i ) β’ dz i } β’ Ξ³ β‘ ( x i ) β’ dx i ,
Ξ³(xiβ1)βexp[βΒ½(xiβ1TSiβ1|ixiβ1β2xiβ1Triβ1|i)]ββ(18), and
L β‘ S i β i β’ Q 1 2 ( Q T 2 β’ S i β i β’ Q 1 2 + I n x ) - 1 β’ Q T 2
is:
riβ1|i=AT(InxβL)ri|i,ββ(19), and
Siβ1|i=AT(InxβL)Si|iAββ(20), and
mo=(mi|xiβ1,fs)=Nxi(ΞΌio,Ξ£io)ββ(21),
ΞΌ i o = ( I n x - Ξ£ i o β’ S i β i ) β’ A β’ x i - 1 + Ξ£ i o β’ r i | i , Ξ£ i o = Q 1 2 ( Q T 2 β’ S i | i β’ Q 1 2 + I n x ) - 1 β’ Q T 2 ,
and an optimal state a priori predicted distribution after transfer of knowledge from the source domain to the target domain for fusion may be obtained according to Formula (14) of the state prior distribution of the KF:
ΞΌi|iβ1=(InxβΞ£ioSi|i)AΞΌiβ1|iβ1+Ξ£iori|iββ(22), and
Ξ£i|iβ1=(InxβΞ£ioSi|i)AΞ£iβ1|iβ1AT(InxβΞ£ioSi|i)T+Ξ£ioββ(23).
The motion capture system of robotic arm in the embodiments of the present invention is configured to implement the foregoing robotic arm motion capture method. Therefore, for specific implementations of the system, reference may be made to the embodiment part of the robotic arm motion capture method above. Therefore, for specific implementations of the system, reference may be made to the description of corresponding embodiment parts. Details are not described again herein.
In addition, the robotic arm motion capture system in this embodiment is configured to implement the foregoing robotic arm motion capture method. Therefore, the functions of the system correspond to the functions of the foregoing method. Details are not described again herein.
The foregoing embodiments are merely preferred embodiments used to fully describe the present invention, and the protection scope of the present invention is not limited thereto. Equivalent replacements or variations made by a person skilled in the art to the present invention all fall within the protection scope of the present invention. The protection scope of the present invention is as defined in the claims.
1. A motion capture method of a robotic arm, comprising steps of:
S1. fastening a visual sensor on a robotic arm to acquire data as a source domain, fastening an inertial sensor on a corresponding human arm to acquire data as a target domain, and establishing a state space expression of a system;
S2. setting an optimal unknown state observed joint distribution based on the state space expression and by using a total probability theory and using an observed prediction distribution of the source domain as a condition, decomposing a conditional joint observed distribution model, and solving an optimal distribution by using KL divergence; and
S3. transferring knowledge of the source domain measured by the visual sensor into the target domain measured by the inertial sensor based on a Kalman filter (KF) and the total probability theory, performing data fusion based on Kalman filtering, and predicting a state of the system at a next moment to implement motion capture of the robotic arm.
2. The motion capture method according to claim 1, wherein in step 1, the state space expression of the system established for motion capture of a human body is:
Xi=AXiβ1+Buiβ1+wiβ1ββ(1),
Zi=CXi+viββ(2), and
Zi,s=CsXi+vi,sββ(3),
wherein a motion angle ΞΈi and an angular velocity {dot over (ΞΈ)}i in a process of arm motion form a state matrix Xi of the system, Zi is a measurement matrix of the target domain measured by the inertial sensor of the system, and Zi,s is a measurement matrix of the source domain of the system; A, B, C, and Cs are determined by parameter requirements of the system, and uiβ1 is a control input of the system; and wiβ1 is process noise of the system, and vi and vi,s are measurement noise, both assumed to be Gaussian white noise with an average value of zero.
3. The motion capture method according to claim 2, wherein step S2 comprises:
S21. setting the optimal unknown state observed joint distribution by using the total probability theory and using the observed prediction distribution fs of the source domain as the condition as follows:
m(Xi,Zi|fs)=m(Zi|Xi,fs)m(Xi|fs)ββ(4), and
wherein it is defined that an ideal state observed distribution model is:
m I ( X n , Z n ) = f β‘ ( X n , Z n ) = f β‘ ( Z n | X n ) β’ f β‘ ( X n ) β‘ β i = 1 n f β‘ ( Z i | X i ) β’ f β‘ ( X i | X i - 1 ) . ( 5 )
S22. decomposing the conditional joint observed distribution model into:
m β‘ ( Z n | X n , f s ) = β i = 1 n m β‘ ( z i | X i , Z i - 1 , f s ) , ( 6 ) m β‘ ( X n | f s ) = β i = 1 n m β‘ ( x i | X i - 1 , f s ) = β i = 1 n m β‘ ( x i | x i - 1 , f s ) , ( 7 )
wherein it is assumed that m(zi|Xi,Ziβ1, fs)β‘fs(zi,s|Ziβ1,s)|zi,s=zi, and under a specified source domain knowledge condition constraint, the unknown state observed joint distribution in Formula (4) is changed into:
m(Xn,Zn|fs)=fs(Zn)m(Xn|fs)ββ(8),
wherein fs(Z n) is knowledge information that needs to be transferred from the source domain into the target domain, and under a condition that fs(Zn) is fixed and known, m(Xn|fs) becomes the only part that allows design optimization; and
S23. solving the optimal distribution by using KL divergence as:
m o ( X n | f s ) = β - 1 n m o ( x , β | x - 1 , f s ) , and ( 9 ) m o ( x i | x i - 1 , f s ) β‘ f β‘ ( x i β x i - 1 ) β’ exp β’ { - β« f s ( z i | Z i - 1 , , s ) β’ ln β’ f s ( z i | Z i - 1 , s ) f β‘ ( z i | x i ) β’ dz i } β’ Ξ³ β‘ ( x i ) Ξ³ β‘ ( x i - 1 ) , ( 10 )
wherein Ξ³(xiβ1) is a normalization function.
4. The motion capture method according to claim 3, wherein step S3 comprises:
S31. assuming that the arm motion in the motion capture of the robotic arm meets a linear Gaussian case as follows:
f(xi|xiβ1)β‘Nxi(Axiβ1,Q)ββ(11),
f(zi|xi)β‘Nzi(Cxi,R)ββ(12),
f(zi,s|xi)β‘Nzi,s(Csxi,R)ββ(13), and
performing explicit recursive calculation according to Kalman filtering to obtain a corresponding state prior distribution, a corresponding state posterior distribution, and observed prior distribution, as follows:
f(xi|xiβ1)β‘Nxi(ΞΌi|i,Ξ£i|i)ββ(14),
f(xi|ziβ1)β‘Nxi(ΞΌi|iβ1,Ξ£i|iβ1)ββ(15),
f(zi|ziβ1)β‘Nzi(zi|iβ1,Ri|iβ1)ββ(16), and
f(zi,s|ziβ1,s)β‘Nzi,s(zi|iβ1,s,Ri|iβ1,s)ββ(17);
S32. according to the total probability theory, under a condition that fs(Zn) is fixed and known, mo(Xn|fs) is a target domain state prediction optimal distribution, wherein Ξ³(xiβ1) is a normalization function, that is:
Ξ³ β‘ ( x i - 1 ) β‘ β« f β‘ ( x i | x i - 1 ) Γ exp β’ { - β« f s ( z i | Z i - 1 , , s ) β’ ln β’ f s ( z i | Z i - 1 , s ) f β‘ ( z i | x i ) β’ dz i } β’ Ξ³ β‘ ( x i ) β’ dx i ,
deriving the normalization function according to the observed prediction distribution:
Ξ³(xiβ1)βexp[βΒ½(xiβ1TSiβ1|ixiβ1β2xiβ1Triβ1|i)]ββ(18), and
wherein a parameter recursion form of
L β‘ S i β i β’ Q 1 2 ( Q T 2 β’ S i β i β’ Q 1 2 + I n x ) - 1 β’ Q T 2
is:
riβ1|i=AT(InxβL)ri|i,ββ(19), and
Siβ1|i=AT(InxβL)Si|iAββ(20), and
it is satisfied that when i=nβ1, . . . , 2, ri|i=ri|i+1+CT Rβ1zi|iβ,s, Si|i=Si|i+1+CT Rβ1C, when i=n, rn|n=CT Rβ1zn|nβ1,s,Sn|n=CT Rβ1C, recursion of the normalization function represents dynamic transfer of observed information at n moments of the source domain, and the target domain obtains an optimal distribution under this condition; and
S33. obtaining state prediction optimal distribution of the target domain according to a recursion representation form of the normalization function obtained in step S31:
mo=(mi|xiβ1,fs)=Nxi(ΞΌio,Ξ£io)ββ(21),
wherein
ΞΌ i o = ( I n x - Ξ£ i o β’ S i β i ) β’ A β’ x i - 1 + Ξ£ i o β’ r i | i , Ξ£ i o = Q 1 2 ( Q T 2 β’ S i | i β’ Q 1 2 + I n x ) - 1 β’ Q T 2 ,
and an optimal state a priori predicted distribution after transfer of knowledge from the source domain to the target domain for fusion may be obtained according to Formula (14) of the state prior distribution of the KF:
ΞΌi|iβ1=(InxβΞ£ioSi|i)AΞΌiβ1|iβ1+Ξ£iori|iββ(22), and
Ξ£i|iβ1=(InxβΞ£ioSi|i)AΞ£iβ1|iβ1AT(InxβΞ£ioSi|i)T+Ξ£ioββ(23).
5. A computer-readable storage medium, comprising a stored program, wherein the program performs the robotic arm motion capture method according to claim 1.
6. An electronic device, comprising: one or more processors, a memory, and one or more programs, wherein the one or more programs are stored in the memory and are configured to be performed by the one or more processors, and the one or more programs are used for performing the robotic arm motion capture method according to claim 1.
7. (canceled)
8. (canceled)
9. (canceled)
10. (canceled)