-
2006-02-28
10/093,464
2002-03-07
US 7,007,269 B2
2006-02-28
-
-
Anil Khatri
2024-03-21
A data structure for exchanging profiling data in the form of trace elements between collection agents and analysis agents in a profiling tool, wherein the data structure comprises elements for identifying fragments, and attributes defining a logical hierarchy between the trace elements. A system is also set forth for providing access to profiling data of an application over an network, comprising a profiling interface for transmitting event data from the application, a collection agent for receiving and encoding the event data in trace element fragments each containing at least one attribute, and in response transmitting the fragments over the network, and at least one analysis client for receiving and collating the fragments in accordance with the at least one attribute.
Get notified when new applications in this technology area are published.
G06F9/44 IPC
Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs Arrangements for executing specific programs
This invention relates generally to profiling tools for analyzing the execution path or characteristics of a software application, and more particularly to a method and apparatus for providing open access to application profiling data across a network.
Profiling tools are used to evaluate applications for a variety of purposes including, for example, “trouble shooting”, “application tuning”, and “resource utilization optimization”. It is known in the prior art to use profiling tools that analyze a software execution path in a specific execution environment, language or operating system. However, where the application consists of more than one process or more than one language (e.g. a simple client-server networked application where the client is written in Java and the server has C stored procedures in an Relational Database Management System (RDBMS)), the client profiling tool traces only the single client activity, and is not capable of tracing the execution path across the network.
To further compound these problems, prior art profiling tools incorporate vendor specific or proprietary data collection engines and data exchange formats. Consequently, it is difficult to correlate execution traces detected by different vendor profiling tools in a distributed system incorporating diverse execution environments, languages and/or operating systems. Each profiling tool requires a specific or proprietary collection agent for each execution environment.
In the prior art model of application profiling, each application is assumed to be written in one programming language and executed on one computing machine in one execution environment. The profiling traces are composed of trace events, which occur sequentially, and which are stored in large files for later transport to profiling tools. The transmission of such large files can be a significant stress on a communication system. In addition, real time or near real time profiling of application is very difficult in such a system.
The advent of computer networking, and more recently the Internet, has, however, changed and expanded what constitutes an application. Over the Internet, an application may have a number of parts, written in a number of different programming languages, distributed on a number of different computer machines, and operated in a number different execution environments (herein referred to as “distributed environment”). The machines, languages, and environments of such an application are typically from a number of different vendors.
It is exceedingly difficult to develop a profiling tool that is able to interface with all of the products that may be available from various vendors in this environment. This is assuming that the vendors are prepared to disclose their application programming interfaces (APIs) to profiling tool vendors who may be competitors. Further, any change to the APIs may cause operability problems to the profiling tools. There are significant problems with developing and maintaining profiling tools, which are able to interface with products of a number of vendors (distributed access).
Additionally, prior art profiling tools obtain and store trace events in sequential form. In a distributed environment, trace events are not sequential especially as between different machines. There is no capability to correlate trace events between different machines or processes.
Recently, the Java programming language has been expanded to define a Java Virtual Machine Profiling Tool (JVMPI) for profiling hooks in a Java virtual machine. JVMPI is a two-way function call interface between the Java virtual machine and an in-process profiler or collection agent. On the one hand, the virtual machine notifies the collection agent of various events, corresponding to, for example, heap allocation, method calls, etc. The collection agent then communicates with a front-end analysis client, which may or may not run in the same process as the collection agent. On the other hand, the collection agent issues control requests to the Java virtual machine through the JVMPI. For example, the collection agent can turn on/off a specific event notification, based on the needs of the analysis client. The client may reside in a different process on the same machine, or on a remote machine connected via a network using vendor specific wire protocol suitable to the analysis client.
Although profiling tools based on JVMPI are useful for obtaining a variety of information performance data, the prior art problems relating to lack of distributed access and correlation capabilities remain.
A portion of the disclosure of this patent document contains material, which is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document, as it appears in the Patent and Trademark Office patent file or records, but otherwise reserves all copyright rights whatsoever.
The essential problem with the prior art application profiling model is the lack of a consistent data exchange format between collection engines and analysis portions of profiling tools of different vendors. For each execution environment, the analysis portions remain essential unchanged, but each of the vendors has to modify their collection engines accordingly. The inventors have discovered that an open data exchange format may be adapted for application profiling data that allows a single collection agent in an execution environment to provide the profiling data to any number of profiling tools from different vendors. Thus, vendors are able to focus on the development of the analysis portions of their profiling tools and allow other parties, such as vendors of the execution environments, to provide the collection agents.
Conversely, the open data exchange format also allows a profiling tool to obtain analysis data from more than one collection agent using only one standard interface. Thus, a profiling tool is able to profile an application executing on more than one machine and execution environment where the data exchange format provides attributes to correlate the trace events.
The open data exchange format thus has a number of advantages including allowing application vendors to retain confidentiality and revision options for their internal APIs without affecting the profiling tools; and reducing development and maintenance costs for profiling tool vendors.
According to the present invention, an open data structure is provided for exchanging profiling data in the form of trace elements between collection agents and analysis agents in a profiling tool. The data structure accommodates transmission of the profiling data in fragments across a network for storage in a database repository or receipt by one or more analysis agents. Each of the fragments comprises one or more trace elements defining trace events. The data structure of trace elements includes attributes, which define a logical hierarchy between the trace elements. More particularly, correlation attributes are provided with each trace element to enable the analysis clients to correlate the trace elements. Each of the trace elements defines a relationship to each of the other trace elements in a trace, and is not just a sequence of collated trace elements. Thus, for example, a profiling tool is able to correlate a method call to an object.
The data structure according to the present invention facilitates distributed access to profiling data across a network. Also, only a single collection agent is required per execution environment, thereby overcoming the prior art requirement for multiple vendor-specific proprietary collection agents.
According to the preferred embodiment, the data structure for the profiling data conforms to XML (Extensible Markup Language). Thus, the load on the network to transport the profiling data is more evenly distributed as the data is sent in XML fragments instead of large files.
A detailed description of the preferred embodiment is set forth herein below, with reference to the accompanying drawing, in which:
FIG. 1 is a block diagram of a profiling tool incorporating the stat structure according to the present invention;
FIG. 2 is a logical hierarchy diagram of a traceRoot element according to the preferred embodiment;
FIG. 3 is a logical hierarchy diagram of trace behavior elements according to the preferred embodiment;
FIG. 4 is a logical hierarchy diagram of thread elements according to the preferred embodiment;
FIG. 5 is a logical hierarchy diagram of class elements according to the preferred embodiment;
FIG. 6 is a logical hierarchy diagram of object elements according to the preferred embodiment;
FIG. 7 is a logical hierarchy diagram of method elements according to the preferred embodiment;
FIG. 8 is a logical hierarchy diagram of line elements according to the preferred embodiment;
FIG. 9 is a logical hierarchy diagram of memory management elements according to the preferred embodiment;
FIG. 10 is a logical hierarchy diagram of exception elements according to the preferred embodiment;
FIG. 11 is a logical hierarchy diagram of runtime elements according to the preferred embodiment; and
FIG. 12 is a sample showing parts of a trace according to the preferred embodiment.
Referring to FIG. 1, a profiling tool is shown comprising one or more Java Virtual Machines (JVM) 100, associated collection agents 102, and analysis clients 104. Each JVM 100 incorporates a Java Virtual Machine Profiling Interface (JVMPI) 120 for communicating with an associated collection agent 102. The collection agents 102 request and receive event or trace data from the JVMs 100 over the JVMPIs 120 in accordance with standard JVMPI protocol, as discussed in greater detail below. The collection agents 102 then encode and transmit the trace data in the form of trace elements, over a network (e.g. Internet 200), to at least one analysis client 104. The trace elements received by the analysis clients 104 are correlated using identification or correlation attribute definitions and references which define logical relationships between the elements, the details of which are set forth below.
The collection agents 102 are stored on computer readable mediums 150 such as, for example, diskettes, zip drives, CDs, and hard drives.
As discussed above, the preferred embodiment is described herein with reference to a Java application environment, with XML being used for implementation of the data structure. However, it is contemplated that the principles of the invention may be extended to other application environments and the data structures may be implemented using other languages, such as other mark-up languages (SGML, etc.). It is further contemplated that principles of the invention may be extended to programming languages other than object orientated languages such as Java.
The collection agent 102 uses the following command line syntax to attach itself to the JVM 100:
java -XrunpiAgent [:<option=value[,option=value]*]
[-Xbootclasspath:<location of picb.jar>,<location of rt.jar>] <Java class>
The JVM 100 attempts to locate the profiler agent library called ipAgent in the Java library directory. Once located, the JVM 100 loads the profiler agent library and hooks for the entry point in a well-known manner. The options supported in the first argument to the invocation command do not form part of the present invention. The collection agent 102 obtains a function call interface by issuing a GetEnv call on the JavaVM pointer in a well known manner. The GetEnv function returns a pointer to a JVMPI_Interface. The JVMPI_Interface returned by GetEnv has all the functions set up except for NotifyEvent. The collection agent 102 must set up the NotifyEvent function pointer.
The JVM 100 sends an event, through the JVMPI 120, to the collection agent 102 by calling NotifyEvent with a JVMPI_Interface data structure as the argument. The JVMPI_Event structure contains the event type, the pointer of the current thread, and other event-specific information, all of which is well known according to the JVMPI specification.
The event-specific information of a trace event is received by the collection agent 102 and is then converted to a data structure in the format of trace elements, which, as discussed above, define or represent trace events of object oriented programming languages.
The trace elements of a trace are emitted to Analysis Clients 104 as XML fragments, where each XML fragment comprises one or more trace elements. Alternatively, the trace elements may be collected and emitted to the Analysis Clients 104 as XML document. The trace elements are formatted as text in compliance with the XML standard.
According to the present embodiment of the invention, the trace elements represent logic components of object oriented programming languages. The logic components include, for example, threads, methods, objects, and classes. Trace elements further include elements to provide context information about a trace as a whole such as, for example, a node trace element to identify the JVM 100 that a trace event is from.
Each of the trace elements contains at least one correlation attribute. A number of different correlation attributes are defined according to this invention. The correlation attributes include other trace elements, reference numbers, time information, and counter information. The primary correlation attribute for associating a trace element to a parent element is by referring to the parent trace element and parent reference number. In particular, each parent trace element or identifier (Id) has a reference number attribute. All parent events are represented by trace elements (or Ids) and are further assigned unique identifying reference numbers. All related or associated trace elements reference parent trace elements (or Ids) and their reference numbers in a nested logical hierarchy. Trace elements in combination with reference numbers uniquely identify instances of certain components of the application.
According to the present embodiment of the invention, the trace elements for defining tracing events of object oriented programming languages are listed in Table 1 in the Id (identifier) column. Each Id has a defining element and an undefining element as shown in Table 1. A defining element provides the information related to an Id. For example, the defining element for a thread Id (see below) contains, among other entries, the name of the thread.
An Id is valid until its undefining element arrives. Where a defining element is enabled, the defining element is sent before the Id appears in other elements. An undefining element invalidates the Id. There are two kinds of Ids; transient Ids and non transient Ids. Both have the same logical lifetime as prescribed by their defining and undefining elements. However, transient Id values may be reused after they have been undefined, and in fact may be reused as a different type of Id. The value of a transientThreadId, for example, may be redefined as a transientClassId after a thread ends. Non transient Ids are unique within the scope of a trace.
| TABLE 1 | ||
| Id | Defining element | Undefining element |
| ThreadId | ThreadStart | threadEnd |
| TransientThreadId | ThreadStart | threadEnd |
| ObjId | ObjAlloc | objFree |
| TransientObjId | objAlloc, objMove, objDef | objFree, objMove |
| ClassId | ClassDef | classUnload |
| TransientClassId | ClassDef | classUnload |
| MethodId | MethodDef | defining classUnload |
| TransientMethodId | MethodDef | defining classUnload |
| LineTableId | LineTable | not applicable |
| NodeId | Node | not applicable |
| ProcessId | ProcessCreate | not available |
| AgentId | AgentCreate | agentDestroy |
| TraceId | TraceStart | traceEnd |
Identifiers (Ids) and identifier references (IdRef) define a logical hierarchy between the various trace elements. Thus, as discussed in greater detail below with reference to FIGS. 2–11, threads, classes, methods, and objects each have unique Ids which are referenced by other elements in accordance with defined relationships of the Id/IdRef attributes. Specifically, with reference to FIG. 2, a <node nodeId> contains a <processCreate processId, nodeIdRef> which contains an <agentCreate agentId, processIdRef> which contains a <traceStart traceId, agentIdRef> which, in turn, contains most other elements by means of the traceIdRef. The forgoing nested element hierarchy is established by element referring back to its parent through a specific IdRef attribute.
Many of the trace elements share the same attributes. The following attributes appear in more than one element:
| TABLE 2 | ||
| VALUE | MEANING | |
| “0” | normal object (i.e., not an array) | |
| “2” | array of objects | |
| “4” | array of Booleans | |
| “5” | array of chars | |
| “6” | array of floats | |
| “7” | array of doubles | |
| “8” | array of bytes | |
| “9” | array of shorts | |
| “10” | array of ints | |
| “11” | array of longs | |
In accordance with the best mode of the invention at the time of filing this application, descriptions will now be provided of the various trace elements formatted in compliance with the XML standard:
| traceRoot |
| <!ELEMENT traceRoot (node | processCreate | agentCreate | |
| agentDestroy | (methodCall | methodReturn | objDef | value | classDef | |
| methodDef | tag | code)* | ( traceStart | traceEnd | processSuspend | |
| processResume | option | filter | classDef | classUnload | methodDef | |
| methodEntry | methodExit | methodCall | value | methodReturn | line | |
| objAlloc | gcAlloc | objFree | objMove | gcFinish | threadStart | |
| threadEnd | runtimeInitDone | runtimeShutdown | throw | catch | |
| methodCount)*)> |
As indicated above, traceRoot is the root element for a valid trace XML document as shown in FIG. 2.
| objRef | |
| <!ELEMENT objRef EMPTY> | |
| <!ATTLIST objRef | |
| name CDATA #IMPLIED | |
A string containing the name of the object reference.
| transientObjIdRef CDATA #IMPLIED | |
| objIdRef CDATA #IMPLIED | |
| > | |
This element is used to associate a name with an transientObjId.
| code | |
| <!ELEMENT code EMPTY> | |
| <!ATTLIST code | |
| type CDATA #IMPLIED | |
The code element is used to insert textual lines of code into the XML file. Thus, in the event that lines of code are to be preserved when the XML file is parsed, such lines are inserted into the code element with the result that the parser extracts the lines of code exactly as they appear in the element.
| tag | |
| <!ELEMENT tag EMPTY> | |
| <!ATTLIST tag | |
| name CDATA #IMPLIED | |
The name of the tag.
| value CDATA #IMPLIED | |
| The value of the tag. | |
| > | |
The tag element contains the information that is associated with a specific tag, expressed as name value pairs.
Trace Behaviour
| node | |
| <! ELEMENT node EMPTY> | |
| <!ATTLIST node | |
| nodeId CDATA #REQUIRED | |
| a UUID to uniquely refer to this node. | |
| hostname CDATA #REQUIRED | |
The name of the machine on which the instrumented JVM is running.
| > | |
| processCreate | |
| <!ELEMENT processCreate EMPTY> | |
| <!ATTLIST processCreate | |
| processId CDATA #REQUIRED | |
| a UUID that uniquely identifies this process. | |
| name CDATA #IMPLIED | |
| the name of the process | |
| pid CDATA #REQUIRED | |
| the platform-specific process identification | |
| nodeIdRef CDATA #REQUIRED | |
| a UUID that references the node element that represents the | |
| host machine. | |
| time CDATA #REQUIRED | |
| > |
| agentCreate |
| <!ELEMENT agentCreate EMPTY> |
| <!ATTLIST agentCreate |
| agentId CDATA #REQUIRED |
| a UUID that uniquely identifies this instance of the agent. |
| processIdRef CDATA #REQUIRED |
| a UUID that references the processCreate element that represents |
| the process. |
| agentName CDATA #REQUIRED |
| the name of the agent. |
| time CDATA #REQUIRED |
| agentParameters CDATA #IMPLIED |
The option string that was passed to the agent on the -XrunpiAgent parameter.
| version CDATA #IMPLIED | |
| a string representing the version of the agent | |
| > | |
| agentDestroy | |
| <!ELEMENT agentDestroy EMPTY> | |
| <!ATTLIST agentDestroy | |
| agentIdRef CDATA #REQUIRED | |
| a UUID that references the matching agentCreate element. | |
| time CDATA #REQUIRED | |
| > | |
| traceStart | |
| <!ELEMENT traceStart EMPTY> | |
| <!ATTLIST traceStart | |
| traceId CDATA #REQUIRED | |
| agentIdRef CDATA #REQUIRED | |
| time CDATA #REQUIRED | |
| collationValue CDATA #IMPLIED | |
| > | |
| traceEnd | |
| <!ELEMENT traceEnd EMPTY> | |
| <!ATTLIST traceEnd | |
| traceIdRef CDATA #REQUIRED | |
| time CDATA #REQUIRED | |
| collationValue CDATA #IMPLIED | |
| > | |
| processSuspend | |
| <!ELEMENT processSuspend EMPTY> | |
| <!ATTLIST processSuspend | |
| time CDATA #IMPLIED | |
| traceIdRef CDATA #IMPLIED | |
| > | |
This element indicates that the execution of the process has been suspended.
| processResume | |
| <!ELEMENT processResume EMPTY> | |
| <!ATTLIST processResume | |
| time CDATA #IMPLIED | |
| traceIdRef CDATA #IMPLIED | |
| > | |
The foregoing element indicates that the execution of the process has resumed.
| option | |
| <!ELEMENT option EMPTY> | |
| <!ATTLIST option | |
| key CDATA #IMPLIED | |
| A string that represents the name of the option. | |
| value CDATA #IMPLIED | |
| A string that represents the value of the option. | |
| traceIdRef CDATA #IMPLIED | |
| > | |
A collection agent 102 may be assigned any number of agent-specific key/value pair options. In addition to the agent performing the specific action as specified by the option, the agent echoes the option through the option element.
| filter | |
| <!ELEMENT filter EMPTY> | |
| <!ATTLIST filter | |
| pattern CDATA #REQUIRED | |
| A prefix or suffix string which is compared against the | |
| class name. (For example, java*”, “sun.com.*”,“*print”.) | |
| genericPattern CDATA #REQUIRED | |
| none |
| The pattern requires an exact match. |
| prefix |
| The pattern has a prefix wildcard and matches anything ending |
| with the pattern. |
| suffix |
| The pattern has a suffix wildcard and matches anything |
| starting with the pattern. |
| mode CDATA #REQUIRED |
| Either include or exclude. |
| traceIdRef CDATA #IMPLIED |
| > |
The class filter is set by a client 104. Filters control which classes are traced using the profiling tool of the present invention. Filters are provided in blocks where a new block of filters completely replaces the current filter definitions. Matching is done by querying the active block of filters until the first match is found based on the supplied pattern. The class is then either included or excluded from the trace based on the mode. If the entire block of filters is traversed and no match is found, then the class is included in the trace. It should be noted that some classes are included in the trace even if they have been filtered out. This occurs if a reference is made to the class from a class that has not been filtered out. For example, the methodCall element can cause this to occur.
Threads
The elements which provide information about threads are set forth in FIG. 4. Other elements point to a THREAD element's transientThreadId to identify the thread they are running in.
| threadStart | |
| <!ELEMENT threadStart EMPTY> | |
| <!ATTLIST threadStart | |
| transientThreadId CDATA #IMPLIED | |
| collationValue CDATA #IMPLIED | |
| traceIdRef CDATA #IMPLIED | |
| > | |
| threadEnd | |
| <!ELEMENT threadEnd EMPTY> | |
| <!ATTLIST threadEnd | |
| transientThreadIdRef CDATA #IMPLIED | |
| threadIdRef CDATA #IMPLIED | |
| time CDATA #IMPLIED | |
| collationValue CDATA #IMPLIED | |
| traceIdRef CDATA #IMPLIED | |
| > | |
This element indicates the end of the thread identified by the transientThreadIdRef or threadIdRef value.
Classes
The hierarchical relationship of the class elements is set forth in FIG. 5. Definitions of these elements are as follows:
| classDef | |
| <!ELEMENT classDef EMPTY> | |
| <!ATTLIST classDef | |
| transientThreadIdRef CDATA #IMPLIED | |
| threadIdRef CDATA #IMPLIED | |
| time CDATA #IMPLIED | |
| transientClassId CDATA #IMPLIED | |
| classId CDATA #IMPLIED | |
| sourceName CDATA #IMPLIED | |
| collationValue CDATA #IMPLIED | |
| traceIdRef CDATA #IMPLIED | |
| > | |
| methodDef | |
| <!ELEMENT methodDef return? parameter* exception*> | |
| <!ATTLIST methodDef | |
| name CDATA #IMPLIED | |
| TABLE 3 | |
| sigNotation | |
| value | COMMENTS |
| JNI | Currently only JNI signature notation is supported. If this |
| attribute is not present, the value JNI is assumed. JNI | |
| signature notation is defined in the Java Native Interface | |
| Specification | |
| XML | The signature is specified by child elements of this |
| methodDef element. In this case, the signature attribute | |
| may be an empty string. | |
| transientClassIdRef CDATA #IMPLIED | |
| classIdRef CDATA #IMPLIED | |
| methodId CDATA #IMPLIED | |
| collationValue CDATA #IMPLIED | |
| traceIdRef CDATA #IMPLIED | |
| > | |
Although technically part of the classDef event, the methodDef element is broken out into a separate element so that it can be optionally output only when referenced.
| Return | |
| <!ELEMENT return EMPTY> | |
| <!ATTLIST return | |
| transientClassIdRef CDATA #IMPLIED | |
| classIdRef CDATA #IMPLIED | |
| > | |
| parameter | |
| <!ELEMENT parameter EMPTY> | |
| <!ATTLIST parameter | |
| name CDATA #IMPLIED |
| A string containing the name of the parameter. |
| transientClassIdRef CDATA #IMPLIED | |
| classIdRef CDATA #IMPLIED | |
| > | |
| exception | |
| <!ELEMENT exception EMPTY> | |
| <!ATTLIST exception | |
| transientClassIdRef CDATA #IMPLIED | |
| classIdRef CDATA #IMPLIED | |
| > | |
| Objects | |
The hierarchical relationship of the object elements is set forth in FIG. 6.
| objAlloc | |
| <!ELEMENT objAlloc EMPTY> | |
| <!ATTLIST objAlloc | |
| transientThreadIdRef CDATA #IMPLIED | |
| threadIdRef CDATA #IMPLIED | |
| time CDATA #IMPLIED | |
| isArray CDATA #IMPLIED | |
| transientObjId CDATA #IMPLIED | |
| objId CDATA #IMPLIED | |
| size CDATA #IMPLIED | |
Size in number of bytes expressed as an unsigned 32-bit value in string format.
| collationValue CDATA #IMPLIED | |
| traceIdRef CDATA #IMPLIED | |
| > | |
This element traces storage allocation. It can be used in conjunction with other elements such as the methodEntry element to identify an associated object.
Methods and Values
The methodEntry and methodExit elements are output when a method is entered, and when the method returns respectively. The methodCall and methodReturn elements are output when a method is about to be called, and after a method returns. It is not necessary for an agent to produce both methodEntry/Exit and methodCall/Return for a each method invocation. The semantics associated with the generation of these elements is agent specific. The hierarchical arrangement of these elements is set forth in FIG. 7.
| methodEntry | |
| <!ELEMENT methodEntry (value*)> | |
| <!ATTLIST methodEntry | |
| transientThreadIdRef CDATA #IMPLIED | |
| threadIdRef CDATA #IMPLIED | |
| time CDATA #IMPLIED | |
| methodIdRef CDATA #IMPLIED | |
| transientMethodIdRef CDATA #IMPLIED | |
| ticket CDATA #IMPLIED | |
| transientObjIdRef CDATA #IMPLIED | |
| transientClassIdRef CDATA #IMPLIED | |
| classIdRef CDATA #IMPLIED | |
| threadCpuTime CDATA #IMPLIED | |
| sequenceCounter CDATA #IMPLIED | |
| contextData CDATA #IMPLIED | |
| stackDepth CDATA #IMPLIED | |
The current depth of the stack including the current method being executed.
| collationValue CDATA #IMPLIED | |
| traceIdRef CDATA #IMPLIED | |
| > | |
| methodExit | |
| <!ELEMENT methodExit (value*)> | |
| <!ATTLIST methodExit | |
| transientThreadIdRef CDATA #IMPLIED | |
| threadIdRef CDATA #IMPLIED | |
| time CDATA #IMPLIED | |
| ticket CDATA #IMPLIED | |
| threadCpuTime CDATA #IMPLIED | |
| methodIdRef CDATA #IMPLIED | |
| transientMethodIdRef CDATA #IMPLIED | |
| transientObjIdRef CDATA #IMPLIED | |
| transientClassIdRef CDATA #IMPLIED | |
| classIdRef CDATA #IMPLIED | |
| sequenceCounter CDATA #IMPLIED | |
| contextData CDATA #IMPLIED | |
| collationValue CDATA #IMPLIED | |
| traceIdRef CDATA #IMPLIED | |
| > | |
| Note: | |
| the optional nested value elements represent parameter values and return value. |
| methodCall | |
| <!ELEMENT methodCall (value*)> | |
| <!ATTLIST methodCall | |
| transientThreadIdRef CDATA #IMPLIED | |
| threadIdRef CDATA #IMPLIED | |
| time CDATA #IMPLIED | |
| methodIdRef CDATA #IMPLIED | |
| transientMethodIdRef CDATA #IMPLIED | |
| ticket CDATA #IMPLIED | |
| transientObjIdRef CDATA #IMPLIED | |
| transientClassIdRef CDATA #IMPLIED | |
| classIdRef CDATA #IMPLIED | |
| threadCpuTime CDATA #IMPLIED | |
| sequenceCounter CDATA #IMPLIED | |
| contextData CDATA #IMPLIED | |
| stackDepth CDATA #IMPLIED | |
The current depth of the stack including the current method being executed.
| collationValue CDATA #IMPLIED | |
| traceIdRef CDATA #IMPLIED | |
| > | |
| Note: | |
| the optional nested value elements represent parameter values. |
| methodReturn | |
| <!ELEMENT methodReturn (value*)> | |
| <!ATTLIST methodReturn | |
| transientThreadIdRef CDATA #IMPLIED | |
| threadIdRef CDATA #IMPLIED | |
| time CDATA #IMPLIED | |
| ticket CDATA #IMPLIED | |
| threadCpuTime CDATA #IMPLIED | |
| methodIdRef CDATA #IMPLIED | |
| transientMethodIdRef CDATA #IMPLIED | |
| transientObjIdRef CDATA #IMPLIED | |
| transientClassIdRef CDATA #IMPLIED | |
| classIdRef CDATA #IMPLIED | |
| sequenceCounter CDATA #IMPLIED | |
| contextData CDATA #IMPLIED | |
| collationValue CDATA #IMPLIED | |
| traceIdRef CDATA #IMPLIED | |
| > | |
| Note: | |
| the optional nested value elements represent parameter values and return value. |
| objDef | |
| <!ELEMENT objDef EMPTY> | |
| <!ATTLIST objDef | |
| objId CDATA #REQUIRED | |
| isArray CDATA #IMPLIED | |
| size CDATA #IMPLIED | |
Size in number of bytes expressed as an unsigned 32-bit value in string format.
| collationValue CDATA #IMPLIED | |
| traceIdRef CDATA #IMPLIED | |
| < | |
The objDef element holds identity information for an object, which can be referred to by elements associated with the object, such as the value element.
| value | |
| <!ELEMENT value pcdata> | |
| <!ATTLIST value | |
| parameterName CDATA #IMPLIED | |
An optional parameter name associated with the value.
An optional reference name associated with the value. For example, a variable name.
| TABLE 4 | |
| Type | Value |
| “serializedObjValueRef” | the value attribute is a reference (format |
| to be defined) to a serialized object located | |
| elsewhere | |
| “serializedObjValue” | the value attribute is a serialized object. |
| “serializedObjValueInline” | the serialized object is located in the pcdata. |
| The value attribute is not used when this type | |
| is specified. | |
| “boolean” | the value attribute is a string representation |
| of a boolean value | |
| “char” | the value attribute is a string representation |
| of a char value | |
| “int” | the value attribute is a string representation |
| of an int value | |
| “long” | the value attribute is a string representation |
| of a long value | |
| “float” | the value attribute is a string representation |
| of a float value | |
| “double” | the value attribute is a string representation |
| of a double value | |
| “string” | the value attribute is a string |
| “opaque” | the value is unknown. The value attribute is |
| not used when this type is specified. | |
| collationValue CDATA #IMPLIED | |
| traceIdRef CDATA #IMPLIED | |
| > | |
The value element is used to reference a data value, either for parameter values in a methodCall, or for the return value of a methodReturn.
| methodCount | |
| <!ELEMENT methodCount EMPTY> | |
| <!ATTLIST methodCount | |
| count CDATA #IMPLIED | |
| collationValue CDATA #IMPLIED | |
| traceIdRef CDATA #IMPLIED | |
| > | |
The methodCount element tracks the number of times a particular method has been invoked. This element is designed to aid in collecting code coverage information. A methodCount element is produced for every method for every class loaded by the application. The count includes all method invocations regardless of whether tracing has been enabled or not. In other words, the data is collected even before a traceStart and even after a traceEnd. The elements are produced just prior to the agentDestroy element being produced. In addition, an agent may be requested to produce the methodCount elements at any arbitrary time before the agentDestroy element is produced.
Lines
These elements are designed to capture line coverage type information. The hierarchical arrangement of these elements is set forth in FIG. 8.
| lineTable | |
| <!ELEMENT lineTable EMPTY> | |
| <!ATTLIST lineTable | |
| lineTableId CDATA #REQUIRED | |
| methodIdRef CDATA #REQUIRED | |
| sourceName CDATA #IMPLIED | |
| startLine CDATA #IMPLIED | |
| endLine CDATA #IMPLIED | |
| sourceCoordinateType CDATA #IMPLIED | |
| line | |
| <!ELEMENT line EMPTY> | |
| <!ATTLIST line | |
| transientThreadIdRef CDATA #IMPLIED | |
| threadIdRef CDATA #IMPLIED | |
| lineTableIdRef CDATA #IMPLIED | |
| time CDATA #IMPLIED | |
| blockIndex CDATA #IMPLIED | |
| traceIdRef CDATA #IMPLIED | |
| > | |
This element is produced for each Coverage Block that is executed.
Memory Management
The gcStart, objFree, classUnload, objMove, and gcFinish elements (along with objAlloc) are for tracing storage management. The hierarchical arrangement of these elements is set forth in FIG. 9.
| GcStart | |
| <!ELEMENT gcStart EMPTY> | |
| <!ATTLIST gcStart | |
| transientThreadIdRef CDATA #IMPLIED | |
| threadIdRef CDATA #IMPLIED | |
| time CDATA #IMPLIED | |
| collationValue CDATA #IMPLIED | |
| traceIdRef CDATA #IMPLIED | |
| > | |
This element is produced when garbage collection (GC) is about to start.
| objFree | |
| <!ELEMENT objFree EMPTY> | |
| <!ATTLIST objFree | |
| transientThreadIdRef CDATA #IMPLIED | |
| threadIdRef CDATA #IMPLIED | |
| time CDATA #IMPLIED | |
| transientObjIdRef CDATA #IMPLIED | |
| objIdRef CDATA #IMPLIED | |
| sequenceCounter CDATA #IMPLIED | |
| contextData CDATA #IMPLIED | |
| collationValue CDATA #IMPLIED | |
| traceIdRef CDATA #IMPLIED | |
| > | |
This element is produced when an object is freed.
| classUnload | |
| <!ELEMENT classUnload EMPTY> | |
| <!ATTLIST classUnload | |
| transientThreadIdRef CDATA #IMPLIED | |
| threadIdRef CDATA #IMPLIED | |
| time CDATA #IMPLIED | |
| transientClassIdRef CDATA #IMPLIED | |
| classIdRef CDATA #IMPLIED | |
| collationValue CDATA #IMPLIED | |
| traceIdRef CDATA #IMPLIED | |
| > | |
| objMove | |
| <!ELEMENT objMove EMPTY> | |
| <!ATTLIST objMove | |
| transientThreadIdRef CDATA #IMPLIED | |
| threadIdRef CDATA #IMPLIED | |
| time CDATA #IMPLIED | |
| transientObjIdRef CDATA #IMPLIED | |
| objIdRef CDATA #IMPLIED | |
| newObjId CDATA #IMPLIED | |
The new object id. See transientObjId for more information
| collationValue CDATA #IMPLIED | |
| traceIdRef CDATA #IMPLIED | |
| > | |
This element is produced when an object is moved in the heap, and is always sent between a pair of gcStart and gcFinish elements. When an object is moved, its transientObjId is changed. All future references to this object use the new transientObjId value. To avoid having to map and track these changes, it is much simpler to use the objId to refer to the object, as this id does not change. Note: Because objId, classId, objIdRef and classIdRef are provided, it is anticipated that objMove need not be traced by a client unless low level memory analysis is required.
| gcFinish | |
| <!ELEMENT gcFinish EMPTY> | |
| <!ATTLIST gcFinish | |
| transientThreadIdRef CDATA #IMPLIED | |
| threadIdRef CDATA #IMPLIED | |
| time CDATA #IMPLIED | |
| totalObjSpace CDATA #IMPLIED | |
Number of used objects on the heap expressed as a 32-bit unsigned value in string format.
| collationValue CDATA #IMPLIED | |
| traceIdRef CDATA #IMPLIED | |
| > | |
The throw and catch elements encapsulate the exception behavior of an application. The hierarchical arrangement of these elements is set forth in FIG. 10.
| throw | |
| <!ELEMENT throw EMPTY> | |
| <!ATTLIST throw | |
| transientThreadIdRef CDATA #IMPLIED | |
| threadIdRef CDATA #IMPLIED | |
| time CDATA #IMPLIED | |
| objIdRef CDATA #IMPLIED | |
| collationValue CDATA #IMPLIED | |
| traceIdRef CDATA #IMPLIED | |
| > | |
When an exception is thrown, a throw element is generated for each unfiltered stack frame that processes the exception. Processing the exception includes simply letting the exception percolate to the next stack frame. A throw element is also generated for the stack frame that catches the exception.
| catch | |
| <!ELEMENT catch EMPTY> | |
| <!ATTLIST catch | |
| transientThreadIdRef CDATA #IMPLIED | |
| threadIdRef CDATA #IMPLIED | |
| time CDATA #IMPLIED | |
| objIdRef CDATA #IMPLIED | |
| collationValue CDATA #IMPLIED | |
| traceIdRef CDATA #IMPLIED | |
| > | |
A catch element is generated when the exception reaches an unfiltered stack frame that defines a catch clause for the exception.
Runtime
These elements track the initialization and shutdown of the program runtime. The hierarchical arrangement of these elements is set forth in FIG. 11.
| runtimeInitDone | |
| <!ELEMENT runtimeInitDone EMPTY> | |
| <!ATTLIST runtimeInitDone | |
| transientThreadIdRef CDATA #IMPLIED | |
| threadIdRef CDATA #IMPLIED | |
| time CDATA #IMPLIED | |
| collationValue CDATA #IMPLIED | |
| traceIdRef CDATA #IMPLIED | |
| > | |
The runtimeInitDone element is produced when the program runtime environment initialization is complete (e.g., for Java it is produced when the JVM initialization is done).
| runtimeShutdown | |
| <!ELEMENT runtimeShutdown EMPTY> | |
| <!ATTLIST runtimeShutdown | |
| transientThreadIdRef CDATA #IMPLIED | |
| threadIdRef CDATA #IMPLIED | |
| time CDATA #IMPLIED | |
| collationValue CDATA #IMPLIED | |
| traceIdRef CDATA #IMPLIED | |
| > | |
The runtimeShutdown element is produced when the program runtime environment terminates (e.g., for Java it is produced when the JVM being traced is shutting down).
Turning to FIG. 12, parts of a sample trace are shown. The illustrated sample is set forth solely for purposes of illustration and is not intended to limit the scope of the invention. Text denoted as 1200 is a header showing the context of the trace, which provided, for example, nodeId (node id reference) as “60646da0-3298-11d5-a73d-a0c48e000000”. Text denoted as 1202 shows the nested hierarchical structure of the trace elements. For example, <methodEntry threadIdRef=“2” time=“987446612.236409” methodIdRef=“2612” objIdRef=“2627” classIdRef=“2617” threadCpuTime=“0.2303312” stackDepth=“1”/> shows that a method 2612 called object 2627 of class 2617 at time 987446612.236409. Text denoted as 1204 shows the end of this trace.
Variations and modifications of the invention are contemplated. For example, the specific XML code fragments set forth herein are provided for the purpose of establishing a best mode of the invention. A person of ordinary skill in the art may conceive of other modes of expressing the functionality inherent in the disclosed code. Such a person may also conceive of further alternative embodiments of the invention, all of which are believed to fall within the sphere and scope of the invention as defined by the appended claims.
1. A system for providing access to profiling data of trace events of an application over a network, comprising:
a profiling interface for transmitting said trace events of said application; and
a collection agent for receiving and encoding said trace events into representative trace elements, each trace element containing at least one correlation attribute, and in response transmitting said trace elements over said network for at least one analysis client to receive and correlate said trace elements in accordance with said at least one correlation attribute.
2. The system of claim 1, wherein said at least one correlation attribute in each of said trace elements comprises an identifier (Id).
3. The system of claim 1, wherein said at least one correlation attribute in each of said trace elements for relating to a parent trace element comprises an identifier reference (IdRef) of said parent trace element.
4. The method of claim 1, wherein the application is distributed in the network such that a first component of the application executes on a first data processing device of the network and a second component of the application executes on a second data processing device in the network, and wherein the correlation attribute correlates trace elements generated by the first component with trace elements generated by the second component.
5. The method of claim 1, wherein the profiling interface provides a mechanism for exchanging trace events between the collection agent and the at least one analysis agent, wherein the collection agent and the at least one analysis agent are part of a profiling tool.
6. A method of providing access to profiling data of trace elements of an application over a network, comprising:
transmitting said trace events of said application; and
receiving and encoding said trace events into representative trace elements, each trace element containing at least one correlation attribute, and in response transmitting said trace elements over said network for at least one analysis client to receive and correlate said trace elements in accordance with said at least one correlation attribute.
7. The method of claim 6, wherein said at least one correlation attribute in each of said trace elements comprises an identifier (Id).
8. The method of claim 6, wherein said at least one correlation attribute in each of said trace elements for relating to a parent trace element comprises an identifier reference (IdRef) of said parent trace element.
9. The method of claim 6, wherein the application is distributed in the network such that a first component of the application executes on a first data processing device of the network and a second component of the application executes on a second data processing device in the network, and wherein the correlation attribute correlates trace elements generated by the first component with trace elements generated by the second component.
10. A data structure, embodied in a computer readable medium, for providing profiling data in the form of trace elements from collection agents to analysis agents, said data structure comprising trace elements for representing trace events of an application, each of said trace elements comprising at least one correlation attribute for defining a logical hierarchy between said trace elements.
11. The data structure of claim 10, wherein said at least one correlation attribute in each of said trace elements comprises an identifier (Id).
12. The data structure of claim 11, wherein said trace elements are written in an open mark-up language.
13. The data structure of claim 10, wherein said at least one correlation attribute in each of said trace elements for relating to a parent trace element comprises an identifier reference (IdRef) of said parent trace element.
14. The data structure of claim 10, wherein the application is distributed in a network such that a first component of the application executes on a first data processing device of the network and a second component of the application executes on a second data processing device in the network, and wherein the correlation attribute correlates trace elements generated by the first component with trace elements generated by the second component.
15. A collection agent, in a data processing system, for collecting and communicating profiling data of trace events of an application according to a data structure, wherein the improvement comprises:
said data structure comprising trace elements for representing said trace events of an application, each of said trace elements comprising at least one correlation attribute for defining a logical hierarchy between said trace elements.
16. The collection agent of claim 15, wherein said at least one correlation attribute in each of said trace elements comprises an identifier (Id).
17. The collection agent of claim 15, wherein said at least one correlation attribute in each of said trace elements for relating to a parent trace element comprises an identifier reference (IdRef) of said parent trace element.
18. The collection agent of claim 15, wherein the application is distributed in a network such that a first component of the application executes on a first data processing device of the network and a second component of the application executes on a second data processing device in the network, and wherein the correlation attribute correlates trace elements generated by the first component with trace elements generated by the second component.
19. An analysis agent, in a data processing system, for receiving profiling data of trace events of an application according to a data structure, wherein the improvement comprises:
said data structure embodied in a computer readable medium comprising trace elements for representing said trace events of an application, each of said trace elements comprising at least one correlation attribute for defining a logical hierarchy between said trace elements.
20. The analysis agent of claim 19, wherein said at least one correlation attribute in each of said trace elements comprises an identifier (Id).
21. The analysis agent of claim 19, wherein said at least one correlation attribute in each of said trace elements for relation to a parent trace element comprises an identifier reference (IdRef) of said parent trace element.
22. The analysis agent of claim 20, wherein said trace elements are written in an open mark-up language.
23. The analysis agent of claim 19, wherein the application is distributed in a network such that a first component of the application executes on a first data processing device of the network and a second component of the application executes on a second data processing device in the network, and wherein the correlation attribute correlates trace elements generated by the first component with trace elements generated by the second component.
24. An article comprising
a computer-readable data storage medium;
means recorded on the medium for providing access to profiling data of an application over a network;
means recorded on the medium for transmitting trace events of said application; and
means recorded on the medium for receiving and encoding said trace events into corresponding trace elements, each trace element containing at least one correlation attribute, and in response transmitting said trace elements over said network for at least one analysis client to receive and correlate said trace elements in accordance with said at least one correlation attribute.
25. The article of claim 24, wherein said at least one correlation attribute in each of said trace elements comprises an identifier (Id).
26. The article of claim 24, wherein said at least one correlation attribute in each of said trace elements for relating to a parent trace element comprises an identifier reference (IdRef) of said parent trace element.
27. The article of claim 24, wherein the application is distributed in the network such that a first component of the application executes on a first data processing device of the network and a second component of the application executes on a second data processing device in the network, and wherein the correlation attribute correlates trace elements generated by the first component with trace elements generated by the second component.