Intel Atom® Processors Based on the Goldmont Microarchitecture

This section provides reference for hardware events that can be monitored for the CPU(s):

Intel® Atom™ processor based on Intel Microarchitecture Silvermont

Intel® Atom™ processor C2000 family for Server (: Avoton)

Events for Intel® microarchitecture code name Tangier

Events for Intel® microarchitecture code name Anniedale

Event Name	Description	Additional Info	EventType
	CORE		CoreOnly
INST_RETIRED.ANY	This event counts the number of instructions that retire. For instructions that consist of multiple micro-ops, this event counts exactly once, as the last micro-op of the instruction retires. The event continues counting while instructions retire, including during interrupt service routines caused by hardware interrupts, faults or traps. Background: Modern microprocessors employ extensive pipelining and speculative techniques. Since sometimes an instruction is started but never completed, the notion of "retirement" is introduced. A retired instruction is one that commits its states. Or stated differently, an instruction might be abandoned at some point. No instruction is truly finished until it retires. This counter measures the number of completed instructions. The fixed event is INST_RETIRED.ANY and the programmable event is INST_RETIRED.ANY_P.	IA32_FIXED_CTR0 Architectural, Fixed	CoreOnly
CPU_CLK_UNHALTED.CORE	Counts the number of core cycles while the core is not in a halt state. The core enters the halt state when it is running the HLT instruction. This event is a component in many key event ratios. The core frequency may change from time to time. For this reason this event may have a changing ratio with regards to time. In systems with a constant core frequency, this event can give you a measurement of the elapsed time while the core was not in halt state by dividing the event count by the core frequency. This event is architecturally defined and is a designated fixed counter. CPU_CLK_UNHALTED.CORE and CPU_CLK_UNHALTED.CORE_P use the core frequency which may change from time to time. CPU_CLK_UNHALTE.REF_TSC and CPU_CLK_UNHALTED.REF are not affected by core frequency changes but counts as if the core is running at the maximum frequency all the time. The fixed events are CPU_CLK_UNHALTED.CORE and CPU_CLK_UNHALTED.REF_TSC and the programmable events are CPU_CLK_UNHALTED.CORE_P and CPU_CLK_UNHALTED.REF.	IA32_FIXED_CTR1 Architectural, Fixed	CoreOnly
CPU_CLK_UNHALTED.REF_TSC	Counts the number of reference cycles while the core is not in a halt state. The core enters the halt state when it is running the HLT instruction. This event is a component in many key event ratios. The core frequency may change from time. This event is not affected by core frequency changes but counts as if the core is running at the maximum frequency all the time. Divide this event count by core frequency to determine the elapsed time while the core was not in halt state. Divide this event count by core frequency to determine the elapsed time while the core was not in halt state. This event is architecturally defined and is a designated fixed counter. CPU_CLK_UNHALTED.CORE and CPU_CLK_UNHALTED.CORE_P use the core frequency which may change from time to time. CPU_CLK_UNHALTE.REF_TSC and CPU_CLK_UNHALTED.REF are not affected by core frequency changes but counts as if the core is running at the maximum frequency all the time. The fixed events are CPU_CLK_UNHALTED.CORE and CPU_CLK_UNHALTED.REF_TSC and the programmable events are CPU_CLK_UNHALTED.CORE_P and CPU_CLK_UNHALTED.REF.	IA32_FIXED_CTR2 Architectural, Fixed	CoreOnly
BR_INST_RETIRED.ALL_BRANCHES	ALL_BRANCHES counts the number of any branch instructions retired. Branch prediction predicts the branch target and enables the processor to begin executing instructions long before the branch true execution path is known. All branches utilize the branch prediction unit (BPU) for prediction. This unit predicts the target address not only based on the EIP of the branch but also based on the execution path through which execution reached this EIP. The BPU can efficiently predict the following branch types: conditional branches, direct calls and jumps, indirect calls and jumps, returns.	EventSel=C4H UMask=00H Counter=0,1 PEBS:[PreciseEventingIP] Architectural	CoreOnly
BR_MISP_RETIRED.ALL_BRANCHES	ALL_BRANCHES counts the number of any mispredicted branch instructions retired. This umask is an architecturally defined event. This event counts the number of retired branch instructions that were mispredicted by the processor, categorized by type. A branch misprediction occurs when the processor predicts that the branch would be taken, but it is not, or vice-versa. When the misprediction is discovered, all the instructions executed in the wrong (speculative) path must be discarded, and the processor must start fetching from the correct path.	EventSel=C5H UMask=00H Counter=0,1 PEBS:[PreciseEventingIP] Architectural	CoreOnly
CPU_CLK_UNHALTED.CORE_P	This event counts the number of core cycles while the core is not in a halt state. The core enters the halt state when it is running the HLT instruction. In mobile systems the core frequency may change from time to time. For this reason this event may have a changing ratio with regards to time.	EventSel=3CH UMask=00H Counter=0,1 Architectural	CoreOnly
CPU_CLK_UNHALTED.REF	This event counts the number of bus cycles that the core is not in a halt state. The core enters the halt state when it is running the HLT instruction. In mobile systems the core frequency may change from time. This event is not affected by core frequency changes but counts as if the core is running at the maximum frequency all the time.	EventSel=3CH UMask=01H Counter=0,1 Architectural	CoreOnly
INST_RETIRED.ANY_P	This event counts the number of instructions that retire execution. For instructions that consist of multiple micro-ops, this event counts the retirement of the last micro-op of the instruction. The counter continues counting during hardware interrupts, traps, and inside interrupt handlers.	EventSel=C0H UMask=00H Counter=0,1 Architectural	CoreOnly
LONGEST_LAT_CACHE.MISS	This event counts the total number of L2 cache references and the number of L2 cache misses respectively.	EventSel=2EH UMask=41H Counter=0,1 Architectural	CoreOnly
LONGEST_LAT_CACHE.REFERENCE	This event counts requests originating from the core that references a cache line in the L2 cache.	EventSel=2EH UMask=4FH Counter=0,1 Architectural	CoreOnly
BACLEARS.ALL	The BACLEARS event counts the number of times the front end is resteered, mainly when the Branch Prediction Unit cannot provide a correct prediction and this is corrected by the Branch Address Calculator at the front end. The BACLEARS.ANY event counts the number of baclears for any type of branch.	EventSel=E6H UMask=01H Counter=0,1	CoreOnly
BACLEARS.COND	The BACLEARS event counts the number of times the front end is resteered, mainly when the Branch Prediction Unit cannot provide a correct prediction and this is corrected by the Branch Address Calculator at the front end. The BACLEARS.COND event counts the number of JCC (Jump on Conditional Code) baclears.	EventSel=E6H UMask=10H Counter=0,1	CoreOnly
BACLEARS.RETURN	The BACLEARS event counts the number of times the front end is resteered, mainly when the Branch Prediction Unit cannot provide a correct prediction and this is corrected by the Branch Address Calculator at the front end. The BACLEARS.RETURN event counts the number of RETURN baclears.	EventSel=E6H UMask=08H Counter=0,1	CoreOnly
BR_INST_RETIRED.ALL_BRANCHES_PS	ALL_BRANCHES counts the number of any branch instructions retired (Precise Event). Branch prediction predicts the branch target and enables the processor to begin executing instructions long before the branch true execution path is known. All branches utilize the branch prediction unit (BPU) for prediction. This unit predicts the target address not only based on the EIP of the branch but also based on the execution path through which execution reached this EIP. The BPU can efficiently predict the following branch types: conditional branches, direct calls and jumps, indirect calls and jumps, returns.	EventSel=C4H UMask=00H Counter=0 PEBS:[PreciseEventingIP]	CoreOnly
BR_INST_RETIRED.ALL_TAKEN_BRANCHES	ALL_TAKEN_BRANCHES counts the number of all taken branch instructions retired. Branch prediction predicts the branch target and enables the processor to begin executing instructions long before the branch true execution path is known. All branches utilize the branch prediction unit (BPU) for prediction. This unit predicts the target address not only based on the EIP of the branch but also based on the execution path through which execution reached this EIP. The BPU can efficiently predict the following branch types: conditional branches, direct calls and jumps, indirect calls and jumps, returns.	EventSel=C4H UMask=80H Counter=0,1 PEBS:[PreciseEventingIP]	CoreOnly
BR_INST_RETIRED.ALL_TAKEN_BRANCHES_PS	ALL_TAKEN_BRANCHES counts the number of all taken branch instructions retired. Branch prediction predicts the branch target and enables the processor to begin executing instructions long before the branch true execution path is known. All branches utilize the branch prediction unit (BPU) for prediction. This unit predicts the target address not only based on the EIP of the branch but also based on the execution path through which execution reached this EIP. The BPU can efficiently predict the following branch types: conditional branches, direct calls and jumps, indirect calls and jumps, returns.	EventSel=C4H UMask=80H Counter=0 PEBS:[PreciseEventingIP]	CoreOnly
BR_INST_RETIRED.CALL	CALL counts the number of near CALL branch instructions retired. Branch prediction predicts the branch target and enables the processor to begin executing instructions long before the branch true execution path is known. All branches utilize the branch prediction unit (BPU) for prediction. This unit predicts the target address not only based on the EIP of the branch but also based on the execution path through which execution reached this EIP. The BPU can efficiently predict the following branch types: conditional branches, direct calls and jumps, indirect calls and jumps, returns.	EventSel=C4H UMask=F9H Counter=0,1 PEBS:[PreciseEventingIP]	CoreOnly
BR_INST_RETIRED.CALL_PS	CALL counts the number of near CALL branch instructions retired (Precise Event). Branch prediction predicts the branch target and enables the processor to begin executing instructions long before the branch true execution path is known. All branches utilize the branch prediction unit (BPU) for prediction. This unit predicts the target address not only based on the EIP of the branch but also based on the execution path through which execution reached this EIP. The BPU can efficiently predict the following branch types: conditional branches, direct calls and jumps, indirect calls and jumps, returns.	EventSel=C4H UMask=F9H Counter=0 PEBS:[PreciseEventingIP]	CoreOnly
BR_INST_RETIRED.FAR_BRANCH	FAR counts the number of far branch instructions retired. Branch prediction predicts the branch target and enables the processor to begin executing instructions long before the branch true execution path is known. All branches utilize the branch prediction unit (BPU) for prediction. This unit predicts the target address not only based on the EIP of the branch but also based on the execution path through which execution reached this EIP. The BPU can efficiently predict the following branch types: conditional branches, direct calls and jumps, indirect calls and jumps, returns.	EventSel=C4H UMask=BFH Counter=0,1 PEBS:[PreciseEventingIP]	CoreOnly
BR_INST_RETIRED.FAR_BRANCH_PS	FAR counts the number of far branch instructions retired (Precise Event). Branch prediction predicts the branch target and enables the processor to begin executing instructions long before the branch true execution path is known. All branches utilize the branch prediction unit (BPU) for prediction. This unit predicts the target address not only based on the EIP of the branch but also based on the execution path through which execution reached this EIP. The BPU can efficiently predict the following branch types: conditional branches, direct calls and jumps, indirect calls and jumps, returns.	EventSel=C4H UMask=BFH Counter=0 PEBS:[PreciseEventingIP]	CoreOnly
BR_INST_RETIRED.IND_CALL	IND_CALL counts the number of near indirect CALL branch instructions retired. Branch prediction predicts the branch target and enables the processor to begin executing instructions long before the branch true execution path is known. All branches utilize the branch prediction unit (BPU) for prediction. This unit predicts the target address not only based on the EIP of the branch but also based on the execution path through which execution reached this EIP. The BPU can efficiently predict the following branch types: conditional branches, direct calls and jumps, indirect calls and jumps, returns.	EventSel=C4H UMask=FBH Counter=0,1 PEBS:[PreciseEventingIP]	CoreOnly
BR_INST_RETIRED.IND_CALL_PS	IND_CALL counts the number of near indirect CALL branch instructions retired (Precise Event). Branch prediction predicts the branch target and enables the processor to begin executing instructions long before the branch true execution path is known. All branches utilize the branch prediction unit (BPU) for prediction. This unit predicts the target address not only based on the EIP of the branch but also based on the execution path through which execution reached this EIP. The BPU can efficiently predict the following branch types: conditional branches, direct calls and jumps, indirect calls and jumps, returns.	EventSel=C4H UMask=FBH Counter=0 PEBS:[PreciseEventingIP]	CoreOnly
BR_INST_RETIRED.JCC	JCC counts the number of conditional branch (JCC) instructions retired. Branch prediction predicts the branch target and enables the processor to begin executing instructions long before the branch true execution path is known. All branches utilize the branch prediction unit (BPU) for prediction. This unit predicts the target address not only based on the EIP of the branch but also based on the execution path through which execution reached this EIP. The BPU can efficiently predict the following branch types: conditional branches, direct calls and jumps, indirect calls and jumps, returns.	EventSel=C4H UMask=7EH Counter=0,1 PEBS:[PreciseEventingIP]	CoreOnly
BR_INST_RETIRED.JCC_PS	JCC counts the number of conditional branch (JCC) instructions retired (Precise Event). Branch prediction predicts the branch target and enables the processor to begin executing instructions long before the branch true execution path is known. All branches utilize the branch prediction unit (BPU) for prediction. This unit predicts the target address not only based on the EIP of the branch but also based on the execution path through which execution reached this EIP. The BPU can efficiently predict the following branch types: conditional branches, direct calls and jumps, indirect calls and jumps, returns.	EventSel=C4H UMask=7EH Counter=0 PEBS:[PreciseEventingIP]	CoreOnly
BR_INST_RETIRED.NON_RETURN_IND	NON_RETURN_IND counts the number of near indirect JMP and near indirect CALL branch instructions retired. Branch prediction predicts the branch target and enables the processor to begin executing instructions long before the branch true execution path is known. All branches utilize the branch prediction unit (BPU) for prediction. This unit predicts the target address not only based on the EIP of the branch but also based on the execution path through which execution reached this EIP. The BPU can efficiently predict the following branch types: conditional branches, direct calls and jumps, indirect calls and jumps, returns.	EventSel=C4H UMask=EBH Counter=0,1 PEBS:[PreciseEventingIP]	CoreOnly
BR_INST_RETIRED.NON_RETURN_IND_PS	NON_RETURN_IND counts the number of near indirect JMP and near indirect CALL branch instructions retired (Precise Event). Branch prediction predicts the branch target and enables the processor to begin executing instructions long before the branch true execution path is known. All branches utilize the branch prediction unit (BPU) for prediction. This unit predicts the target address not only based on the EIP of the branch but also based on the execution path through which execution reached this EIP. The BPU can efficiently predict the following branch types: conditional branches, direct calls and jumps, indirect calls and jumps, returns.	EventSel=C4H UMask=EBH Counter=0 PEBS:[PreciseEventingIP]	CoreOnly
BR_INST_RETIRED.REL_CALL	REL_CALL counts the number of near relative CALL branch instructions retired. Branch prediction predicts the branch target and enables the processor to begin executing instructions long before the branch true execution path is known. All branches utilize the branch prediction unit (BPU) for prediction. This unit predicts the target address not only based on the EIP of the branch but also based on the execution path through which execution reached this EIP. The BPU can efficiently predict the following branch types: conditional branches, direct calls and jumps, indirect calls and jumps, returns.	EventSel=C4H UMask=FDH Counter=0,1 PEBS:[PreciseEventingIP]	CoreOnly
BR_INST_RETIRED.REL_CALL_PS	REL_CALL counts the number of near relative CALL branch instructions retired (Precise Event). Branch prediction predicts the branch target and enables the processor to begin executing instructions long before the branch true execution path is known. All branches utilize the branch prediction unit (BPU) for prediction. This unit predicts the target address not only based on the EIP of the branch but also based on the execution path through which execution reached this EIP. The BPU can efficiently predict the following branch types: conditional branches, direct calls and jumps, indirect calls and jumps, returns.	EventSel=C4H UMask=FDH Counter=0 PEBS:[PreciseEventingIP]	CoreOnly
BR_INST_RETIRED.RETURN	RETURN counts the number of near RET branch instructions retired. Branch prediction predicts the branch target and enables the processor to begin executing instructions long before the branch true execution path is known. All branches utilize the branch prediction unit (BPU) for prediction. This unit predicts the target address not only based on the EIP of the branch but also based on the execution path through which execution reached this EIP. The BPU can efficiently predict the following branch types: conditional branches, direct calls and jumps, indirect calls and jumps, returns.	EventSel=C4H UMask=F7H Counter=0,1 PEBS:[PreciseEventingIP]	CoreOnly
BR_INST_RETIRED.RETURN_PS	RETURN counts the number of near RET branch instructions retired (Precise Event). Branch prediction predicts the branch target and enables the processor to begin executing instructions long before the branch true execution path is known. All branches utilize the branch prediction unit (BPU) for prediction. This unit predicts the target address not only based on the EIP of the branch but also based on the execution path through which execution reached this EIP. The BPU can efficiently predict the following branch types: conditional branches, direct calls and jumps, indirect calls and jumps, returns.	EventSel=C4H UMask=F7H Counter=0 PEBS:[PreciseEventingIP]	CoreOnly
BR_INST_RETIRED.TAKEN_JCC	TAKEN_JCC counts the number of taken conditional branch (JCC) instructions retired. Branch prediction predicts the branch target and enables the processor to begin executing instructions long before the branch true execution path is known. All branches utilize the branch prediction unit (BPU) for prediction. This unit predicts the target address not only based on the EIP of the branch but also based on the execution path through which execution reached this EIP. The BPU can efficiently predict the following branch types: conditional branches, direct calls and jumps, indirect calls and jumps, returns.	EventSel=C4H UMask=FEH Counter=0,1 PEBS:[PreciseEventingIP]	CoreOnly
BR_INST_RETIRED.TAKEN_JCC_PS	TAKEN_JCC counts the number of taken conditional branch (JCC) instructions retired (Precise Event). Branch prediction predicts the branch target and enables the processor to begin executing instructions long before the branch true execution path is known. All branches utilize the branch prediction unit (BPU) for prediction. This unit predicts the target address not only based on the EIP of the branch but also based on the execution path through which execution reached this EIP. The BPU can efficiently predict the following branch types: conditional branches, direct calls and jumps, indirect calls and jumps, returns.	EventSel=C4H UMask=FEH Counter=0 PEBS:[PreciseEventingIP]	CoreOnly
BR_MISP_RETIRED.ALL_BRANCHES_PS	ALL_BRANCHES counts the number of any mispredicted branch instructions retired (Precise Event). This umask is an architecturally defined event. This event counts the number of retired branch instructions that were mispredicted by the processor, categorized by type. A branch misprediction occurs when the processor predicts that the branch would be taken, but it is not, or vice-versa. When the misprediction is discovered, all the instructions executed in the wrong (speculative) path must be discarded, and the processor must start fetching from the correct path.	EventSel=C5H UMask=00H Counter=0 PEBS:[PreciseEventingIP]	CoreOnly
BR_MISP_RETIRED.IND_CALL	IND_CALL counts the number of mispredicted near indirect CALL branch instructions retired. This event counts the number of retired branch instructions that were mispredicted by the processor, categorized by type. A branch misprediction occurs when the processor predicts that the branch would be taken, but it is not, or vice-versa. When the misprediction is discovered, all the instructions executed in the wrong (speculative) path must be discarded, and the processor must start fetching from the correct path.	EventSel=C5H UMask=FBH Counter=0,1 PEBS:[PreciseEventingIP]	CoreOnly
BR_MISP_RETIRED.IND_CALL_PS	IND_CALL counts the number of mispredicted near indirect CALL branch instructions retired (Precise Event). This event counts the number of retired branch instructions that were mispredicted by the processor, categorized by type. A branch misprediction occurs when the processor predicts that the branch would be taken, but it is not, or vice-versa. When the misprediction is discovered, all the instructions executed in the wrong (speculative) path must be discarded, and the processor must start fetching from the correct path.	EventSel=C5H UMask=FBH Counter=0 PEBS:[PreciseEventingIP]	CoreOnly
BR_MISP_RETIRED.JCC	JCC counts the number of mispredicted conditional branches (JCC) instructions retired. This event counts the number of retired branch instructions that were mispredicted by the processor, categorized by type. A branch misprediction occurs when the processor predicts that the branch would be taken, but it is not, or vice-versa. When the misprediction is discovered, all the instructions executed in the wrong (speculative) path must be discarded, and the processor must start fetching from the correct path.	EventSel=C5H UMask=7EH Counter=0,1 PEBS:[PreciseEventingIP]	CoreOnly
BR_MISP_RETIRED.JCC_PS	JCC counts the number of mispredicted conditional branches (JCC) instructions retired (Precise Event). This event counts the number of retired branch instructions that were mispredicted by the processor, categorized by type. A branch misprediction occurs when the processor predicts that the branch would be taken, but it is not, or vice-versa. When the misprediction is discovered, all the instructions executed in the wrong (speculative) path must be discarded, and the processor must start fetching from the correct path.	EventSel=C5H UMask=7EH Counter=0 PEBS:[PreciseEventingIP]	CoreOnly
BR_MISP_RETIRED.NON_RETURN_IND	NON_RETURN_IND counts the number of mispredicted near indirect JMP and near indirect CALL branch instructions retired. This event counts the number of retired branch instructions that were mispredicted by the processor, categorized by type. A branch misprediction occurs when the processor predicts that the branch would be taken, but it is not, or vice-versa. When the misprediction is discovered, all the instructions executed in the wrong (speculative) path must be discarded, and the processor must start fetching from the correct path.	EventSel=C5H UMask=EBH Counter=0,1 PEBS:[PreciseEventingIP]	CoreOnly
BR_MISP_RETIRED.NON_RETURN_IND_PS	NON_RETURN_IND counts the number of mispredicted near indirect JMP and near indirect CALL branch instructions retired (Precise Event). This event counts the number of retired branch instructions that were mispredicted by the processor, categorized by type. A branch misprediction occurs when the processor predicts that the branch would be taken, but it is not, or vice-versa. When the misprediction is discovered, all the instructions executed in the wrong (speculative) path must be discarded, and the processor must start fetching from the correct path.	EventSel=C5H UMask=EBH Counter=0 PEBS:[PreciseEventingIP]	CoreOnly
BR_MISP_RETIRED.RETURN	RETURN counts the number of mispredicted near RET branch instructions retired. This event counts the number of retired branch instructions that were mispredicted by the processor, categorized by type. A branch misprediction occurs when the processor predicts that the branch would be taken, but it is not, or vice-versa. When the misprediction is discovered, all the instructions executed in the wrong (speculative) path must be discarded, and the processor must start fetching from the correct path.	EventSel=C5H UMask=F7H Counter=0,1 PEBS:[PreciseEventingIP]	CoreOnly
BR_MISP_RETIRED.RETURN_PS	RETURN counts the number of mispredicted near RET branch instructions retired (Precise Event). This event counts the number of retired branch instructions that were mispredicted by the processor, categorized by type. A branch misprediction occurs when the processor predicts that the branch would be taken, but it is not, or vice-versa. When the misprediction is discovered, all the instructions executed in the wrong (speculative) path must be discarded, and the processor must start fetching from the correct path.	EventSel=C5H UMask=F7H Counter=0 PEBS:[PreciseEventingIP]	CoreOnly
BR_MISP_RETIRED.TAKEN_JCC	TAKEN_JCC counts the number of mispredicted taken conditional branch (JCC) instructions retired. This event counts the number of retired branch instructions that were mispredicted by the processor, categorized by type. A branch misprediction occurs when the processor predicts that the branch would be taken, but it is not, or vice-versa. When the misprediction is discovered, all the instructions executed in the wrong (speculative) path must be discarded, and the processor must start fetching from the correct path.	EventSel=C5H UMask=FEH Counter=0,1 PEBS:[PreciseEventingIP]	CoreOnly
BR_MISP_RETIRED.TAKEN_JCC_PS	TAKEN_JCC counts the number of mispredicted taken conditional branch (JCC) instructions retired (Precise Event). This event counts the number of retired branch instructions that were mispredicted by the processor, categorized by type. A branch misprediction occurs when the processor predicts that the branch would be taken, but it is not, or vice-versa. When the misprediction is discovered, all the instructions executed in the wrong (speculative) path must be discarded, and the processor must start fetching from the correct path.	EventSel=C5H UMask=FEH Counter=0 PEBS:[PreciseEventingIP]	CoreOnly
CORE_REJECT_L2Q.ALL	Counts the number of (demand and L1 prefetchers) core requests rejected by the L2Q due to a full or nearly full w condition which likely indicates back pressure from L2Q. It also counts requests that would have gone directly to the XQ, but are rejected due to a full or nearly full condition, indicating back pressure from the IDI link. The L2Q may also reject transactions from a core to insure fairness between cores, or to delay a core’s dirty eviction when the address conflicts incoming external snoops. (Note that L2 prefetcher requests that are dropped are not counted by this event.)	EventSel=31H UMask=00H Counter=0,1	CoreOnly
CPU_CLK_UNHALTED.THREAD	Counts the number of core cycles while the core is not in a halt state. The core enters the halt state when it is running the HLT instruction. This event is a component in many key event ratios. The core frequency may change from time to time. For this reason this event may have a changing ratio with regards to time. In systems with a constant core frequency, this event can give you a measurement of the elapsed time while the core was not in halt state by dividing the event count by the core frequency. This event is architecturally defined and is a designated fixed counter. CPU_CLK_UNHALTED.CORE and CPU_CLK_UNHALTED.CORE_P use the core frequency which may change from time to time. CPU_CLK_UNHALTE.REF_TSC and CPU_CLK_UNHALTED.REF are not affected by core frequency changes but counts as if the core is running at the maximum frequency all the time. The fixed events are CPU_CLK_UNHALTED.CORE and CPU_CLK_UNHALTED.REF_TSC and the programmable events are CPU_CLK_UNHALTED.CORE_P and CPU_CLK_UNHALTED.REF.	IA32_FIXED_CTR1 Fixed	CoreOnly
CYCLES_DIV_BUSY.ALL	Cycles the divider is busy.This event counts the cycles when the divide unit is unable to accept a new divide UOP because it is busy processing a previously dispatched UOP. The cycles will be counted irrespective of whether or not another divide UOP is waiting to enter the divide unit (from the RS). This event might count cycles while a divide is in progress even if the RS is empty. The divide instruction is one of the longest latency instructions in the machine. Hence, it has a special event associated with it to help determine if divides are delaying the retirement of instructions.	EventSel=CDH UMask=01H Counter=0,1	CoreOnly
DECODE_RESTRICTION.PREDECODE_WRONG	Counts the number of times a decode restriction reduced the decode throughput due to wrong instruction length prediction.	EventSel=E9H UMask=01H Counter=0,1	CoreOnly
FETCH_STALL.ALL	Counts cycles that fetch is stalled due to any reason. That is, the decoder queue is able to accept bytes, but the fetch unit is unable to provide bytes. This will include cycles due to an ITLB miss, ICache miss and other events.	EventSel=86H UMask=3FH Counter=0,1	CoreOnly
FETCH_STALL.ICACHE_FILL_PENDING_CYCLES	Counts cycles that fetch is stalled due to an outstanding ICache miss. That is, the decoder queue is able to accept bytes, but the fetch unit is unable to provide bytes due to an ICache miss. Note: this event is not the same as the total number of cycles spent retrieving instruction cache lines from the memory hierarchy. Counts cycles that fetch is stalled due to any reason. That is, the decoder queue is able to accept bytes, but the fetch unit is unable to provide bytes. This will include cycles due to an ITLB miss, ICache miss and other events.	EventSel=86H UMask=04H Counter=0,1	CoreOnly
FETCH_STALL.ITLB_FILL_PENDING_CYCLES	Counts cycles that fetch is stalled due to an outstanding ITLB miss. That is, the decoder queue is able to accept bytes, but the fetch unit is unable to provide bytes due to an ITLB miss. Note: this event is not the same as page walk cycles to retrieve an instruction translation.	EventSel=86H UMask=02H Counter=0,1	CoreOnly
ICACHE.ACCESSES	This event counts all instruction fetches, not including most uncacheable fetches.	EventSel=80H UMask=03H Counter=0,1	CoreOnly
ICACHE.HIT	This event counts all instruction fetches from the instruction cache.	EventSel=80H UMask=01H Counter=0,1	CoreOnly
ICACHE.MISSES	This event counts all instruction fetches that miss the Instruction cache or produce memory requests. This includes uncacheable fetches. An instruction fetch miss is counted only once and not once for every cycle it is outstanding.	EventSel=80H UMask=02H Counter=0,1	CoreOnly
INST_RETIRED.ANY_PS	This event counts the number of instructions that retire execution. For instructions that consist of multiple micro-ops, this event counts the retirement of the last micro-op of the instruction. The counter continues counting during hardware interrupts, traps, and inside interrupt handlers. (Precise Event)	EventSel=C0H UMask=00H Counter=0 PEBS:[PreciseEventingIP]	CoreOnly
L2_REJECT_XQ.ALL	This event counts the number of demand and prefetch transactions that the L2 XQ rejects due to a full or near full condition which likely indicates back pressure from the IDI link. The XQ may reject transactions from the L2Q (non-cacheable requests), BBS (L2 misses) and WOB (L2 write-back victims).	EventSel=30H UMask=00H Counter=0,1	CoreOnly
MACHINE_CLEARS.ALL	Machine clears happen when something happens in the machine that causes the hardware to need to take special care to get the right answer. When such a condition is signaled on an instruction, the front end of the machine is notified that it must restart, so no more instructions will be decoded from the current path. All instructions "older" than this one will be allowed to finish. This instruction and all "younger" instructions must be cleared, since they must not be allowed to complete. Essentially, the hardware waits until the problematic instruction is the oldest instruction in the machine. This means all older instructions are retired, and all pending stores (from older instructions) are completed. Then the new path of instructions from the front end are allowed to start into the machine. There are many conditions that might cause a machine clear (including the receipt of an interrupt, or a trap or a fault). All those conditions (including but not limited to MACHINE_CLEARS.MEMORY_ORDERING, MACHINE_CLEARS.SMC, and MACHINE_CLEARS.FP_ASSIST) are captured in the ANY event. In addition, some conditions can be specifically counted (i.e. SMC, MEMORY_ORDERING, FP_ASSIST). However, the sum of SMC, MEMORY_ORDERING, and FP_ASSIST machine clears will not necessarily equal the number of ANY.	EventSel=C3H UMask=08H Counter=0,1	CoreOnly
MACHINE_CLEARS.FP_ASSIST	This event counts the number of times that pipeline stalled due to FP operations needing assists.	EventSel=C3H UMask=04H Counter=0,1	CoreOnly
MACHINE_CLEARS.MEMORY_ORDERING	This event counts the number of times that pipeline was cleared due to memory ordering issues.	EventSel=C3H UMask=02H Counter=0,1	CoreOnly
MACHINE_CLEARS.SMC	This event counts the number of times that a program writes to a code section. Self-modifying code causes a severe penalty in all Intel® architecture processors.	EventSel=C3H UMask=01H Counter=0,1	CoreOnly
MEM_UOPS_RETIRED.ALL_LOADS	This event counts the number of load ops retired.	EventSel=04H UMask=40H Counter=0,1	CoreOnly
MEM_UOPS_RETIRED.ALL_STORES	This event counts the number of store ops retired.	EventSel=04H UMask=80H Counter=0,1	CoreOnly
MEM_UOPS_RETIRED.DTLB_MISS_LOADS	This event counts the number of load ops retired that had DTLB miss.	EventSel=04H UMask=08H Counter=0,1 PEBS:[PreciseEventingIP]	CoreOnly
MEM_UOPS_RETIRED.DTLB_MISS_LOADS_PS	This event counts the number of load ops retired that had DTLB miss. (Precise Event)	EventSel=04H UMask=08H Counter=0 PEBS:[PreciseEventingIP]	CoreOnly
MEM_UOPS_RETIRED.HITM	This event counts the number of load ops retired that got data from the other core or from the other module.	EventSel=04H UMask=20H Counter=0,1 PEBS:[PreciseEventingIP]	CoreOnly
MEM_UOPS_RETIRED.HITM_PS	This event counts the number of load ops retired that got data from the other core or from the other module. (Precise Event)	EventSel=04H UMask=20H Counter=0 PEBS:[PreciseEventingIP]	CoreOnly
MEM_UOPS_RETIRED.L1_MISS_LOADS	This event counts the number of load ops retired that miss in L1 Data cache. Note that prefetch misses will not be counted.	EventSel=04H UMask=01H Counter=0,1	CoreOnly
MEM_UOPS_RETIRED.L2_HIT_LOADS	This event counts the number of load ops retired that hit in the L2.	EventSel=04H UMask=02H Counter=0,1 PEBS:[PreciseEventingIP]	CoreOnly
MEM_UOPS_RETIRED.L2_HIT_LOADS_PS	This event counts the number of load ops retired that hit in the L2. (Precise Event)	EventSel=04H UMask=02H Counter=0 PEBS:[PreciseEventingIP]	CoreOnly
MEM_UOPS_RETIRED.L2_MISS_LOADS	This event counts the number of load ops retired that miss in the L2.	EventSel=04H UMask=04H Counter=0,1 PEBS:[PreciseEventingIP]	CoreOnly
MEM_UOPS_RETIRED.L2_MISS_LOADS_PS	This event counts the number of load ops retired that miss in the L2. (Precise Event)	EventSel=04H UMask=04H Counter=0 PEBS:[PreciseEventingIP]	CoreOnly
MEM_UOPS_RETIRED.UTLB_MISS	This event counts the number of load ops retired that had UTLB miss.	EventSel=04H UMask=10H Counter=0,1	CoreOnly
MS_DECODED.MS_ENTRY	Counts the number of times the MSROM starts a flow of UOPS. It does not count every time a UOP is read from the microcode ROM. The most common case that this counts is when a micro-coded instruction is encountered by the front end of the machine. Other cases include when an instruction encounters a fault, trap, or microcode assist of any sort. The event will count MSROM startups for UOPS that are speculative, and subsequently cleared by branch mispredict or machine clear. Background: UOPS are produced by two mechanisms. Either they are generated by hardware that decodes instructions into UOPS, or they are delivered by a ROM (called the MSROM) that holds UOPS associated with a specific instruction. MSROM UOPS might also be delivered in response to some condition such as a fault or other exceptional condition. This event is an excellent mechanism for detecting instructions that require the use of MSROM instructions.	EventSel=E7H UMask=01H Counter=0,1	CoreOnly
NO_ALLOC_CYCLES.ALL	The NO_ALLOC_CYCLES.ALL event counts the number of cycles when the front-end does not provide any instructions to be allocated for any reason. This event indicates the cycles where an allocation stalls occurs, and no UOPS are allocated in that cycle.	EventSel=CAH UMask=3FH Counter=0,1	CoreOnly
NO_ALLOC_CYCLES.MISPREDICTS	Counts the number of cycles when no uops are allocated and the alloc pipe is stalled waiting for a mispredicted jump to retire. After the misprediction is detected, the front end will start immediately but the allocate pipe stalls until the mispredicted.	EventSel=CAH UMask=04H Counter=0,1	CoreOnly
NO_ALLOC_CYCLES.NOT_DELIVERED	The NO_ALLOC_CYCLES.NOT_DELIVERED event is used to measure front-end inefficiencies, i.e. when front-end of the machine is not delivering micro-ops to the back-end and the back-end is not stalled. This event can be used to identify if the machine is truly front-end bound. When this event occurs, it is an indication that the front-end of the machine is operating at less than its theoretical peak performance. Background: We can think of the processor pipeline as being divided into 2 broader parts: Front-end and Back-end. Front-end is responsible for fetching the instruction, decoding into micro-ops (uops) in machine understandable format and putting them into a micro-op queue to be consumed by back end. The back-end then takes these micro-ops, allocates the required resources. When all resources are ready, micro-ops are executed. If the back-end is not ready to accept micro-ops from the front-end, then we do not want to count these as front-end bottlenecks. However, whenever we have bottlenecks in the back-end, we will have allocation unit stalls and eventually forcing the front-end to wait until the back-end is ready to receive more UOPS. This event counts the cycles only when back-end is requesting more uops and front-end is not able to provide them. Some examples of conditions that cause front-end efficiencies are: Icache misses, ITLB misses, and decoder restrictions that limit the front-end bandwidth.	EventSel=CAH UMask=50H Counter=0,1	CoreOnly
NO_ALLOC_CYCLES.RAT_STALL	Counts the number of cycles when no uops are allocated and a RATstall is asserted.	EventSel=CAH UMask=20H Counter=0,1	CoreOnly
NO_ALLOC_CYCLES.ROB_FULL	Counts the number of cycles when no uops are allocated and the ROB is full (less than 2 entries available).	EventSel=CAH UMask=01H Counter=0,1	CoreOnly
PAGE_WALKS.CYCLES	This event counts every cycle when a data (D) page walk or instruction (I) page walk is in progress. Since a pagewalk implies a TLB miss, the approximate cost of a TLB miss can be determined from this event.	EventSel=05H UMask=03H Counter=0,1	CoreOnly
PAGE_WALKS.D_SIDE_CYCLES	This event counts every cycle when a D-side (walks due to a load) page walk is in progress. Page walk duration divided by number of page walks is the average duration of page-walks.	EventSel=05H UMask=01H Counter=0,1	CoreOnly
PAGE_WALKS.D_SIDE_WALKS	This event counts when a data (D) page walk is completed or started. Since a page walk implies a TLB miss, the number of TLB misses can be counted by counting the number of pagewalks.	EventSel=05H UMask=01H EdgeDetect=1 Counter=0,1	CoreOnly
PAGE_WALKS.I_SIDE_CYCLES	This event counts every cycle when a I-side (walks due to an instruction fetch) page walk is in progress. Page walk duration divided by number of page walks is the average duration of page-walks.	EventSel=05H UMask=02H Counter=0,1	CoreOnly
PAGE_WALKS.I_SIDE_WALKS	This event counts when an instruction (I) page walk is completed or started. Since a page walk implies a TLB miss, the number of TLB misses can be counted by counting the number of pagewalks.	EventSel=05H UMask=02H EdgeDetect=1 Counter=0,1	CoreOnly
PAGE_WALKS.WALKS	This event counts when a data (D) page walk or an instruction (I) page walk is completed or started. Since a page walk implies a TLB miss, the number of TLB misses can be counted by counting the number of pagewalks.	EventSel=05H UMask=03H EdgeDetect=1 Counter=0,1	CoreOnly
REHABQ.ANY_LD	This event counts the number of load uops reissued from Rehabq.	EventSel=03H UMask=40H Counter=0,1	CoreOnly
REHABQ.ANY_ST	This event counts the number of store uops reissued from Rehabq.	EventSel=03H UMask=80H Counter=0,1	CoreOnly
REHABQ.LD_BLOCK_ST_FORWARD	This event counts the number of retired loads that were prohibited from receiving forwarded data from the store because of address mismatch.	EventSel=03H UMask=01H Counter=0,1 PEBS:[PreciseEventingIP]	CoreOnly
REHABQ.LD_BLOCK_ST_FORWARD_PS	This event counts the number of retired loads that were prohibited from receiving forwarded data from the store because of address mismatch. (Precise event)	EventSel=03H UMask=01H Counter=0 PEBS:[PreciseEventingIP]	CoreOnly
REHABQ.LD_BLOCK_STD_NOTREADY	This event counts the cases where a forward was technically possible, but did not occur because the store data was not available at the right time.	EventSel=03H UMask=02H Counter=0,1	CoreOnly
REHABQ.LD_SPLITS	This event counts the number of retire loads that experienced cache line boundary splits.	EventSel=03H UMask=08H Counter=0,1 PEBS:[PreciseEventingIP]	CoreOnly
REHABQ.LD_SPLITS_PS	This event counts the number of retired loads that experienced cache line boundary splits. (Precise event)	EventSel=03H UMask=08H Counter=0 PEBS:[PreciseEventingIP]	CoreOnly
REHABQ.LOCK	This event counts the number of retired memory operations with lock semantics. These are either implicit locked instructions such as the XCHG instruction or instructions with an explicit LOCK prefix (0xF0).	EventSel=03H UMask=10H Counter=0,1	CoreOnly
REHABQ.ST_SPLITS	This event counts the number of retire stores that experienced cache line boundary splits.	EventSel=03H UMask=04H Counter=0,1	CoreOnly
REHABQ.STA_FULL	This event counts the number of retired stores that are delayed because there is not a store address buffer available.	EventSel=03H UMask=20H Counter=0,1	CoreOnly
RS_FULL_STALL.ALL	Counts the number of cycles the Alloc pipeline is stalled when any one of the RSs (IEC, FPC and MEC) is full. This event is a superset of all the individual RS stall event counts.	EventSel=CBH UMask=1FH Counter=0,1	CoreOnly
RS_FULL_STALL.MEC	Counts the number of cycles and allocation pipeline is stalled and is waiting for a free MEC reservation station entry. The cycles should be appropriately counted in case of the cracked ops e.g. In case of a cracked load-op, the load portion is sent to M.	EventSel=CBH UMask=01H Counter=0,1	CoreOnly
UOPS_RETIRED.ALL	This event counts the number of micro-ops retired. The processor decodes complex macro instructions into a sequence of simpler micro-ops. Most instructions are composed of one or two micro-ops. Some instructions are decoded into longer sequences such as repeat instructions, floating point transcendental instructions, and assists. In some cases micro-op sequences are fused or whole instructions are fused into one micro-op. See other UOPS_RETIRED events for differentiating retired fused and non-fused micro-ops.	EventSel=C2H UMask=10H Counter=0,1	CoreOnly
UOPS_RETIRED.MS	This event counts the number of micro-ops retired that were supplied from MSROM.	EventSel=C2H UMask=01H Counter=0,1	CoreOnly
	UNCORE		Uncore
	OFFCORE		Offcore
OFFCORE_RESPONSE:request=ANY_CODE_RD: response=L2_MISS.ANY	Counts any code reads (demand & prefetch)miss L2.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=1680000044H	Offcore
OFFCORE_RESPONSE:request=ANY_CODE_RD: response=L2_MISS.HITM_OTHER_CORE	Counts any code reads (demand & prefetch)hit in the other module where modified copies were found in other core's L1 cache.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=1000000044H	Offcore
OFFCORE_RESPONSE:request=ANY_CODE_RD: response=L2_MISS.HIT_OTHER_CORE_NO_FWD	Counts any code reads (demand & prefetch)miss L2 and the snoops to sibling cores hit in either E/S state and the line is not forwarded.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=400000044H	Offcore
OFFCORE_RESPONSE:request=ANY_CODE_RD: response=L2_MISS.SNOOP_MISS	Counts any code reads (demand & prefetch)miss L2 with a snoop miss response.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=200000044H	Offcore
OFFCORE_RESPONSE:request=ANY_CODE_RD: response=ANY_RESPONSE	Counts any code reads (demand & prefetch)have any response type.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=10044H	Offcore
OFFCORE_RESPONSE:request=ANY_RFO: response=L2_MISS.ANY	Counts any rfo reads (demand & prefetch)miss L2.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=1680000022H	Offcore
OFFCORE_RESPONSE:request=ANY_RFO: response=L2_MISS.HITM_OTHER_CORE	Counts any rfo reads (demand & prefetch)hit in the other module where modified copies were found in other core's L1 cache.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=1000000022H	Offcore
OFFCORE_RESPONSE:request=ANY_RFO: response=L2_MISS.HIT_OTHER_CORE_NO_FWD	Counts any rfo reads (demand & prefetch)miss L2 and the snoops to sibling cores hit in either E/S state and the line is not forwarded.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=400000022H	Offcore
OFFCORE_RESPONSE:request=ANY_RFO: response=L2_MISS.SNOOP_MISS	Counts any rfo reads (demand & prefetch)miss L2 with a snoop miss response.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=200000022H	Offcore
OFFCORE_RESPONSE:request=ANY_RFO: response=ANY_RESPONSE	Counts any rfo reads (demand & prefetch)have any response type.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=10022H	Offcore
OFFCORE_RESPONSE:request=ANY_DATA_RD: response=L2_MISS.ANY	Counts any data read (demand & prefetch)miss L2.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=1680003091H	Offcore
OFFCORE_RESPONSE:request=ANY_DATA_RD: response=L2_MISS.HITM_OTHER_CORE	Counts any data read (demand & prefetch)hit in the other module where modified copies were found in other core's L1 cache.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=1000003091H	Offcore
OFFCORE_RESPONSE:request=ANY_DATA_RD: response=L2_MISS.HIT_OTHER_CORE_NO_FWD	Counts any data read (demand & prefetch)miss L2 and the snoops to sibling cores hit in either E/S state and the line is not forwarded.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=400003091H	Offcore
OFFCORE_RESPONSE:request=ANY_DATA_RD: response=L2_MISS.SNOOP_MISS	Counts any data read (demand & prefetch)miss L2 with a snoop miss response.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=200003091H	Offcore
OFFCORE_RESPONSE:request=ANY_DATA_RD: response=ANY_RESPONSE	Counts any data read (demand & prefetch)have any response type.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=13091H	Offcore
OFFCORE_RESPONSE:request=STREAMING_STORES: response=L2_MISS.ANY	Counts streaming storemiss L2.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=1680004800H	Offcore
OFFCORE_RESPONSE:request=ANY_REQUEST: response=L2_MISS.HITM_OTHER_CORE	Counts any requesthit in the other module where modified copies were found in other core's L1 cache.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=1000008008H	Offcore
OFFCORE_RESPONSE:request=ANY_REQUEST: response=L2_MISS.HIT_OTHER_CORE_NO_FWD	Counts any requestmiss L2 and the snoops to sibling cores hit in either E/S state and the line is not forwarded.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=400008008H	Offcore
OFFCORE_RESPONSE:request=ANY_REQUEST: response=L2_MISS.SNOOP_MISS	Counts any requestmiss L2 with a snoop miss response.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=200008008H	Offcore
OFFCORE_RESPONSE:request=ANY_REQUEST: response=ANY_RESPONSE	Counts any requesthave any response type.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=18008H	Offcore
OFFCORE_RESPONSE:request=PF_L1_DATA_RD: response=L2_MISS.ANY	Counts DCU hardware prefetcher data readmiss L2.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=1680002000H	Offcore
OFFCORE_RESPONSE:request=PF_L1_DATA_RD: response=L2_MISS.HITM_OTHER_CORE	Counts DCU hardware prefetcher data readhit in the other module where modified copies were found in other core's L1 cache.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=1000002000H	Offcore
OFFCORE_RESPONSE:request=PF_L1_DATA_RD: response=L2_MISS.HIT_OTHER_CORE_NO_FWD	Counts DCU hardware prefetcher data readmiss L2 and the snoops to sibling cores hit in either E/S state and the line is not forwarded.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=400002000H	Offcore
OFFCORE_RESPONSE:request=PF_L1_DATA_RD: response=L2_MISS.SNOOP_MISS	Counts DCU hardware prefetcher data readmiss L2 with a snoop miss response.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=200002000H	Offcore
OFFCORE_RESPONSE:request=PF_L1_DATA_RD: response=ANY_RESPONSE	Counts DCU hardware prefetcher data readhave any response type.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=12000H	Offcore
OFFCORE_RESPONSE:request=PARTIAL_WRITES: response=L2_MISS.ANY	Countsof demand RFO requests to write to partial cache linesmiss L2.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=1680000100H	Offcore
OFFCORE_RESPONSE:request=PARTIAL_READS: response=L2_MISS.ANY	Counts demand reads of partial cache lines (including UC and WC)miss L2.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=1680000080H	Offcore
OFFCORE_RESPONSE:request=PF_L2_CODE_RD: response=L2_MISS.ANY	Counts code reads generated by L2 prefetchersmiss L2.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=1680000040H	Offcore
OFFCORE_RESPONSE:request=PF_L2_CODE_RD: response=L2_MISS.HIT_OTHER_CORE_NO_FWD	Counts code reads generated by L2 prefetchersmiss L2 and the snoops to sibling cores hit in either E/S state and the line is not forwarded.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=400000040H	Offcore
OFFCORE_RESPONSE:request=PF_L2_CODE_RD: response=L2_MISS.SNOOP_MISS	Counts code reads generated by L2 prefetchersmiss L2 with a snoop miss response.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=200000040H	Offcore
OFFCORE_RESPONSE:request=PF_L2_RFO: response=L2_MISS.ANY	Counts RFO requests generated by L2 prefetchersmiss L2.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=1680000020H	Offcore
OFFCORE_RESPONSE:request=PF_L2_RFO: response=L2_MISS.HITM_OTHER_CORE	Counts RFO requests generated by L2 prefetchershit in the other module where modified copies were found in other core's L1 cache.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=1000000020H	Offcore
OFFCORE_RESPONSE:request=PF_L2_RFO: response=L2_MISS.HIT_OTHER_CORE_NO_FWD	Counts RFO requests generated by L2 prefetchersmiss L2 and the snoops to sibling cores hit in either E/S state and the line is not forwarded.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=400000020H	Offcore
OFFCORE_RESPONSE:request=PF_L2_RFO: response=L2_MISS.SNOOP_MISS	Counts RFO requests generated by L2 prefetchersmiss L2 with a snoop miss response.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=200000020H	Offcore
OFFCORE_RESPONSE:request=PF_L2_DATA_RD: response=L2_MISS.ANY	Counts data cacheline reads generated by L2 prefetchersmiss L2.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=1680000010H	Offcore
OFFCORE_RESPONSE:request=PF_L2_DATA_RD: response=L2_MISS.HITM_OTHER_CORE	Counts data cacheline reads generated by L2 prefetchershit in the other module where modified copies were found in other core's L1 cache.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=1000000010H	Offcore
OFFCORE_RESPONSE:request=PF_L2_DATA_RD: response=L2_MISS.HIT_OTHER_CORE_NO_FWD	Counts data cacheline reads generated by L2 prefetchersmiss L2 and the snoops to sibling cores hit in either E/S state and the line is not forwarded.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=400000010H	Offcore
OFFCORE_RESPONSE:request=PF_L2_DATA_RD: response=L2_MISS.SNOOP_MISS	Counts data cacheline reads generated by L2 prefetchersmiss L2 with a snoop miss response.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=200000010H	Offcore
OFFCORE_RESPONSE:request=COREWB: response=L2_MISS.ANY	Counts writeback (modified to exclusive)miss L2.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=1680000008H	Offcore
OFFCORE_RESPONSE:request=COREWB: response=L2_MISS.NO_SNOOP_NEEDED	Counts writeback (modified to exclusive)miss L2 with no details on snoop-related information.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=80000008H	Offcore
OFFCORE_RESPONSE:request=DEMAND_CODE_RD: response=OUTSTANDING	Counts demand and DCU prefetch instruction cachelineare outstanding, per cycle, from the time of the L2 miss to when any response is received.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx(1A6H)=4000000004H	Offcore
OFFCORE_RESPONSE:request=DEMAND_CODE_RD: response=L2_MISS.ANY	Counts demand and DCU prefetch instruction cachelinemiss L2.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=1680000004H	Offcore
OFFCORE_RESPONSE:request=DEMAND_CODE_RD: response=L2_MISS.HIT_OTHER_CORE_NO_FWD	Counts demand and DCU prefetch instruction cachelinemiss L2 and the snoops to sibling cores hit in either E/S state and the line is not forwarded.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=400000004H	Offcore
OFFCORE_RESPONSE:request=DEMAND_CODE_RD: response=L2_MISS.SNOOP_MISS	Counts demand and DCU prefetch instruction cachelinemiss L2 with a snoop miss response.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=200000004H	Offcore
OFFCORE_RESPONSE:request=DEMAND_CODE_RD: response=ANY_RESPONSE	Counts demand and DCU prefetch instruction cachelinehave any response type.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=10004H	Offcore
OFFCORE_RESPONSE:request=DEMAND_RFO: response=OUTSTANDING	Counts demand and DCU prefetch RFOsare outstanding, per cycle, from the time of the L2 miss to when any response is received.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx(1A6H)=4000000002H	Offcore
OFFCORE_RESPONSE:request=DEMAND_RFO: response=L2_MISS.ANY	Counts demand and DCU prefetch RFOsmiss L2.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=1680000002H	Offcore
OFFCORE_RESPONSE:request=DEMAND_RFO: response=L2_MISS.HITM_OTHER_CORE	Counts demand and DCU prefetch RFOshit in the other module where modified copies were found in other core's L1 cache.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=1000000002H	Offcore
OFFCORE_RESPONSE:request=DEMAND_RFO: response=L2_MISS.HIT_OTHER_CORE_NO_FWD	Counts demand and DCU prefetch RFOsmiss L2 and the snoops to sibling cores hit in either E/S state and the line is not forwarded.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=400000002H	Offcore
OFFCORE_RESPONSE:request=DEMAND_RFO: response=L2_MISS.SNOOP_MISS	Counts demand and DCU prefetch RFOsmiss L2 with a snoop miss response.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=200000002H	Offcore
OFFCORE_RESPONSE:request=DEMAND_DATA_RD: response=OUTSTANDING	Counts demand and DCU prefetch data readare outstanding, per cycle, from the time of the L2 miss to when any response is received.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx(1A6H)=4000000001H	Offcore
OFFCORE_RESPONSE:request=DEMAND_DATA_RD: response=L2_MISS.ANY	Counts demand and DCU prefetch data readmiss L2.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=1680000001H	Offcore
OFFCORE_RESPONSE:request=DEMAND_DATA_RD: response=L2_MISS.HITM_OTHER_CORE	Counts demand and DCU prefetch data readhit in the other module where modified copies were found in other core's L1 cache.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=1000000001H	Offcore
OFFCORE_RESPONSE:request=DEMAND_DATA_RD: response=L2_MISS.HIT_OTHER_CORE_NO_FWD	Counts demand and DCU prefetch data readmiss L2 and the snoops to sibling cores hit in either E/S state and the line is not forwarded.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=400000001H	Offcore
OFFCORE_RESPONSE:request=DEMAND_DATA_RD: response=L2_MISS.SNOOP_MISS	Counts demand and DCU prefetch data readmiss L2 with a snoop miss response.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=200000001H	Offcore
OFFCORE_RESPONSE:request=DEMAND_DATA_RD: response=ANY_RESPONSE	Counts demand and DCU prefetch data readhave any response type.	EventSel=(B7H) UMask={01H,02H} MSR_OFFCORE_RSPx{1A6H,1A7H}=10001H	Offcore