2009-01-09 17:01:53 +11:00
|
|
|
/*
|
2010-02-25 18:09:45 -06:00
|
|
|
* Performance event support - hardware-specific disambiguation
|
2009-01-09 17:01:53 +11:00
|
|
|
*
|
2010-02-25 18:09:45 -06:00
|
|
|
* For now this is a compile-time decision, but eventually it should be
|
|
|
|
* runtime. This would allow multiplatform perf event support for e300 (fsl
|
|
|
|
* embedded perf counters) plus server/classic, and would accommodate
|
|
|
|
* devices other than the core which provide their own performance counters.
|
|
|
|
*
|
|
|
|
* Copyright 2010 Freescale Semiconductor, Inc.
|
2009-01-09 17:01:53 +11:00
|
|
|
*
|
|
|
|
* This program is free software; you can redistribute it and/or
|
|
|
|
* modify it under the terms of the GNU General Public License
|
|
|
|
* as published by the Free Software Foundation; either version
|
|
|
|
* 2 of the License, or (at your option) any later version.
|
|
|
|
*/
|
2009-06-17 21:52:09 +10:00
|
|
|
|
2009-06-17 21:50:04 +10:00
|
|
|
#ifdef CONFIG_PPC_PERF_CTRS
|
2010-02-25 18:09:45 -06:00
|
|
|
#include <asm/perf_event_server.h>
|
2009-06-17 21:50:04 +10:00
|
|
|
#endif
|
|
|
|
|
2010-02-25 18:09:45 -06:00
|
|
|
#ifdef CONFIG_FSL_EMB_PERF_EVENT
|
|
|
|
#include <asm/perf_event_fsl_emb.h>
|
|
|
|
#endif
|
2010-05-20 07:47:21 +02:00
|
|
|
|
|
|
|
#ifdef CONFIG_PERF_EVENTS
|
|
|
|
#include <asm/ptrace.h>
|
|
|
|
#include <asm/reg.h>
|
|
|
|
|
2012-06-26 01:01:36 +00:00
|
|
|
/*
|
|
|
|
* Overload regs->result to specify whether we should use the MSR (result
|
|
|
|
* is zero) or the SIAR (result is non zero).
|
|
|
|
*/
|
2010-05-20 07:47:21 +02:00
|
|
|
#define perf_arch_fetch_caller_regs(regs, __ip) \
|
|
|
|
do { \
|
2012-06-26 01:01:36 +00:00
|
|
|
(regs)->result = 0; \
|
2010-05-20 07:47:21 +02:00
|
|
|
(regs)->nip = __ip; \
|
powerpc: Reimplement __get_SP() as a function not a define
Li Zhong points out an issue with our current __get_SP()
implementation. If ftrace function tracing is enabled (ie -pg
profiling using _mcount) we spill a stack frame on 64bit all the
time.
If a function calls __get_SP() and later calls a function that is
tail call optimised, we will pop the stack frame and the value
returned by __get_SP() is no longer valid. An example from Li can
be found in save_stack_trace -> save_context_stack:
c0000000000432c0 <.save_stack_trace>:
c0000000000432c0: mflr r0
c0000000000432c4: std r0,16(r1)
c0000000000432c8: stdu r1,-128(r1) <-- stack frame for _mcount
c0000000000432cc: std r3,112(r1)
c0000000000432d0: bl <._mcount>
c0000000000432d4: nop
c0000000000432d8: mr r4,r1 <-- __get_SP()
c0000000000432dc: ld r5,632(r13)
c0000000000432e0: ld r3,112(r1)
c0000000000432e4: li r6,1
c0000000000432e8: addi r1,r1,128 <-- pop stack frame
c0000000000432ec: ld r0,16(r1)
c0000000000432f0: mtlr r0
c0000000000432f4: b <.save_context_stack> <-- tail call optimized
save_context_stack ends up with a stack pointer below the current
one, and it is likely to be scribbled over.
Fix this by making __get_SP() a function which returns the
callers stack frame. Also replace inline assembly which grabs
the stack pointer in save_stack_trace and show_stack with
__get_SP().
This also fixes an issue with perf_arch_fetch_caller_regs().
It currently unwinds the stack once, which will skip a
valid stack frame on a leaf function. With the __get_SP() fixes
in this patch, we never need to unwind the stack frame to get
to the first interesting frame.
We have to export __get_SP() because perf_arch_fetch_caller_regs()
(which is used in modules) calls it from a header file.
Reported-by: Li Zhong <zhong@linux.vnet.ibm.com>
Signed-off-by: Anton Blanchard <anton@samba.org>
Signed-off-by: Michael Ellerman <mpe@ellerman.id.au>
2014-10-13 19:41:38 +11:00
|
|
|
(regs)->gpr[1] = __get_SP(); \
|
2010-05-20 07:47:21 +02:00
|
|
|
asm volatile("mfmsr %0" : "=r" ((regs)->msr)); \
|
|
|
|
} while (0)
|
|
|
|
#endif
|