Thanks for your reply.
Probably we have a different understanding of which timestamp we expect to be displayed. I expect the video timestamp to be displayed because the user works based on video frames. Not more or less. If it's 0.08, I expect 0.08 to be displayed. It is the video frame time stamp and independent from any audio track and independent from the frame type (I, B, etc.)
So, I don't understand it, I can only accept it.
EDIT: I don't know which meaning 0.146 s has for the user. I don't see the sense of this computation. It's neither the timestamp from the opened file, nor it's the timestamp as it will be written to the output file when copying/encoding. It's an arbitrary value. But maybe I'm missing something, I don't know.