Hi,
> Obviously I need to learn enough R to be able to do this sort of analysis myself. In the interim, since I’m doing a similar experiment monday (this time varying either classification of ping or the queue-ing algorithm, not the buffer size - I haven’t decided which), we’d love to see your script in the hope I can immediately use it in the class…
Ok. lets see. I created something that should ease up the plotting. See
attached tar archive and sample plot. To summarize there is a Makefile
which plots the data and expects:
- p1.txt and p2.txt as input (which should be the logs from the ping
command)
- awk and sed to be installed and in the path
- GNU R to be installed and in the path
What it does is:
1.) extracts the RTT timeseries ‘hopefully’ covering the download
period, by looking for the ping sequence number (SEQ) with maximum RTT
(MAX) and considering all RTTs from SEQ-100 to SEQ+100.
2.) replacing all timeouted pings with an RTT value of MAX+100ms, where
the 100ms can be configured in the BEGIN clause of the awk script.
3.) plot the two time series.
What it includes:
- extract.timeseries.awk (performs steps 1+2)
- plotit.R (performs step 3)
- Makefile (wrapper)
- p*.txt example input files
> As for these plots, what I’d like:
>
> the Y axis to stay in the the same range of 0 - 1000 ms
(for the combined plot i determine the maximum RTT and use that as an upper bound)
> the X axis to be 100 seconds long, showing the idle period, the spike, the idle period.
(i opted for a 200 second = 200 ping samples period instead)
> the two plots to be directly comparable, using a different color for the ping ‘dots’, and roughly the same start time, so I can overlay them…
already done. Only one output plot.
> 1) start time is uncontrolled (but close enough, given the length of transfer)
you can ‘tune’ the input data and have the input file start at a chosen
point in time that is less than 100 seconds before the maximum to
achieve this.
(I did not want to get into more sophisticated methods for determining
the download period from the data, sofar)
> 4) tcpdump on at least several stations would be VERY interesting. Dario says you were seeing some interesting duplicate acks - I still haven’t verified we’ve actually FIXED the stack enough to get reliable results.
No what we saw is ping reporting “(DUP!)” behind some lines.
> Would you be interested in following up this line of work with a paper w/me/etc?
sure.
best
Fabian