Determine flow size distribution from traffic dump data
Determine flow size distribution from traffic dump data
Hi Everyone, I generated a trace from Wireshark with 5 million packets. I need help analyzing the flow size distribution using a pandas DataFrame. Specifically, I want to find the 50th, 95th, and 99th percentiles of flow sizes. We can group the data based on the 5 tuples (Source_IP, Source_Port, Destination_IP, Destination_Port, Protocol) to identify each flow. Although it might not always be accurate, assuming all packets with the same tuple belong to one flow should work.