The art of time-bending: Data augmentation and early prediction for efficient traffic classification

Research output: Contribution to journalArticlepeer-review

Abstract

The accurate identification of internet traffic is crucial for network management. However, the use of encryption techniques and constant changes in network protocols make it difficult to extract useful features for traffic classification. Additionally, there may be limited data availability and a lack of diversity within the dataset, which poses further challenges. To address these issues, our research proposes a novel solution that uses an innovative data augmentation technique. This approach leverages the capabilities of LSTM networks to create synthetic data points that closely resemble real traffic data. By doing so, we can significantly enrich the dataset used for training and improve classification efficiency. We conducted thorough experiments to validate our approach and found that combining LSTM-generated data with actual traffic data leads to notable improvements in classification efficiency. We demonstrated the effectiveness of our methodology using academic and commercial datasets. Our classifier, trained on the generated data, showed a performance boost of 6%. Moreover, when classifying with only half of the time, thus utilizing half of the signal, our approach achieved a notable 4% improvement compared to the original classifier. The inclusion of augmented samples within the training set led to a noticeable improvement in both accuracy and F1-score. These findings compellingly demonstrate our data augmentation strategy's practical utility and efficiency in earlier prediction with improved performance for encrypted traffic classification systems.

Original languageEnglish
Article number124166
JournalExpert Systems with Applications
Volume252
DOIs
StatePublished - 15 Oct 2024

Keywords

  • Data augmentation
  • Internet traffic classification
  • Long Short-Term Memory (LSTM) networks

Fingerprint

Dive into the research topics of 'The art of time-bending: Data augmentation and early prediction for efficient traffic classification'. Together they form a unique fingerprint.

Cite this