LogOnline

Dataset Preparation and Dependencies

To run LogOnline, the dependencies should first be installed via

pip install -r requirements.txt

In this paper, all the experiments are conducted on two public log dataset, namely HDFS and BGL. However, we do not include them in this repo due to a lack of space. The original HDFS and BGL dataset can be obtained from loghub. Another recent survey on log-based anomaly detection maintained a collection of parsed log data, which can be obtained from this website. You are encouraged to download these dataset before running our method. The downloaded files should be placed under the /data folder, in the directory named after the dataset name. For example, the parsed HDFS file HDFS.log_structured.csv should be placed under /data/HDFS/HDFS.log_structured.csv.

Training of the normality detection model

The normality detection model is trained separately using the notebook script in src/autoencoder.ipynb. Afterwards, the parameter files of the trained normality detection model are placed under checkpoint/ directory, which are loaded by the anomaly detection model in the evaluation phase. For ease of use, we already placed the checkpoints for both HDFS and BGL dataset under the checkpoint/ directory.

Running of LogOnline

You can run our proposed LogOnline by the following command

bash scripts/run_unilog.sh

The log generated is written to a log file under the /log directory. You can modify the shell script in order to try different experimental settings, such the dataset used, session size and window size, number of LSTM layers and so on. Check the possible parameters by running

python src/train.py --help

Citation

If you find this code helpful for your work or research, please cite:

@INPROCEEDINGS{LogOnline,
  author={Wang, Xuheng and Song, Jiaxing and Zhang, Xu and Tang, Junshu and Gao, Weihe and Lin, Qingwei},
  booktitle={2023 38th IEEE/ACM International Conference on Automated Software Engineering (ASE)}, 
  title={LogOnline: A Semi-Supervised Log-Based Anomaly Detector Aided with Online Learning Mechanism}, 
  year={2023},
  pages={141-152},
  doi={10.1109/ASE56229.2023.00043}
}

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
checkpoint		checkpoint
scripts		scripts
src		src
.gitignore		.gitignore
README.md		README.md
result.md		result.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LogOnline

Dataset Preparation and Dependencies

Training of the normality detection model

Running of LogOnline

Citation

About

Releases

Packages

Languages

xUhEngwAng/LogOnline

Folders and files

Latest commit

History

Repository files navigation

LogOnline

Dataset Preparation and Dependencies

Training of the normality detection model

Running of LogOnline

Citation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages