AI Malware Guardian's detection models were trained using publicly available research datasets. We gratefully acknowledge the authors of these datasets and their contributions to the security research community. All datasets are used strictly in accordance with their respective licenses.
Full title: COMISET: Dataset for the analysis of malicious events in Windows systems
Authors: Pérez-Sánchez, A., Palacios, R., & López, G. (2025)
Institution: Comillas Pontifical University
License: Creative Commons Attribution 4.0 International (CC BY 4.0)
Usage: COMISET was used to train the behavioral pattern detection component of AI Malware Guardian. The original dataset was not modified or redistributed — only derived model weights are incorporated into the product.
Citation:
Pérez-Sánchez, A., Palacios, R., & López, G. (2025). COMISET: Dataset for the analysis of malicious events in Windows systems [Data set]. Zenodo. https://doi.org/10.5281/zenodo.15375146
Full title: Binary-30K: A Large-Scale Multi-Platform Binary Dataset for Machine Learning Research
Author: Bommarito, Michael J., II (2025)
Dataset: huggingface.co/datasets/mjbommar/binary-30k-tokenized
Paper: github.com/mjbommar/binary-bpe-paper
License: Creative Commons Attribution 4.0 International (CC BY 4.0)
Usage: Binary-30K was used to train the static PE file anomaly detection baseline model in AI Malware Guardian. The original dataset was not modified or redistributed — only derived model weights are incorporated into the product.
Citation:
@article{bommarito2025binary30k,
title={Binary-30K: A Large-Scale Multi-Platform Binary Dataset for Machine Learning Research},
author={Bommarito, Michael J., II},
journal={arXiv preprint},
year={2025},
url={https://github.com/mjbommar/binary-bpe-paper}
}
The Creative Commons Attribution 4.0 International license requires that attribution be provided when the licensed work is shared. The full text of the CC BY 4.0 license is available at creativecommons.org/licenses/by/4.0/legalcode.
AI Malware Guardian is built using open-source software components. The following notices are provided to fulfill the requirements of each component's license. A complete NOTICES.txt file is also bundled alongside the application in the installation directory.
Copyright © Microsoft Corporation. All rights reserved.
Licensed under the MIT License.
Source: github.com/microsoft/onnxruntime
Copyright © 2019–2024 Tauri Programme within The Commons Conservancy.
Licensed under the MIT License or Apache License 2.0.
Source: github.com/tauri-apps/tauri
This product also includes the following open-source libraries, each licensed under the MIT License and/or Apache License 2.0. Full copyright notices are listed in the bundled NOTICES.txt file.