6k.txt | Almost
Other search results refer to a "6k example" reference dataset extracted from Reddit by Zhang et al. (2020), used to evaluate bias in machine learning models.
Uses radare2 for analysis and Elasticsearch to store results, highlighting a practical approach to building a "homelab" malware database. almost 6k.txt
This article details a project using a 6000-sample (6k) dataset to build a malware triage system. Other search results refer to a "6k example"