Vulnerability Description
A vulnerability in the ArxivReader class of the run-llama/llama_index repository, versions up to v0.12.22.post1, allows for MD5 hash collisions when generating filenames for downloaded papers. This can lead to data loss as papers with identical titles but different contents may overwrite each other, preventing some papers from being processed for AI model training. The issue is resolved in version 0.12.28.
CVSS Score
MEDIUM
Affected Products
| Vendor | Product | Versions |
|---|---|---|
| Llamaindex | Llamaindex | < 0.12.28 |
Related Weaknesses (CWE)
References
- https://github.com/run-llama/llama_index/commit/0008041e8dde8e519621388e5d6f558bPatch
- https://huntr.com/bounties/80182c3a-876f-422f-8bac-38267e0345d6ExploitThird Party Advisory
- https://huntr.com/bounties/80182c3a-876f-422f-8bac-38267e0345d6ExploitThird Party Advisory
FAQ
What is CVE-2025-3044?
CVE-2025-3044 is a vulnerability with a CVSS score of 5.3 (MEDIUM). A vulnerability in the ArxivReader class of the run-llama/llama_index repository, versions up to v0.12.22.post1, allows for MD5 hash collisions when generating filenames for downloaded papers. This ca...
How severe is CVE-2025-3044?
CVE-2025-3044 has been rated MEDIUM with a CVSS base score of 5.3/10. Review the CVSS metrics above for detailed severity breakdown.
Is there a patch for CVE-2025-3044?
Check the references section above for vendor advisories and patch information. Affected products include: Llamaindex Llamaindex.