High Precision Folder Matching (HPFM) for faster open source scanning

1 scanosss 1 6/30/2025, 10:59:30 AM
SCANOSS just added a new feature to its CLI (scanoss-py): High Precision Folder Matching (HPFM).

Instead of matching every file individually, HPFM analyses the full structure of a scanned folder—file names, directory layout, extensions, and contents—to quickly determine which open source component it most likely represents.

Think of it like scanning a car: traditional scanning lists all the parts; HPFM tells you “this is a Peugeot 2008.”

Why it’s useful: – Speeds up scanning in high-volume environments – Reduces noise by focusing on the dominant component – Can improve the accuracy of follow-up scans by providing a “hint” – Helps detect altered or incomplete components

Available now in scanoss-py. Docs/GitHub: https://github.com/scanoss/scanoss.py CLI: scannos-py folder-hash <directory>

Would love to hear feedback, suggestions, or use cases this might help with.

Comments (1)

scanosss · 5h ago
Small fix on the CLI usage thanks to those who flagged it.

The correct command is:

scannos-py folder-scan <directory>

folder-scan (fs) --> Scan the given directory using folder hashing

folder-hash (fh) --> Produce a folder hash for the given directory