[PATCH] D59539: [llvm-exegesis] Option to lobotomize dbscan (PR40880)
Clement Courbet via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Tue Mar 26 02:31:47 PDT 2019
courbet added a comment.
> To reword: because if i do simple clustering by opcode, i will then need to add yet another
> "stabilization" step - for each cluster, check that every measurement is neighbor of all
> the other points in that cluster, and if they are not, mark cluster as noise.
> (well, not every vs. every, just the lower/upper triangle excluding diagonal)
OK I see, thanks. To sum up my understanding: There are some areas where two clusters that should be separate are so noisy that there is a dense region connecting the two clusters, so even taking a small epsilon will not separate them. You want to reject these merged clusters based on the variance of the points within the cluster.
One suggestion I have is to compute the variance within the cluster (this can be done incrementally when adding points to the cluster) and reject clusters where the variance is more than a certain threshold. What do you think ?
> I can do that instead, maybe that would even better than this (no dependency on measurement ordering).
Yes, I would really like to avoid the dependence on the ordering.
Repository:
rL LLVM
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D59539/new/
https://reviews.llvm.org/D59539
More information about the llvm-commits
mailing list