<html>
    <head>
      <base href="https://bugs.llvm.org/">
    </head>
    <body><table border="1" cellspacing="0" cellpadding="8">
        <tr>
          <th>Bug ID</th>
          <td><a class="bz_bug_link 
          bz_status_NEW "
   title="NEW - [llvm-exegesis] Analysis: smarter clustering?"
   href="https://bugs.llvm.org/show_bug.cgi?id=40880">40880</a>
          </td>
        </tr>

        <tr>
          <th>Summary</th>
          <td>[llvm-exegesis] Analysis: smarter clustering?
          </td>
        </tr>

        <tr>
          <th>Product</th>
          <td>tools
          </td>
        </tr>

        <tr>
          <th>Version</th>
          <td>trunk
          </td>
        </tr>

        <tr>
          <th>Hardware</th>
          <td>PC
          </td>
        </tr>

        <tr>
          <th>OS</th>
          <td>Linux
          </td>
        </tr>

        <tr>
          <th>Status</th>
          <td>NEW
          </td>
        </tr>

        <tr>
          <th>Severity</th>
          <td>enhancement
          </td>
        </tr>

        <tr>
          <th>Priority</th>
          <td>P
          </td>
        </tr>

        <tr>
          <th>Component</th>
          <td>llvm-exegesis
          </td>
        </tr>

        <tr>
          <th>Assignee</th>
          <td>unassignedbugs@nondot.org
          </td>
        </tr>

        <tr>
          <th>Reporter</th>
          <td>lebedev.ri@gmail.com
          </td>
        </tr>

        <tr>
          <th>CC</th>
          <td>clement.courbet@gmail.com, gchatelet@google.com, llvm-bugs@lists.llvm.org
          </td>
        </tr></table>
      <p>
        <div>
        <pre>Even with all the fixes, i'm still seeing some confusing behavior.
Even with `-analysis-clustering-epsilon=0.001`, the clusters end up
with very different instructions (e.g. rthroughput of 1 vs rthroughput of 2)
in the same cluster. I'm not sure if it is a bug or not, but i'm very
sure this is not the behavior one wants.

Has anyone else seen this? I'm not sure how to show an reproducer,
without uploading the entire measurement set..

Any thoughts on this? 
Perhaps the clustering algo is simply misimplemented?

Or maybe some other algorithm should be used?
E.g. go through ever SchedWrite class, group all the measurements
for all the instructions that are within this SchedWrite class into
a cluster, then crop off the outliers into their own clusters?</pre>
        </div>
      </p>


      <hr>
      <span>You are receiving this mail because:</span>

      <ul>
          <li>You are on the CC list for the bug.</li>
      </ul>
    </body>
</html>