Geometric methods of accelerating triangle-inequality-based k-means.

dc.contributor.advisorHamerly, Gregory James, 1977-
dc.creatorRysavy, Petr. 1991-
dc.date.accessioned2016-01-14T17:41:15Z
dc.date.available2016-01-14T17:41:15Z
dc.date.created2015-12
dc.date.issued2015-11-13
dc.date.submittedDecember 2015
dc.date.updated2016-01-14T17:41:15Z
dc.description.abstractOne of the most frequent ways how to cluster data is k-means. The standard way of solving the problem is iterative Lloyd's algorithm. This algorithm performs many redundant calculations. Elkan's and Hamerly's algorithms, the heap algorithm and many others eliminate this redundancy by maintaining a set of upper and lower bounds. The goal of this thesis is to further improve the runtime of those algorithms. Namely we improve the way how those bounds are maintained between iterations. By tighter updates of lower bounds we can further decrease the number of distance calculations. The other improvements stated in the thesis include elimination of centroids from the innermost loop of the algorithms when the bounds do not help. The common property of those two proposals is that they require only calculations that are done once per iteration. We also solve a problem that is left as open in the heap algorithm.
dc.format.mimetypeapplication/pdf
dc.identifier.urihttp://hdl.handle.net/2104/9570
dc.language.isoen
dc.rights.accessrightsWorldwide access.
dc.subjectClustering. K-means. Triangle inequality.
dc.titleGeometric methods of accelerating triangle-inequality-based k-means.
dc.typeThesis
dc.type.materialtext
thesis.degree.departmentBaylor University. Dept. of Computer Science.
thesis.degree.grantorBaylor University
thesis.degree.levelMasters
thesis.degree.nameM.S.

Files

Original bundle

Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
RYSAVY-THESIS-2015.pdf
Size:
2.69 MB
Format:
Adobe Portable Document Format
No Thumbnail Available
Name:
Petr_Rysavy_copyrightandavailabilityform.pdf
Size:
1.12 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
LICENSE.txt
Size:
1.95 KB
Format:
Plain Text
Description: