학술논문
A Comparative Analysis of Embedding Techniques and Clustering Algorithms on Benchmark Datasets
이용수 0
- 영문명
- 발행기관
- 한국공공가치학회
- 저자명
- Min Seo Park Aaditya Yadav Amaan Dhada Donghoon Kim Ikshita Yadav Junkyung Lee
- 간행물 정보
- 『Journal of Public Value』Vol. 9, 69~84쪽, 전체 16쪽
- 주제분류
- 사회과학 > 사회복지학
- 파일형태
- 발행일자
- 2025.06.30

국문 초록
Purpose: To systematically evaluate and compare the effectiveness of various embedding techniques when combined with different clustering algorithms across diverse benchmark datasets, providing practical guidance for method selection based on dataset characteristics.
Method: We conducted comprehensive experiments using 12 embedding techniques (including UMAP, t-SNE, PCA, Isomap, and others) com-bined with 12 clustering algorithms (including K-Means, Gaussian Mixture Models, GenieClust, and others) across multiple dataset collections from ClustBench. Performance was evaluated using Normalized Clustering Accuracy (NCA) and Adjusted Rand Index (AR) as primary metrics.
Results: UMAP emerged as the top-performing embedding technique across all evaluation metrics, followed closely by t-SNE. GenieClust demonstrated superior performance among clustering algorithms, with Gaussian Mixture Models ranking second. The combination of Base embedding with GenieClust achieved the highest average performance, while computationally expensive embedding techniques generally outperformed simpler methods at the cost of scalability.
Conclusion: No single embedding-clustering combination dominates universally across all datasets. The study reveals important tradeoffs be-tween computational complexity and clustering performance, with UMAP and GenieClust showing consistently strong results. Method selection should be based on dataset characteristics, computational constraints, and performance requirements.
영문 초록
목차
1. Introduction
2. Related Work
3. Methodology
4. Experimental Setup
5. Results
6. Limitations
7. Conclusion
8. References
키워드
해당간행물 수록 논문
참고문헌
최근 이용한 논문
교보eBook 첫 방문을 환영 합니다!
신규가입 혜택 지급이 완료 되었습니다.
바로 사용 가능한 교보e캐시 1,000원 (유효기간 7일)
지금 바로 교보eBook의 다양한 콘텐츠를 이용해 보세요!
