본문 바로가기

추천 검색어

실시간 인기 검색어

학술논문

A Comparative Analysis of Embedding Techniques and Clustering Algorithms on Benchmark Datasets

이용수  0

영문명
발행기관
한국공공가치학회
저자명
Min Seo Park Aaditya Yadav Amaan Dhada Donghoon Kim Ikshita Yadav Junkyung Lee
간행물 정보
『Journal of Public Value』Vol. 9, 69~84쪽, 전체 16쪽
주제분류
사회과학 > 사회복지학
파일형태
PDF
발행일자
2025.06.30
4,720

구매일시로부터 72시간 이내에 다운로드 가능합니다.
이 학술논문 정보는 (주)교보문고와 각 발행기관 사이에 저작물 이용 계약이 체결된 것으로, 교보문고를 통해 제공되고 있습니다.

1:1 문의
논문 표지

국문 초록

Purpose: To systematically evaluate and compare the effectiveness of various embedding techniques when combined with different clustering algorithms across diverse benchmark datasets, providing practical guidance for method selection based on dataset characteristics. Method: We conducted comprehensive experiments using 12 embedding techniques (including UMAP, t-SNE, PCA, Isomap, and others) com-bined with 12 clustering algorithms (including K-Means, Gaussian Mixture Models, GenieClust, and others) across multiple dataset collections from ClustBench. Performance was evaluated using Normalized Clustering Accuracy (NCA) and Adjusted Rand Index (AR) as primary metrics. Results: UMAP emerged as the top-performing embedding technique across all evaluation metrics, followed closely by t-SNE. GenieClust demonstrated superior performance among clustering algorithms, with Gaussian Mixture Models ranking second. The combination of Base embedding with GenieClust achieved the highest average performance, while computationally expensive embedding techniques generally outperformed simpler methods at the cost of scalability. Conclusion: No single embedding-clustering combination dominates universally across all datasets. The study reveals important tradeoffs be-tween computational complexity and clustering performance, with UMAP and GenieClust showing consistently strong results. Method selection should be based on dataset characteristics, computational constraints, and performance requirements.

영문 초록

목차

1. Introduction
2. Related Work
3. Methodology
4. Experimental Setup
5. Results
6. Limitations
7. Conclusion
8. References

키워드

해당간행물 수록 논문

참고문헌

교보eBook 첫 방문을 환영 합니다!

신규가입 혜택 지급이 완료 되었습니다.

바로 사용 가능한 교보e캐시 1,000원 (유효기간 7일)
지금 바로 교보eBook의 다양한 콘텐츠를 이용해 보세요!

교보e캐시 1,000원
TOP
인용하기
APA

Min Seo Park,Aaditya Yadav,Amaan Dhada,Donghoon Kim,Ikshita Yadav,Junkyung Lee. (2025).A Comparative Analysis of Embedding Techniques and Clustering Algorithms on Benchmark Datasets. Journal of Public Value, (), 69-84

MLA

Min Seo Park,Aaditya Yadav,Amaan Dhada,Donghoon Kim,Ikshita Yadav,Junkyung Lee. "A Comparative Analysis of Embedding Techniques and Clustering Algorithms on Benchmark Datasets." Journal of Public Value, (2025): 69-84

결제완료
e캐시 원 결제 계속 하시겠습니까?
교보 e캐시 간편 결제