본문 바로가기

추천 검색어

실시간 인기 검색어

학술논문

A Comparative Analysis of Embedding Techniques and Clustering Algorithms on Benchmark Datasets

이용수 0

영문명
발행기관
한국공공가치학회
저자명
Min Seo Park Aaditya Yadav Amaan Dhada Donghoon Kim Ikshita Yadav Junkyung Lee
간행물 정보
『Journal of Public Value』Vol. 9, 69~84쪽, 전체 16쪽
주제분류
사회과학 > 사회복지학
파일형태
PDF
발행일자
2025.06.30
이용가능 이용불가
  • sam무제한 이용권 으로 학술논문 이용이 가능합니다.
  • 이 학술논문 정보는 (주)교보문고와 각 발행기관 사이에 저작물 이용 계약이 체결된 것으로, 교보문고를 통해 제공되고 있습니다. 1:1 문의
논문 표지

국문 초록

Purpose: To systematically evaluate and compare the effectiveness of various embedding techniques when combined with different clustering algorithms across diverse benchmark datasets, providing practical guidance for method selection based on dataset characteristics. Method: We conducted comprehensive experiments using 12 embedding techniques (including UMAP, t-SNE, PCA, Isomap, and others) com-bined with 12 clustering algorithms (including K-Means, Gaussian Mixture Models, GenieClust, and others) across multiple dataset collections from ClustBench. Performance was evaluated using Normalized Clustering Accuracy (NCA) and Adjusted Rand Index (AR) as primary metrics. Results: UMAP emerged as the top-performing embedding technique across all evaluation metrics, followed closely by t-SNE. GenieClust demonstrated superior performance among clustering algorithms, with Gaussian Mixture Models ranking second. The combination of Base embedding with GenieClust achieved the highest average performance, while computationally expensive embedding techniques generally outperformed simpler methods at the cost of scalability. Conclusion: No single embedding-clustering combination dominates universally across all datasets. The study reveals important tradeoffs be-tween computational complexity and clustering performance, with UMAP and GenieClust showing consistently strong results. Method selection should be based on dataset characteristics, computational constraints, and performance requirements.

영문 초록

목차

1. Introduction
2. Related Work
3. Methodology
4. Experimental Setup
5. Results
6. Limitations
7. Conclusion
8. References

키워드

해당간행물 수록 논문

참고문헌

최근 이용한 논문
교보eBook 첫 방문을 환영 합니다!

신규가입 혜택 지급이 완료 되었습니다.

바로 사용 가능한 교보e캐시 1,000원 (유효기간 7일)
지금 바로 교보eBook의 다양한 콘텐츠를 이용해 보세요!

교보e캐시 1,000원
TOP
인용하기
APA

Min Seo Park,Aaditya Yadav,Amaan Dhada,Donghoon Kim,Ikshita Yadav,Junkyung Lee. (2025).A Comparative Analysis of Embedding Techniques and Clustering Algorithms on Benchmark Datasets. Journal of Public Value, (), 69-84

MLA

Min Seo Park,Aaditya Yadav,Amaan Dhada,Donghoon Kim,Ikshita Yadav,Junkyung Lee. "A Comparative Analysis of Embedding Techniques and Clustering Algorithms on Benchmark Datasets." Journal of Public Value, (2025): 69-84

sam 이용권 선택
님이 보유하신 이용권입니다.
차감하실 sam이용권을 선택하세요.