- Tài khoản và mật khẩu chỉ cung cấp cho sinh viên, giảng viên, cán bộ của TRƯỜNG ĐẠI HỌC FPT
- Hướng dẫn sử dụng:
Xem Video
.
- Danh mục tài liệu mới:
Tại đây
.
-
Đăng nhập
:
Tại đây
.
Trí tuệ nhân tạo Artificial Intelligence Domain Adaptation Semantic Search
Issue Date:
2023
Publisher:
FPTU Hà Nội
Abstract:
Semantic search is an evolution in both accuracy and flexibility of the search engine. Unlike traditional keyword-based searches, it delves deeper by comprehending the semantics behind words and user queries, resulting in more accurate and relevant search outcomes. In this research, we conduct experiments on a telecommunication domain news dataset to see how its data differs from general model training data and how these differences affect the model feature extraction output and evaluate the model performance in building a search engine.
We will compare different architecture and model used in semantic search task. Furthermore, we conduct investigations of the method to finetune a feature extraction BERT [1] on processed data and the use of Multiple Negative Ranking Loss [2] when the data for the Semantic Textual Similarity training task is not in the ideal format of Premise-Hypothesis-Label.
Finally, we evaluate the model inference performance on a complete pipeline to ensure compelling business requirements. The results show that when data is not ideal, a semantic search model based on Transformers [3] still achieves great retrieval rates on a human-evaluated keywords dataset. We succeeded in creating a highly accurate model while compelling to the required speed on a low-end system with no GPU