A Comparative and Standardization Study of Immunology Terminology Across the Taiwan Strait: A Dual Metrics Perspective Based on Jaccard and Cosine Similarity
This study employs the Jaccard coefficient and cosine similarity to systematically assess the differences and integration of immunology terminology across the Taiwan Strait in terms of both character form and semantics. The results indicate that, at the character level, the similarity increased from 0.3894 to 0.6992 after conversion between traditional and simplified characters and through standardization efforts, representing a rise of 79.53%. This suggests that character form is a primary factor contributing to the differences in terminology across the Strait, but can be effectively bridged through standardization. At the semantic level, there is a high overall consistency (average cosine similarity of 0.687), with 40.8% of terms showing high consistency, 43.8% exhibiting moderate differences in word formation, and 15.4% demonstrating significant semantic divergence, mainly stemming from translation strategies and regional language habits. The study reveals the fundamental characteristic of immunology terminology across the Strait as being “formally different yet semantically similar” and proposes the establishment of a “dual-track governance” strategy to eliminate character barriers through technological means, focus on semantic alignment for low-similarity terms, and promote the creation of a dynamic terminology database and interface support, thereby achieving efficient integration of academic knowledge and barrier-free communication across the Strait.