llmshield.matchers.dicts package
Module contents
Dictionary data files for entity matching.
- Description:
This subpackage contains text files with known entities used for dictionary-based entity detection. These files include lists of cities, countries, organisations, and common English words used to improve entity classification accuracy.
- Data Files:
cities.txt: List of world cities
countries.txt: List of countries and territories
organisations.txt: List of known organisations
english_corpus.txt: Common English words for filtering
- Author:
LLMShield by brainpolo, 2025