Project Wulfila - an electronic edition of the Gothic Bible
Search the text using regular expressions

Search options:

Case-sensitive search

[1] Use hv to transcribe the -character. Transcribing it as "hw" is not possible, as this combination occurs occasionally as two separate phonemes (e.g. the verb şairh-wisan among others). 

[2] Transcription of the thorn-character as "th" is equally problematic (e.g. the verb at-haitan among others).  You can instead:

  • ... type "ş" or "Ş" directly. How you do this depends on your operating system and settings. On a Windows platform you can type special characters using the numerical keypad and ALT-key: thorn is represented by ALT-0254 and ALT-0222 (capital). I don't know if the same or a similar mechanism applies to UNIX/Linux or Mac; of course, one can also copy / paste the characters above. 

  • ... type "c"  or "C" instead. The script will change them to thorn before executing the query (actually, it's the other way round- see [4]).  I chose this character simply because it is one of the few word characters that doesn't occur in the Gothic Bible (or its commonly used transcription). So if you are looking for wairşai wilja şeins you can type "waircai wilja ceins" (without the quotes).

[3] The text is searched on the verse-level, including punctuation. You can enter a word or a regular expression . If you are not familiar with regular expressions, have a look at the syntax, as it differs from classical expressions with wildcards typically used in word processors and online queries. For instance, if you want to find all instances of the lemma "wulfs", you could type "wulf\w*" (without quotes), which is interpreted as "find the string wulf followed by zero or more (= *) word characters (= \w)".

[4] A technical note: for the moment beinig, the search engine is nothing more than a simple loop that checks a flat file using Microsoft VBScript Regular Expressions version 5.5. Unfortunately, this particular implementation of regular expressions is (literally) rather basic and doesn't treat "ş" (nor any other non-English character for that matter) as a word character, which makes the \b en \w operators useless for most languages other than English. This situation is avoided by replacing "ş" by an unused word character in the files to be searched. So any thorn in the search field is replaced by "c", and not the other way round.