TOBAR NA GAEDHILGE 1.14

Ciarán Ó Duibhín, 2023/10/05
Clár cinn / Home page / Page d'accueil / Hauptseite / Главная страница
My email address

What it does
    • What the results may look like
    • What Gaelic texts can be searched
    • Requesting Gaelic forms
    • More about what to expect in a Gaelic index
    • Searching in other languages
    • Translation equivalents
Installation
History
New this time
The texts in detail
Keyboard layouts
Miscellaneous


For clarity, we will generally avoid the ambiguous term “word”, and use instead either “form” or “token”, depending on which is meant.
As an example, in the sentence Ba é an t-arán an t-anlann. there are 6 tokens, but only 5 distinct forms — there are 2 tokens of the form an.

What it does

Tobar na Gaedhilge is a searchable textbase of high-quality 20th-century Gaelic texts (mostly Irish, with some Scottish), best described as "continuity Gaelic" in its several naturally-occurring varieties. The textbase contains over 5.9 million tokens of Gaelic text, and is freely downloadable for installation on a personal computer under MS Windows.  After the software is installed, a form may be requested, and examples of its use may be viewed. The purpose of Tobar na Gaedhilge is to allow the texts to be used as a lexical and grammatical resource. To protect the rights of the authors and publishers, the texts are not made available in continuously readable form.

A summary of some of the user-friendly features of the system:
• texts are grouped into collections, each for a particular naturally-occurring variety of Gaelic
• a complete pick-list of the forms in the chosen collection is presented on-screen
• forms are listed with initial mutations removed, eg. fear, fhear, bhfear are all grouped under fear; while athair, h-athair, n-athair, t-athair are all grouped under athair
• forms are listed with enclitics separated, eg. d'ith is listed under d' and ith; agamsa is listed under agam and sa
• apostrophes are distinguished from quotation marks and are retained in forms, eg. 'ach is not conflated with ach
• forms split at end-of-line in the original text are rejoined in the list, where appropriate
• collocations of forms may be found, either in immediate sequence, or within a sentence-like unit
• an analytic approach is taken to compound forms, eg. sean-amhrán is treated as a sequence of sean, -, and amhrán
• accents may optionally be discounted for searching, eg. comhartha may be made to retrieve both comhartha and cómhartha
• an asterisk may be used as a wild-card to match all or part of a form, eg. beir* or *stin
These features were designed into the program from the beginning, and the texts were prepared with the markup necessary to support them.

Tobar na Gaedhilge is completely free of adware, spyware or other harmful inclusions.

In addition, the textbase contains translations of some of the Gaelic texts into or from English, French, German and Russian. These will be collectively referred to as other languages. Retrieved segments of Gaelic text may be displayed in other languages, if the translations are stored; and further, the forms occurring in the stored texts in other languages may be used to search the textbase.

• What the results may look like

We begin by showing the results of some typical searches, and how to explore those results. We will later show how to request a search.

We may show sentences containing a selected Gaelic form, taking one example each from the Munster, Connacht and Scottish texts. Thereafter, we will draw our examples from the Ulster texts, which form by far the largest part of the material stored. (Contributions of Munster, Connacht or Scottish texts are most welcome.)

Figure 1: We looked in the Munster texts for examples of the form cábóg (a country person). We found two examples in Pádraig Ua Maoileoin, Na hAird Ó Thuaidh, and we show the second example here. Page and line reference is given to the published book.

Screenshot of Munster sentence view

In passing, a small thing to notice in this example is the hyphen in "shráid-eanna". This shows that the token was hyphenated (at the end of a line) in the original book. But the hyphen does not interfere with indexing the sentence, as use of the Innéacsáilte display will confirm.

The navigation panel (at upper right) allows us to move around the retrieved examples. The display panel (at lower right) allows the form in which the sentence is displayed to be controlled. These are described following Figure 6 below.

Figure 2: We looked in the Connacht texts for examples of the sequence of forms lúb gaoil (blood relationship), and we found two examples, both in Séamus Mag Uidhir, Fánaidheacht i gConndae Mhuigheo. We show the first sentence here.

Screenshot of Connacht sentence view

Figure 3: We looked in the Scottish texts for examples of forms beginning with càr. With a request like this for forms matching a general pattern, clearly more than one form may match — we refer to this situation as matching a disjunction of forms. The sentences for each matching form are presented (over all the relevant books) before presenting the sentences for the next matching form. Here, we show a sentence containing the matching form càraich (fix).

Screenshot of Scottish sentence view

Besides viewing complete sentences as above (View: Abairteacha (il-teangthach)), three other ways of viewing results are provided, which are more compact for large quantities of text. (The view can be chosen at the same time as the search forms are chosen.) Output from these views is shown in the next two figures.

Figure 4: The frequency (Minicidheacht) view of the form saoghal (life) in the texts of the current collection.

Screenshot of frequency view

Scrolling may be necessary to reveal the information for all the texts. If the request matched a disjunction of forms, a navigation option AthFhocal (next form) will proceed to the next matching form. A navigation option Réidh (finished) leaves the results and returns to the search screen.

Figure 5: A keyword-in-context view (KWIC) of the form athair (father) in Séamus 'ac Grianna, Thiar i dTír Chonaill. The first batch of occurrences is shown.

Screenshot of concordance view

For the KWIC view, the navigation panel at the upper right allows us to move down a screenful (Síos), up a screenful (Suas), to start of text (Suas go bárr), to end of text (Síos go bun). These options allow us to move around move only within the current text, and (for a disjunction of matching forms) only for the current form.

To move to the next book of the collection, we use AthLeabhar. If the request matched several forms in disjunction, AthFhocal will take us to the first book for next matching form. The Réidh (finished) option leaves the results and returns to the search screen. Some of the navigation options are provided with keyboard shortcuts, which may prove convenient for frequent use. (The AthThlámán option is equivalent to Síos.)

It is not currently possible to move to the previous book or to a previous matching form. But we can get back to the start of the retrieved results by choosing Réidh and returning to the results without altering the previous search request.

Figure 5a: A monolingual by sentence view (Abairteacha (aon-teangthach)) of the form athair (father) in Séamus 'ac Grianna, Thiar i dTír Chonaill. The first batch of occurrences is shown.

Screenshot of monolingual sentence view

In the Abairteacha (aon-teangthach) view, whole sentences are shown, rather than single lines as in the KWIC view. The first batch of sentences is shown, and the navigation panel allows us to move down a batch (Síos) or move to the start of text (Suas go bárr). The further options move up a batch (Suas) and move to end of text (Síos go bun) are not yet properly programmed. These options allow us to move around move only within the current text, and (for a disjunction of matching forms) only for the current form.

The monolingual option differs from the multilingual by sentence (Abairteacha (il-teangthach)) view which we have seen in earlier by-sentence figures, and which is the default option. The multilingual view shows only one sentence at a time, but it offers other option for the display of that sentence.

Figure 6: A multilingual by sentence view (Abairteacha (il-teangthach)) of the form oidhreógach (ice) in Seosamh 'ac Grianna, Pádraic Ó Conaire agus Aistí Eile. The first (and, as it happens, only) such sentence in this text is displayed.

Screenshot of multilingual sentence view

Here, the navigation panel allows us to move to the next retrieved sentence (Síos), to the previous retrieved sentence (Suas), to the first retrieved sentence of the text (Suas go bárr), or to the last retrieved sentence of the text (Síos go bun). These options allow us to move around move only within the current text, and (for a disjunction of matching forms) only for the current form.

In either of the by sentence views, we use AthLeabhar to move to the next book of the collection. If the request matched several forms in disjunction, AthFhocal will take us to the first book for next matching form. The Réidh (finished) option leaves the results and returns to the search screen. Some of the navigation options are provided with keyboard shortcuts, which may prove convenient for frequent use. (The AthAbairt option is equivalent to Síos.)

It is not currently possible, in either of the by sentence views, to move to the previous book or to a previous matching form. But we can get back to the start of the retrieved results by choosing Réidh and returning to the results without altering the previous search request.

The extra facilities available in the multilingual by-sentence option are activated by the display panel at the lower right. Firstly, it allows the text of the sentence in the main language (ie. the language of the index in use) to be shown in a choice of ways: plain and uncorrected (Foillsighthe); plain but corrected (Lom); including mark-up (Marcáilte); or as a list of the tokens by which it is indexed (Innéacsáilte). The Innéacsáilte option allows you to see which index terms will fetch this sentence. This will allow you to see how the text has been tokenized and indexed.

To explain what is meant by "correction": this is limited to obvious errors. An isolated spelling variant which is at odds with consistent usage in the rest of the same book may also be corrected; but beyond this, we make no attempt at normalization of forms which are not clearly in error. Corrections to Gaelic text may also include restoring the wording of the manuscript where known. And correction of a text in a language other than Gaelic may be used to bring that text closer to the edition used by the Gaelic translater. None of these correction processes can be guaranteed to have been applied exhaustively. The corrections are applied to the indexes; and to the displayed sentences under the Lom option. The uncorrected book text will always be displayed under the Foillsighthe option.

As an example, the misprint cómhhartha occurs in Ben-Hur on page 337, and should clearly be corrected and indexed as cómhartha. So if we search for cómhartha, we will find the relevant sentence, and if we view it under the default Lom option, we will see the token as cómhartha. But if viewed under the Foillsighthe option, we will see it, uncorrected, as cómhhartha. Under the Marcáilte option, we will see the complete markup as cómh[h]artha. And under the Innéacsáilte option, we will see it confirmed that the token is indexed as cómhartha.

Secondly, in the multilingual by-sentence option, the display panel allows the translation of the sentence into other languages to be shown, if stored. To do this, tick the names of the desired languages, visible on the sentence-level displays above, labelled Béarla (English) and Fraincis (French) and Gearmáinis (German) and Rúisis (Russian). If the name of the language is italicised, this means that a translation in that language is not available for the sentence to be displayed. At present, translations or originals are held only for texts in the Ulaidh collection, and only for a proportion of this: English, 2.39M tokens; French, 0.90M tokens; German, 0.92M tokens; Russian, 0.14M tokens. In these other languages, the sentence is always shown plain and corrected.

Figure 7: Sentences containing the form creafadaigh (shaking); the first of two examples found in Seosamh 'ac Grianna, Seideán Bruithne/Amy Foster. English and French translations are available and are shown.

Screenshot of Gaelic sentences with translations

When a sentence is shown in several languages, the choice of display between plain (Lom); including mark-up (Marcáilte); as a list of the tokens by which it is indexed (Innéacsáilte); or plain and uncorrected exactly as printed (Foillsighte) applies only to the language of the index used. In other languages the sentence is shown plain.

The page and line number is shown for the displayed sentence. The page and line number are taken from the edition used, and they can be displayed only if the computer-readable version of the text is paginated — see the list of texts below for this information. All the Gaelic texts are paginated, but some in other languages, texts which have been obtained from various sources may not be paginated.

Several other options on the navigation panel are common to the KWIC and both by-sentence views.

Cóipeáil (copy) copies the current display to a textfile, which by default, is called samplaí.txt and is placed in the My Documents folder, and the copied material is appended to it. Comhad Cóipeála (copy file) allows the name and location of the file to be changed, and also the mode from append to overwrite (but it will revert to appending after the first copy). The darker elongated panel shown in Figure 6 is the result of clicking Comhad Cóipeála.

Retrieval of a collocation may require the creation of workfiles. These will be placed in the temporary workspace folder, if possible. In the unlikely event that this is not possible, you will be prompted for the name of a folder to hold workfiles; you could, for example, place them on a memory stick.

• What Gaelic texts can be searched

Gaelic is found in several slightly different forms, and the texts are organized into collections to reflect this and to keep each collection fairly homogeneous in language.  The five collections supplied are (as of 2023/10/04):

  • Ulaidh (Ulster)           Index: Gaedhilg. 17 authors;  74 books;  57,012 forms;  5,132,477 tokens
  • Connachta                 Index: Gaedhilge.  8 authors;  10 books;  22,982 forms;    406,316 tokens
  • Mumhain (Munster)  Index: Gaolainn.   6 authors;   6 books;  13,375 forms;    226,404 tokens
  • Alba (Scotland)          Index: Gàidhlig.    4 authors;   5 books;    8,021 forms;   115,996 tokens
  • Oirthear (Eastern)     Index: Gaodhlag.   2 regions;    3 books;    8,700 forms;  104,842 tokens

At the present stage of development, the Ulaidh collection is much larger than any of the others.

The identities of the texts in all collections are listed in full below.

Searching may be restricted to any chosen subset of the texts of a collection, by deselecting temporarily individual texts or authors.

Each collection has a pre-compiled index associated with it, made from the forms found in the relevant books, and in which requests for forms are looked up. The statistics just given for the collections refer to these indexes — indexes of forms found in the Gaelic texts. This does not mean that the forms in the index are exclusively Gaelic forms; rather they will reflect whatever the Gaelic texts contain, including borrowings, phrases and quotations in other languages.

Much more detail will be given later about what forms may be found in the indexes.

 

Figure 8: This is the program's opening screen, and the first task is to choose the desired collection and index.

Screenshot of opening screen

The program should show a list of the available collections, as in blue above — together with some statistics of the highlighted collection (below) and a pick list of its index (on the right). If there are no collections listed, you may be in the wrong folder, and you can browse (using Cuirtear Lorg) to a different folder. The indexes available to the highlighted collection are shown in brown. When you have marked the desired collection and chosen your index to it, click on Isteach to enter the collection/index.

Amach is to exit the program.

Treoir is for help.

• Requesting Gaelic forms

When a collection has been selected, and Isteach clicked, the display changes to that shown in the next figure, which allows you to type forms and to choose a view, among other things.

Figure 9: Requesting Gaelic forms.

Screenshot of requesting forms

Before entering our own forms, however, first notice that this screen allows you to go back and change to a different collection and index, by using the Athruigh button on the Cnuasacht panel. And also, that you may see which authors and books are included in the current collection and index by using the Athruigh button on the Leabharthaí panel, and you may choose to select temporarily a subset of those books. (Note that selecting a subset of a collection is not reflected in the pick-list of forms, which remains that for the whole collection.)

From the Radharc (views) panel, you may choose your view of the results: frequencies (Minicidheacht); a keyword-in-context concordance (KWIC); batched sentences (Abairteacha (aon-teangthach)); or single sentences with extra display options (Abairteacha (il-teangthach)). Samples of each view have already been shown above.

And now we come to the Focal panel, where the desired form or forms may be typed into the box provided, or may be inserted there by double-clicking them on the pick list, which is a displayed segment of the collection's index. The pick list accommodates itself to the existing contents of the box, as a guide to what words are available.

If typing into the box, any accented character should be pre-composed, not a combination — e.g. type á as normal, NOT a followed by a combining acute accent.

As an alternative to typing it into the box, a search form may also be chosen by double-clicking it on the pick list, or it may be pasted from the Windows clipboard (Ctrl/V).  The new form will be appended to anything already in the box (and which is not selected); if the existing material in the box is already selected — which is the default — the new search form will replace it. TIP: when working between the box and the picklist, you can cycle the focus around these and the other screen items by repeatedly pressing the TAB key. While an item has focus, any content which is selected will be visibly highlighted.

In the box you may put:
• a form, such as oidhreógach or saoghal or athair, as used in our previous examples
• two or more forms occurring together (eg. lúb gaoil)
• any form may contain a wild-card (*), that is, an asterisk which matches any number of letters, including none.  For example, all forms with a particular stem may be sought (eg. beir*), or all forms with a particular termination (eg. *stin).

If the box contains more than one form (ie. there is a space within it), you are asked to choose between seeking the forms directly adjacent and in the given order; or within the same sentence in any order. (Note however that the KWIC display mode is unsuitable when asking for consecutive forms; in this case, the monolingual by sentence mode is substituted.)

It is even possible to give one or more of the forms as simply the asterisk (*), which matches any form; the search is then assumed to be a consecutive one. (But avoid giving * as the final form, as the search will be unnecessarily slow.) As we will see below (under demutation) a hyphen is, in most circumstances, counted as a separate form, so search for sean-bhean as three forms: sean - bean (as well as sean bean and seanbhean to cover any unhyphenated instances).

You may tick the Gan beinn ar an tsíneadh fhada checkbox if you want to include forms which differ from that requested only by the presence or absence of an accent, eg. comhradh with this checkbox ticked will match comhrádh, cómhradh and cómhrádh as well. Actually, ticking this box also includes forms which differ from that requested:
• by the presence of ANY accent;
• by a difference of CASE, though this affects only lemma indexes and the German form index, as case differences are already neutralized in other form indexes;
• by the intrusion of certain non-alphanumeric characters, such as £, %, period (indicating abbreviation), apostrophe (indicating elision), hyphen (indicating anonymisation), etc. For example, 2 will retrieve also £2 or 2% or or ; but note that many Gaelic forms containing apostrophes or hyphens are already treated as compounds and indexed as two or more separate parts, as explained under decompounding below.

To type accented vowels, use your normal method of doing this under Windows. For information about keying accented letters under Windows, look here, or see the section "Keyboards layouts" near the end of this file. (But you will not require the support for dotted consonants offered by these keyboards, as lenition is always indicated by suffixing the letter h in Tobar na Gaedhilge.) Your method of typing accented characters should result in precomposed characters, as most methods do, rather than in separate combining accents.

When all this is complete, you may click the OK button to produce the results.

Further hints on the choice of forms for searching will come in the next section of this document.

• More about what to expect in a Gaelic index

Here are some pointers regarding what kinds of forms are worth requesting.

When a form is requested, it is matched against a pre-compiled index of forms from the chosen collection.  A scrolling alphabetic listing of the current index is shown, and will indicate what forms are available. For Gaelic, this index consists of forms which are aggregated in a number of ways to increase coverage:

• lowercased: the forms in a Gaelic index have been converted to lowercase by replacing any capital letters by small letters; this even applies to proper names. Any capital letters you include in your request will also be so converted.

• decliticised: common enclitics, such as d' in d'ól, or 's in 'seadh, or -sa in agam-sa, are treated as separate forms in the index (d' + ól; 's + eadh; agam + - + sa), and should also be detached in your request. Enclitics are normally signalled in running text by a hyphen or an apostrophe. But when there is no overt signal in similar cases (eg. agamsa, seadh), the splitting in the index will have been performed manually and is unlikely to have been exhaustive.

A number of common contracted forms have been indexed under their parts, e.g. 'na (from ina) under ' and a; 'na (from chun an) under 'n and a'; 'na (from chun na) under ' and na; ab (from a ba) under a and b, or under a and b'; gurab under gur and a and b; and many other similar cases. This aspect is to be made more rigorous.

• decompounded: very few forms containing a hyphen have been admitted to the indexes — a list of these can be obtained by searching for *-*. Rather, most hyphenated forms have been treated atomistically in the indexes, and are found by seeking their parts, including the hyphen, eg. leith-phighinn by seeking the three items leith and - and pighinn, with checking of the "consecutive" option ("Díreach i ndiaidh a chéile").

• demutated: initial mutations are removed from forms in the index; so, for example, fear, fhear and bhfear are all indexed as fear, while t-olc, n-olc and holc are all indexed as olc — but, where the mutation is permanent, it is retained, e.g. chugam, thart (in one of its senses), (go) dtí. You may have noticed the benefits of demutation and decliticisation in our athair example above. An initial mutation does not leave any trace in the index; and this is also true of any hyphen which is only part of an initial mutation. When typing Gaelic forms to be searched for, remember to remove initial mutations, unless they are a permanent part of the form. Removal of initial mutations may seem counter-intuitive when requesting a sequence of forms (eg. ár athair), but it is nonetheless required.

The forms in the Gaelic forms index are not lemmatized, ie. terminally inflected forms, such as fear, fir, feara, must be searched for separately — although the wild card may often be used to advantage to retrieve the several related forms in disjunction.

Finally, note that the index is based on the forms of the text after they have been subjected to a limited and controlled degree of correction, as explained just below Figure 6 above.

• Searching in other languages

As well as searching the textbase for Gaelic words, indexes have been created in four additional languages, based on the forms found in translations or originals in these languages of some of the Gaelic texts. As stated earlier, these translations/originals are at present confined to the Ulaidh collection, and cover only a proportion of it: Béarla (English), 2.37M tokens; Fraincis (French), 0.87M tokens; Gearmáinis (German), 0.74M tokens. There is also a tiny amount of Russian, 0.09M tokens.

Generally, such texts have been independently translated into Gaelic and into the other non-original languages; the original language in most such cases has been English, but there are examples of French (La Terre qui Meurt; Pêcheur d'Islande) and of Russian (Записки охотника). Translation of a Gaelic original is found only into Russian (extracts from Ó Neamh go h-Árainn and from Fallaing Shíoda).

Note that the Gaelic version of the text is considered pivotal. Consequently, substantial amounts of material absent from a Gaelic text will not be included in other language versions of that text either. For example, the Gaelic translation of Turgenev's Записки охотника contains only about half the stories in the original, and only these stories will be included in other languages. Also, in the other language collections, texts are grouped according to the author or translater of the Gaelic version.

Figure 10: Search of the English index of the Ulaidh collection for the form bunch. An example is shown from Ben-Hur, and the English and Gaelic and French and German of the sentence is displayed.

Screenshot of English sentences with translations

Figure 11: Search of the French index of the Ulaidh collection for the form accroché. An example is shown from Iascaire Inse Tuile, and the French and Gaelic and English and German of the sentence is displayed.

Screenshot of French sentences with translations

Figure 12: Search of the German index of the Ulaidh collection for the form Knurren. An example is shown from Scairt an Dúthchais, and the German and Gaelic and English and French and Russian of the sentence is displayed.

Screenshot of German sentences with translations

Figure 12a: Search of the Russian index of the Ulaidh collection for the form лошадь. An example is shown from Scéalta Sealgaire, and the Russian and Gaelic and English of the sentence is displayed.

Screenshot of Russian sentences with translations

To search using another language, select the Ulaidh collection, and then the appropriate language. You will then have a further choice between Foirmeacha (forms) and Lemmata (lemmas), because a rough and ready lemmatization has been applied to the English, French, German and Russian texts, resulting in two indexes for each of these languages. The statistics for these indexes are, at 2023/10/04:

  • Ulaidh (Ulster)         Index: Béarla (foirmeacha). 7 Gaelic writers; 23 books;     46,928 forms;    2,375,425 tokens
  •                                   Index: Béarla (lemmata).  7 Gaelic writers;   23 books;   36,319 lemmas;    2,366,875 tokens
  •                                   Index: Fraincis (foirmeacha). 3 Gaelic writers;  9 books;    34,709 forms;    869,635 tokens
  •                                   Index: Fraincis (lemmata).  3 Gaelic writers;   9 books;    16,468 lemmas;    867,783 tokens
  •                                   Index: Gearmáinis (foirmeacha). 4 Gaelic writers;  10 books;   52,982 forms;   847,353 tokens
  •                                   Index: Gearmáinis (lemmata).  4 Gaelic writers;   10 books;    32,170 lemmas;   851,994 tokens
  •                                   Index: Rúisis (foirmeacha). 3 Gaelic writers;  4 parts of books;  21,544 forms;   92,991 tokens
  •                                   Index: Rúisis (lemmata).  3 Gaelic writers;   4 parts of books;  11,350 lemmas;  92,908 tokens

The form indexes for all languages except German are lowercased, even for proper names, and when typing a request, any uppercase letters are automatically converted to lowercase.

In the German form index, the initial letter of a noun (or a name) retains its case — actually, the initial letter takes its case from the lemma assigned to it, which is usually the same thing (lemmatization is discussed below); and when typing a request, case remains as typed. Only forms matching the case of the request are retrieved, unless the option Gan beinn ar an tsíneadh fhada is active, when forms which differ only in case will also be found.

In form indexes, enclitics are separated (eg. English 's, 've, n't, French l', m', German 's (gibt's), 'n (so'n)). Hyphenated words are generally decomponded, eg. French garde-robe; but this policy has not been consistently applied to English, where eg. decompounded water-tight is found as well as unitary water-tight and watertight. In the German index, decompounding of (hyphenless) words, eg. weitergehen, has not been attempted.

Besides a forms index, for additional languages there are also lemma indexes. Using the English lemmas index, a request for man will match the forms man or men; while, using the French lemmas index, a request for homme will match the forms homme or hommes; or using the German lemmas index, a request for Mann will match forms Mann, Mannes, Manne, Männer, Männern. (A Gaelic lemmas index is under construction.)

When discussing a lemmatized index, the form vs token distinction is still relevant, but should be understood as referring to lemma(-form) vs (lemma-)token. The token count for a lemma index will differ slightly from the token count for a form index of the same texts. Among the reasons for this, a form token may give rise to two lemma tokens, eg. the English form "cannot" gives lemmas "can" and "not"; or the German form "im" gives lemmas "in" and "die". Conversely, lemma indexes exclude "foreign" items, which also makes them more "mono-lingual" than form indexes.

Figure 13: A KWIC view of the lemma listen in Gadaidheacht le Láimh Láidir, according to our English lemma index. The corresponding material, in Gaelic and any other languages in which it is available, may be inspected, one example at a time, in the sentence view.

Screenshot of KWIC using English lemma index

Figure 14: A KWIC view of the lemma abandonner (to abandon) in Ben-Hur, according to our French lemma index. The corresponding material, in Gaelic and any other languages in which it is available, may be inspected, one example at a time, in the sentence view.

Screenshot of KWIC using French lemma index

Figure 15: A KWIC view of the lemma Baum (tree) in Scairt an Dúthchais, according to our German lemma index. The corresponding material, in Gaelic and any other languages in which it is available, may be inspected, one example at a time, in the sentence view.

Screenshot of KWIC using German lemma index

Figure 15a: A KWIC view of the lemma тёмный (dark) in Scéalta Sealgaire, according to our Russian lemma index. The corresponding material, in Gaelic and any other languages in which it is available, may be inspected, one example at a time, in the sentence view.

Screenshot of KWIC using Russian lemma index

It is important to understand that our lemmatization of English and French and German and Russian has been performed automatically, using the Tree Tagger. This software is among the best of its kind, and lemmatization would have been impractical without it, but, as with all statistical operations, a percentage of errors is inevitable, and some remain despite much manual post-checking. The Russian lemma index has further benefitted from the use of Sharoff's lemmatisation tool, and of Usachev's morphology file and of the morfer.ru site, as described here.

In our lemma indexes, an initial capital has been retained in appropriate lemmas, mostly names (as well as for nouns in the German lemma index); and when typing a request into a lemma index, no changes are made to the case typed. Therefore it matters whether the letters you type in your request to a lemma index are small letters or capitals. Keep an eye on the scrolling alphabetic list for guidance on what lemmas are available and when you should use a capital letter. If you wish to retrieve both upper- and lowercase lemmas matching your request, use the option Gan beinn ar an tsíneadh fhada

When displaying a sentence found through a lemma index, the Innéacsáilte display option shows the sentence in the first language as a list of the lemmas with which it is indexed.

Many lemmas are ambiguous, e.g. in English: pack or stamp or well or lie or bound or back; or in French: pas or tendre or vague or fin. Ideally, we would like to retrieve only the desired sense of an ambiguous lemma, and, since version 1.5, we have tried to separate the senses by part of speech, using four broad categories of N (noun), V (verb), J (adjective) and Z (other). Thus for example a request for English lemma well will be asked to choose between N, V and Z; a request for French lemma vague will be asked to choose between J and N. This may help in many cases, but not in others; for the English lemma lie, for example, a more useful division would be into recline and untruth, rather then into noun and verb. Further development in this area may be expected in future versions.

• Translation equivalents

A related innovation is the calculation of translation equivalents. Ideally, given a lemma in the source language, this would produce a listing of the relatively most common lemmas in the corresponding sentences of the target language (lemma to lemma). For the present, however, the Gaelic lemmas index is small, and none of the texts contributing to it have translations stored, so this facility is not of use when Gaelic is the source language. But it would still be useful with Gaelic as the target language, and to enable this in the short term, we modify the ideal procedure to use instead the forms of the target language (lemma to form).

This technique has potential, but is limited at present by the amount of parallel text available for each language pair (the numbers below are the numbers of Gaelic form tokens in the shared texts):

EnglishFrenchGermanRussian
Gaelic2,850,000903,000919,000142,000
English903,000919,000128,000
French590,00045,000
German45,000

This facility is offered — where the source language is other than Gaelic — by means of a fifth output view, named Freagar-fhocla. Any of the supported languages, including Gaelic, may be used as the target language. The calculation may take a few moments. The resulting display is a list of target-language forms, each accompanied by a score, and sorted on these scores (the user may have it re-sorted alphabetically on the forms themselves). The scores — which are not raw token counts — may range from 99,999,999 down to 100,000, and measure how common the target-language form is in the neighbourhood of the source-language lemma in comparison with the whole of the target language corpus.

Note: if a subset of source-language texts is selected, only that subset contributes to the calculation.

From v1.14 (2023), this facility is also offered with Gaelic as the source language. However at this stage the results will be disappointing, because none of the Gaelic texts which have been completely lemmatized has a translation into any other language. The only results, therefore, come from lightly-lemmatized texts, and are unlikely to be meaningful. Among the words which may be tested with this facility is . Gaelic may of course be used as the target language.

Figure 15: Search for Gaelic forms collocating with the English lemma child, in the Ulster texts.

Screenshot of translation equivalents

The chosen source-language lemma defines a set of sentences in the source-language corpus — those sentences in which it occurs — and a corresponding set of sentences in the target-language corpus — those sentences which translate them. This "select part" of the target-language corpus is studied, looking for forms (freagar-fhocla, word-equivalents) which are more frequent there than in the target-language corpus on average.

If the source-language lemma is uncommon (read: selects less than one-thousandth part of the source language corpus), a warning is issued that the results may not be statistically useful, but no impediment is placed on calculating them.

Each target-language form is given a score by comparing its (relative) frequency in the select part with its (relative) frequency in the whole corpus. If a form turns out to be equally frequent in the select part as in the whole corpus, it is given a score of 100,000; if it turns out to be twice as frequent in the select part as in the whole corpus, it is assigned 200,000; and so on. Forms less frequent in the select part than in the whole corpus are discarded as uninteresting, so that 100,000 is the minimum score among those retained. At the other end of the range, the score 99,999,999 is assigned to any form which is 100 times or more as common in the select part as in the whole corpus.

Even if a target-language form scores within the range 100000..99999999, it is still omitted from the displayed list if its (absolute) frequency is small. This is intended to overcome random collocations, which would decline naturally as more text is added, but may mask more significant data while they remain. An empirical lower cutoff for a form is applied when its (absolute) frequency in the select part is less than the square root of one-tenth of its (absolute) frequency in the whole corpus.

Results are still poor enough with the amount of text available, but will improve as the quantity increases. Even at the present time, however, it may be of interest to input English lemmas from the following list, and to compare the results with the content of existing English–Irish dictionaries, noting what is found in the dictionaries but absent in the corpus, as well as what relevant equivalents are found in the corpus but not in the dictionaries: smoke, minute, also, yet, dog, ice, bee, garden, help, interest, gravel, cave, busy, cell, kitchen, open.

Here, for comparison, are the forms in French, German and Russian which correlate most strongly with the English lemma child, showing also which books contribute to the figures, and the frequency of the lemma child in the English version of each book:

Screenshot of translation equivalents

Screenshot of translation equivalents

Screenshot of translation equivalents

Installation

It is suggested that you uninstall any earlier version of Tobar na Gaedhilge before installing this one. Uninstallation may be performed using the Start/Programs menu, or using the Add/Remove Programs control panel. If you want to retain older versions, you may install this one to a different folder, eg. to C:\Program Files (x86)\Tobar2023 (just change the folder name when prompted during installation).

System requirements for the current version are:

If you have Windows, any version from Win 2000 through XP, Vista, Windows 7, 8, 10 or later, download (tarraing anuas chugat) the file TOBAR2023.EXE to any folder, and double-click on it to install Tobar na Gaedhilge.

You may receive worrying messages when you try to install this program. For example, Windows may warn you that it is an "unrecognized app", which consequently "might put your PC at risk". MS Edge may say it "isn't commonly downloaded" and "make sure you trust it". Chrome may go further and describe it as "dangerous". Don't worry. This is what they say about the download of a file with the .exe extension, and about which they know nothing. Other systems raise no objections to this program, and it has been passed and whitelisted by MalWareBytes. You may safely click "Run anyway" or "Keep" or whatever, to ignore this spurious warning. There is no danger.

If you have only Windows 95 or 98 or ME, see here for an older version; if you have only MS-DOS or Win 3.1, see here for a much older version.

I am hopeful that Tobar na Gaedhilge can now be run on MAC and on LINUX, if you download and install the CrossOver software. I haven't got the hardware to verify this myself, but feedback has been positive. CrossOver is free to use for a fortnight, which should allow you to verify that you are satisfied with it; to continue to use it in perpetuity costs around £38. Check the system requirements button to find the latest applicable version of CrossOver for your system. Of course, CrossOver will also enable you to run other Windows programs, not only Tobar na Gaedhilge. There is no need to buy a copy of Windows. See also here.

Since version 1.8 of Tobar na Gaedhilge, installation uses the Inno installer, and may take a few minutes.

By default, installation is to the folder C:\Program Files\Tobar, where the following files will be created:

The texts: 144 files, with names bearing the extension .MRK, containing the texts. These files are not intended to be used in any way other than through the program TOBAR. The full list of texts, with acknowledgements to those who originally produced them in machine-readable form, is given later in this file.

The indexes to the collections (each index consists of four files):

A number of files containing optional keyboard layouts:

To run the textbase, after installing it, use the shortcut "Tobar na Gaedhilge" already placed on your Start/Programs menu. Or double-click on the TOBAR.EXE icon, which you will find in the Program Files (x86)\Tobar folder (you may, if you find it convenient, create a short-cut by dragging this icon to your desktop).

History

Version 1.14 released October 2023.

Version 1.13 released May 2021.

Version 1.12 released September 2019.

Version 1.11 released August 2018.

Version 1.10 released August 2017.

Version 1.9 released August 2016.

Version 1.8 released September 2015.

Version 1.7 released August 2014.

Version 1.6 released September 2012.

Version 1.5 released October 2010.

Version 1.4 released May 2009.

Version 1.3 released September 2006.

Version 1.2 released November 2004.

Version 1.1 released November 2003.

Version 1.0 released as a native Windows application in February 2002.

The system was previously an MS-DOS application, known as GAELDICT or FOCAL, of which five versions were released between 1995 and 1999.

New this time

The texts in detail

In selecting and preparing texts for Tobar na Gaedhilge, the emphasis is on authenticity, accuracy and added-value.

Authenticity: Texts chosen are written in the early to middle 20th century, by native speakers of Gaelic (or occasionally, by non-native speakers who modelled their speech on well-defined local Gaelic). Our policy is to adhere as closely as possible to the language in which the authors wrote them.  In general, the earliest editions have been preferred, and standardized or school editions avoided. Contexts may be displayed exactly as found in the book, but are by default displayed in the same form as used in creating the indexes, viz. after two kinds of corrections may be applied: (1) undisputed errors may be corrected; (2) deviations from the manuscript may be corrected, where the manuscript is available — which is fortunately the case for some of the books published by Oifig an tSoláthair. Use of manuscripts is a major exercise and is only in its initial stages; texts to which it has been applied will be noted in the list below. Earlier serialized publication may be treated in the same way as manuscripts. Beyond these cases, we do not seek to change the text in any way, for example, we avoid regularization, even within a text, and we retain any spelling which is arguably a valid representation of the spoken word. While much of the information so retained will be random variation, other some will contain important clues to the detailed pronunciation. Particularly important are indications of elision, where the spelling shows them. Also important are words where the presence and absence of a vowel length mark are equally common, showing that no distinction is signified by the length mark in such cases. The borderline between significant and non-significant variation may be identified later, but should not be imposed in advance.

Accuracy: Texts are either typed through the keyboard, or scanned. The input may be performed specificially for Tobar na Gaedhilge, or texts may be donated by individuals or projects, to all of whom we are grateful, and who are acknowledged under "Original e-text" in the lists below. The contribution of Foclóir Stairiúil na Nua-Ghaeilge at Acadamh Ríoga na hÉireann, Dublin, is particularly significant. On incorporation into Tobar na Gaedhilge, all texts are subject to continuous correction of transcription errors, as these are uncovered.

Added-value: It has proved possible to augment the texts, in their stored medium, in various ways which add to their value. Indexing is based on a version of the text in which undisputed errors are corrected. The Gaelic versions are aligned with the versions in other languages, where these are available to us. Lemmatization, in a rough-and-ready form, has been applied to the versions in languages other than Gaelic. New possibilities for analysis arise from these enhancements. Future objectives include lemmatization of the Gaelic texts, and addition of aligned sound files. 

A full list of texts at the current version follows. The text identification scheme used in this list is developed from that used by Foclóir Stairiúil na Nua-Ghaeilge, but the higher numbers may diverge.

ULAIDH:

LU006: Na Rosa go Bráthach; Mághnus 'ac Comhghaill (1885–1965)
Publisher: Oifig an tSoláthair, Dublin, 1939 (we used: An Cúigiú Cló, 1946)
Original e-text: Rita Nic Aodha Bhuí
Added: 2009
Manuscript: mn009, not consulted
LU010: Na Glúnta Rosannacha; Niall Ó Domhnaill (1908–1995)
Publisher: Oifig an tSoláthair, Dublin, 1952
Original e-text: Foclóir na Nua-Ghaeilge, Acadamh Ríoga na hÉireann, Dublin
Added: 1997
Manuscript: not available
LU013: Troid Bhaile an Droichid; Séamus 'ac an Bháird (1871–1951)
Publisher: Connradh na Gaedhilge, Dublin, 1907 (we used: Preas Dhún Dealgan, Dundalk, undated, 1920s)
Original e-text: Foclóir na Nua-Ghaeilge, Acadamh Ríoga na hÉireann, Dublin
Added: 1995 or 1996
Manuscript: not available
LU016: Eoghan Ruadh Ó Néill; Seosamh 'ac Grianna (1900–1990)
Publisher: Oifig Díolta Foillseacháin Rialtais, Dublin, 1931
Original e-text: Rita Nic Aodha Bhuí, for FNG
Added: 2004
Manuscript: not available
LU018: Dochartach Dhuibhlionna; Seosamh 'ac Grianna (1900–1990)
Publisher: Cú Uladh, Dublin, undated (1925)
Original e-text: Foclóir na Nua-Ghaeilge, Acadamh Ríoga na hÉireann, Dublin
Added: 1995 or 1996
Manuscript: not available
LU019: An Grádh agus an Ghruaim; Seosamh 'ac Grianna (1900–1990)
Publisher: C S Ó Fallamhain, Dublin, 1929
Original e-text: Ciarán Ó Duibhín
Added: 2003
Manuscript: not available
LU020: Pádraic Ó Conaire agus Aistí Eile; Seosamh 'ac Grianna (1900–1990)
Publisher: Oifig Díolta Foillseacháin Rialtais, Dublin, 1936
Original e-text: Ciarán Ó Duibhín
Added: 1995 or 1996
Manuscript: not available
LU021: Mo Bhealach Féin / Dá mBíodh Ruball ar an Éan; Seosamh 'ac Grianna (1900–1990)
Publisher: Oifig an tSoláthair, Dublin, 1940
Original e-text: Ciarán Ó Duibhín
Added: 2018
Manuscript: mm028, not consulted
LU023: Indé agus Indiu; Seaghán 'ac Meanman (1886–1962)
Publisher: C S Ó Fallamhain, Dublin, 1929
Original e-text: Ciarán Ó Duibhín
Added: 1999
Manuscript: not available
LU024: Fear Siubhail; Seaghán 'ac Meanman (1886–1962)
Publisher: Preas Dhún Dealgan, Dundalk, 1924 (we used: Oifig Díolta Foillseacháin Rialtais, Dublin, 1931)
Original e-text: Ciarán Ó Duibhín
Added: 1997
Manuscript: not available
Tá píosaí Béarla in 1924, a bhfuil Gaedhilg ortha in 1931. Is beag má tá duifir sa chuid eile.
LU025: Sgéalta Goiride Geimhridh; Seaghán 'ac Meanman (1886–1962)
Publisher: Clódhanna Teo, 1918 (we mainly used: Preas Dhún Dealgan, Dundalk, 1922)
Original e-text: Ciarán Ó Duibhín
Added: 2002
Manuscript: not available
The stories had earlier been previously published in the "Weekly Freeman" and the "Claidheamh Soluis".
Despite what is said in the foreword, the 1922 edition is considerably changed from the 1918 one, and is much better overall.
We base our text on the 1922 edition modified by consultation of the 1918 edition, rather than vice versa.
LU026: Ó Chamhaoir go Clap-Sholas; Seaghán 'ac Meanman (1886–1962)
Publisher: Oifig an tSoláthair, Dublin, 1940
Original e-text: Foclóir na Nua-Ghaeilge, Acadamh Ríoga na hÉireann, Dublin
Added: 1995 or 1996
Manuscript: mo001, not consulted
LU027: Mám as mo Mhála; Seaghán 'ac Meanman (1886–1962)
Publisher: Oifig an tSoláthair, Dublin, 1940
Original e-text: Ciarán Ó Duibhín
Added: 2002
Manuscript: mm009, not consulted
LU028: Mám Eile as an Mhála Chéadna; Seaghán 'ac Meanman (1886–1962)
Publisher: Oifig an tSoláthair, Dublin, 1954
Original e-text: Foclóir na Nua-Ghaeilge, Acadamh Ríoga na hÉireann, Dublin
Added: 1995 or 1996
Manuscript: not available
LU029: Crathadh an Phocáin; Seaghán 'ac Meanman (1886–1962)
Publisher: Oifig an tSoláthair, Dublin, 1955
Original e-text: Foclóir na Nua-Ghaeilge, Acadamh Ríoga na hÉireann, Dublin
Added: 1995 or 1996
Manuscript: not available
LU032: Saoghal Corrach; Séamus 'ac Grianna (1889–1969)
Publisher: An Press Náisiúnta, Dublin, undated (1945)
Original e-text: Foclóir na Nua-Ghaeilge, Acadamh Ríoga na hÉireann, Dublin
Added: 1995 or 1996
Manuscript: not available
LU033: Mo Dhá Róisín; Séamus 'ac Grianna (1889–1969)
Publisher: Preas Dhún Dealgan, Dundalk, undated (1921)
Original e-text: Foclóir na Nua-Ghaeilge, Acadamh Ríoga na hÉireann, Dublin
Added: 1995 or 1996
Manuscript: not available
LU034: Nuair a Bhí Mé Óg; Séamus 'ac Grianna (1889–1969)
Publisher: Clólucht an Talbóidigh, Dublin, 1942
Original e-text: Foclóir na Nua-Ghaeilge, Acadamh Ríoga na hÉireann, Dublin
Added: 1995 or 1996
Manuscript: not available
LU035: Cioth is Dealán; Séamus 'ac Grianna (1889–1969)
Publisher: Preas Dhún Dealgan, Dundalk, undated (1926)
Original e-text: Foclóir na Nua-Ghaeilge, Acadamh Ríoga na hÉireann, Dublin
Added: 1995 or 1996
Manuscript: not available
LU036: Caisleáin Óir; Séamus 'ac Grianna (1889–1969)
Publisher: Preas Dhún Dealgan, Dundalk, 1924
Original e-text: Foclóir na Nua-Ghaeilge, Acadamh Ríoga na hÉireann, Dublin
Added: 1995 or 1996
Manuscript: not available
LU037: Thiar i dTír Chonaill; Séamus 'ac Grianna (1889–1969)
Publisher: Faoi Chomhartha na dTrí gCoinneal, Dublin, 1940
Original e-text: Ciarán Ó Duibhín
Added: 1995 or 1996
Manuscript: not available
LU038: Bean Ruadh de Dhálach; Séamus 'ac Grianna (1889–1969)
Publisher: Oifig an tSoláthair, Dublin, 1966
Original e-text: Foclóir na Nua-Ghaeilge, Acadamh Ríoga na hÉireann, Dublin
Added: 2002
Manuscript: not available
LU039: Micheál Ruadh; Séamus 'ac Grianna (1889–1969)
Publisher: Preas Dhún Dealgan, Dundalk, undated (1925)
Original e-text: Ailbhe Ó Corráin, then of Queen's University Belfast
Added: 1995 or 1996
Manuscript: not available
LU040: Rann na Feirste; Séamus 'ac Grianna (1889–1969)
Publisher: An Press Náisiúnta, undated (1942)
Original e-text: Ciarán Ó Duibhín
Added: 1998
Manuscript: not available
LU041: An Clár is an Fhoireann; Séamus 'ac Grianna (1889–1969)
Publisher: Oifig an tSoláthair, Dublin, 1955
Original e-text: Ciarán Ó Duibhín
Added: 1998
Manuscript: not available
LU044: Le Clap-Sholus; Séamus 'ac Grianna (1889–1969)
Publisher: Oifig an tSoláthair, Dublin, 1967
Original e-text: Foclóir na Nua-Ghaeilge, Acadamh Ríoga na hÉireann, Dublin
Added: 1995 or 1996
Manuscript: not available
LU045: Oidhche Shamhraidh; Séamus 'ac Grianna (1889–1969)
Publisher: Oifig an tSoláthair, Dublin, 1968
Original e-text: Ciarán Ó Duibhín
Added: 2012
Manuscript: not available
LU047: Scéal Úr agus Sean-Scéal; Séamus 'ac Grianna (1889–1969)
Publisher: Oifig an tSoláthair, Dublin, 1945 (we used: Oifig an tSoláthair, Dublin, 1950)
Original e-text: Ciarán Ó Duibhín
Added: 1995 or 1996
Manuscript: not available
LU050: An Teach nár Tógadh; Séamus 'ac Grianna (1889–1969)
Publisher: Oifig an tSoláthair, Dublin, 1948
Original e-text: Ciarán Ó Duibhín
Added: 1995 or 1996
Manuscript: not available
LU054: Cloich Cheann-Fhaolaidh; Séamus Ó Searcaigh (1886–1965)
Publisher: M.H. Gill & Son Ltd, Dublin, 1908
Original e-text: Rita Nic Aodha Bhuí, for FNG
Added: 2004
Manuscript: not available
LU056: Thiar i nGleann Ceo; Tadhg Ó Rabhartaigh (1909–1982)
Publisher: Oifig an tSoláthair, Dublin, 1953
Original e-text: Seosamh Ó Labhraí, Coláiste Ollscoile Naomh Muire, Béal Feirste
Added: 2002
Manuscript: not available
LU057: Mian na Marbh; Tadhg Ó Rabhartaigh (1909–1982)
Publisher: Oifig Díolta Foillseacháin Rialtais, Dublin, 1937
Original e-text: Foclóir na Nua-Ghaeilge, Acadamh Ríoga na hÉireann, Dublin
Added: 1995 or 1996
Manuscript: mm019, not consulted
LU065: Rácáil agus Scuabadh; Seaghán 'ac Meanman (1886–1962)
Publisher: Oifig an tSoláthair, Dublin, 1955
Original e-text: Foclóir na Nua-Ghaeilge, Acadamh Ríoga na hÉireann, Dublin
Added: 1995 or 1996
Manuscript: not available
LU071: Fallaing Shíoda; Séamus 'ac Grianna (1889–1969)
Publisher: Oifig an tSoláthair, Dublin, 1956
Original e-text: Ciarán Ó Duibhín
Added: 2016
Manuscript: not available
LU072: Cáitheamh na dTonn; Pádraig Ó Gallchobhair (1893–1961)
Publisher: Oifig Díolta Foillseacháin Rialtais, Dublin, 1934
Original e-text: Foclóir na Nua-Ghaeilge, Acadamh Ríoga na hÉireann, Dublin
Added: 1995 or 1996
Manuscript: mc011, not consulted
LU073: Na Lochlannaigh; Seosamh 'ac Grianna (1900–1990)
Publisher: Oifig an tSoláthair, Dublin, 1938
Original e-text: Lá, Belfast
Added: 1995 or 1996
Manuscript: mn007, not consulted
LU075: Ó Neamh go h-Árainn; Séamus 'ac Grianna (1889–1969)
Publisher: Oifig an tSoláthair, Dublin, 1953 (previously, Irish Press)
Original e-text: Ciarán Ó Duibhín
Added: 2015
Manuscript: not available
LU079: Sgéilíní na Finne; Aindrias Ó Baoighill (1888–1972)
Publisher: C S Ó Fallamhain, Dublin, undated (1928)
Original e-text: Ciarán Ó Duibhín
Added: 1999
Manuscript: not available
LU089: Filí Gan Iomrádh; Seosamh 'ac Grianna (1900–1990)
Publisher: An t-Ultach, 1926
Original e-text: Foclóir na Nua-Ghaeilge, Acadamh Ríoga na hÉireann, Dublin
Added: 2018
Manuscript: not available
LU092: Bruighean Feille; Niall Ó Domhnaill (1908-1995)
Publisher: Oifig Díolta Foillseacháin Rialtais, 1934
Original e-text: Ciarán Ó Duibhín
Added: 2017
Manuscript: mb025, not consulted
LU105: Buaidh na Tuigse; Séamus Ó Searcaigh (1886–1965)
Publisher: Oifig an tSoláthair, Dublin, 1940
Original e-text: Brian Mac Lochlainn
Added: 2010
Manuscript: mb028, consulted(?)
LU106: Crann an Eolais, An Bhachlóg; Seaghán Mac Meanman (1886–1962)
Publisher: Brún agus Ó Nualláin Teór., Dublin, c1939
Original e-text: Pádraig Ó Mianáin
Added: 2014
Manuscript: not available
LU107: Crann an Eolais, An Duilleog; Seaghán Mac Meanman (1886–1962)
Publisher: Brún agus Ó Nualláin Teór., Dublin, c1939
Original e-text: Pádraig Ó Mianáin
Added: 2014
Manuscript: not available
LU108: Crann an Eolais, An Chraobh; Seaghán Mac Meanman (1886–1962)
Publisher: Brún agus Ó Nualláin Teór., Dublin, c1939
Original e-text: Pádraig Ó Mianáin
Added: 2014
Manuscript: not available
LU109: Crann an Eolais, An Bláth; Seaghán Mac Meanman (1886–1962)
Publisher: Brún agus Ó Nualláin Teór., Dublin, c1939
Original e-text: Pádraig Ó Mianáin
Added: 2014
Manuscript: not available
LU110: Crann an Eolais, An Toradh; Seaghán Mac Meanman (1886–1962)
Publisher: Brún agus Ó Nualláin Teór., Dublin, c1940
Original e-text: Pádraig Ó Mianáin
Added: 2015
Manuscript: not available
LU111: Árd-Léightheoir Gaedhilge, Cuid 1; Seaghán Mac Meanman (1886–1962)
Publisher: Brún agus Ó Nualláin Teór., Dublin, 1937
Original e-text: Pádraig Ó Mianáin
Added: 2012
Manuscript: not available
LU112: Árd-Léightheoir Gaedhilge, Cuid 2; Seaghán Mac Meanman (1886–1962)
Publisher: Brún agus Ó Nualláin Teór., Dublin, 1939
Original e-text: Pádraig Ó Mianáin
Added: 2012
Manuscript: not available
U032: Dírbheathaisnéis, Niall 'ac Giolla Bhrighde (1861–1942), (ed. Liam Ó Connacháin)
Publisher: Brún agus Ó Nualláin Teór., Dublin, undated (1940)
Original e-text: Pádraig Ó Mianáin
Added: 2009
Manuscript: not available
U043: Scéalta Johnny Sheimisín, (ed.) Niall Ó Domhnaill (1908–1995)
Publisher: Comhaltas Uladh, Belfast & Dundalk, 1948
Original e-text: Foclóir na Nua-Ghaeilge, Acadamh Ríoga na hÉireann, Dublin
Added: 2006
Manuscript: not available
U131: Scéal Hiúdaí Sheáinín; Eoghan Ó Domhnaill (1908–1966)
Publisher: Oifig an tSoláthair, 1940
Original e-text: Rita Nic Aodha Bhuí
Added: 2009
Manuscript: not available
A221: Scéalta Sealgaire; trans. Máighréad Nic Mhaicín (1899–1983)
Publisher: Oifig an tSoláthair, 1954
Original e-text: Rita Nic Aodha Bhuí
Added: 2009
Manuscript: ms022, lightly consulted
AU002: Gadaidheacht le Láimh Láidir; trans. Domhnall 'ac Grianna (1894–1962)
Publisher: Oifig Díolta Foillseacháin Rialtais, Dublin, 1936
Original e-text: Brian Mac Lochlainn
Added: 2004
Manuscript: mg001, not consulted
AU010: Teach an Chrochadóra; trans. Seosamh 'ac Grianna (1900–1990)      ÚR!
Publisher: Oifig Díolta Foillseacháin Rialtais, Dublin, 1935
Original e-text: Ciarán Ó Duibhín
Added: 2023
Manuscript: mt009, not consulted
AU011: 'Teacht fríd an tSeagal; trans. Seosamh 'ac Grianna (1900–1990)
Publisher: Oifig Díolta Foillseacháin Rialtais, Dublin, gan dáta (1934)
Original e-text: Foclóir na Nua-Ghaeilge, Acadamh Ríoga na hÉireann, Dublin
Added: 2009
Manuscript: mt015, heavily consulted
AU012: Séideán Bruithne / Amy Foster; trans. Seosamh 'ac Grianna (1900–1990)
Publisher: Oifig Díolta Foillseacháin Rialtais, Dublin, 1935
Original e-text: Nollaig Ó hUrmoltaigh, then of Queen's University, Belfast
Added: 1995 or 1996
Manuscript: ms044, not consulted
AU013: Muinntir an Oileáin; trans. Seosamh 'ac Grianna (1900–1990)
Publisher: Oifig Díolta Foillseacháin Rialtais, Dublin, 1935
Original e-text: Rita Nic Aodha Bhuí
Added: 2004
Manuscript: mm032, not consulted
AU017: Éire agus Éireannaigh i Muirthéacht na Frainnce; trans. Séamus 'ac Grianna (1889–1969)
Publisher: Oifig Díolta Foillseacháin Rialtais, Dublin, 1933
Original e-text: Ciarán Ó Duibhín
Added: 2018
Manuscript: me012 (Gaelic), me011 (Roman typescript), not consulted
AU018: Scairt an Dúthchais; trans. Niall Ó Domhnaill (1908–1995)
Publisher: Oifig Díolta Foillseacháin Rialtais, Dublin, 1932
Original e-text: Ciarán Ó Duibhín
Added: 1997
Manuscript: not available
AU019: Ise; trans. Niall Ó Domhnaill (1908–1995)
Publisher: Oifig Díolta Foillseacháin Rialtais, Dublin, 1933
Original e-text: Rita Nic Aodha Bhuí
Added: 2012
Manuscript: ms053, not consulted
AU020: Ben-Hur; trans. Seosamh 'ac Grianna (1900–1990)
Publisher: Oifig Díolta Foillseacháin Rialtais, Dublin, 1933
Original e-text: Ciarán Ó Duibhín, using facilities at Irish Studies, University of Ulster, Coleraine
Added: 1998
Manuscript: mb019, not consulted
AU021: Eachtraí Sherlock Holmes; trans. Proinnsias Ó Brógáin (1905–1997)
Publisher: Oifig Díolta Foillseacháin Rialtais, Dublin, 1936
Original e-text: Ciarán Ó Duibhín
Added: 1999
Manuscript: ms078, not consulted
AU022: Iascaire Inse Tuile; trans. Séamus 'ac Grianna (1889–1969)
Publisher: Oifig an tSoláthair, Dublin, 1952
Original e-text: Rita Nic Aodha Bhuí
Added: 2003
Manuscript: not available
AU023: Dith Céille Almayer; trans. Seosamh 'ac Grianna (1900–1990)
Publisher: Oifig Díolta Foillseacháin Rialtais, Dublin, 1936
Original e-text: Rita Nic Aodha Bhuí
Added: 2003
Manuscript: md026, not consulted
AU024: Faoi Chrann Smola; trans. Séamus 'ac Grianna (1889–1969)
Publisher: Oifig Díolta Foillseacháin Rialtais, 1934
Original e-text: Brian Mac Lochlainn
Added: 2003
Manuscript: mf004, not consulted
AU025: Uaigheanna Chill Mhóirne; trans. Domhnall 'ac Grianna (1894–1962)
Publisher: Oifig Díolta Foillseacháin Rialtais, Dublin, 1933
Original e-text: Foclóir na Nua-Ghaeilge, Acadamh Ríoga na hÉireann, Dublin
Added: 2006
Manuscript: mu002 (Gaelic), mu001 (Roman), not consulted
AU027: Néall Dearg; trans. Niall 'ac Suibhne (1895–1949)
Publisher: Oifig Díolta Foillseacháin Rialtais, Dublin, 1935
Original e-text: Brian Mac Lochlainn
Added: 2006
Manuscript: mn014, not consulted
AU031: Ivanhoe; trans. Seosamh 'ac Grianna (1900–1990)
Publisher: Oifig Díolta Foillseacháin Rialtais, 1937
Original e-text: Ciarán Ó Duibhín
Added: 2010
Manuscript: not available
AU032: An Máirnéalach Dubh; trans. Seosamh 'ac Grianna (1900–1990)
Publisher: Oifig Díolta Foillseacháin Rialtais, 1933
Original e-text: Ciarán Ó Duibhín
Added: 2014
Manuscript: ma060 (typescript), not consulted
AU033: Caiphtín Blood; trans. Séamus 'ac Grianna (1889–1969)
Publisher: Oifig Díolta Foillseacháin Rialtais, 1937
Original e-text: Brian Mac Lochlainn
Added: 2014
Manuscript: not available
AU034: An Bealach Achtuighthe; trans. Séamus 'ac Grianna (1889–1969)
Publisher: Oifig Díolta Foillseacháin Rialtais, 1937
Original e-text: Ciarán Ó Duibhín
Added: 2015
Manuscript: ma015, not consulted
AU035: Flatha na h-Alban; trans. Domhnall 'ac Grianna (1894–1962)
Publisher: Oifig Díolta Foillseacháin Rialtais, 1937
Original e-text: Ciarán Ó Duibhín
Added: 2017
Manuscript: mf031, not consulted
AU037: Filéad na Bainríoghaine; trans. Séamus 'ac Grianna (1889–1969)
Publisher: Oifig an tSoláthair, 1940
Original e-text: Ciarán Ó Duibhín
Added: 2018
Manuscript: mf016, not consulted
AU038: Lisín; trans. Séamus 'ac Grianna (1889–1969)
Publisher: Oifig an tSoláthair, 1939
Original e-text: Ciarán Ó Duibhín
Added: 2019
Manuscript: ml015, not consulted

ULAIDH/BÉARLA:

Note: English translations of the following texts are currently in print:
• LU034, Nuair a Bhí Mé Óg ("When I was Young: a Gaelic Childhood", A.J. Hughes, 1998, ISBN-13978-1899047574)
• LU021, Mo Bhealach Féin ("This Road of Mine", Mícheál Ó hAodha, 2020, ISBN-?)

AU002B: Robbery Under Arms; Rolf Boldrewood
Original e-text: Alan R. Light, via Project Gutenberg
Pagination: none
Added: 2004
AU010B: Hangman's House; Donn Byrne      ÚR!
Original e-text: Ciarán Ó Duibhín
Pagination: Sampson Low, no date (nlt 1936), pp v–x, 3–406
Added: 2023
AU011B: Comin' thro' the Rye; Helen Mathers
Original e-text: Ciarán Ó Duibhín
Pagination: The Alma Taylor Edition, Herbert Jenkins Limited, 159th edition (pp 9–379)
Added: 2009
AU012B: Typhoon / Amy Foster; Joseph Conrad
Original e-text: Judith Boss, Omaha, Nebraska, via Project Gutenberg
Pagination: none (Typhoon); Falk, Doubleday, New York, 1919, pp 155–214 (Amy Foster)
Added: 1998 (Typhoon), 1997 (Amy Foster)
AU013B: Islanders; Peadar O'Donnell
Original e-text: Ciarán Ó Duibhín
Pagination: Jonathan Cape, no date, pp 5–224
Added: 2004
AU017B: Ireland and Irishmen in the French Revolution; Richard Hayes (1878–1958)
Original e-text: Ciarán Ó Duibhín
Pagination: Phoenix, Dublin, 1932 (pp iii–302)
Added: 2018
AU018B: The Call of the Wild; Jack London
Original e-text: Oxford Text Archive
Pagination: Library of America, 1982 (pp 5–86)
Added: 1997
AU019B: She; Henry Rider-Haggard
Original e-text: John Bickers, and Dagny, et al, via Project Gutenberg
Pagination: Longmans Green, 1887 (pp 1–317)
Added: 2012
AU020B: Ben-Hur; Lew Wallace
Original e-text: Virginia Tech, apparently via Project Gutenberg
Pagination: none
Added: 1998
AU021B: The Memoirs of Sherlock Holmes; Arthur Conan-Doyle
Original e-text: Roger Squires
Pagination: none
Added: 1999
AU022B: Iceland Fisherman; Pierre Loti, English translation
Original e-text: I believe this version began as the Project Gutenberg text by Dagny and John Bickers, which is based on the Cadiot translation, as published without attribution by Collier or Appleton in 1902. However, the omissions in that translation have since been filled in "Tobar", and many other changes of wording made, by some process which I cannot recall. If anyone recognizes the present "Tobar" English text, please contact me.
Pagination: none
Added: 2003
AU023B: Almayer's Folly; Joseph Conrad
Original e-text: David Price, via Project Gutenberg
Pagination: none
Added: 2003
AU024B: Autumn Glory; René Bazin, English translation by Ellen Waugh
Original e-text: Hélène de Mink, "renebazin", and the Online Distributed Proofreading Team (http://www.pgdp.net), via Project Gutenberg
Pagination: Jarrolds (no date, 1929?), pp 7–291
Added: 2012
AU025B: The Graves of Kilmorna; Canon Sheehan
Original e-text: Ciarán Ó Duibhín
Pagination: Longmans Green (1914), pp 3–373
Added: 2006
AU027B: Red Cloud; General Sir William F Butler
Original e-text: Ciarán Ó Duibhín
Pagination: pp 1–220
Added: 2006
AU031B: Ivanhoe; Walter Scott
Original e-text: Anders Thulin, via Wiretap and Project Gutenberg
Pagination: none
Added: 2010
AU032B: The Nigger of the Narcissus; Joseph Conrad
Original e-text: Jeffrey Triggs, via Oxford Text Archive. This is based on the Penguin Classics edition of 1988. Several alterations have been made to bring it into line with the 1921 text of Heinemann's "Works of Joseph Conrad", which is closer to both the Gaelic and French translations.
Pagination: Penguin Classics, pp 1–128
Added: 2014
AU033B: Captain Blood; Rafael Sabatini
Original e-text: Project Gutenberg
Pagination: Grosset & Dunlap (1922), pp 3–356
Added: 2014
AU034B: The Right of Way; Gilbert Parker
Original e-text: David Widger, via Project Gutenberg
Pagination: Nelson (c1911), pp 9–378
Added: 2015
AU035B: The Scottish Chiefs; Jane Porter
Original e-text: Ciarán Ó Duibhín
Pagination: Richard Edward King (no date), pp 1–350
Added: 2017
AU037B: The Queen's Fillet; Canon Sheehan
Original e-text: archive.org
Pagination: Phoenix (no date, ?1911), pp 3–376
Added: 2018
AU038B: Lisheen; Canon Sheehan
Original e-text: archive.org
Pagination: Phoenix (no date, ?1907), pp 3–454
Added: 2019
A221B: A Sportsman's Sketches; Ivan Turgenev, English translation by Constance Garnett
Only those stories which have been translated into Irish Gaelic are included.
Original e-text: The University of Adelaide Library, https://ebooks.adelaide.edu.au/t/turgenev/ivan/t93s/
Pagination: Heinemann (1895) partial, pp 1.1–292 & 2.1–284
Added: 2016

ULAIDH/FRAINCIS:

AU012F: Typhon; Joseph Conrad, French translation by André Gide
Original e-text: ATILF, CNRS, Nancy (FranText) (http://atilf.atilf.fr/artis/nvlbiblio.htm, http://www.frantext.fr/)
               Amy Foster; Joseph Conrad, French translation by Georges Jean-Aubry
Original e-text: Wikilivres (http://www.wikilivres.info/wiki/Amy_Foster)
Pagination: Oeuvres complètes, pp 11–154 (Typhon); Falk, Gallimard, 1934, pp 155–204 (Amy Foster)
Added: 2004
AU018F: L'Appel sauvage; Jack London, French translation by Frédéric Klein
Original e-text: Ciarán Ó Duibhín
Pagination: Phébus (2003), pp 7–142
Added: 2012
AU019F: Elle; Henry Rider Haggard, French translation by Jacques Hillemacher
Original e-text: Ciarán Ó Duibhín
Pagination: Georges Crès (1920), pp 1–491
Added: 2015
AU020F: Ben-Hur; Lew Wallace, French translation by Philippe Mazoyer
Original e-text: Ciarán Ó Duibhín
Pagination: Presses Pocket (no date), pp 1–514
Added: 2009
AU022F: Pêcheur d'Islande; Pierre Loti
Original e-text: L'Institut National de la Langue Française
Pagination: INaLF, cf. Calmann-Lévy (1886), pp 3–319
Added: 2003
AU023F: La Folie Almayer; Joseph Conrad, French translation by Anne-Marie Soulac
Original e-text: Ciarán Ó Duibhín
Pagination: La Pléiade (1982), pp 5–180
Added: 2014
AU024F: La Terre qui Meurt; René Bazin
Original e-text: Ciarán Ó Duibhín
Pagination: Calmann-Lévy (c1903), pp 1–336
Added: 2003
AU031F: Ivanhoé: Walter Scott, French translation by Alexandre Dumas père
Original e-text: Jean-Yves Dupuis, Bibliothèque électronique du Québec
Pagination: BeQ, pp 11–896
Added: 2012
AU032F: Le Nègre du "Narcisse": Joseph Conrad, French translation by Robert d'Humières revised by Maurice-Paul Gautier
Original e-text: Ciarán Ó Duibhín
Pagination: La Pléiade (1982), pp 499–643
Added: 2014

ULAIDH/GEARMÁINIS:

AU012D: Taifun / Amy Foster; Joseph Conrad, German translation by Elise Eckert
Original e-text: Ciarán Ó Duibhín
Pagination: Engelhorn (1908), pp 3–168
Added: 2016
AU018D: Der Ruf der Wildnis; Jack London, German translation by Rainer von Savigny
Original e-text: Ciarán Ó Duibhín
Pagination: Artemis & Winkler (1991), pp 7–117
Added: 2014
AU020D: Ben Hur; Lew Wallace, German translation by Günter Jürgensmeier
Original e-text: Ciarán Ó Duibhín
Pagination: Deutcher Taschenbuch Verlag, 2010, pp 7–596
On the occasions when the Jürgensmeier translation omits material, or re-orders material significantly, the corresponding sentences may be taken from the German translation of Karl Wilding (1906, marked on output by ¹...¹), or from that of Matthäus Winter (1906, marked by ²...²). All page and line numbers relate to the Jürgensmeier translation.
Added: 2014
AU022D: Islandfischer; Pierre Loti, German translation by Carmen Sylva
Original e-text: Projekt GaGa (http://www.gaga.net/pgdp/default.php)
Pagination: Emil Strauss, Bonn (1898), pp 1–333
Added: 2012
AU023D: Almayers Wahn; Joseph Conrad, German translation by Günther Danehl
Original e-text: Ciarán Ó Duibhín
Pagination: Fischer Taschenbuch Verlag (1972), pp 9–159
Added: 2016
AU025D: Die Gräber von Kilmorna; Canon Sheehan, German translation by Oskar Jacob
Original e-text: Ciarán Ó Duibhín
Pagination: Benziger (no date), pp 7–382
Added: 2021
AU032D: Der Nigger vom Narzissus; Joseph Conrad, German translation by Ernst W Freißler
Original e-text: Ciarán Ó Duibhín
Pagination: S Fischer Verlag (1951), pp 5–162
Added: 2019
AU033D: Captain Blood; Rafael Sabatini, German translation by Joachim Pente
Original e-text: Ciarán Ó Duibhín
Pagination: Unionsverlag (2014), pp 7–399
Added: 2016
AU037D: Das Haarband de Königin; Canon Sheehan, German translation by Oskar Jacob      ÚR!
Some sections of the text which do not translate the original are omitted.
Original e-text: Ciarán Ó Duibhín
Pagination: Benziger (1919), pp 9–482
Added: 2023
AU038D: Lisheen; Canon Sheehan, German translation by Oskar Jacob
Original e-text: Ciarán Ó Duibhín
Pagination: Benziger (1914), pp 9–476
Added: 2021

ULAIDH/RÚISIS:

LU071R: Шёлковая шаль (4 scéal as "Fallaing Shíoda"), Russian translation by Анна Александровнa Коростелёвa
Original e-text: http://web.archive.org/web/20050225010305/http://irish.ru:80/index.php?referer=34&show=liter/moira03
http://web.archive.org/web/20050225010109/http://irish.ru:80/index.php?referer=34&show=liter/moira04
http://web.archive.org/web/20050224181211/http://irish.ru:80/index.php?referer=34&show=liter/fiddler01
http://web.archive.org/web/20050302011503/http://irish.ru:80/index.php?referer=34&show=liter/fiddler02
http://web.archive.org/web/20050225005851/http://irish.ru:80/index.php?referer=34&show=liter/moira05
with some amendments taken from http://www.philol.msu.ru/~school/sem_irish_t_mojra.zip
Pagination: none
Added: 2016
LU075R: Всадник на белой лошади (1 scéal as "Ó Neamh go h-Árainn"), Russian translation by Анна Александровнa Коростелёвa
Original e-text: http://web.archive.org/web/20050224181320/http://irish.ru:80/index.php?referer=34&show=liter/horseman01
Pagination: none
Added: 2015
A221R: Записки охотника: Иван Сергеевич Тургенев
Only those stories which have been translated into Irish Gaelic are included.
Original e-text: http://az.lib.ru/t/turgenew_i_s/text_0080.shtml
Pagination: imposed from Глазунов (1909), partial, pp 1–385 (http://dlib.rsl.ru/viewer/01003763369#?page=1)
Added: 2016
AU018R: Зов предков; Jack London, Russian translation by Зинаида Вершинина
Original e-text: http://az.lib.ru/l/london_d/text_1903_the_call_of_the_wild.shtml
Pagination: none
Added: 2017

CONNACHTA:

LC001: Obair is Luadhainn; Colm Ó Gaora (1887–1954)
Publisher: Oifig Díolta Foillseacháin Rialtais, 1937
Original e-text: Ciarán Ó Duibhín
Added: 2016
Manuscript: ms004, not consulted
LC023: Feamainn Bhealtaine; Máirtín Ó Diréain (1910–1988)
Publisher: An Clóchomhar, 1961
Original e-text: Foclóir na Nua-Ghaeilge, Acadamh Ríoga na hÉireann, Dublin
Added: 1997
Manuscript: not available
LC026: Eochaidh Mac Rí 'n-Éirinn; Micheál Ó Máille (c1880–1911)
Publisher: Connradh na Gaedhilge, 1912
Original e-text: Foclóir na Nua-Ghaeilge, Acadamh Ríoga na hÉireann, Dublin
Added: 2015
Manuscript: not available
LC027: An Mothall Sin Ort; Seán Ó Ruadháin (1883–1966)
Publisher: Oifig an tSoláthair, 1967
Original e-text: Foclóir na Nua-Ghaeilge, Acadamh Ríoga na hÉireann, Dublin
Added: 1997
Manuscript: not available
(Almost) restored to the text of the original articles in "Irish Press" and "Comhar".
LC046: Pádhraic Mháire Bhán; Seán Ó Ruadháin (1883–1966)
Publisher: Oifig Díolta Foillseacháin Rialtais, 1935
Original e-text: Foclóir na Nua-Ghaeilge, Acadamh Ríoga na hÉireann, Dublin
Added: 2015
Manuscript: mp002 (typescript), not consulted
LC093: Fánaidheacht i gConndae Mhuigheo; Séamus Mag Uidhir (1902–1969)
Publisher: Oifig an tSoláthair, 1944
Original e-text: Peter K Griffin
Added: 2006
Manuscript: not available
This book is an abridged collection of pieces previously published in "An tÉireannach".
LC095: Bairbre Ruadh; Pádraic Ó Conaire (1882–1928)
Publisher: Connradh na Gaedhilge, 1908
Original e-text: Foclóir na Nua-Ghaeilge, Acadamh Ríoga na hÉireann, Dublin
Added: 2015
Manuscript: not available
C047: Amhráin Chlainne Gaedheal; Mícheál (c1880–1911)& Tomás (1880–1938) Ó Máille
Publisher: Connradh na Gaedhilge, 1925
Original e-text: Foclóir na Nua-Ghaeilge, Acadamh Ríoga na hÉireann, Dublin
Added: 2015
Manuscript: not available
C053: An Fibín; Colm Ó Gaora (1887–1954)
Publisher: Connradh na Gaedhilge, 1905
Original e-text: Foclóir na Nua-Ghaeilge, Acadamh Ríoga na hÉireann, Dublin
Added: 2015
Manuscript: not available
C055: Amhráin Chuilm de Bhailís; ed. Pádraic Ó Domhnalláin (1884–1960), Eoin Mac Néill (1867–1945)
Publisher: Connradh na Gaedhilge, 1904
Original e-text: Foclóir na Nua-Ghaeilge, Acadamh Ríoga na hÉireann, Dublin
Added: 2015
Manuscript: not available

MUMHAIN:

LM015: Cormac Ua Conaill; Pádraig Ó Duinnín (1860–1934)
Publisher: Connradh na Gaedhilge, 1901
Original e-text: Foclóir na Nua-Ghaeilge, Acadamh Ríoga na hÉireann, Dublin
Added: 2015
Manuscript: not available
LM016: Mo Sgéal Féin; Peadar Ua Laoghaire (1839–1920)
Publisher: Brún agus Ó Nóláin Teo, 1915
Original e-text: Foclóir na Nua-Ghaeilge, Acadamh Ríoga na hÉireann, Dublin
Added: 2017
Manuscript: not available
LM054: Timcheall Chinn Sléibhe; Seán Ó Dálaigh (1861–1940)
Publisher: Oifig an tSoláthair, 1933
Original e-text: Aindí Coyle
Added: 2006
Manuscript: not available
LM066: Na hAird Ó Thuaidh; Pádraig Ua Maoileoin (1913–2002)
Publisher: Sáirséal agus Dill, Dublin, 1960
Original e-text: Foclóir na Nua-Ghaeilge, Acadamh Ríoga na hÉireann, Dublin
Added: 1997
Manuscript: not available
LM102: Tadhg Gabha; Séamus Ua Dubhghaill (1855–1929)
Publisher: Connradh na Gaedhilge, 1904
Original e-text: Foclóir na Nua-Ghaeilge, Acadamh Ríoga na hÉireann, Dublin
Added: 2015
Manuscript: not available
M170: Sgeulaidheacht Chúige Mumhan: Pádraig Ó Laoghaire (1870–1896)
Publisher: Pádraig Ua Briain, 1895
Original e-text: Foclóir na Nua-Ghaeilge, Acadamh Ríoga na hÉireann, Dublin
Added: 2015
Manuscript: not available

ALBA:

COMPANAC: Companach na Cloinne; Iain Mac Phàidein (1850–1935)
Publisher: Eneas MacKay, Stirling, 1912
Original e-text: Ciarán Ó Duibhín
Added: 1997
Manuscript: not available
TRIDEALB: Tri Dealbhan Cluiche; Alasdair Caimbeul (c1944–)
Publisher: Cló Ostaig, An t-Eilean Sgìtheanach, 1990
Original e-text: Caoimhín Ó Donnghaile, Sabhal Mòr Ostaig
Added: 1997
Manuscript: not available
BODACH: Am Measg nam Bodach; various authors
Publisher: An Comunn Gàidhealach, Glaschu, 1938
Original e-text: Ciarán Ó Duibhín
Added: 1998
Manuscript: not available
SEANCH: Seanchaidh na Tràghad; Iain Mac Cormaic (1860–1947)
Publisher: Eneas MacKay, Stirling, 1911
Original e-text: Ciarán Ó Duibhín
Added: 1998
Manuscript: not available
AIRIGH: Seanchaidh na h-Àirigh; Iain Mac Cormaic (1860–1947)
Publisher: Eneas MacKay, Stirling, 1911
Original e-text: Ciarán Ó Duibhín
Added: 2014
Manuscript: not available

OIRTHEAR:

U018: Sgéalta Mhuintir Luinigh; eag. Éamonn Ó Tuathail (lgh 1–178)
Publisher: Irish Folklore Institute, 1933
Original e-text: Foclóir na Nua-Ghaeilge, Acadamh Ríoga na hÉireann, Dublin
Added: 2010
Manuscript: not available
U039: Sgéalaidhe Óirghiall; eag. Seosamh Laoide
Publisher: Connradh na Gaedhilge, Dublin, 1905
Original e-text: Foclóir na Nua-Ghaeilge, Dublin
Added: 2015
Manuscript: not available
U136: Mion-chaint na Midhe; Pádraig Ua Dubhthaigh
Publisher: Connradh na Gaedhilge, Dublin, 1905
Original e-text: Foclóir na Nua-Ghaeilge, Dublin
Added: 2016
Manuscript: not available

Keyboard layouts

If you are not already using a satisfactory method of keyboarding Gaelic text,
you may find one of the Gaelic keyboard layouts which are included with Tobar na Gaedhilge useful.
However, the use of these layouts and of Tobar na Gaedhilge are completely independent of one another.

These layouts provide keying of the pre-composed accented characters required for the Gaelic languages,
as well as the most common such characters for several other languages, on standard US and UK keyboard hardware.

Information and instructions regarding three layouts may be found in the directory into which Tobar na Gaedhilge was installed.  The layouts are:

Miscellaneous

• Citing Tobar na Gaedhilge

If you publish results which have benefitted from the use of Tobar na Gaedhilge, it would be appreciated if you would cite us among your sources. A suggested form of citation for this version is:

Ciarán Ó Duibhín, Tobar na Gaedhilge, version 1.14 (2023), Gaelic textbase and retrieval system for use under MS Windows, freely downloadable from http://www.smo.uhi.ac.uk/~oduibhin/tobar/index.htm

• WARNING: Always check the sources

Although great care has been taken to make the textbase accurate, and it is constantly being improved, it is advisable not to draw linguistic conclusions without checking examples against the original printed matter. Remember that you should use the Foillsighthe display to see the original published text without any corrections at all, rather than the default Lom display which incorporates corrections and in a few cases takes account of the manuscript.

• Using Tobar na Gaedhilge on college networks

Colleges wishing to make Tobar na Gaedhilge available on their student computing networks are welcome to do so.

• Are some letters being displayed as "empty boxes"?

If this is happening, it may help to install on your computer the latest versions of the Segoe UI font (used for the alphabetic menu of available words) and the Consolas font (used for the results).

• Like to help? Here's how

The compiler intends to continue to add Ulster texts to the textbase. He would welcome volunteers to:

For guidance on how to prepare a text, see here, but the only thing that really matters is to put each book line on a separate line of the file. The rest of the process can be dealt with centrally.