diff --git a/.gitattributes b/.gitattributes new file mode 100644 index 00000000..416bf3c7 --- /dev/null +++ b/.gitattributes @@ -0,0 +1,2 @@ +*.tf -linguist-detectable +jquery.js -linguist-vendored diff --git a/app/config.yaml b/app/config.yaml index da27447b..17734cbb 100644 --- a/app/config.yaml +++ b/app/config.yaml @@ -44,7 +44,7 @@ dataDisplay: docs: docBase: '{docRoot}/{repo}' docExt: '' - docPage: 0_home + docPage: '' docRoot: https://{org}.github.io featurePage: 0_home interfaceDefaults: {} diff --git a/bhsa-clariah-ineo.yml b/bhsa-clariah-ineo.yml new file mode 100644 index 00000000..d226ac7c --- /dev/null +++ b/bhsa-clariah-ineo.yml @@ -0,0 +1,166 @@ +intro: >- + This is the text-fabric representation of the Hebrew Bible Database, + containing the text of the Hebrew Bible augmented with linguistic annotations. +properties: + access: + - link: https://creativecommons.org/licenses/by-nc/4.0/ + title: CC-BY-NC + community: + - title: >- + The Slack community in etcbc-vu has a high question-answering and + problem solving potential. If you need an invite, ask for it who is + already part of it, and if you do not know one, ask one the contact + persons + development: + - link: https://dans.knaw.nl/en/ + title: DANS + - link: https://di.huc.knaw.nl + title: Humanities Cluster - Digital Infrastructure + - link: http://etcbc.nl/ + title: ETCBC + - title: >- + Eep Talstra, Constantijn Sikkel, Willem van Peursen, Dirk Roorda, Cody Kingham, Martijn Naaijer + generalContact: + - link: http://etcbc.nl/contact/ + title: ETCBC Contact + informationTypes: + - '1' + intro: Biblia Hebraica Stuttgartensia Amstelodamensis + languages: + - Hebrew + - Aramaic + - English + learn: + - label: >- + There is an extensive set of tutorials for working with the BHSA by + means of Text-Fabric. + link: https://github.com/ETCBC/bhsa/tree/master/tutorial + title: Repository + - link: >- + https://nbviewer.jupyter.org/github/ETCBC/bhsa/blob/master/tutorial/start.ipynb + title: Entry point + link: https://github.com/ETCBC/bhsa/ + mediaTypes: + - 'text ' + problemContact: + - link: https://pure.knaw.nl/portal/nl/persons/dirk-roorda + title: Dr. Dirk Roorda + programmingLanguages: + - link: https://www.python.org + title: Python 3.6 + researchActivities: + - '1' + - '1.1' + - 1.1.4 + - 1.1.7 + - 1.7.1 + - 2.1.4 + - 2.4.1 + - '5.1' + - '6' + researchContact: + - link: https://research.vu.nl/en/persons/eep-talstra + title: Prof. dr. Eep Talstra + - link: + title: Prof. dr. Willem van Peursen + researchDomains: + - '11.15' + - '11.17' + - '19.3' + resourceHost: + - link: https://etcbc.github.io/bhsa/ + title: ETCBC Github + resourceOwner: + - link: http://etcbc.nl/ + title: ETCBC + resourceTypes: + - Data + sourceCodeLocation: + - link: https://github.com/ETCBC/bhsa/ + standards: + - link: https://pypi.org/project/text-fabric/ + title: 'Text-Fabric ' + status: + - Active + versions: + - link: https://github.com/ETCBC/bhsa/releases/tag/v1.7.3 + title: 1.7.3 +relatedProjects: + - 'LinkSyr: Linking Syriac Data' +relatedResources: + - This resource is not (yet) available +slug: bhsa +tabs: + learn: + body: "## Learn\nDifferent ways to explore this dataset are supported.\n\n•\tUsing the website SHEBANQ for users that do not want to use the resource programmatically: you can execute linguistic queries and save and publish them.\n\n![](https://cdn.sanity.io/images/0v602vuh/production/be69557154a0a694960f71b4045fd6673b2a694e-3120x3364.png?auto=format&fit=crop&dpr=1&fit=fill&q=80&w=1400)\n\n•\tUse the Text-Fabric browser. You need Python, but you do not have to program in it. You can execute queries in your browser, served by a local webserver.\n\n![](https://cdn.sanity.io/images/0v602vuh/production/d959213a1276b09c9eddfdb03302f353c8f7a8e2-3154x2698.png?auto=format&fit=crop&dpr=1&fit=fill&q=80&w=700)\n\n•\tUse Text-Fabric as a library. You need to program in Python. You can build data workflows, and you can write exploratory Jupyter notebooks, by which you have ultimate control over the data, and powerful methods to render parts of the corpus in rich displays.\n\n![](https://cdn.sanity.io/images/0v602vuh/production/fbd4a1c6fe6396280a742e9146d2b21c6160eee9-2264x3398.png?auto=format&fit=crop&dpr=1&fit=fill&q=80&w=700)\n\n* Text-Fabric is on the [Python Package Index](https://pypi.org/project/text-fabric/) and can be installed by means of pip. Once Text-Fabric is installed, it will fetch a working copy of the data to your computer when it needs it. You can also obtain the data directly from [GitHub](https://github.com/etcbc/bhsa/).\n\n* There is an extensive set of tutorials for working with the BHSA by means of Text-Fabric.\n* Repo: https://github.com/annotation/tutorials/tree/master/bhsa\n* Entry point: https://nbviewer.jupyter.org/github/annotation/tutorials/blob/master/bhsa/start.ipynb" + mentions: + body: "## Publications\n*\t[Coding the Hebrew Bible](https://doi.org/10.1163/24523666-01000011)\n*\t[The Hebrew Bible as Data: Laboratory – Sharing – Experiences](https://doi.org/10.5334/bbi.18 ). CLARIN in the Low Countries, Ch. 18. \n" + overview: + body: >+ + ## Overview + + * This [text-fabric + ](https://annotation.github.io/text-fabric/tf)representation of the Hebrew + Bible Database contains the text of the Hebrew Bible augmented with + linguistic annotations compiled by the [Eep Talstra Centre for Bible and + Computer](http://etcbc.nl/), VU University Amsterdam. + + * The text is based on the [Biblia Hebraica + Stuttgartensia](https://www.academic-bible.com/en/online-bibles/biblia-hebraica-stuttgartensia-bhs/read-the-bible-text/) + edited by Karl Elliger and Wilhelm Rudolph, Fifth Revised Edition, edited + by Adrian Schenker, © 1977 and 1997 Deutsche Bibelgesellschaft, Stuttgart. + + * The [text-fabric ](https://annotation.github.io/text-fabric/tf)version + has been prepared by Dirk Roorda, [Data Archiving and Networked + Services](https://dans.knaw.nl/nl), with support from Martijn Naaijer, + Cody Kingham, and Constantijn Sikkel. + + * The data is available in more formats. In the SHEBANQ subdirectory you + find data in MQL format and in MYSQL format that directly goes into the + [SHEBANQ website](http://shebanq.ancient-data.org/). + + * In the + [bigTables](https://github.com/ETCBC/bhsa/blob/master/programs/bigTables.ipynb) + you find ways to export the complete data as one big table, and store it + in R format or in Pandas format. The notebooks + [bigTablesP](https://github.com/ETCBC/bhsa/blob/master/programs/bigTablesP.ipynb) + and + [bigTablesR](https://github.com/ETCBC/bhsa/blob/master/programs/bigTablesR.ipynb) + show you a few things that you can do in R and Pandas. + + bodyMore: > + This dataset contains a precise transcription of the Codex Leningradensis. + It follows the Biblia Hebraica Stuttgartensia. The text is augmented with + linguistic annotations, from lemmatization and morphology, to syntax and + discourse structures. + + + All this data is represented in such a way that you can compute with it. + Text and annotations are transparently encoded in plain text files. The + Python library Text-Fabric offers a browsing/searching/computing interface + to this data. The website https://shebanq.ancient-data.org is based on the + very same data. Text-Fabric also supports the publishing of your own + results so that others can use it alongside the main dataset. + + + The data is licensed by the [CC-BY-NC + license](https://creativecommons.org/licenses/by-nc/4.0/). This means that + you can do everything you want with it, provided you give attribution and + you do not use it commercially. For commercial use you have to contact the + German Bible Society. As long as you stay within these restrictions, you + may select, copy and modify this data in all quantities you like, and also + re-publish it under whatever license, provided the new license does not + permit commercial re-use. + + + ### Provenance + + The source data resides on a server of the ETCBC, managed by Constantijn + Sikkel. He makes that data available as an MQL database dump, together + with supplementary data files. From there it is transported to this GitHub + repo by means of a [pipeline](https://github.com/ETCBC/pipeline). This + dataset contains several versions of the BHSA, from 2011 till now. When + you navigate to a version, you'll see more information about that version + and its provenance. For all versions the + [pipeline](https://github.com/ETCBC/pipeline) has been followed. +title: BHSA diff --git a/programs/versionMappings.ipynb b/programs/versionMappings.ipynb index 89080475..bd90b3ae 100644 --- a/programs/versionMappings.ipynb +++ b/programs/versionMappings.ipynb @@ -271,6 +271,16 @@ "# Computing" ] }, + { + "cell_type": "code", + "execution_count": 1, + "metadata": {}, + "outputs": [], + "source": [ + "%load_ext autoreload\n", + "%autoreload 2" + ] + }, { "cell_type": "code", "execution_count": 2, @@ -280,7 +290,7 @@ "import os\n", "import collections\n", "from functools import reduce\n", - "from utils import caption\n", + "from tf.dataset.nodemaps import caption\n", "from tf.fabric import Fabric" ] }, @@ -293,13 +303,14 @@ }, { "cell_type": "code", - "execution_count": 23, + "execution_count": 3, "metadata": {}, "outputs": [], "source": [ "REPO = os.path.expanduser(\"~/github/etcbc/bhsa\")\n", "baseDir = \"{}/tf\".format(REPO)\n", "tempDir = \"{}/_temp\".format(REPO)\n", + "SILENT = \"auto\"\n", "\n", "versions = \"\"\"\n", " 3\n", @@ -338,7 +349,7 @@ }, { "cell_type": "code", - "execution_count": 24, + "execution_count": 4, "metadata": { "lines_to_next_cell": 2 }, @@ -348,27 +359,21 @@ "output_type": "stream", "text": [ "..............................................................................................\n", - ". 5m 27s Version -> 2017 <- loading ... .\n", + ". 0.00s Version -> 2017 <- loading ... .\n", "..............................................................................................\n", - "This is Text-Fabric 9.1.7\n", + "This is Text-Fabric 10.2.0\n", "Api reference : https://annotation.github.io/text-fabric/tf/cheatsheet.html\n", "\n", "115 features found and 0 ignored\n", - " 0.00s loading features ...\n", - " | 0.00s Dataset without structure sections in otext:no structure functions in the T-API\n", - " 18s All features loaded/computed - for details use TF.isLoaded()\n", "..............................................................................................\n", - ". 5m 45s Version -> 2021 <- loading ... .\n", + ". 1.85s Version -> 2021 <- loading ... .\n", "..............................................................................................\n", - "This is Text-Fabric 9.1.7\n", + "This is Text-Fabric 10.2.0\n", "Api reference : https://annotation.github.io/text-fabric/tf/cheatsheet.html\n", "\n", - "115 features found and 0 ignored\n", - " 0.00s loading features ...\n", - " | 0.00s Dataset without structure sections in otext:no structure functions in the T-API\n", - " 22s All features loaded/computed - for details use TF.isLoaded()\n", + "116 features found and 0 ignored\n", "..............................................................................................\n", - ". 6m 07s All versions loaded .\n", + ". 3.69s All versions loaded .\n", "..............................................................................................\n" ] } @@ -379,10 +384,10 @@ "for v in versions:\n", " for (param, value) in versionInfo.get(v, versionInfo[\"\"]).items():\n", " globals()[param] = value\n", - " caption(4, \"Version -> {} <- loading ...\".format(v))\n", - " TF[v] = Fabric(locations=\"{}/{}\".format(baseDir, v), modules=[\"\"])\n", + " caption(4, \"Version -> {} <- loading ...\".format(v), silent=SILENT)\n", + " TF[v] = Fabric(locations=\"{}/{}\".format(baseDir, v), modules=[\"\"], silent=SILENT)\n", " api[v] = TF[v].load(\"{} {}\".format(OCC, LEX)) # noqa F821\n", - "caption(4, \"All versions loaded\")" + "caption(4, \"All versions loaded\", silent=SILENT)" ] }, { @@ -396,7 +401,7 @@ }, { "cell_type": "code", - "execution_count": 25, + "execution_count": 5, "metadata": {}, "outputs": [], "source": [ @@ -404,7 +409,7 @@ " for (param, value) in versionInfo.get(v, versionInfo[\"\"]).items():\n", " globals()[param] = value\n", " api[v].makeAvailableIn(globals())\n", - " caption(4, \"Active version is now -> {} <-\".format(v))" + " caption(4, \"Active version is now -> {} <-\".format(v), silent=SILENT)" ] }, { @@ -416,7 +421,7 @@ }, { "cell_type": "code", - "execution_count": 26, + "execution_count": 6, "metadata": {}, "outputs": [ { @@ -424,13 +429,11 @@ "output_type": "stream", "text": [ "..............................................................................................\n", - ". 6m 09s Active version is now -> 2017 <- .\n", + ". 7.49s Active version is now -> 2017 <- .\n", "..............................................................................................\n", - "| 6m 09s \t 426584 slots\n", "..............................................................................................\n", - ". 6m 09s Active version is now -> 2021 <- .\n", - "..............................................................................................\n", - "| 6m 09s \t 426590 slots\n" + ". 7.49s Active version is now -> 2021 <- .\n", + "..............................................................................................\n" ] } ], @@ -496,7 +499,7 @@ }, { "cell_type": "code", - "execution_count": 27, + "execution_count": 7, "metadata": {}, "outputs": [], "source": [ @@ -549,7 +552,7 @@ }, { "cell_type": "code", - "execution_count": 28, + "execution_count": 8, "metadata": {}, "outputs": [ { @@ -557,29 +560,28 @@ "output_type": "stream", "text": [ "..............................................................................................\n", - ". 6m 14s Masking lexemes .\n", + ". 12s Masking lexemes .\n", "..............................................................................................\n", "..............................................................................................\n", - ". 6m 14s Active version is now -> 2017 <- .\n", + ". 12s Active version is now -> 2017 <- .\n", "..............................................................................................\n", "..............................................................................................\n", - ". 6m 15s Active version is now -> 2021 <- .\n", - "..............................................................................................\n", - "| 6m 16s Done\n" + ". 13s Active version is now -> 2021 <- .\n", + "..............................................................................................\n" ] } ], "source": [ "lexemes = {}\n", "\n", - "caption(4, \"Masking lexemes\")\n", + "caption(4, \"Masking lexemes\", silent=SILENT)\n", "for v in versions:\n", " activate(v)\n", " lexemes[v] = collections.OrderedDict()\n", " for n in F.otype.s(\"word\"):\n", " lex = Fs(LEX).v(n) # noqa F821\n", " lexemes[v][n] = (lex, mask(lex, trans=0), mask(lex))\n", - "caption(0, \"Done\")" + "caption(0, \"Done\", silent=SILENT)" ] }, { @@ -626,7 +628,7 @@ }, { "cell_type": "code", - "execution_count": 29, + "execution_count": 9, "metadata": {}, "outputs": [], "source": [ @@ -652,7 +654,7 @@ }, { "cell_type": "code", - "execution_count": 30, + "execution_count": 10, "metadata": {}, "outputs": [], "source": [ @@ -936,7 +938,7 @@ }, { "cell_type": "code", - "execution_count": 31, + "execution_count": 11, "metadata": { "lines_to_end_of_cell_marker": 2 }, @@ -1083,14 +1085,14 @@ }, { "cell_type": "code", - "execution_count": 32, + "execution_count": 12, "metadata": {}, "outputs": [], "source": [ "def edgesFromMaps():\n", " edges.clear()\n", " for ((v, w), mp) in sorted(mappings.items()):\n", - " caption(4, \"Make edge from slot mapping {} => {}\".format(v, w))\n", + " caption(4, \"Make edge from slot mapping {} => {}\".format(v, w), silent=SILENT)\n", "\n", " edge = {}\n", " dm = dissimilarity[(v, w)]\n", @@ -2964,7 +2966,7 @@ }, { "cell_type": "code", - "execution_count": 33, + "execution_count": 13, "metadata": {}, "outputs": [], "source": [ @@ -2977,7 +2979,7 @@ }, { "cell_type": "code", - "execution_count": 34, + "execution_count": 14, "metadata": {}, "outputs": [ { @@ -3117,7 +3119,7 @@ }, { "cell_type": "code", - "execution_count": 35, + "execution_count": 15, "metadata": {}, "outputs": [ { @@ -3125,7 +3127,7 @@ "output_type": "stream", "text": [ "..............................................................................................\n", - ". 6m 40s Make edge from slot mapping 2017 => 2021 .\n", + ". 47s Make edge from slot mapping 2017 => 2021 .\n", "..............................................................................................\n" ] } @@ -3143,7 +3145,7 @@ }, { "cell_type": "code", - "execution_count": 36, + "execution_count": 16, "metadata": {}, "outputs": [], "source": [ @@ -3153,7 +3155,7 @@ }, { "cell_type": "code", - "execution_count": 37, + "execution_count": 17, "metadata": {}, "outputs": [], "source": [ @@ -3169,12 +3171,12 @@ }, { "cell_type": "code", - "execution_count": 38, + "execution_count": 18, "metadata": {}, "outputs": [], "source": [ "def makeNodeMapping(nodeType, v, w, force=False):\n", - " caption(2, \"Mapping {} nodes {} ==> {}\".format(nodeType, v, w))\n", + " caption(2, \"Mapping {} nodes {} ==> {}\".format(nodeType, v, w), silent=SILENT)\n", " mapKey = (v, w)\n", " edge = edges[mapKey]\n", "\n", @@ -3186,7 +3188,8 @@ " mapping = {}\n", " diag = {}\n", " caption(\n", - " 0, \"Extending slot mapping {} ==> {} for {} nodes\".format(*mapKey, nodeType)\n", + " 0, \"Extending slot mapping {} ==> {} for {} nodes\".format(*mapKey, nodeType),\n", + " silent=SILENT\n", " )\n", " for n in api[v].F.otype.s(nodeType):\n", " slots = api[v].E.oslots.s(n)\n", @@ -3251,17 +3254,17 @@ "\n", " diagnosis.setdefault(mapKey, {})[nodeType] = diag\n", " nodeMapping.setdefault(mapKey, {})[nodeType] = mapping\n", - " caption(0, \"\\tDone\")" + " caption(0, \"\\tDone\", silent=SILENT)" ] }, { "cell_type": "code", - "execution_count": 39, + "execution_count": 19, "metadata": {}, "outputs": [], "source": [ "def exploreNodeMapping(nodeType, v, w, force=False):\n", - " caption(4, \"Statistics for {} ==> {} ({})\".format(v, w, nodeType))\n", + " caption(4, \"Statistics for {} ==> {} ({})\".format(v, w, nodeType), silent=SILENT)\n", " mapKey = (v, w)\n", " diag = diagnosis[mapKey][nodeType]\n", " total = len(diag)\n", @@ -3273,19 +3276,19 @@ " for (n, dia) in diag.items():\n", " reasons[dia] += 1\n", "\n", - " caption(0, \"\\t{:<30} : {:6.2f}% {:>7}x\".format(\"TOTAL\", 100, total))\n", + " caption(0, \"\\t{:<30} : {:6.2f}% {:>7}x\".format(\"TOTAL\", 100, total), silent=SILENT)\n", " for stat in statLabels:\n", " statLabel = statLabels[stat]\n", " amount = reasons[stat]\n", " if amount == 0:\n", " continue\n", " perc = 100 * amount / total\n", - " caption(0, \"\\t{:<30} : {:6.2f}% {:>7}x\".format(statLabel, perc, amount))" + " caption(0, \"\\t{:<30} : {:6.2f}% {:>7}x\".format(statLabel, perc, amount), silent=SILENT)" ] }, { "cell_type": "code", - "execution_count": 40, + "execution_count": 20, "metadata": { "lines_to_next_cell": 2 }, @@ -3297,197 +3300,123 @@ "\n", "**********************************************************************************************\n", "* *\n", - "* 6m 46s Mapping book nodes 2017 ==> 2021 *\n", + "* 53s Mapping book nodes 2017 ==> 2021 *\n", "* *\n", "**********************************************************************************************\n", "\n", - "| 6m 46s Extending slot mapping 2017 ==> 2021 for book nodes\n", - "| 6m 57s \tDone\n", "..............................................................................................\n", - ". 6m 57s Statistics for 2017 ==> 2021 (book) .\n", + ". 1m 02s Statistics for 2017 ==> 2021 (book) .\n", "..............................................................................................\n", - "| 6m 57s \tTOTAL : 100.00% 39x\n", - "| 6m 57s \tunique, perfect : 100.00% 39x\n", "\n", "**********************************************************************************************\n", "* *\n", - "* 6m 57s Mapping chapter nodes 2017 ==> 2021 *\n", + "* 1m 02s Mapping chapter nodes 2017 ==> 2021 *\n", "* *\n", "**********************************************************************************************\n", "\n", - "| 6m 57s Extending slot mapping 2017 ==> 2021 for chapter nodes\n", - "| 6m 59s \tDone\n", "..............................................................................................\n", - ". 6m 59s Statistics for 2017 ==> 2021 (chapter) .\n", + ". 1m 04s Statistics for 2017 ==> 2021 (chapter) .\n", "..............................................................................................\n", - "| 6m 59s \tTOTAL : 100.00% 929x\n", - "| 6m 59s \tunique, perfect : 100.00% 929x\n", "\n", "**********************************************************************************************\n", "* *\n", - "* 6m 59s Mapping lex nodes 2017 ==> 2021 *\n", + "* 1m 04s Mapping lex nodes 2017 ==> 2021 *\n", "* *\n", "**********************************************************************************************\n", "\n", - "| 6m 59s Extending slot mapping 2017 ==> 2021 for lex nodes\n", - "| 7m 14s \tDone\n", "..............................................................................................\n", - ". 7m 14s Statistics for 2017 ==> 2021 (lex) .\n", + ". 1m 11s Statistics for 2017 ==> 2021 (lex) .\n", "..............................................................................................\n", - "| 7m 14s \tTOTAL : 100.00% 9233x\n", - "| 7m 14s \tunique, perfect : 99.79% 9214x\n", - "| 7m 14s \tunique, imperfect : 0.13% 12x\n", - "| 7m 14s \tmultiple, cleanly composed : 0.02% 2x\n", - "| 7m 14s \tmultiple, non-perfect : 0.05% 5x\n", "\n", "**********************************************************************************************\n", "* *\n", - "* 7m 14s Mapping verse nodes 2017 ==> 2021 *\n", + "* 1m 11s Mapping verse nodes 2017 ==> 2021 *\n", "* *\n", "**********************************************************************************************\n", "\n", - "| 7m 14s Extending slot mapping 2017 ==> 2021 for verse nodes\n", - "| 7m 16s \tDone\n", "..............................................................................................\n", - ". 7m 16s Statistics for 2017 ==> 2021 (verse) .\n", + ". 1m 12s Statistics for 2017 ==> 2021 (verse) .\n", "..............................................................................................\n", - "| 7m 16s \tTOTAL : 100.00% 23213x\n", - "| 7m 16s \tunique, perfect : 100.00% 23213x\n", "\n", "**********************************************************************************************\n", "* *\n", - "* 7m 16s Mapping half_verse nodes 2017 ==> 2021 *\n", + "* 1m 12s Mapping half_verse nodes 2017 ==> 2021 *\n", "* *\n", "**********************************************************************************************\n", "\n", - "| 7m 16s Extending slot mapping 2017 ==> 2021 for half_verse nodes\n", - "| 7m 19s \tDone\n", "..............................................................................................\n", - ". 7m 19s Statistics for 2017 ==> 2021 (half_verse) .\n", + ". 1m 13s Statistics for 2017 ==> 2021 (half_verse) .\n", "..............................................................................................\n", - "| 7m 19s \tTOTAL : 100.00% 45180x\n", - "| 7m 19s \tunique, perfect : 100.00% 45178x\n", - "| 7m 19s \tunique, imperfect : 0.00% 2x\n", "\n", "**********************************************************************************************\n", "* *\n", - "* 7m 19s Mapping sentence nodes 2017 ==> 2021 *\n", + "* 1m 13s Mapping sentence nodes 2017 ==> 2021 *\n", "* *\n", "**********************************************************************************************\n", "\n", - "| 7m 19s Extending slot mapping 2017 ==> 2021 for sentence nodes\n", - "| 7m 22s \tDone\n", "..............................................................................................\n", - ". 7m 22s Statistics for 2017 ==> 2021 (sentence) .\n", + ". 1m 14s Statistics for 2017 ==> 2021 (sentence) .\n", "..............................................................................................\n", - "| 7m 22s \tTOTAL : 100.00% 63711x\n", - "| 7m 22s \tunique, perfect : 99.76% 63559x\n", - "| 7m 22s \tunique, imperfect : 0.15% 94x\n", - "| 7m 22s \tmultiple, cleanly composed : 0.05% 35x\n", - "| 7m 22s \tmultiple, non-perfect : 0.04% 23x\n", "\n", "**********************************************************************************************\n", "* *\n", - "* 7m 22s Mapping sentence_atom nodes 2017 ==> 2021 *\n", + "* 1m 15s Mapping sentence_atom nodes 2017 ==> 2021 *\n", "* *\n", "**********************************************************************************************\n", "\n", - "| 7m 22s Extending slot mapping 2017 ==> 2021 for sentence_atom nodes\n", - "| 7m 24s \tDone\n", "..............................................................................................\n", - ". 7m 24s Statistics for 2017 ==> 2021 (sentence_atom) .\n", + ". 1m 16s Statistics for 2017 ==> 2021 (sentence_atom) .\n", "..............................................................................................\n", - "| 7m 24s \tTOTAL : 100.00% 64486x\n", - "| 7m 24s \tunique, perfect : 99.78% 64342x\n", - "| 7m 24s \tunique, imperfect : 0.14% 92x\n", - "| 7m 24s \tmultiple, cleanly composed : 0.07% 44x\n", - "| 7m 24s \tmultiple, non-perfect : 0.01% 8x\n", "\n", "**********************************************************************************************\n", "* *\n", - "* 7m 24s Mapping clause nodes 2017 ==> 2021 *\n", + "* 1m 16s Mapping clause nodes 2017 ==> 2021 *\n", "* *\n", "**********************************************************************************************\n", "\n", - "| 7m 24s Extending slot mapping 2017 ==> 2021 for clause nodes\n", - "| 7m 27s \tDone\n", "..............................................................................................\n", - ". 7m 27s Statistics for 2017 ==> 2021 (clause) .\n", + ". 1m 17s Statistics for 2017 ==> 2021 (clause) .\n", "..............................................................................................\n", - "| 7m 27s \tTOTAL : 100.00% 88101x\n", - "| 7m 27s \tunique, perfect : 99.85% 87967x\n", - "| 7m 27s \tunique, imperfect : 0.09% 75x\n", - "| 7m 27s \tmultiple, cleanly composed : 0.06% 50x\n", - "| 7m 27s \tmultiple, non-perfect : 0.01% 9x\n", "\n", "**********************************************************************************************\n", "* *\n", - "* 7m 27s Mapping clause_atom nodes 2017 ==> 2021 *\n", + "* 1m 17s Mapping clause_atom nodes 2017 ==> 2021 *\n", "* *\n", "**********************************************************************************************\n", "\n", - "| 7m 27s Extending slot mapping 2017 ==> 2021 for clause_atom nodes\n", - "| 7m 30s \tDone\n", "..............................................................................................\n", - ". 7m 30s Statistics for 2017 ==> 2021 (clause_atom) .\n", + ". 1m 18s Statistics for 2017 ==> 2021 (clause_atom) .\n", "..............................................................................................\n", - "| 7m 30s \tTOTAL : 100.00% 90669x\n", - "| 7m 30s \tunique, perfect : 99.86% 90543x\n", - "| 7m 30s \tunique, imperfect : 0.08% 69x\n", - "| 7m 30s \tmultiple, cleanly composed : 0.06% 55x\n", - "| 7m 30s \tmultiple, non-perfect : 0.00% 2x\n", "\n", "**********************************************************************************************\n", "* *\n", - "* 7m 30s Mapping phrase nodes 2017 ==> 2021 *\n", + "* 1m 18s Mapping phrase nodes 2017 ==> 2021 *\n", "* *\n", "**********************************************************************************************\n", "\n", - "| 7m 30s Extending slot mapping 2017 ==> 2021 for phrase nodes\n", - "| 7m 34s \tDone\n", "..............................................................................................\n", - ". 7m 34s Statistics for 2017 ==> 2021 (phrase) .\n", + ". 1m 20s Statistics for 2017 ==> 2021 (phrase) .\n", "..............................................................................................\n", - "| 7m 34s \tTOTAL : 100.00% 253187x\n", - "| 7m 34s \tunique, perfect : 99.94% 253042x\n", - "| 7m 34s \tunique, imperfect : 0.04% 93x\n", - "| 7m 34s \tmultiple, cleanly composed : 0.02% 50x\n", - "| 7m 34s \tmultiple, non-perfect : 0.00% 2x\n", "\n", "**********************************************************************************************\n", "* *\n", - "* 7m 34s Mapping phrase_atom nodes 2017 ==> 2021 *\n", + "* 1m 20s Mapping phrase_atom nodes 2017 ==> 2021 *\n", "* *\n", "**********************************************************************************************\n", "\n", - "| 7m 34s Extending slot mapping 2017 ==> 2021 for phrase_atom nodes\n", - "| 7m 37s \tDone\n", "..............................................................................................\n", - ". 7m 37s Statistics for 2017 ==> 2021 (phrase_atom) .\n", + ". 1m 22s Statistics for 2017 ==> 2021 (phrase_atom) .\n", "..............................................................................................\n", - "| 7m 37s \tTOTAL : 100.00% 267519x\n", - "| 7m 37s \tunique, perfect : 99.95% 267396x\n", - "| 7m 37s \tunique, imperfect : 0.03% 84x\n", - "| 7m 37s \tmultiple, cleanly composed : 0.01% 39x\n", "\n", "**********************************************************************************************\n", "* *\n", - "* 7m 37s Mapping subphrase nodes 2017 ==> 2021 *\n", + "* 1m 22s Mapping subphrase nodes 2017 ==> 2021 *\n", "* *\n", "**********************************************************************************************\n", "\n", - "| 7m 37s Extending slot mapping 2017 ==> 2021 for subphrase nodes\n", - "| 7m 39s \tDone\n", - "..............................................................................................\n", - ". 7m 39s Statistics for 2017 ==> 2021 (subphrase) .\n", "..............................................................................................\n", - "| 7m 39s \tTOTAL : 100.00% 113784x\n", - "| 7m 39s \tunique, perfect : 63.33% 72063x\n", - "| 7m 39s \tmultiple, one perfect : 36.49% 41522x\n", - "| 7m 39s \tunique, imperfect : 0.00% 3x\n", - "| 7m 39s \tmultiple, cleanly composed : 0.00% 4x\n", - "| 7m 39s \tmultiple, non-perfect : 0.15% 168x\n", - "| 7m 39s \tnot mapped : 0.02% 24x\n" + ". 1m 22s Statistics for 2017 ==> 2021 (subphrase) .\n", + "..............................................................................................\n" ] } ], @@ -3515,14 +3444,14 @@ }, { "cell_type": "code", - "execution_count": 41, + "execution_count": 21, "metadata": {}, "outputs": [], "source": [ "def writeMaps():\n", " for ((v1, v2), edge) in sorted(edges.items()):\n", " fName = \"omap@{}-{}\".format(v1, v2)\n", - " caption(4, \"Write edge as TF feature {}\".format(fName))\n", + " caption(4, \"Write edge as TF feature {}\".format(fName), silent=SILENT)\n", "\n", " edgeFeatures = {fName: edge}\n", " metaData = {\n", @@ -3541,12 +3470,13 @@ " nodeFeatures={},\n", " edgeFeatures=edgeFeatures,\n", " metaData=metaData,\n", + " silent=SILENT,\n", " )" ] }, { "cell_type": "code", - "execution_count": 42, + "execution_count": 23, "metadata": {}, "outputs": [ { @@ -3554,25 +3484,24 @@ "output_type": "stream", "text": [ "..............................................................................................\n", - ". 7m 45s Write mappings as TF edges .\n", + ". 2m 46s Write mappings as TF edges .\n", "..............................................................................................\n", - "| 7m 45s \t 2017 ==> 2021\n", "..............................................................................................\n", - ". 7m 45s Write edge as TF feature omap@2017-2021 .\n", + ". 2m 46s Write edge as TF feature omap@2017-2021 .\n", "..............................................................................................\n", "..............................................................................................\n", - ". 7m 45s Active version is now -> 2021 <- .\n", + ". 2m 46s Active version is now -> 2021 <- .\n", "..............................................................................................\n", " 0.00s Exporting 0 node and 1 edge and 0 config features to ~/github/etcbc/bhsa/tf/2021:\n", - " | 3.44s T omap@2017-2021 to ~/github/etcbc/bhsa/tf/2021\n", - " 3.44s Exported 0 node features and 1 edge features and 0 config features to ~/github/etcbc/bhsa/tf/2021\n" + " | 1.76s T omap@2017-2021 to ~/github/etcbc/bhsa/tf/2021\n", + " 1.76s Exported 0 node features and 1 edge features and 0 config features to ~/github/etcbc/bhsa/tf/2021\n" ] } ], "source": [ - "caption(4, \"Write mappings as TF edges\")\n", + "caption(4, \"Write mappings as TF edges\", silent=SILENT)\n", "for (v1, v2) in sorted(mappings.keys()):\n", - " caption(0, \"\\t {:>4} ==> {:<4}\".format(v1, v2))\n", + " caption(0, \"\\t {:>4} ==> {:<4}\".format(v1, v2), silent=SILENT)\n", "\n", "writeMaps()" ] @@ -3604,7 +3533,7 @@ "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython3", - "version": "3.10.0" + "version": "3.10.4" }, "widgets": { "application/vnd.jupyter.widget-state+json": { diff --git a/tf/2021/omap@2017-2021.tf b/tf/2021/omap@2017-2021.tf index d0d3de9b..736d555d 100644 --- a/tf/2021/omap@2017-2021.tf +++ b/tf/2021/omap@2017-2021.tf @@ -5,7 +5,7 @@ @see=https://github.com/ETCBC/bhsa/blob/master/programs/versionMappings.ipynb @valueType=int @writtenBy=Text-Fabric -@dateWritten=2021-12-09T14:51:51Z +@dateWritten=2022-08-23T09:39:37Z 1 2 diff --git a/tutorial/cookbook/accents.py b/tutorial/cookbook/accents.py deleted file mode 100644 index 018a0a41..00000000 --- a/tutorial/cookbook/accents.py +++ /dev/null @@ -1,110 +0,0 @@ -# --- -# jupyter: -# jupytext: -# text_representation: -# extension: .py -# format_name: light -# format_version: '1.5' -# jupytext_version: 1.11.4 -# kernelspec: -# display_name: Python 3 -# language: python -# name: python3 -# --- - -# # Word sets by accents -# -# We make some classes of words, defined by the accents they contain, and save them as sets, to be used in queries. - -import re - -from tf.app import use -from tf.lib import writeSets - -A = use("bhsa:clone", hoist=globals()) - -# We define the accents and create a regular expression out of them. - -A_ACCENTS = set("04 24 33 63 70 71 72 73 74 93 94".split()) - -A_PAT = "|".join(A_ACCENTS) -A_RE = re.compile(f"(?:{A_PAT})") -A_RE - -# We make two sets of words: words that contain one or more accents in `A_ACCENTS` and words that don't. -# -# The first set we call `word_a` and the other set `word_non_a`. -# -# We go through all words of the whole corpus. - -# + -wordA = set() -wordNonA = set() - -A.indent(reset=True) -A.info("Classifying words") - -for w in F.otype.s("word"): - translit = F.g_word.v(w) - if A_RE.search(translit): - wordA.add(w) - else: - wordNonA.add(w) - -A.info(f"word_a has {len(wordA):>6} members") -A.info(f"word_non_a has {len(wordNonA):>6} members") -# - - -# Collect the sets in a dictionary that assigns names to them: - -accents = dict( - word_a=wordA, - word_non_a=wordNonA, -) - -# Test the set in a query: - -query = """ -book book=Genesis - word_a - g_cons~^(?![KL]$) - trailer~[^&] -""" -results = A.search(query, sets=accents) -A.table(results, end=5) -A.table(results, end=5, fmt="text-trans-full") - -query = """ -book book=Genesis - word_non_a - g_cons~^(?![KL]$) - trailer~[^&] -""" -results = A.search(query, sets=accents) -A.table(results, end=5) -A.table(results, end=5, fmt="text-trans-full") - -# Now save the sets as a TF file in your Downloads folder (if you want it in an other place, -# tweak the variable `SET_DIR` below. -# -# We use the TF helper function -# [`writeSets`](https://annotation.github.io/text-fabric/tf/lib.html#tf.lib.writeSets) -# to do the work. - -# + -SET_DIR = "~/Downloads" - -writeSets(accents, f"{SET_DIR}/accents") -# - - -# Check: - -# !ls -l ~/Downloads/accents - -# Now you can use this set in the text-fabric browser by saying: -# -# ```sh -# text-fabric bhsa --sets=~/Downloads/accents -# ``` - -# ![tfbrowser](accentsScreenshot.png) diff --git a/tutorial/cookbook/export.ipynb b/tutorial/cookbook/export.ipynb new file mode 100644 index 00000000..181caf01 --- /dev/null +++ b/tutorial/cookbook/export.ipynb @@ -0,0 +1,53 @@ +{ + "cells": [ + { + "cell_type": "code", + "execution_count": 4, + "id": "aad674cb-5dbc-4959-b5a8-a2aba6f5fe8c", + "metadata": {}, + "outputs": [], + "source": [ + "from tf.app import use\n", + "A = use(\"etcbc/bhsa\", hoist=globals())\n", + "\n", + "jussive1='''\n", + "clause\n", + " word lex=>L= language=Hebrew\n", + " < word language=Hebrew sp=verb vt=impf ps=p1|p2|p3 vbe#H=\n", + "'''\n", + "results = A.search(jussive1)\n", + "\n", + "A.export(results)" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "id": "562542c5-951c-4160-b3d1-7976e8d8d120", + "metadata": {}, + "outputs": [], + "source": [] + } + ], + "metadata": { + "kernelspec": { + "display_name": "Python 3 (ipykernel)", + "language": "python", + "name": "python3" + }, + "language_info": { + "codemirror_mode": { + "name": "ipython", + "version": 3 + }, + "file_extension": ".py", + "mimetype": "text/x-python", + "name": "python", + "nbconvert_exporter": "python", + "pygments_lexer": "ipython3", + "version": "3.10.4" + } + }, + "nbformat": 4, + "nbformat_minor": 5 +} diff --git a/tutorial/cookbook/namedEntity.py b/tutorial/cookbook/namedEntity.py deleted file mode 100644 index 956e3516..00000000 --- a/tutorial/cookbook/namedEntity.py +++ /dev/null @@ -1,101 +0,0 @@ -# --- -# jupyter: -# jupytext: -# text_representation: -# extension: .py -# format_name: light -# format_version: '1.5' -# jupytext_version: 1.11.4 -# kernelspec: -# display_name: Python 3 -# language: python -# name: python3 -# --- - -# # Named Entities in the BHSA -# -# For prelimanaries, such as installing Text-Fabric and using it, consult the -# [start tutorial](https://nbviewer.jupyter.org/github/annotation/tutorials/blob/master/bhsa/start.ipynb) -# -# We show how to fetch person/place/people/measure names from the BHSA data - -import os -from tf.app import use - -A = use("bhsa", hoist=globals()) - -# If you expand the triangle in front of BHSA above, you see which features have been loaded. -# -# We need [nametype](https://etcbc.github.io/bhsa/features/nametype/) specifically. -# It is a mapping from word numbers to types of proper names. -# -# Here is a frequency distribution of its values: - -F.nametype.freqList() - -# We query the measure names (`mens`): - -# + -query = """ -word nametype=mens -""" - -results = A.search(query) -# - - -A.table(results) - -# The frequency list promised 30 results but we see only 20. That is because there are also other things that have a name type: lexemes: - -# + -queryL = """ -lex nametype=mens -""" - -resultsL = A.search(queryL) -# - - -A.table(resultsL) - -# Let's make a data file of all words that have a name type. -# We'll produce a tab-separated file with a bit of extra information. - -# + -query = """ -word nametype gloss* -""" - -results = A.search(query) -# - - -A.table(results, end=10) - -A.show(results, start=10000, end=10003) - -A.export(results, toFile="namedEntities.tsv") - -# !head -n 20 ~/Downloads/namedEntities.tsv - -# Note that this file is in UTF16 with a byte order that is chosen such that the file opens without issue in Excel. -# -# If you want to read the file by Python, it works like this: - -# + -filePath = os.path.expanduser("~/Downloads/namedEntities.tsv") - -i = 0 -limit = 20 - -with open(filePath, encoding="utf16") as fh: - for line in fh: - i += 1 - cells = line.rstrip("\n").split("\t") - print(i, cells) - if i > limit: - break -# - - -# See also the documentation of the -# [export function](https://annotation.github.io/text-fabric/tf/advanced/display.html#tf.advanced.display.export) - -# CC-BY Dirk Roorda diff --git a/tutorial/data:~/github/etcbc/participants/actor/tf/2021/actor.tf b/tutorial/data:~/github/etcbc/participants/actor/tf/2021/actor.tf index 1060210d..3c14f398 100644 --- a/tutorial/data:~/github/etcbc/participants/actor/tf/2021/actor.tf +++ b/tutorial/data:~/github/etcbc/participants/actor/tf/2021/actor.tf @@ -7,7 +7,7 @@ @valueType=str @writtenBy=Text-Fabric @writtenBy=Text-Fabric -@dateWritten=2022-01-31T19:21:39Z +@dateWritten=2022-08-23T09:16:53Z 943201 JHWH JHWH diff --git a/tutorial/data:~/github/etcbc/participants/actor/tf/2021/coref.tf b/tutorial/data:~/github/etcbc/participants/actor/tf/2021/coref.tf index 1a7d5c2b..d7dc2969 100644 --- a/tutorial/data:~/github/etcbc/participants/actor/tf/2021/coref.tf +++ b/tutorial/data:~/github/etcbc/participants/actor/tf/2021/coref.tf @@ -7,7 +7,7 @@ @valueType=str @writtenBy=Text-Fabric @writtenBy=Text-Fabric -@dateWritten=2022-01-31T19:21:39Z +@dateWritten=2022-08-23T09:16:53Z 63021 1317253 63029 943206 diff --git a/tutorial/data:~/github/etcbc/participants/actor/tf/2021/prs_actor.tf b/tutorial/data:~/github/etcbc/participants/actor/tf/2021/prs_actor.tf index 4c898795..4dc3c3d2 100644 --- a/tutorial/data:~/github/etcbc/participants/actor/tf/2021/prs_actor.tf +++ b/tutorial/data:~/github/etcbc/participants/actor/tf/2021/prs_actor.tf @@ -7,7 +7,7 @@ @valueType=str @writtenBy=Text-Fabric @writtenBy=Text-Fabric -@dateWritten=2022-01-31T19:21:39Z +@dateWritten=2022-08-23T09:16:53Z 63021 >HRN 63029 >HRN BN >HRN BN JFR>L diff --git a/tutorial/map.ipynb b/tutorial/map.ipynb index 688feff8..da1d2c54 100644 --- a/tutorial/map.ipynb +++ b/tutorial/map.ipynb @@ -128,7 +128,7 @@ { "data": { "text/html": [ - "TF-app: ~/text-fabric-data/etcbc/bhsa/app" + "TF-app: ~/text-fabric-data/github/etcbc/bhsa/app" ], "text/plain": [ "" @@ -140,7 +140,7 @@ { "data": { "text/html": [ - "data: ~/text-fabric-data/etcbc/bhsa/tf/2021" + "data: ~/text-fabric-data/github/etcbc/bhsa/tf/2021" ], "text/plain": [ "" @@ -152,7 +152,7 @@ { "data": { "text/html": [ - "data: ~/text-fabric-data/etcbc/phono/tf/2021" + "data: ~/text-fabric-data/github/etcbc/phono/tf/2021" ], "text/plain": [ "" @@ -164,7 +164,7 @@ { "data": { "text/html": [ - "data: ~/text-fabric-data/etcbc/parallels/tf/2021" + "data: ~/text-fabric-data/github/etcbc/parallels/tf/2021" ], "text/plain": [ "" @@ -173,64 +173,21 @@ "metadata": {}, "output_type": "display_data" }, - { - "name": "stdout", - "output_type": "stream", - "text": [ - "This is Text-Fabric 9.2.3\n", - "Api reference : https://annotation.github.io/text-fabric/tf/cheatsheet.html\n", - "\n", - "122 features found and 0 ignored\n" - ] - }, { "data": { "text/html": [ - "Text-Fabric: Text-Fabric API 9.2.3, etcbc/bhsa/app v3, Search Reference
Data: BHSA, Character table, Feature docs
Features:
\n", + "Text-Fabric: Text-Fabric API 10.2.0, etcbc/bhsa/app v3, Search Reference
Data: BHSA, Character table, Feature docs
Features:
\n", "
Parallel Passages\n", "
\n", "\n", "
\n", "
\n", - "crossref\n", + "crossref\n", "
\n", "
int
\n", - "
\n", - " 🆗 links between similar passages\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
BHSA Data: Constantijn Sikkel; Parallels Notebook: Dirk Roorda, Martijn Naaijer
\n", - "
\n", - "\n", - "
\n", - "
coreData:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:40:46Z
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
Parallels notebook, see https://github.com/ETCBC/parallels
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " 🆗 links between similar passages\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", @@ -241,8156 +198,667 @@ "\n", "
\n", "
\n", - "book\n", + "book\n", "
\n", "
str
\n", - "
\n", - " ✅ book name in Latin (Genesis; Numeri; Reges1; ...)\n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + " ✅ book name in Latin (Genesis; Numeri; Reges1; ...)\n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "
\n", + "book@ll\n", "
\n", + "
str
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:17:55Z
\n", - "
\n", + " ✅ book name in amharic (ኣማርኛ)\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", "
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "
\n", + "chapter\n", "
\n", + "
int
\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", + " ✅ chapter number (1; 2; 3; ...)\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
\n", + "
\n", + "code\n", "
\n", + "
int
\n", + "\n", + " ✅ identifier of a clause atom relationship (0; 74; 367; ...)\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "book@ll\n", + "det\n", "
\n", "
str
\n", - "
\n", - " ✅ book name in amharic (ኣማርኛ)\n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + " ✅ determinedness of phrase(atom) (det; und; NA.)\n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "
\n", + "domain\n", "
\n", + "
str
\n", + "\n", + " ✅ text type of clause (? (Unknown); N (narrative); D (discursive); Q (Quotation).)\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:20:27Z
\n", "
\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", + "
\n", + "
\n", + "freq_lex\n", "
\n", + "
int
\n", + "\n", + " ✅ frequency of lexemes\n", "\n", - "
\n", - "
encoders:
\n", - "
Dirk Roorda (TF)
\n", "
\n", "\n", - "
\n", - "
language:
\n", - "
ኣማርኛ
\n", + "
\n", + "
\n", + "function\n", "
\n", + "
str
\n", + "\n", + " ✅ syntactic function of phrase (Cmpl; Objc; Pred; ...)\n", "\n", - "
\n", - "
languageCode:
\n", - "
am
\n", "
\n", "\n", - "
\n", - "
languageEnglish:
\n", - "
amharic
\n", + "
\n", + "
\n", + "g_cons\n", "
\n", + "
str
\n", + "\n", + " ✅ word consonantal-transliterated (B R>CJT BR> >LHJM ...)\n", "\n", - "
\n", - "
provenance:
\n", - "
book names from wikipedia and other sources
\n", "
\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", + "
\n", + "
\n", + "g_cons_utf8\n", "
\n", + "
str
\n", + "\n", + " ✅ word consonantal-Hebrew (ב ראשׁית ברא אלהים)\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
\n", + "
\n", + "g_lex\n", "
\n", + "
str
\n", + "\n", + " ✅ lexeme pointed-transliterated (B.:- R;>CIJT B.@R@> >:ELOH ...)\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "chapter\n", + "g_lex_utf8\n", "
\n", - "
int
\n", - "
\n", - " ✅ chapter number (1; 2; 3; ...)\n", - "
\n", + "
str
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + " ✅ lexeme pointed-Hebrew (בְּ רֵאשִׁית בָּרָא אֱלֹה)\n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "
\n", + "g_word\n", "
\n", + "
str
\n", + "\n", + " ✅ word pointed-transliterated (B.:- R;>CI73JT B.@R@74> >:ELOHI92JM)\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:17:55Z
\n", "
\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", + "
\n", + "
\n", + "g_word_utf8\n", "
\n", + "
str
\n", + "\n", + " ✅ word pointed-Hebrew (בְּ רֵאשִׁ֖ית בָּרָ֣א אֱלֹהִ֑ים)\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", + "
\n", + "
\n", + "gloss\n", "
\n", + "
str
\n", + "\n", + " 🆗 english translation of lexeme (beginning create god(s))\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
\n", + "
\n", + "gn\n", "
\n", + "
str
\n", + "\n", + " ✅ grammatical gender (m; f; NA; unknown.)\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "code\n", + "label\n", "
\n", - "
int
\n", - "
\n", - " ✅ identifier of a clause atom relationship (0; 74; 367; ...)\n", - "
\n", + "
str
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + " ✅ (half-)verse label (half verses: A; B; C; verses: GEN 01,02)\n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "
\n", + "language\n", "
\n", + "
str
\n", + "\n", + " ✅ of word or lexeme (Hebrew; Aramaic.)\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:17:56Z
\n", "
\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", + "
\n", + "
\n", + "lex\n", "
\n", + "
str
\n", + "\n", + " ✅ lexeme consonantal-transliterated (B R>CJT/ BR>[ >LHJM/)\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", + "
\n", + "
\n", + "lex_utf8\n", "
\n", + "
str
\n", + "\n", + " ✅ lexeme consonantal-Hebrew (ב ראשׁית֜ ברא אלהים֜)\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
\n", + "
\n", + "ls\n", "
\n", + "
str
\n", + "\n", + " ✅ lexical set, subclassification of part-of-speech (card; ques; mult)\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "det\n", + "nametype\n", "
\n", "
str
\n", - "
\n", - " ✅ determinedness of phrase(atom) (det; und; NA.)\n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", + " ⚠️ named entity type (pers; mens; gens; topo; ppde.)\n", + "\n", "
\n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", + "
\n", + "
\n", + "nme\n", "
\n", + "
str
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", + " ✅ nominal ending consonantal-transliterated (absent; n/a; JM, ...)\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:17:56Z
\n", "
\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", + "
\n", + "
\n", + "nu\n", "
\n", + "
str
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", + " ✅ grammatical number (sg; du; pl; NA; unknown.)\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", "
\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", + "
\n", + "
\n", + "number\n", "
\n", + "
int
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " ✅ sequence number of an object within its context\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "domain\n", + "otype\n", "
\n", "
str
\n", - "
\n", - " ✅ text type of clause (? (Unknown); N (narrative); D (discursive); Q (Quotation).)\n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + " \n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "
\n", + "pargr\n", "
\n", + "
str
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:17:57Z
\n", - "
\n", + " 🆗 hierarchical paragraph number (1; 1.2; 1.2.3.4; ...)\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", "
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "
\n", + "pdp\n", "
\n", + "
str
\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", + " ✅ phrase dependent part-of-speech (art; verb; subs; nmpr, ...)\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
\n", + "
\n", + "pfm\n", "
\n", + "
str
\n", + "\n", + " ✅ preformative consonantal-transliterated (absent; n/a; J, ...)\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "freq_lex\n", + "prs\n", "
\n", - "
int
\n", - "
\n", - " ✅ frequency of lexemes\n", - "
\n", + "
str
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + " ✅ pronominal suffix consonantal-transliterated (absent; n/a; W; ...)\n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "
\n", + "prs_gn\n", "
\n", + "
str
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:24:45Z
\n", - "
\n", + " ✅ pronominal suffix gender (m; f; NA; unknown.)\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", "
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "
\n", + "prs_nu\n", "
\n", + "
str
\n", "\n", - "
\n", - "
provenance:
\n", - "
computed on the basis of the ETCBC core set of features
\n", - "
\n", + " ✅ pronominal suffix number (sg; du; pl; NA; unknown.)\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", "
\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", + "
\n", + "
\n", + "prs_ps\n", "
\n", + "
str
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " ✅ pronominal suffix person (p1; p2; p3; NA; unknown.)\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "function\n", + "ps\n", "
\n", "
str
\n", - "
\n", - " ✅ syntactic function of phrase (Cmpl; Objc; Pred; ...)\n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + " ✅ grammatical person (p1; p2; p3; NA; unknown.)\n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "
\n", + "qere\n", "
\n", + "
str
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:17:57Z
\n", - "
\n", + " ✅ word pointed-transliterated masoretic reading correction\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", "
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "
\n", + "qere_trailer\n", "
\n", + "
str
\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", + " ✅ interword material -pointed-transliterated (Masoretic correction)\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
\n", + "
\n", + "qere_trailer_utf8\n", "
\n", + "
str
\n", + "\n", + " ✅ interword material -pointed-transliterated (Masoretic correction)\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "g_cons\n", + "qere_utf8\n", "
\n", "
str
\n", - "
\n", - " ✅ word consonantal-transliterated (B R>CJT BR> >LHJM ...)\n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + " ✅ word pointed-Hebrew masoretic reading correction\n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "
\n", + "rank_lex\n", "
\n", + "
int
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:17:57Z
\n", - "
\n", + " ✅ ranking of lexemes based on freqnuecy\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", "
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "
\n", + "rela\n", "
\n", + "
str
\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", + " ✅ linguistic relation between clause/(sub)phrase(atom) (ADJ; MOD; ATR; ...)\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
\n", + "
\n", + "sp\n", "
\n", + "
str
\n", + "\n", + " ✅ part-of-speech (art; verb; subs; nmpr, ...)\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "g_cons_utf8\n", + "st\n", "
\n", "
str
\n", - "
\n", - " ✅ word consonantal-Hebrew (ב ראשׁית ברא אלהים)\n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + " ✅ state of a noun (a (absolute); c (construct); e (emphatic).)\n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "
\n", + "tab\n", "
\n", + "
int
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:17:58Z
\n", - "
\n", + " ✅ clause atom: its level in the linguistic embedding\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", "
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "
\n", + "trailer\n", "
\n", + "
str
\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", + " ✅ interword material pointed-transliterated (& 00 05 00_P ...)\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
\n", + "
\n", + "trailer_utf8\n", "
\n", + "
str
\n", + "\n", + " ✅ interword material pointed-Hebrew (־ ׃)\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "g_lex\n", + "txt\n", "
\n", "
str
\n", - "
\n", - " ✅ lexeme pointed-transliterated (B.:- R;>CIJT B.@R@> >:ELOH ...)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", + " ✅ text type of clause and surrounding (repetion of ? N D Q as in feature domain)\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:17:58Z
\n", "
\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", + "
\n", + "
\n", + "typ\n", "
\n", + "
str
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", + " ✅ clause/phrase(atom) type (VP; NP; Ellp; Ptcp; WayX)\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", "
\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", + "
\n", + "
\n", + "uvf\n", "
\n", + "
str
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " ✅ univalent final consonant consonantal-transliterated (absent; N; J; ...)\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "g_lex_utf8\n", + "vbe\n", "
\n", "
str
\n", - "
\n", - " ✅ lexeme pointed-Hebrew (בְּ רֵאשִׁית בָּרָא אֱלֹה)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", + " ✅ verbal ending consonantal-transliterated (n/a; W; ...)\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:17:59Z
\n", "
\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", + "
\n", + "
\n", + "vbs\n", "
\n", + "
str
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", + " ✅ root formation consonantal-transliterated (absent; n/a; H; ...)\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", "
\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", + "
\n", + "
\n", + "verse\n", "
\n", + "
int
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " ✅ verse number\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "g_word\n", + "voc_lex\n", "
\n", "
str
\n", - "
\n", - " ✅ word pointed-transliterated (B.:- R;>CI73JT B.@R@74> >:ELOHI92JM)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", + " ✅ vocalized lexeme pointed-transliterated (B.: R;>CIJT BR> >:ELOHIJM)\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:04Z
\n", "
\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", + "
\n", + "
\n", + "voc_lex_utf8\n", "
\n", + "
str
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", + " ✅ vocalized lexeme pointed-Hebrew (בְּ רֵאשִׁית ברא אֱלֹהִים)\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", "
\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", + "
\n", + "
\n", + "vs\n", "
\n", + "
str
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " ✅ verbal stem (qal; piel; hif; apel; pael)\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "g_word_utf8\n", + "vt\n", "
\n", "
str
\n", - "
\n", - " ✅ word pointed-Hebrew (בְּ רֵאשִׁ֖ית בָּרָ֣א אֱלֹהִ֑ים)\n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + " ✅ verbal tense (perf; impv; wayq; infc)\n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "
\n", + "mother\n", "
\n", + "
none
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:04Z
\n", - "
\n", + " ✅ linguistic dependency between textual objects\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", "
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "
\n", + "oslots\n", "
\n", + "
none
\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", + " \n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + "
\n", + "
\n", "\n", - "
\n", - "
\n", - "\n", + "
Phonetic Transcriptions\n", + "
\n", "\n", "
\n", "
\n", - "gloss\n", + "phono\n", "
\n", "
str
\n", - "
\n", - " 🆗 english translation of lexeme (beginning create god(s))\n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", + " 🆗 phonological transcription (bᵊ rēšˌîṯ bārˈā ʔᵉlōhˈîm)\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:21:13Z
\n", + "
\n", + "
\n", + "phono_trailer\n", "
\n", + "
str
\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", + " 🆗 interword material in phonological transcription\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", - "
\n", - "
provenance:
\n", - "
from additional lexicon file provided by the ETCBC
\n", - "
\n", + "
\n", + "
\n" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/html": [ + "" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - }, - { - "data": { - "text/html": [ - "\n", - "\n" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - } - ], - "source": [ - "N = use(\"etcbc/bhsa\")" - ] - }, - { - "cell_type": "markdown", - "metadata": { - "tags": [] - }, - "source": [ - "## Load the available version of the participant features\n", - "\n", - "We have forked Christian's repo to `etcbc/participants`, so make sure to clone it to your computer:\n", - "\n", - "```\n", - "cd ~/github/etcbc\n", - "git clone https://github.com/ETCBC/participants\n", - "```" - ] - }, - { - "cell_type": "code", - "execution_count": 6, - "metadata": {}, - "outputs": [], - "source": [ - "LOCATION = \"data:~/github/etcbc/participants/actor/tf\"" - ] - }, - { - "cell_type": "markdown", - "metadata": { - "tags": [] - }, - "source": [ - "Now we can load the actor features for version `c`." - ] - }, - { - "cell_type": "code", - "execution_count": 7, - "metadata": {}, - "outputs": [ - { - "name": "stdout", - "output_type": "stream", - "text": [ - "This is Text-Fabric 9.2.3\n", - "Api reference : https://annotation.github.io/text-fabric/tf/cheatsheet.html\n", - "\n", - "3 features found and 0 ignored\n", - " 0.00s Not all of the warp features otype and oslots are present in\n", - "~/github/etcbc/participants/actor/tf/c\n", - " 0.00s Only the Feature and Edge APIs will be enabled\n", - " 0.00s Warp feature \"otext\" not found. Working without Text-API\n", - "\n" - ] - }, - { - "data": { - "text/html": [ - "Text-Fabric: Text-Fabric API 9.2.3, no app configured
Data: ~/github/etcbc/participants/actor/tf/c
Features:
\n", - "
TF dataset (unspecified)\n", - "
\n", - "\n", - "
\n", - "
\n", - "actor\n", - "
\n", - "
str
\n", - "
\n", - " Participant references for words, subphrases and phrases. The references are adapted from Eep Talstra's work on participant tracking. http://doi.org/10.5281/zenodo.1479491\n", - "
\n", - "\n", - "
\n", - "
coreData:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
coreVersion:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2020-05-11T13:34:09Z
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "prs_actor\n", - "
\n", - "
str
\n", - "
\n", - " Participant references for pronominal suffixes. The references are adapted from Eep Talstra's work on participant tracking. http://doi.org/10.5281/zenodo.1479491\n", - "
\n", - "\n", - "
\n", - "
coreData:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
coreVersion:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2020-05-11T13:34:13Z
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "coref\n", - "
\n", - "
none
\n", - "
\n", - " Edges to co-referring actors on chapter-level. The references are adapted from Eep Talstra's work on participant tracking. http://doi.org/10.5281/zenodo.1479491\n", - "
\n", - "\n", - "
\n", - "
coreData:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
coreVersion:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2020-05-11T13:34:16Z
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - } - ], - "source": [ - "P = use(LOCATION, version=\"c\")" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "By clicking the triangles you can find more information about these features." - ] - }, - { - "cell_type": "markdown", - "metadata": { - "tags": [] - }, - "source": [ - "## Upgrade the participant features\n", - "\n", - "We are going to upgrade the participant features from version `c` to version `2021`.\n", - "\n", - "For that, we use [tf.dataset.nodemaps.Versions](https://annotation.github.io/text-fabric/tf/dataset/nodemaps.html#tf.dataset.nodemaps.Versions).\n", - "\n", - "We initialize the Versions object with two text-fabric api objects:" - ] - }, - { - "cell_type": "code", - "execution_count": 8, - "metadata": {}, - "outputs": [], - "source": [ - "apis = {\"2021\": N.api, \"c\": P.api}\n", - "\n", - "V = Versions(apis, \"c\", \"2021\")" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "Finally we migrate the features from \"c\" to \"2021\" and save them in the correct location.\n", - "\n", - "We skip the `otext` feature, since it is a special config feature, not a data feature made by Christian." - ] - }, - { - "cell_type": "code", - "execution_count": 9, - "metadata": { - "tags": [] - }, - "outputs": [ - { - "name": "stdout", - "output_type": "stream", - "text": [ - " 0.00s Exporting 2 node and 1 edge and 0 config features to data:~/github/etcbc/participants/actor/tf/2021:\n", - " | 0.00s T actor to data:~/github/etcbc/participants/actor/tf/2021\n", - " | 0.00s T prs_actor to data:~/github/etcbc/participants/actor/tf/2021\n", - " | 0.05s T coref to data:~/github/etcbc/participants/actor/tf/2021\n", - " 0.06s Exported 2 node features and 1 edge features and 0 config features to data:~/github/etcbc/participants/actor/tf/2021\n" - ] - } - ], - "source": [ - "V.migrateFeatures((\"actor\", \"coref\", \"prs_actor\"), location=LOCATION)" - ] - }, - { - "cell_type": "markdown", - "metadata": { - "tags": [] - }, - "source": [ - "## Load the upgraded module\n", - "\n", - "Now we are in a position that we can load version 2021 of the BHSA together with the migrated module of participant features.\n", - "Note that we we point Text-Fabric to the forked repo (`etcbc` instead of `ch-jensen`) and then to\n", - "our local clone (`:clone`)." - ] - }, - { - "cell_type": "code", - "execution_count": 10, - "metadata": {}, - "outputs": [ - { - "data": { - "text/html": [ - "TF-app: ~/text-fabric-data/etcbc/bhsa/app" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - }, - { - "data": { - "text/html": [ - "data: ~/text-fabric-data/etcbc/bhsa/tf/2021" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - }, - { - "data": { - "text/html": [ - "data: ~/github/etcbc/participants/actor/tf/2021" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - }, - { - "data": { - "text/html": [ - "data: ~/text-fabric-data/etcbc/phono/tf/2021" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - }, - { - "data": { - "text/html": [ - "data: ~/text-fabric-data/etcbc/parallels/tf/2021" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - }, - { - "name": "stdout", - "output_type": "stream", - "text": [ - "This is Text-Fabric 9.2.3\n", - "Api reference : https://annotation.github.io/text-fabric/tf/cheatsheet.html\n", - "\n", - "125 features found and 0 ignored\n" - ] - }, - { - "data": { - "text/html": [ - "Text-Fabric: Text-Fabric API 9.2.3, etcbc/bhsa/app v3, Search Reference
Data: BHSA, Character table, Feature docs
Features:
\n", - "
Parallel Passages\n", - "
\n", - "\n", - "
\n", - "
\n", - "crossref\n", - "
\n", - "
int
\n", - "
\n", - " 🆗 links between similar passages\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
BHSA Data: Constantijn Sikkel; Parallels Notebook: Dirk Roorda, Martijn Naaijer
\n", - "
\n", - "\n", - "
\n", - "
coreData:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:40:46Z
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
Parallels notebook, see https://github.com/ETCBC/parallels
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "\n", - "
BHSA = Biblia Hebraica Stuttgartensia Amstelodamensis\n", - "
\n", - "\n", - "
\n", - "
\n", - "book\n", - "
\n", - "
str
\n", - "
\n", - " ✅ book name in Latin (Genesis; Numeri; Reges1; ...)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:17:55Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "book@ll\n", - "
\n", - "
str
\n", - "
\n", - " ✅ book name in amharic (ኣማርኛ)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:20:27Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
language:
\n", - "
ኣማርኛ
\n", - "
\n", - "\n", - "
\n", - "
languageCode:
\n", - "
am
\n", - "
\n", - "\n", - "
\n", - "
languageEnglish:
\n", - "
amharic
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
book names from wikipedia and other sources
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "chapter\n", - "
\n", - "
int
\n", - "
\n", - " ✅ chapter number (1; 2; 3; ...)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:17:55Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "code\n", - "
\n", - "
int
\n", - "
\n", - " ✅ identifier of a clause atom relationship (0; 74; 367; ...)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:17:56Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "det\n", - "
\n", - "
str
\n", - "
\n", - " ✅ determinedness of phrase(atom) (det; und; NA.)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:17:56Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "domain\n", - "
\n", - "
str
\n", - "
\n", - " ✅ text type of clause (? (Unknown); N (narrative); D (discursive); Q (Quotation).)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:17:57Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "freq_lex\n", - "
\n", - "
int
\n", - "
\n", - " ✅ frequency of lexemes\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:24:45Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
computed on the basis of the ETCBC core set of features
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "function\n", - "
\n", - "
str
\n", - "
\n", - " ✅ syntactic function of phrase (Cmpl; Objc; Pred; ...)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:17:57Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "g_cons\n", - "
\n", - "
str
\n", - "
\n", - " ✅ word consonantal-transliterated (B R>CJT BR> >LHJM ...)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:17:57Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "g_cons_utf8\n", - "
\n", - "
str
\n", - "
\n", - " ✅ word consonantal-Hebrew (ב ראשׁית ברא אלהים)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:17:58Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "g_lex\n", - "
\n", - "
str
\n", - "
\n", - " ✅ lexeme pointed-transliterated (B.:- R;>CIJT B.@R@> >:ELOH ...)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:17:58Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "g_lex_utf8\n", - "
\n", - "
str
\n", - "
\n", - " ✅ lexeme pointed-Hebrew (בְּ רֵאשִׁית בָּרָא אֱלֹה)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:17:59Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "g_word\n", - "
\n", - "
str
\n", - "
\n", - " ✅ word pointed-transliterated (B.:- R;>CI73JT B.@R@74> >:ELOHI92JM)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:04Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "g_word_utf8\n", - "
\n", - "
str
\n", - "
\n", - " ✅ word pointed-Hebrew (בְּ רֵאשִׁ֖ית בָּרָ֣א אֱלֹהִ֑ים)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:04Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "gloss\n", - "
\n", - "
str
\n", - "
\n", - " 🆗 english translation of lexeme (beginning create god(s))\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:21:13Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
from additional lexicon file provided by the ETCBC
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "gn\n", - "
\n", - "
str
\n", - "
\n", - " ✅ grammatical gender (m; f; NA; unknown.)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:05Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "label\n", - "
\n", - "
str
\n", - "
\n", - " ✅ (half-)verse label (half verses: A; B; C; verses: GEN 01,02)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:06Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "language\n", - "
\n", - "
str
\n", - "
\n", - " ✅ of word or lexeme (Hebrew; Aramaic.)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:21:13Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
from additional lexicon file provided by the ETCBC
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "lex\n", - "
\n", - "
str
\n", - "
\n", - " ✅ lexeme consonantal-transliterated (B R>CJT/ BR>[ >LHJM/)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:21:14Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
from additional lexicon file provided by the ETCBC
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "lex_utf8\n", - "
\n", - "
str
\n", - "
\n", - " ✅ lexeme consonantal-Hebrew (ב ראשׁית֜ ברא אלהים֜)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:21:15Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
from additional lexicon file provided by the ETCBC
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "ls\n", - "
\n", - "
str
\n", - "
\n", - " ✅ lexical set, subclassification of part-of-speech (card; ques; mult)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:21:15Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
from additional lexicon file provided by the ETCBC
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "nametype\n", - "
\n", - "
str
\n", - "
\n", - " ⚠️ named entity type (pers; mens; gens; topo; ppde.)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:21:15Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
from additional lexicon file provided by the ETCBC
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "nme\n", - "
\n", - "
str
\n", - "
\n", - " ✅ nominal ending consonantal-transliterated (absent; n/a; JM, ...)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:08Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "nu\n", - "
\n", - "
str
\n", - "
\n", - " ✅ grammatical number (sg; du; pl; NA; unknown.)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:08Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "number\n", - "
\n", - "
int
\n", - "
\n", - " ✅ sequence number of an object within its context\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:09Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "otype\n", - "
\n", - "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:21:15Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "pargr\n", - "
\n", - "
str
\n", - "
\n", - " 🆗 hierarchical paragraph number (1; 1.2; 1.2.3.4; ...)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:22:50Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
from additional paragraph file provided by the ETCBC
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "pdp\n", - "
\n", - "
str
\n", - "
\n", - " ✅ phrase dependent part-of-speech (art; verb; subs; nmpr, ...)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:10Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "pfm\n", - "
\n", - "
str
\n", - "
\n", - " ✅ preformative consonantal-transliterated (absent; n/a; J, ...)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:11Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "prs\n", - "
\n", - "
str
\n", - "
\n", - " ✅ pronominal suffix consonantal-transliterated (absent; n/a; W; ...)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:11Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "prs_gn\n", - "
\n", - "
str
\n", - "
\n", - " ✅ pronominal suffix gender (m; f; NA; unknown.)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:11Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "prs_nu\n", - "
\n", - "
str
\n", - "
\n", - " ✅ pronominal suffix number (sg; du; pl; NA; unknown.)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:12Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "prs_ps\n", - "
\n", - "
str
\n", - "
\n", - " ✅ pronominal suffix person (p1; p2; p3; NA; unknown.)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:12Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "ps\n", - "
\n", - "
str
\n", - "
\n", - " ✅ grammatical person (p1; p2; p3; NA; unknown.)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:12Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "qere\n", - "
\n", - "
str
\n", - "
\n", - " ✅ word pointed-transliterated masoretic reading correction\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:23:29Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
from additional ketiv/qere file provided by the ETCBC
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "qere_trailer\n", - "
\n", - "
str
\n", - "
\n", - " ✅ interword material -pointed-transliterated (Masoretic correction)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:23:29Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
from additional ketiv/qere file provided by the ETCBC
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "qere_trailer_utf8\n", - "
\n", - "
str
\n", - "
\n", - " ✅ interword material -pointed-transliterated (Masoretic correction)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:23:29Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
from additional ketiv/qere file provided by the ETCBC
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "qere_utf8\n", - "
\n", - "
str
\n", - "
\n", - " ✅ word pointed-Hebrew masoretic reading correction\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:23:29Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
from additional ketiv/qere file provided by the ETCBC
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "rank_lex\n", - "
\n", - "
int
\n", - "
\n", - " ✅ ranking of lexemes based on freqnuecy\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:24:46Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
computed on the basis of the ETCBC core set of features
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "rela\n", - "
\n", - "
str
\n", - "
\n", - " ✅ linguistic relation between clause/(sub)phrase(atom) (ADJ; MOD; ATR; ...)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:13Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "sp\n", - "
\n", - "
str
\n", - "
\n", - " ✅ part-of-speech (art; verb; subs; nmpr, ...)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:21:16Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
from additional lexicon file provided by the ETCBC
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "st\n", - "
\n", - "
str
\n", - "
\n", - " ✅ state of a noun (a (absolute); c (construct); e (emphatic).)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:14Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "tab\n", - "
\n", - "
int
\n", - "
\n", - " ✅ clause atom: its level in the linguistic embedding\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:16Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "trailer\n", - "
\n", - "
str
\n", - "
\n", - " ✅ interword material pointed-transliterated (& 00 05 00_P ...)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:01Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "trailer_utf8\n", - "
\n", - "
str
\n", - "
\n", - " ✅ interword material pointed-Hebrew (־ ׃)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:01Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "txt\n", - "
\n", - "
str
\n", - "
\n", - " ✅ text type of clause and surrounding (repetion of ? N D Q as in feature domain)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:16Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "typ\n", - "
\n", - "
str
\n", - "
\n", - " ✅ clause/phrase(atom) type (VP; NP; Ellp; Ptcp; WayX)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:16Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "uvf\n", - "
\n", - "
str
\n", - "
\n", - " ✅ univalent final consonant consonantal-transliterated (absent; N; J; ...)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:17Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "vbe\n", - "
\n", - "
str
\n", - "
\n", - " ✅ verbal ending consonantal-transliterated (n/a; W; ...)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:17Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "vbs\n", - "
\n", - "
str
\n", - "
\n", - " ✅ root formation consonantal-transliterated (absent; n/a; H; ...)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:17Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "verse\n", - "
\n", - "
int
\n", - "
\n", - " ✅ verse number\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:18Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "voc_lex\n", - "
\n", - "
str
\n", - "
\n", - " ✅ vocalized lexeme pointed-transliterated (B.: R;>CIJT BR> >:ELOHIJM)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:21:16Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
from additional lexicon file provided by the ETCBC
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "voc_lex_utf8\n", - "
\n", - "
str
\n", - "
\n", - " ✅ vocalized lexeme pointed-Hebrew (בְּ רֵאשִׁית ברא אֱלֹהִים)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:21:17Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
from additional lexicon file provided by the ETCBC
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "vs\n", - "
\n", - "
str
\n", - "
\n", - " ✅ verbal stem (qal; piel; hif; apel; pael)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:18Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "vt\n", - "
\n", - "
str
\n", - "
\n", - " ✅ verbal tense (perf; impv; wayq; infc)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:18Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "mother\n", - "
\n", - "
none
\n", - "
\n", - " ✅ linguistic dependency between textual objects\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:22Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "oslots\n", - "
\n", - "
none
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:21:17Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "\n", - "
etcbc/participants/actor/tf\n", - "
\n", - "\n", - "
\n", - "
\n", - "actor\n", - "
\n", - "
str
\n", - "
\n", - " Participant references for words, subphrases and phrases. The references are adapted from Eep Talstra's work on participant tracking. http://doi.org/10.5281/zenodo.1479491\n", - "
\n", - "\n", - "
\n", - "
coreData:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
coreVersion:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-16T11:10:24Z
\n", - "
\n", - "\n", - "
\n", - "
upgraded:
\n", - "
‼️ from version c to 2021
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "prs_actor\n", - "
\n", - "
str
\n", - "
\n", - " Participant references for pronominal suffixes. The references are adapted from Eep Talstra's work on participant tracking. http://doi.org/10.5281/zenodo.1479491\n", - "
\n", - "\n", - "
\n", - "
coreData:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
coreVersion:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-16T11:10:24Z
\n", - "
\n", - "\n", - "
\n", - "
upgraded:
\n", - "
‼️ from version c to 2021
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "coref\n", - "
\n", - "
none
\n", - "
\n", - " Edges to co-referring actors on chapter-level. The references are adapted from Eep Talstra's work on participant tracking. http://doi.org/10.5281/zenodo.1479491\n", - "
\n", - "\n", - "
\n", - "
coreData:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
coreVersion:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-16T11:10:24Z
\n", - "
\n", - "\n", - "
\n", - "
upgraded:
\n", - "
‼️ from version c to 2021
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "\n", - "
Phonetic Transcriptions\n", - "
\n", - "\n", - "
\n", - "
\n", - "phono\n", - "
\n", - "
str
\n", - "
\n", - " 🆗 phonological transcription (bᵊ rēšˌîṯ bārˈā ʔᵉlōhˈîm)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
BHSA Data: Constantijn Sikkel; Phono Notebook: Dirk Roorda
\n", - "
\n", - "\n", - "
\n", - "
coreData:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:25:55Z
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
computed by the phono notebook, see https://github.com/ETCBC/phono
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "phono_trailer\n", - "
\n", - "
str
\n", - "
\n", - " 🆗 interword material in phonological transcription\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
BHSA Data: Constantijn Sikkel; Phono Notebook: Dirk Roorda
\n", - "
\n", - "\n", - "
\n", - "
coreData:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:25:55Z
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
computed by the phono notebook, see https://github.com/ETCBC/phono
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - }, - { - "data": { - "text/html": [ - "" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/html": [ + "\n", + "\n" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + } + ], + "source": [ + "N = use(\"etcbc/bhsa\")" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "tags": [] + }, + "source": [ + "## Load the available version of the participant features\n", + "\n", + "We have forked Christian's repo to `etcbc/participants`, so make sure to clone it to your computer:\n", + "\n", + "```\n", + "cd ~/github/etcbc\n", + "git clone https://github.com/ETCBC/participants\n", + "```" + ] + }, + { + "cell_type": "code", + "execution_count": 4, + "metadata": {}, + "outputs": [], + "source": [ + "LOCATION = \"data:~/github/etcbc/participants/actor/tf\"" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "tags": [] + }, + "source": [ + "Now we can load the actor features for version `c`." + ] + }, + { + "cell_type": "code", + "execution_count": 5, + "metadata": {}, + "outputs": [ + { + "data": { + "text/html": [ + "Text-Fabric: Text-Fabric API 10.2.0, no app configured
Data: ~/github/etcbc/participants/actor/tf/c
Features:
\n", + "
TF dataset (unspecified)\n", + "
\n", + "\n", + "
\n", + "
\n", + "actor\n", + "
\n", + "
str
\n", + "\n", + " Participant references for words, subphrases and phrases. The references are adapted from Eep Talstra's work on participant tracking. http://doi.org/10.5281/zenodo.1479491\n", + "\n", + "
\n", + "\n", + "
\n", + "
\n", + "prs_actor\n", + "
\n", + "
str
\n", + "\n", + " Participant references for pronominal suffixes. The references are adapted from Eep Talstra's work on participant tracking. http://doi.org/10.5281/zenodo.1479491\n", + "\n", + "
\n", + "\n", + "
\n", + "
\n", + "coref\n", + "
\n", + "
none
\n", + "\n", + " Edges to co-referring actors on chapter-level. The references are adapted from Eep Talstra's work on participant tracking. http://doi.org/10.5281/zenodo.1479491\n", + "\n", + "
\n", + "\n", + "
\n", + "
\n" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + } + ], + "source": [ + "P = use(LOCATION, version=\"c\")" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "By clicking the triangles you can find more information about these features." + ] + }, + { + "cell_type": "markdown", + "metadata": { + "tags": [] + }, + "source": [ + "## Upgrade the participant features\n", + "\n", + "We are going to upgrade the participant features from version `c` to version `2021`.\n", + "\n", + "For that, we use [tf.dataset.nodemaps.Versions](https://annotation.github.io/text-fabric/tf/dataset/nodemaps.html#tf.dataset.nodemaps.Versions).\n", + "\n", + "We initialize the Versions object with two text-fabric api objects:" + ] + }, + { + "cell_type": "code", + "execution_count": 15, + "metadata": {}, + "outputs": [], + "source": [ + "apis = {\"2021\": N.api, \"c\": P.api}\n", + "\n", + "V = Versions(apis, \"c\", \"2021\")" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Finally we migrate the features from \"c\" to \"2021\" and save them in the correct location.\n", + "\n", + "We skip the `otext` feature, since it is a special config feature, not a data feature made by Christian." + ] + }, + { + "cell_type": "code", + "execution_count": 19, + "metadata": { + "tags": [] + }, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + " 49s start migrating\n", + " 0.03s Done\n" + ] + } + ], + "source": [ + "V.migrateFeatures((\"actor\", \"coref\", \"prs_actor\"), location=LOCATION)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Here it is handy to make the migration a bit more verbose. We do it again:" + ] + }, + { + "cell_type": "code", + "execution_count": 20, + "metadata": { + "tags": [] + }, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + " 57s start migrating\n", + " 0.32s All additional features loaded - for details use TF.isLoaded()\n", + " 0.32s Mapping actor (node)\n", + " 0.33s Mapping coref (edge)\n", + " 0.40s Mapping prs_actor (node)\n", + " 0.00s Exporting 2 node and 1 edge and 0 config features to data:~/github/etcbc/participants/actor/tf/2021:\n", + " | 0.00s T actor to data:~/github/etcbc/participants/actor/tf/2021\n", + " | 0.00s T prs_actor to data:~/github/etcbc/participants/actor/tf/2021\n", + " | 0.03s T coref to data:~/github/etcbc/participants/actor/tf/2021\n", + " 0.03s Exported 2 node features and 1 edge features and 0 config features to data:~/github/etcbc/participants/actor/tf/2021\n", + " 0.03s Done\n" + ] + } + ], + "source": [ + "V.migrateFeatures((\"actor\", \"coref\", \"prs_actor\"), location=LOCATION, silent=\"auto\")" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "tags": [] + }, + "source": [ + "## Load the upgraded module\n", + "\n", + "Now we are in a position that we can load version 2021 of the BHSA together with the migrated module of participant features.\n", + "Note that we we point Text-Fabric to the forked repo (`etcbc` instead of `ch-jensen`) and then to\n", + "our local clone (`:clone`).\n", + "\n", + "We increase the verbosity, in order to display more metadata of the features." + ] + }, + { + "cell_type": "code", + "execution_count": 23, + "metadata": {}, + "outputs": [ + { + "data": { + "text/html": [ + "TF-app: ~/text-fabric-data/github/etcbc/bhsa/app" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/html": [ + "data: ~/text-fabric-data/github/etcbc/bhsa/tf/2021" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/html": [ + "data: ~/github/etcbc/participants/actor/tf/2021" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/html": [ + "data: ~/text-fabric-data/github/etcbc/phono/tf/2021" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/html": [ + "data: ~/text-fabric-data/github/etcbc/parallels/tf/2021" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "name": "stdout", + "output_type": "stream", + "text": [ + "This is Text-Fabric 10.2.0\n", + "Api reference : https://annotation.github.io/text-fabric/tf/cheatsheet.html\n", + "\n", + "125 features found and 0 ignored\n", + " 0.67s Dataset without structure sections in otext:no structure functions in the T-API\n", + " 2.18s All features loaded/computed - for details use TF.isLoaded()\n", + " 1.48s All additional features loaded - for details use TF.isLoaded()\n" + ] + }, + { + "data": { + "text/html": [ + "Text-Fabric: Text-Fabric API 10.2.0, etcbc/bhsa/app v3, Search Reference
Data: BHSA, Character table, Feature docs
Features:
\n", + "
Parallel Passages\n", + "
\n", + "\n", + "
\n", + "
\n", + "crossref\n", + "
\n", + "
int
\n", + "\n", + "
\n", + " 🆗 links between similar passages\n", + "
\n", + "\n", + "
\n", + "
author:
\n", + "
BHSA Data: Constantijn Sikkel; Parallels Notebook: Dirk Roorda, Martijn Naaijer
\n", + "
\n", + "\n", + "
\n", + "
coreData:
\n", + "
BHSA
\n", + "
\n", + "\n", + "
\n", + "
dateWritten:
\n", + "
2021-12-09T14:40:46Z
\n", + "
\n", + "\n", + "
\n", + "
provenance:
\n", + "
Parallels notebook, see https://github.com/ETCBC/parallels
\n", + "
\n", + "\n", + "
\n", + "
version:
\n", + "
2021
\n", + "
\n", + "\n", + "
\n", + "
writtenBy:
\n", + "
Text-Fabric
\n", + "
\n", + "\n", + "
\n", + "
\n", + "\n", + "
\n", + "\n", + "
\n", + "
\n", + "\n", + "
BHSA = Biblia Hebraica Stuttgartensia Amstelodamensis\n", + "
\n", + "\n", + "
\n", + "
\n", + "book\n", + "
\n", + "
str
\n", + "\n", + "
\n", + " ✅ book name in Latin (Genesis; Numeri; Reges1; ...)\n", + "
\n", + "\n", + "
\n", + "
author:
\n", + "
Eep Talstra Centre for Bible and Computer
\n", + "
\n", + "\n", + "
\n", + "
dataset:
\n", + "
BHSA
\n", + "
\n", + "\n", + "
\n", + "
datasetName:
\n", + "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "\n", + "
\n", + "
dateWritten:
\n", + "
2021-12-09T14:17:55Z
\n", + "
\n", + "\n", + "
\n", + "
email:
\n", + "
shebanq@ancient-data.org
\n", + "
\n", + "\n", + "
\n", + "
encoders:
\n", + "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "\n", + "
\n", + "
version:
\n", + "
2021
\n", + "
\n", + "\n", + "
\n", + "
website:
\n", + "
https://shebanq.ancient-data.org
\n", + "
\n", + "\n", + "
\n", + "
writtenBy:
\n", + "
Text-Fabric
\n", + "
\n", + "\n", + "
\n", + "
\n", + "\n", + "
\n", + "\n", + "
\n", + "
\n", + "book@ll\n", + "
\n", + "
str
\n", + "\n", + "
\n", + " ✅ book name in amharic (ኣማርኛ)\n", + "
\n", + "\n", + "
\n", + "
author:
\n", + "
Eep Talstra Centre for Bible and Computer
\n", + "
\n", + "\n", + "
\n", + "
dataset:
\n", + "
BHSA
\n", + "
\n", + "\n", + "
\n", + "
datasetName:
\n", + "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "\n", + "
\n", + "
dateWritten:
\n", + "
2021-12-09T14:20:27Z
\n", + "
\n", + "\n", + "
\n", + "
email:
\n", + "
shebanq@ancient-data.org
\n", + "
\n", + "\n", + "
\n", + "
encoders:
\n", + "
Dirk Roorda (TF)
\n", + "
\n", + "\n", + "
\n", + "
language:
\n", + "
ኣማርኛ
\n", + "
\n", + "\n", + "
\n", + "
languageCode:
\n", + "
am
\n", + "
\n", + "\n", + "
\n", + "
languageEnglish:
\n", + "
amharic
\n", + "
\n", + "\n", + "
\n", + "
provenance:
\n", + "
book names from wikipedia and other sources
\n", + "
\n", + "\n", + "
\n", + "
version:
\n", + "
2021
\n", + "
\n", + "\n", + "
\n", + "
website:
\n", + "
https://shebanq.ancient-data.org
\n", + "
\n", + "\n", + "
\n", + "
writtenBy:
\n", + "
Text-Fabric
\n", + "
\n", + "\n", + "
\n", + "
\n", + "\n", + "
\n", + "\n", + "
\n", + "
\n", + "chapter\n", + "
\n", + "
int
\n", + "\n", + "
\n", + " ✅ chapter number (1; 2; 3; ...)\n", + "
\n", + "\n", + "
\n", + "
author:
\n", + "
Eep Talstra Centre for Bible and Computer
\n", + "
\n", + "\n", + "
\n", + "
dataset:
\n", + "
BHSA
\n", + "
\n", + "\n", + "
\n", + "
datasetName:
\n", + "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "\n", + "
\n", + "
dateWritten:
\n", + "
2021-12-09T14:17:55Z
\n", + "
\n", + "\n", + "
\n", + "
email:
\n", + "
shebanq@ancient-data.org
\n", + "
\n", + "\n", + "
\n", + "
encoders:
\n", + "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "\n", + "
\n", + "
version:
\n", + "
2021
\n", + "
\n", + "\n", + "
\n", + "
website:
\n", + "
https://shebanq.ancient-data.org
\n", + "
\n", + "\n", + "
\n", + "
writtenBy:
\n", + "
Text-Fabric
\n", + "
\n", + "\n", + "
\n", + "
\n", + "\n", + "
\n", + "\n", + "
\n", + "
\n", + "code\n", + "
\n", + "
int
\n", + "\n", + "
\n", + " ✅ identifier of a clause atom relationship (0; 74; 367; ...)\n", + "
\n", + "\n", + "
\n", + "
author:
\n", + "
Eep Talstra Centre for Bible and Computer
\n", + "
\n", + "\n", + "
\n", + "
dataset:
\n", + "
BHSA
\n", + "
\n", + "\n", + "
\n", + "
datasetName:
\n", + "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "\n", + "
\n", + "
dateWritten:
\n", + "
2021-12-09T14:17:56Z
\n", + "
\n", + "\n", + "
\n", + "
email:
\n", + "
shebanq@ancient-data.org
\n", + "
\n", + "\n", + "
\n", + "
encoders:
\n", + "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "\n", + "
\n", + "
version:
\n", + "
2021
\n", + "
\n", + "\n", + "
\n", + "
website:
\n", + "
https://shebanq.ancient-data.org
\n", + "
\n", + "\n", + "
\n", + "
writtenBy:
\n", + "
Text-Fabric
\n", + "
\n", + "\n", + "
\n", + "
\n", + "\n", + "
\n", + "\n", + "
\n", + "
\n", + "det\n", + "
\n", + "
str
\n", + "\n", + "
\n", + " ✅ determinedness of phrase(atom) (det; und; NA.)\n", + "
\n", + "\n", + "
\n", + "
author:
\n", + "
Eep Talstra Centre for Bible and Computer
\n", + "
\n", + "\n", + "
\n", + "
dataset:
\n", + "
BHSA
\n", + "
\n", + "\n", + "
\n", + "
datasetName:
\n", + "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "\n", + "
\n", + "
dateWritten:
\n", + "
2021-12-09T14:17:56Z
\n", + "
\n", + "\n", + "
\n", + "
email:
\n", + "
shebanq@ancient-data.org
\n", + "
\n", + "\n", + "
\n", + "
encoders:
\n", + "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "\n", + "
\n", + "
version:
\n", + "
2021
\n", + "
\n", + "\n", + "
\n", + "
website:
\n", + "
https://shebanq.ancient-data.org
\n", + "
\n", + "\n", + "
\n", + "
writtenBy:
\n", + "
Text-Fabric
\n", + "
\n", + "\n", + "
\n", + "
\n", + "\n", + "
\n", + "\n", + "
\n", + "
\n", + "domain\n", + "
\n", + "
str
\n", + "\n", + "
\n", + " ✅ text type of clause (? (Unknown); N (narrative); D (discursive); Q (Quotation).)\n", + "
\n", + "\n", + "
\n", + "
author:
\n", + "
Eep Talstra Centre for Bible and Computer
\n", + "
\n", + "\n", + "
\n", + "
dataset:
\n", + "
BHSA
\n", + "
\n", + "\n", + "
\n", + "
datasetName:
\n", + "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "\n", + "
\n", + "
dateWritten:
\n", + "
2021-12-09T14:17:57Z
\n", + "
\n", + "\n", + "
\n", + "
email:
\n", + "
shebanq@ancient-data.org
\n", + "
\n", + "\n", + "
\n", + "
encoders:
\n", + "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "\n", + "
\n", + "
version:
\n", + "
2021
\n", + "
\n", + "\n", + "
\n", + "
website:
\n", + "
https://shebanq.ancient-data.org
\n", + "
\n", + "\n", + "
\n", + "
writtenBy:
\n", + "
Text-Fabric
\n", + "
\n", + "\n", + "
\n", + "
\n", + "\n", + "
\n", + "\n", + "
\n", + "
\n", + "freq_lex\n", + "
\n", + "
int
\n", + "\n", + "
\n", + " ✅ frequency of lexemes\n", + "
\n", + "\n", + "
\n", + "
author:
\n", + "
Eep Talstra Centre for Bible and Computer
\n", + "
\n", + "\n", + "
\n", + "
dataset:
\n", + "
BHSA
\n", + "
\n", + "\n", + "
\n", + "
datasetName:
\n", + "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "\n", + "
\n", + "
dateWritten:
\n", + "
2021-12-09T14:24:45Z
\n", + "
\n", + "\n", + "
\n", + "
email:
\n", + "
shebanq@ancient-data.org
\n", + "
\n", + "\n", + "
\n", + "
encoders:
\n", + "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "\n", + "
\n", + "
provenance:
\n", + "
computed on the basis of the ETCBC core set of features
\n", + "
\n", + "\n", + "
\n", + "
version:
\n", + "
2021
\n", + "
\n", + "\n", + "
\n", + "
website:
\n", + "
https://shebanq.ancient-data.org
\n", + "
\n", + "\n", + "
\n", + "
writtenBy:
\n", + "
Text-Fabric
\n", + "
\n", + "\n", + "
\n", + "
\n", + "\n", + "
\n", + "\n", + "
\n", + "
\n", + "function\n", + "
\n", + "
str
\n", + "\n", + "
\n", + " ✅ syntactic function of phrase (Cmpl; Objc; Pred; ...)\n", + "
\n", + "\n", + "
\n", + "
author:
\n", + "
Eep Talstra Centre for Bible and Computer
\n", + "
\n", + "\n", + "
\n", + "
dataset:
\n", + "
BHSA
\n", + "
\n", + "\n", + "
\n", + "
datasetName:
\n", + "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "\n", + "
\n", + "
dateWritten:
\n", + "
2021-12-09T14:17:57Z
\n", + "
\n", + "\n", + "
\n", + "
email:
\n", + "
shebanq@ancient-data.org
\n", + "
\n", + "\n", + "
\n", + "
encoders:
\n", + "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "\n", + "
\n", + "
version:
\n", + "
2021
\n", + "
\n", + "\n", + "
\n", + "
website:
\n", + "
https://shebanq.ancient-data.org
\n", + "
\n", + "\n", + "
\n", + "
writtenBy:
\n", + "
Text-Fabric
\n", + "
\n", + "\n", + "
\n", + "
\n", + "\n", + "
\n", + "\n", + "
\n", + "
\n", + "g_cons\n", + "
\n", + "
str
\n", + "\n", + "
\n", + " ✅ word consonantal-transliterated (B R>CJT BR> >LHJM ...)\n", + "
\n", + "\n", + "
\n", + "
author:
\n", + "
Eep Talstra Centre for Bible and Computer
\n", + "
\n", + "\n", + "
\n", + "
dataset:
\n", + "
BHSA
\n", + "
\n", + "\n", + "
\n", + "
datasetName:
\n", + "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "\n", + "
\n", + "
dateWritten:
\n", + "
2021-12-09T14:17:57Z
\n", + "
\n", + "\n", + "
\n", + "
email:
\n", + "
shebanq@ancient-data.org
\n", + "
\n", "\n", - "/* PROVENANCE */\n", + "
\n", + "
encoders:
\n", + "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", "\n", - "div.prov {\n", - "\tmargin: 40px;\n", - "\tpadding: 20px;\n", - "\tborder: 2px solid var(--fog-rim);\n", - "}\n", - "div.pline {\n", - "\tdisplay: flex;\n", - "\tflex-flow: row nowrap;\n", - "\tjustify-content: stretch;\n", - "\talign-items: baseline;\n", - "}\n", - "div.p2line {\n", - "\tmargin-left: 2em;\n", - "\tdisplay: flex;\n", - "\tflex-flow: row nowrap;\n", - "\tjustify-content: stretch;\n", - "\talign-items: baseline;\n", - "}\n", - "div.psline {\n", - "\tdisplay: flex;\n", - "\tflex-flow: row nowrap;\n", - "\tjustify-content: stretch;\n", - "\talign-items: baseline;\n", - "\tbackground-color: var(--gold-mist-back);\n", - "}\n", - "div.pname {\n", - "\tflex: 0 0 5rem;\n", - "\tfont-weight: bold;\n", - "}\n", - "div.pval {\n", - " flex: 1 1 auto;\n", - "}\n", + "
\n", + "
version:
\n", + "
2021
\n", + "
\n", "\n", - "/* KEYBOARD */\n", - ".ccoff {\n", - " background-color: inherit;\n", - "}\n", - ".ccon {\n", - " background-color: yellow ! important;\n", - "}\n", - "/* TF header */\n", + "
\n", + "
website:
\n", + "
https://shebanq.ancient-data.org
\n", + "
\n", "\n", - "summary {\n", - " /* needed to override the normalize.less\n", - " * in the classical jupyter notebook\n", - " */\n", - " display: list-item ! important;\n", - "}\n", + "
\n", + "
writtenBy:
\n", + "
Text-Fabric
\n", + "
\n", + "\n", + "
\n", + "
\n", + "\n", + "
\n", + "\n", + "
\n", + "
\n", + "g_cons_utf8\n", + "
\n", + "
str
\n", + "\n", + "
\n", + " ✅ word consonantal-Hebrew (ב ראשׁית ברא אלהים)\n", + "
\n", + "\n", + "
\n", + "
author:
\n", + "
Eep Talstra Centre for Bible and Computer
\n", + "
\n", + "\n", + "
\n", + "
dataset:
\n", + "
BHSA
\n", + "
\n", + "\n", + "
\n", + "
datasetName:
\n", + "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "\n", + "
\n", + "
dateWritten:
\n", + "
2021-12-09T14:17:58Z
\n", + "
\n", + "\n", + "
\n", + "
email:
\n", + "
shebanq@ancient-data.org
\n", + "
\n", + "\n", + "
\n", + "
encoders:
\n", + "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "\n", + "
\n", + "
version:
\n", + "
2021
\n", + "
\n", + "\n", + "
\n", + "
website:
\n", + "
https://shebanq.ancient-data.org
\n", + "
\n", + "\n", + "
\n", + "
writtenBy:
\n", + "
Text-Fabric
\n", + "
\n", + "\n", + "
\n", + "
\n", + "\n", + "
\n", + "\n", + "
\n", + "
\n", + "g_lex\n", + "
\n", + "
str
\n", + "\n", + "
\n", + " ✅ lexeme pointed-transliterated (B.:- R;>CIJT B.@R@> >:ELOH ...)\n", + "
\n", + "\n", + "
\n", + "
author:
\n", + "
Eep Talstra Centre for Bible and Computer
\n", + "
\n", + "\n", + "
\n", + "
dataset:
\n", + "
BHSA
\n", + "
\n", + "\n", + "
\n", + "
datasetName:
\n", + "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "\n", + "
\n", + "
dateWritten:
\n", + "
2021-12-09T14:17:58Z
\n", + "
\n", + "\n", + "
\n", + "
email:
\n", + "
shebanq@ancient-data.org
\n", + "
\n", + "\n", + "
\n", + "
encoders:
\n", + "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "\n", + "
\n", + "
version:
\n", + "
2021
\n", + "
\n", + "\n", + "
\n", + "
website:
\n", + "
https://shebanq.ancient-data.org
\n", + "
\n", + "\n", + "
\n", + "
writtenBy:
\n", + "
Text-Fabric
\n", + "
\n", + "\n", + "
\n", + "
\n", + "\n", + "
\n", + "\n", + "
\n", + "
\n", + "g_lex_utf8\n", + "
\n", + "
str
\n", + "\n", + "
\n", + " ✅ lexeme pointed-Hebrew (בְּ רֵאשִׁית בָּרָא אֱלֹה)\n", + "
\n", + "\n", + "
\n", + "
author:
\n", + "
Eep Talstra Centre for Bible and Computer
\n", + "
\n", + "\n", + "
\n", + "
dataset:
\n", + "
BHSA
\n", + "
\n", + "\n", + "
\n", + "
datasetName:
\n", + "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "\n", + "
\n", + "
dateWritten:
\n", + "
2021-12-09T14:17:59Z
\n", + "
\n", + "\n", + "
\n", + "
email:
\n", + "
shebanq@ancient-data.org
\n", + "
\n", + "\n", + "
\n", + "
encoders:
\n", + "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "\n", + "
\n", + "
version:
\n", + "
2021
\n", + "
\n", + "\n", + "
\n", + "
website:
\n", + "
https://shebanq.ancient-data.org
\n", + "
\n", + "\n", + "
\n", + "
writtenBy:
\n", + "
Text-Fabric
\n", + "
\n", + "\n", + "
\n", + "
\n", + "\n", + "
\n", + "\n", + "
\n", + "
\n", + "g_word\n", + "
\n", + "
str
\n", + "\n", + "
\n", + " ✅ word pointed-transliterated (B.:- R;>CI73JT B.@R@74> >:ELOHI92JM)\n", + "
\n", + "\n", + "
\n", + "
author:
\n", + "
Eep Talstra Centre for Bible and Computer
\n", + "
\n", "\n", - ".fcorpus {\n", - " display: flex;\n", - " flex-flow: column nowrap;\n", - " justify-content: flex-start;\n", - " align-items: flex-start;\n", - " align-content: flex-start;\n", - "}\n", - ".frow {\n", - " display: flex;\n", - " flex-flow: row nowrap;\n", - " justify-content: flex-start;\n", - " align-items: flex-start;\n", - " align-content: flex-start;\n", - "}\n", - ".fmeta {\n", - " display: flex;\n", - " flex-flow: column nowrap;\n", - " justify-content: flex-start;\n", - " align-items: flex-start;\n", - " align-content: flex-start;\n", - "}\n", - ".fmetarow {\n", - " display: flex;\n", - " flex-flow: row nowrap;\n", - " justify-content: flex-start;\n", - " align-items: flex-start;\n", - " align-content: flex-start;\n", - "}\n", - ".fmetakey {\n", - " min-width: 10rem;\n", - " font-family: monospace;\n", - "}\n", - ".fnamecat {\n", - " min-width: 10rem;\n", - "}\n", - ".fnamecat.edge {\n", - " font-weight: bold;\n", - " font-style: italic;\n", - "}\n", - ".fmono {\n", - " font-family: monospace;\n", - "}\n", + "
\n", + "
dataset:
\n", + "
BHSA
\n", + "
\n", "\n", - ":root {\n", - "\t--node: hsla(120, 100%, 20%, 1.0 );\n", - "\t--label: hsla( 0, 100%, 20%, 1.0 );\n", - "\t--tfsechead: hsla( 0, 100%, 25%, 1.0 );\n", - "\t--structure: hsla(120, 100%, 25%, 1.0 );\n", - "\t--features: hsla( 0, 0%, 30%, 1.0 );\n", - " --text-color: hsla( 60, 80%, 10%, 1.0 );\n", - " --lex-color: hsla(220, 90%, 60%, 1.0 );\n", - " --meta-color: hsla( 0, 0%, 90%, 0.7 );\n", - " --meta-width: 3px;\n", - " --border-color-nul: hsla( 0, 0%, 90%, 0.5 );\n", - " --border-color0: hsla( 0, 0%, 90%, 0.9 );\n", - " --border-color1: hsla( 0, 0%, 80%, 0.9 );\n", - " --border-color2: hsla( 0, 0%, 70%, 0.9 );\n", - " --border-color3: hsla( 0, 0%, 80%, 0.8 );\n", - " --border-color4: hsla( 0, 0%, 60%, 0.9 );\n", - " --border-width-nul: 2px;\n", - " --border-width0: 2px;\n", - " --border-width1: 3px;\n", - " --border-width2: 4px;\n", - " --border-width3: 6px;\n", - " --border-width4: 5px;\n", - " --border-width-plain: 2px;\n", - "}\n", - ".hl {\n", - " background-color: var(--hl-strong);\n", - "}\n", - "span.hl {\n", - "\tbackground-color: var(--hl-strong);\n", - "\tborder-width: 0;\n", - "\tborder-radius: 2px;\n", - "\tborder-style: solid;\n", - "}\n", - "div.contnr.hl,div.lbl.hl {\n", - " background-color: var(--hl-strong);\n", - "}\n", - "div.contnr.hl {\n", - " border-color: var(--hl-rim) ! important;\n", - "\tborder-width: 4px ! important;\n", - "}\n", + "
\n", + "
datasetName:
\n", + "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", "\n", - "span.hlbx {\n", - "\tborder-color: var(--hl-rim);\n", - "\tborder-width: 4px ! important;\n", - "\tborder-style: solid;\n", - "\tborder-radius: 6px;\n", - " padding: 4px;\n", - " margin: 4px;\n", - "}\n", + "
\n", + "
dateWritten:
\n", + "
2021-12-09T14:18:04Z
\n", + "
\n", "\n", - "span.plain {\n", - " display: inline-block;\n", - " white-space: pre-wrap;\n", - "}\n", + "
\n", + "
email:
\n", + "
shebanq@ancient-data.org
\n", + "
\n", "\n", - ":root {\n", - "\t--hl-strong: hsla( 60, 100%, 70%, 0.9 );\n", - "\t--hl-rim: hsla( 55, 80%, 50%, 1.0 );\n", - "}\n", - "" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - }, - { - "data": { - "text/html": [ + "
\n", + "
encoders:
\n", + "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", "\n", - "\n" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - } - ], - "source": [ - "N = use(\"etcbc/bhsa\", mod=\"etcbc/participants/actor/tf:clone\")" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "If you click the triangles and navigate to the full metadata of the participants features,\n", - "you see a line\n", - "\n", - "```\n", - "upgraded: ‼️ from version c to 2021\n", - "```" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "## Checks\n", - "\n", - "Let's do a few checks to see how well the upgrade process has worked.\n", - "\n", - "First we load the `c` version of the BHSA and Christian's original features." - ] - }, - { - "cell_type": "code", - "execution_count": 11, - "metadata": {}, - "outputs": [ - { - "data": { - "text/html": [ - "TF-app: ~/text-fabric-data/etcbc/bhsa/app" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - }, - { - "data": { - "text/html": [ - "data: ~/text-fabric-data/etcbc/bhsa/tf/c" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - }, - { - "data": { - "text/html": [ - "data: ~/text-fabric-data/ch-jensen/participants/actor/tf/c" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - }, - { - "data": { - "text/html": [ - "data: ~/text-fabric-data/etcbc/phono/tf/c" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - }, - { - "data": { - "text/html": [ - "data: ~/text-fabric-data/etcbc/parallels/tf/c" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - }, - { - "name": "stdout", - "output_type": "stream", - "text": [ - "This is Text-Fabric 9.2.3\n", - "Api reference : https://annotation.github.io/text-fabric/tf/cheatsheet.html\n", - "\n", - "123 features found and 0 ignored\n" - ] - }, - { - "data": { - "text/html": [ - "Text-Fabric: Text-Fabric API 9.2.3, etcbc/bhsa/app v3, Search Reference
Data: BHSA, Character table, Feature docs
Features:
\n", - "
Parallel Passages\n", - "
\n", + "
\n", + "
version:
\n", + "
2021
\n", + "
\n", + "\n", + "
\n", + "
website:
\n", + "
https://shebanq.ancient-data.org
\n", + "
\n", + "\n", + "
\n", + "
writtenBy:
\n", + "
Text-Fabric
\n", + "
\n", + "\n", + "
\n", + "
\n", + "\n", + "
\n", "\n", "
\n", - "
\n", - "crossref\n", + "
\n", + "g_word_utf8\n", "
\n", - "
int
\n", + "
str
\n", + "\n", "
\n", - " \n", + " ✅ word pointed-Hebrew (בְּ רֵאשִׁ֖ית בָּרָ֣א אֱלֹהִ֑ים)\n", "
\n", "\n", "
\n", "
author:
\n", - "
BHSA Data: Constantijn Sikkel; Parallels Notebook: Dirk Roorda, Martijn Naaijer
\n", + "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", - "
coreData:
\n", + "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", - "
coreVersion:
\n", - "
_temp
\n", + "
datasetName:
\n", + "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", - "
2018-10-08T15:18:08Z
\n", + "
2021-12-09T14:18:04Z
\n", "
\n", "\n", "
\n", - "
source:
\n", - "
Parallels Module
\n", + "
email:
\n", + "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
encoders:
\n", + "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "\n", + "
\n", + "
version:
\n", + "
2021
\n", + "
\n", + "\n", + "
\n", + "
website:
\n", + "
https://shebanq.ancient-data.org
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "
writtenBy:
\n", + "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", - "
ch-jensen/participants/actor/tf\n", - "
\n", + "
\n", "\n", "
\n", "
\n", - "actor\n", + "gloss\n", "
\n", "
str
\n", + "\n", "
\n", - " Participant references for words, subphrases and phrases. The references are adapted from Eep Talstra's work on participant tracking. http://doi.org/10.5281/zenodo.1479491\n", + " 🆗 english translation of lexeme (beginning create god(s))\n", "
\n", "\n", "
\n", - "
coreData:
\n", + "
author:
\n", + "
Eep Talstra Centre for Bible and Computer
\n", + "
\n", + "\n", + "
\n", + "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", - "
coreVersion:
\n", - "
c
\n", + "
datasetName:
\n", + "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", - "
2020-05-11T13:34:09Z
\n", + "
2021-12-09T14:21:13Z
\n", + "
\n", + "\n", + "
\n", + "
email:
\n", + "
shebanq@ancient-data.org
\n", + "
\n", + "\n", + "
\n", + "
encoders:
\n", + "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "\n", + "
\n", + "
provenance:
\n", + "
from additional lexicon file provided by the ETCBC
\n", + "
\n", + "\n", + "
\n", + "
version:
\n", + "
2021
\n", + "
\n", + "\n", + "
\n", + "
website:
\n", + "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", @@ -9092,32 +2650,59 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "
\n", + "\n", "
\n", "\n", "
\n", "
\n", - "prs_actor\n", + "gn\n", "
\n", "
str
\n", + "\n", "
\n", - " Participant references for pronominal suffixes. The references are adapted from Eep Talstra's work on participant tracking. http://doi.org/10.5281/zenodo.1479491\n", + " ✅ grammatical gender (m; f; NA; unknown.)\n", "
\n", "\n", "
\n", - "
coreData:
\n", + "
author:
\n", + "
Eep Talstra Centre for Bible and Computer
\n", + "
\n", + "\n", + "
\n", + "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", - "
coreVersion:
\n", - "
c
\n", + "
datasetName:
\n", + "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", - "
2020-05-11T13:34:13Z
\n", + "
2021-12-09T14:18:05Z
\n", + "
\n", + "\n", + "
\n", + "
email:
\n", + "
shebanq@ancient-data.org
\n", + "
\n", + "\n", + "
\n", + "
encoders:
\n", + "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "\n", + "
\n", + "
version:
\n", + "
2021
\n", + "
\n", + "\n", + "
\n", + "
website:
\n", + "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", @@ -9125,56 +2710,79 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "
\n", + "\n", "
\n", "\n", "
\n", - "
\n", - "coref\n", + "
\n", + "label\n", "
\n", - "
none
\n", + "
str
\n", + "\n", "
\n", - " Edges to co-referring actors on chapter-level. The references are adapted from Eep Talstra's work on participant tracking. http://doi.org/10.5281/zenodo.1479491\n", + " ✅ (half-)verse label (half verses: A; B; C; verses: GEN 01,02)\n", "
\n", "\n", "
\n", - "
coreData:
\n", + "
author:
\n", + "
Eep Talstra Centre for Bible and Computer
\n", + "
\n", + "\n", + "
\n", + "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", - "
coreVersion:
\n", - "
c
\n", + "
datasetName:
\n", + "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", - "
2020-05-11T13:34:16Z
\n", + "
2021-12-09T14:18:06Z
\n", "
\n", "\n", "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
email:
\n", + "
shebanq@ancient-data.org
\n", + "
\n", + "\n", + "
\n", + "
encoders:
\n", + "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "\n", + "
\n", + "
version:
\n", + "
2021
\n", + "
\n", + "\n", + "
\n", + "
website:
\n", + "
https://shebanq.ancient-data.org
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "
writtenBy:
\n", + "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", - "
BHSA = Biblia Hebraica Stuttgartensia Amstelodamensis\n", - "
\n", + "
\n", "\n", "
\n", "
\n", - "book\n", + "language\n", "
\n", "
str
\n", + "\n", "
\n", - " \n", + " ✅ of word or lexeme (Hebrew; Aramaic.)\n", "
\n", "\n", "
\n", @@ -9194,7 +2802,7 @@ "\n", "
\n", "
dateWritten:
\n", - "
2018-10-08T15:07:15Z
\n", + "
2021-12-09T14:21:13Z
\n", "
\n", "\n", "
\n", @@ -9208,8 +2816,13 @@ "
\n", "\n", "
\n", + "
provenance:
\n", + "
from additional lexicon file provided by the ETCBC
\n", + "
\n", + "\n", + "
\n", "
version:
\n", - "
c
\n", + "
2021
\n", "
\n", "\n", "
\n", @@ -9222,17 +2835,19 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "
\n", + "\n", "\n", "\n", "
\n", "
\n", - "book@ll\n", + "lex\n", "
\n", "
str
\n", + "\n", "
\n", - " \n", + " ✅ lexeme consonantal-transliterated (B R>CJT/ BR>[ >LHJM/)\n", "
\n", "\n", "
\n", @@ -9252,7 +2867,7 @@ "\n", "
\n", "
dateWritten:
\n", - "
2018-10-08T15:11:15Z
\n", + "
2021-12-09T14:21:14Z
\n", "
\n", "\n", "
\n", @@ -9262,32 +2877,82 @@ "\n", "
\n", "
encoders:
\n", - "
Dirk Roorda (TF)
\n", + "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", - "
language:
\n", - "
ኣማርኛ
\n", + "
provenance:
\n", + "
from additional lexicon file provided by the ETCBC
\n", "
\n", "\n", "
\n", - "
languageCode:
\n", - "
am
\n", + "
version:
\n", + "
2021
\n", "
\n", "\n", "
\n", - "
languageEnglish:
\n", - "
amharic
\n", + "
website:
\n", + "
https://shebanq.ancient-data.org
\n", + "
\n", + "\n", + "
\n", + "
writtenBy:
\n", + "
Text-Fabric
\n", + "
\n", + "\n", + "
\n", + "
\n", + "\n", + "
\n", + "\n", + "
\n", + "
\n", + "lex_utf8\n", + "
\n", + "
str
\n", + "\n", + "
\n", + " ✅ lexeme consonantal-Hebrew (ב ראשׁית֜ ברא אלהים֜)\n", + "
\n", + "\n", + "
\n", + "
author:
\n", + "
Eep Talstra Centre for Bible and Computer
\n", + "
\n", + "\n", + "
\n", + "
dataset:
\n", + "
BHSA
\n", + "
\n", + "\n", + "
\n", + "
datasetName:
\n", + "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "\n", + "
\n", + "
dateWritten:
\n", + "
2021-12-09T14:21:15Z
\n", + "
\n", + "\n", + "
\n", + "
email:
\n", + "
shebanq@ancient-data.org
\n", + "
\n", + "\n", + "
\n", + "
encoders:
\n", + "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
provenance:
\n", - "
book names from wikipedia and other sources
\n", + "
from additional lexicon file provided by the ETCBC
\n", "
\n", "\n", "
\n", "
version:
\n", - "
c
\n", + "
2021
\n", "
\n", "\n", "
\n", @@ -9300,17 +2965,19 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "\n", + "\n", "\n", "\n", "
\n", "
\n", - "chapter\n", + "ls\n", "
\n", - "
int
\n", + "
str
\n", + "\n", "
\n", - " \n", + " ✅ lexical set, subclassification of part-of-speech (card; ques; mult)\n", "
\n", "\n", "
\n", @@ -9330,7 +2997,7 @@ "\n", "
\n", "
dateWritten:
\n", - "
2018-10-08T15:07:15Z
\n", + "
2021-12-09T14:21:15Z
\n", "
\n", "\n", "
\n", @@ -9344,8 +3011,13 @@ "
\n", "\n", "
\n", + "
provenance:
\n", + "
from additional lexicon file provided by the ETCBC
\n", + "
\n", + "\n", + "
\n", "
version:
\n", - "
c
\n", + "
2021
\n", "
\n", "\n", "
\n", @@ -9358,17 +3030,19 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "\n", + "\n", "\n", "\n", "
\n", "
\n", - "code\n", + "nametype\n", "
\n", - "
int
\n", + "
str
\n", + "\n", "
\n", - " \n", + " ⚠️ named entity type (pers; mens; gens; topo; ppde.)\n", "
\n", "\n", "
\n", @@ -9388,7 +3062,7 @@ "\n", "
\n", "
dateWritten:
\n", - "
2018-10-08T15:07:15Z
\n", + "
2021-12-09T14:21:15Z
\n", "
\n", "\n", "
\n", @@ -9402,8 +3076,13 @@ "
\n", "\n", "
\n", + "
provenance:
\n", + "
from additional lexicon file provided by the ETCBC
\n", + "
\n", + "\n", + "
\n", "
version:
\n", - "
c
\n", + "
2021
\n", "
\n", "\n", "
\n", @@ -9416,17 +3095,19 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "\n", + "\n", "\n", "\n", "
\n", "
\n", - "det\n", + "nme\n", "
\n", "
str
\n", + "\n", "
\n", - " \n", + " ✅ nominal ending consonantal-transliterated (absent; n/a; JM, ...)\n", "
\n", "\n", "
\n", @@ -9446,7 +3127,7 @@ "\n", "
\n", "
dateWritten:
\n", - "
2018-10-08T15:07:15Z
\n", + "
2021-12-09T14:18:08Z
\n", "
\n", "\n", "
\n", @@ -9461,7 +3142,7 @@ "\n", "
\n", "
version:
\n", - "
c
\n", + "
2021
\n", "
\n", "\n", "
\n", @@ -9474,17 +3155,19 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "\n", + "\n", "\n", "\n", "
\n", "
\n", - "domain\n", + "nu\n", "
\n", "
str
\n", + "\n", "
\n", - " \n", + " ✅ grammatical number (sg; du; pl; NA; unknown.)\n", "
\n", "\n", "
\n", @@ -9504,7 +3187,7 @@ "\n", "
\n", "
dateWritten:
\n", - "
2018-10-08T15:07:19Z
\n", + "
2021-12-09T14:18:08Z
\n", "
\n", "\n", "
\n", @@ -9519,7 +3202,7 @@ "\n", "
\n", "
version:
\n", - "
c
\n", + "
2021
\n", "
\n", "\n", "
\n", @@ -9532,17 +3215,19 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "\n", + "\n", "\n", "\n", "
\n", "
\n", - "freq_lex\n", + "number\n", "
\n", "
int
\n", + "\n", "
\n", - " \n", + " ✅ sequence number of an object within its context\n", "
\n", "\n", "
\n", @@ -9562,7 +3247,7 @@ "\n", "
\n", "
dateWritten:
\n", - "
2018-10-08T15:14:58Z
\n", + "
2021-12-09T14:18:09Z
\n", "
\n", "\n", "
\n", @@ -9572,17 +3257,12 @@ "\n", "
\n", "
encoders:
\n", - "
Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
computed addition to core set of features
\n", + "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", - "
c
\n", + "
2021
\n", "
\n", "\n", "
\n", @@ -9595,15 +3275,17 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "\n", + "\n", "\n", "\n", "
\n", "
\n", - "function\n", + "otype\n", "
\n", "
str
\n", + "\n", "
\n", " \n", "
\n", @@ -9625,7 +3307,7 @@ "\n", "
\n", "
dateWritten:
\n", - "
2018-10-08T15:07:19Z
\n", + "
2021-12-09T14:21:15Z
\n", "
\n", "\n", "
\n", @@ -9640,7 +3322,7 @@ "\n", "
\n", "
version:
\n", - "
c
\n", + "
2021
\n", "
\n", "\n", "
\n", @@ -9653,17 +3335,19 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "\n", + "\n", "\n", "\n", "
\n", "
\n", - "g_cons\n", + "pargr\n", "
\n", "
str
\n", + "\n", "
\n", - " \n", + " 🆗 hierarchical paragraph number (1; 1.2; 1.2.3.4; ...)\n", "
\n", "\n", "
\n", @@ -9683,7 +3367,7 @@ "\n", "
\n", "
dateWritten:
\n", - "
2018-10-08T15:07:19Z
\n", + "
2021-12-09T14:22:50Z
\n", "
\n", "\n", "
\n", @@ -9697,8 +3381,13 @@ "
\n", "\n", "
\n", + "
provenance:
\n", + "
from additional paragraph file provided by the ETCBC
\n", + "
\n", + "\n", + "
\n", "
version:
\n", - "
c
\n", + "
2021
\n", "
\n", "\n", "
\n", @@ -9711,17 +3400,19 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "\n", + "\n", "\n", "\n", "
\n", "
\n", - "g_cons_utf8\n", + "pdp\n", "
\n", "
str
\n", + "\n", "
\n", - " \n", + " ✅ phrase dependent part-of-speech (art; verb; subs; nmpr, ...)\n", "
\n", "\n", "
\n", @@ -9741,7 +3432,7 @@ "\n", "
\n", "
dateWritten:
\n", - "
2018-10-08T15:07:20Z
\n", + "
2021-12-09T14:18:10Z
\n", "
\n", "\n", "
\n", @@ -9756,7 +3447,7 @@ "\n", "
\n", "
version:
\n", - "
c
\n", + "
2021
\n", "
\n", "\n", "
\n", @@ -9769,17 +3460,19 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "\n", + "\n", "\n", "\n", "
\n", "
\n", - "g_lex\n", + "pfm\n", "
\n", "
str
\n", + "\n", "
\n", - " \n", + " ✅ preformative consonantal-transliterated (absent; n/a; J, ...)\n", "
\n", "\n", "
\n", @@ -9799,7 +3492,7 @@ "\n", "
\n", "
dateWritten:
\n", - "
2018-10-08T15:07:21Z
\n", + "
2021-12-09T14:18:11Z
\n", "
\n", "\n", "
\n", @@ -9814,7 +3507,7 @@ "\n", "
\n", "
version:
\n", - "
c
\n", + "
2021
\n", "
\n", "\n", "
\n", @@ -9827,17 +3520,19 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "\n", + "\n", "\n", "\n", "
\n", "
\n", - "g_lex_utf8\n", + "prs\n", "
\n", "
str
\n", + "\n", "
\n", - " \n", + " ✅ pronominal suffix consonantal-transliterated (absent; n/a; W; ...)\n", "
\n", "\n", "
\n", @@ -9857,7 +3552,7 @@ "\n", "
\n", "
dateWritten:
\n", - "
2018-10-08T15:07:22Z
\n", + "
2021-12-09T14:18:11Z
\n", "
\n", "\n", "
\n", @@ -9872,7 +3567,7 @@ "\n", "
\n", "
version:
\n", - "
c
\n", + "
2021
\n", "
\n", "\n", "
\n", @@ -9885,17 +3580,19 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "\n", + "\n", "\n", "\n", "
\n", "
\n", - "g_word\n", + "prs_gn\n", "
\n", "
str
\n", + "\n", "
\n", - " \n", + " ✅ pronominal suffix gender (m; f; NA; unknown.)\n", "
\n", "\n", "
\n", @@ -9915,7 +3612,7 @@ "\n", "
\n", "
dateWritten:
\n", - "
2018-10-08T15:07:34Z
\n", + "
2021-12-09T14:18:11Z
\n", "
\n", "\n", "
\n", @@ -9930,7 +3627,7 @@ "\n", "
\n", "
version:
\n", - "
c
\n", + "
2021
\n", "
\n", "\n", "
\n", @@ -9943,17 +3640,19 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "\n", + "\n", "\n", "\n", "
\n", "
\n", - "g_word_utf8\n", + "prs_nu\n", "
\n", "
str
\n", + "\n", "
\n", - " \n", + " ✅ pronominal suffix number (sg; du; pl; NA; unknown.)\n", "
\n", "\n", "
\n", @@ -9973,7 +3672,7 @@ "\n", "
\n", "
dateWritten:
\n", - "
2018-10-08T15:07:34Z
\n", + "
2021-12-09T14:18:12Z
\n", "
\n", "\n", "
\n", @@ -9988,7 +3687,7 @@ "\n", "
\n", "
version:
\n", - "
c
\n", + "
2021
\n", "
\n", "\n", "
\n", @@ -10001,17 +3700,19 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "\n", + "\n", "\n", "\n", "
\n", "
\n", - "gloss\n", + "prs_ps\n", "
\n", "
str
\n", + "\n", "
\n", - " \n", + " ✅ pronominal suffix person (p1; p2; p3; NA; unknown.)\n", "
\n", "\n", "
\n", @@ -10031,7 +3732,7 @@ "\n", "
\n", "
dateWritten:
\n", - "
2019-01-31T17:40:54Z
\n", + "
2021-12-09T14:18:12Z
\n", "
\n", "\n", "
\n", @@ -10041,12 +3742,12 @@ "\n", "
\n", "
encoders:
\n", - "
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
\n", + "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", - "
c
\n", + "
2021
\n", "
\n", "\n", "
\n", @@ -10059,17 +3760,19 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "\n", + "\n", "\n", "\n", "
\n", "
\n", - "gn\n", + "ps\n", "
\n", "
str
\n", + "\n", "
\n", - " \n", + " ✅ grammatical person (p1; p2; p3; NA; unknown.)\n", "
\n", "\n", "
\n", @@ -10089,7 +3792,7 @@ "\n", "
\n", "
dateWritten:
\n", - "
2018-10-08T15:07:35Z
\n", + "
2021-12-09T14:18:12Z
\n", "
\n", "\n", "
\n", @@ -10104,7 +3807,7 @@ "\n", "
\n", "
version:
\n", - "
c
\n", + "
2021
\n", "
\n", "\n", "
\n", @@ -10117,17 +3820,19 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "\n", + "\n", "\n", "\n", "
\n", "
\n", - "label\n", + "qere\n", "
\n", "
str
\n", + "\n", "
\n", - " \n", + " ✅ word pointed-transliterated masoretic reading correction\n", "
\n", "\n", "
\n", @@ -10147,7 +3852,7 @@ "\n", "
\n", "
dateWritten:
\n", - "
2018-10-08T15:07:37Z
\n", + "
2021-12-09T14:23:29Z
\n", "
\n", "\n", "
\n", @@ -10161,8 +3866,13 @@ "
\n", "\n", "
\n", + "
provenance:
\n", + "
from additional ketiv/qere file provided by the ETCBC
\n", + "
\n", + "\n", + "
\n", "
version:
\n", - "
c
\n", + "
2021
\n", "
\n", "\n", "
\n", @@ -10175,17 +3885,19 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "\n", + "\n", "\n", "\n", "
\n", "
\n", - "language\n", + "qere_trailer\n", "
\n", "
str
\n", + "\n", "
\n", - " \n", + " ✅ interword material -pointed-transliterated (Masoretic correction)\n", "
\n", "\n", "
\n", @@ -10205,7 +3917,7 @@ "\n", "
\n", "
dateWritten:
\n", - "
2018-10-08T15:11:51Z
\n", + "
2021-12-09T14:23:29Z
\n", "
\n", "\n", "
\n", @@ -10215,12 +3927,17 @@ "\n", "
\n", "
encoders:
\n", - "
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
\n", + "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "\n", + "
\n", + "
provenance:
\n", + "
from additional ketiv/qere file provided by the ETCBC
\n", "
\n", "\n", "
\n", "
version:
\n", - "
c
\n", + "
2021
\n", "
\n", "\n", "
\n", @@ -10233,17 +3950,19 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "\n", + "\n", "\n", "\n", "
\n", "
\n", - "lex\n", + "qere_trailer_utf8\n", "
\n", "
str
\n", + "\n", "
\n", - " \n", + " ✅ interword material -pointed-transliterated (Masoretic correction)\n", "
\n", "\n", "
\n", @@ -10263,7 +3982,7 @@ "\n", "
\n", "
dateWritten:
\n", - "
2018-10-08T15:11:53Z
\n", + "
2021-12-09T14:23:29Z
\n", "
\n", "\n", "
\n", @@ -10273,12 +3992,17 @@ "\n", "
\n", "
encoders:
\n", - "
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
\n", + "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "\n", + "
\n", + "
provenance:
\n", + "
from additional ketiv/qere file provided by the ETCBC
\n", "
\n", "\n", "
\n", "
version:
\n", - "
c
\n", + "
2021
\n", "
\n", "\n", "
\n", @@ -10291,17 +4015,19 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "\n", + "\n", "\n", "\n", "
\n", "
\n", - "lex_utf8\n", + "qere_utf8\n", "
\n", "
str
\n", + "\n", "
\n", - " \n", + " ✅ word pointed-Hebrew masoretic reading correction\n", "
\n", "\n", "
\n", @@ -10321,7 +4047,7 @@ "\n", "
\n", "
dateWritten:
\n", - "
2018-10-08T15:11:54Z
\n", + "
2021-12-09T14:23:29Z
\n", "
\n", "\n", "
\n", @@ -10331,12 +4057,17 @@ "\n", "
\n", "
encoders:
\n", - "
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
\n", + "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "\n", + "
\n", + "
provenance:
\n", + "
from additional ketiv/qere file provided by the ETCBC
\n", "
\n", "\n", "
\n", "
version:
\n", - "
c
\n", + "
2021
\n", "
\n", "\n", "
\n", @@ -10349,17 +4080,19 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "\n", + "\n", "\n", "\n", "
\n", "
\n", - "ls\n", + "rank_lex\n", "
\n", - "
str
\n", + "
int
\n", + "\n", "
\n", - " \n", + " ✅ ranking of lexemes based on freqnuecy\n", "
\n", "\n", "
\n", @@ -10379,7 +4112,7 @@ "\n", "
\n", "
dateWritten:
\n", - "
2018-10-08T15:11:55Z
\n", + "
2021-12-09T14:24:46Z
\n", "
\n", "\n", "
\n", @@ -10389,12 +4122,17 @@ "\n", "
\n", "
encoders:
\n", - "
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
\n", + "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "\n", + "
\n", + "
provenance:
\n", + "
computed on the basis of the ETCBC core set of features
\n", "
\n", "\n", "
\n", "
version:
\n", - "
c
\n", + "
2021
\n", "
\n", "\n", "
\n", @@ -10407,17 +4145,19 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "\n", + "\n", "\n", "\n", "
\n", "
\n", - "nametype\n", + "rela\n", "
\n", "
str
\n", + "\n", "
\n", - " \n", + " ✅ linguistic relation between clause/(sub)phrase(atom) (ADJ; MOD; ATR; ...)\n", "
\n", "\n", "
\n", @@ -10437,7 +4177,7 @@ "\n", "
\n", "
dateWritten:
\n", - "
2019-01-31T17:40:54Z
\n", + "
2021-12-09T14:18:13Z
\n", "
\n", "\n", "
\n", @@ -10447,12 +4187,12 @@ "\n", "
\n", "
encoders:
\n", - "
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
\n", + "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", - "
c
\n", + "
2021
\n", "
\n", "\n", "
\n", @@ -10465,17 +4205,19 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "\n", + "\n", "\n", "\n", "
\n", "
\n", - "nme\n", + "sp\n", "
\n", "
str
\n", + "\n", "
\n", - " \n", + " ✅ part-of-speech (art; verb; subs; nmpr, ...)\n", "
\n", "\n", "
\n", @@ -10495,7 +4237,7 @@ "\n", "
\n", "
dateWritten:
\n", - "
2018-10-08T15:07:41Z
\n", + "
2021-12-09T14:21:16Z
\n", "
\n", "\n", "
\n", @@ -10509,8 +4251,13 @@ "
\n", "\n", "
\n", + "
provenance:
\n", + "
from additional lexicon file provided by the ETCBC
\n", + "
\n", + "\n", + "
\n", "
version:
\n", - "
c
\n", + "
2021
\n", "
\n", "\n", "
\n", @@ -10523,17 +4270,19 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "\n", + "\n", "\n", "\n", "
\n", "
\n", - "nu\n", + "st\n", "
\n", "
str
\n", + "\n", "
\n", - " \n", + " ✅ state of a noun (a (absolute); c (construct); e (emphatic).)\n", "
\n", "\n", "
\n", @@ -10553,7 +4302,7 @@ "\n", "
\n", "
dateWritten:
\n", - "
2018-10-08T15:07:42Z
\n", + "
2021-12-09T14:18:14Z
\n", "
\n", "\n", "
\n", @@ -10568,7 +4317,7 @@ "\n", "
\n", "
version:
\n", - "
c
\n", + "
2021
\n", "
\n", "\n", "
\n", @@ -10581,17 +4330,19 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "\n", + "\n", "\n", "\n", "
\n", "
\n", - "number\n", + "tab\n", "
\n", "
int
\n", + "\n", "
\n", - " \n", + " ✅ clause atom: its level in the linguistic embedding\n", "
\n", "\n", "
\n", @@ -10611,7 +4362,7 @@ "\n", "
\n", "
dateWritten:
\n", - "
2018-10-08T15:07:43Z
\n", + "
2021-12-09T14:18:16Z
\n", "
\n", "\n", "
\n", @@ -10626,7 +4377,7 @@ "\n", "
\n", "
version:
\n", - "
c
\n", + "
2021
\n", "
\n", "\n", "
\n", @@ -10639,17 +4390,19 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "\n", + "\n", "\n", "\n", "
\n", "
\n", - "otype\n", + "trailer\n", "
\n", "
str
\n", + "\n", "
\n", - " \n", + " ✅ interword material pointed-transliterated (& 00 05 00_P ...)\n", "
\n", "\n", "
\n", @@ -10669,7 +4422,7 @@ "\n", "
\n", "
dateWritten:
\n", - "
2018-10-08T15:11:56Z
\n", + "
2021-12-09T14:18:01Z
\n", "
\n", "\n", "
\n", @@ -10679,12 +4432,12 @@ "\n", "
\n", "
encoders:
\n", - "
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
\n", + "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", - "
c
\n", + "
2021
\n", "
\n", "\n", "
\n", @@ -10697,17 +4450,19 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "\n", + "\n", "\n", "\n", "
\n", "
\n", - "pargr\n", + "trailer_utf8\n", "
\n", "
str
\n", + "\n", "
\n", - " \n", + " ✅ interword material pointed-Hebrew (־ ׃)\n", "
\n", "\n", "
\n", @@ -10727,7 +4482,7 @@ "\n", "
\n", "
dateWritten:
\n", - "
2018-10-08T15:13:35Z
\n", + "
2021-12-09T14:18:01Z
\n", "
\n", "\n", "
\n", @@ -10737,12 +4492,12 @@ "\n", "
\n", "
encoders:
\n", - "
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
\n", + "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", - "
c
\n", + "
2021
\n", "
\n", "\n", "
\n", @@ -10755,17 +4510,19 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "\n", + "\n", "\n", "\n", "
\n", "
\n", - "pdp\n", + "txt\n", "
\n", "
str
\n", + "\n", "
\n", - " \n", + " ✅ text type of clause and surrounding (repetion of ? N D Q as in feature domain)\n", "
\n", "\n", "
\n", @@ -10785,7 +4542,7 @@ "\n", "
\n", "
dateWritten:
\n", - "
2018-10-08T15:07:46Z
\n", + "
2021-12-09T14:18:16Z
\n", "
\n", "\n", "
\n", @@ -10800,7 +4557,7 @@ "\n", "
\n", "
version:
\n", - "
c
\n", + "
2021
\n", "
\n", "\n", "
\n", @@ -10813,17 +4570,19 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "\n", + "\n", "\n", "\n", "
\n", "
\n", - "pfm\n", + "typ\n", "
\n", "
str
\n", + "\n", "
\n", - " \n", + " ✅ clause/phrase(atom) type (VP; NP; Ellp; Ptcp; WayX)\n", "
\n", "\n", "
\n", @@ -10843,7 +4602,7 @@ "\n", "
\n", "
dateWritten:
\n", - "
2018-10-08T15:07:46Z
\n", + "
2021-12-09T14:18:16Z
\n", "
\n", "\n", "
\n", @@ -10858,7 +4617,7 @@ "\n", "
\n", "
version:
\n", - "
c
\n", + "
2021
\n", "
\n", "\n", "
\n", @@ -10871,17 +4630,19 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "\n", + "\n", "\n", "\n", "
\n", "
\n", - "prs\n", + "uvf\n", "
\n", "
str
\n", + "\n", "
\n", - " \n", + " ✅ univalent final consonant consonantal-transliterated (absent; N; J; ...)\n", "
\n", "\n", "
\n", @@ -10901,7 +4662,7 @@ "\n", "
\n", "
dateWritten:
\n", - "
2018-10-08T15:07:47Z
\n", + "
2021-12-09T14:18:17Z
\n", "
\n", "\n", "
\n", @@ -10916,7 +4677,7 @@ "\n", "
\n", "
version:
\n", - "
c
\n", + "
2021
\n", "
\n", "\n", "
\n", @@ -10929,17 +4690,19 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "\n", + "\n", "\n", "\n", "
\n", "
\n", - "prs_gn\n", + "vbe\n", "
\n", "
str
\n", + "\n", "
\n", - " \n", + " ✅ verbal ending consonantal-transliterated (n/a; W; ...)\n", "
\n", "\n", "
\n", @@ -10959,7 +4722,7 @@ "\n", "
\n", "
dateWritten:
\n", - "
2018-10-08T15:07:48Z
\n", + "
2021-12-09T14:18:17Z
\n", "
\n", "\n", "
\n", @@ -10974,7 +4737,7 @@ "\n", "
\n", "
version:
\n", - "
c
\n", + "
2021
\n", "
\n", "\n", "
\n", @@ -10987,17 +4750,19 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "\n", + "\n", "\n", "\n", "
\n", "
\n", - "prs_nu\n", + "vbs\n", "
\n", "
str
\n", + "\n", "
\n", - " \n", + " ✅ root formation consonantal-transliterated (absent; n/a; H; ...)\n", "
\n", "\n", "
\n", @@ -11017,7 +4782,7 @@ "\n", "
\n", "
dateWritten:
\n", - "
2018-10-08T15:07:49Z
\n", + "
2021-12-09T14:18:17Z
\n", "
\n", "\n", "
\n", @@ -11032,7 +4797,7 @@ "\n", "
\n", "
version:
\n", - "
c
\n", + "
2021
\n", "
\n", "\n", "
\n", @@ -11045,17 +4810,19 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "\n", + "\n", "\n", "\n", "
\n", "
\n", - "prs_ps\n", + "verse\n", "
\n", - "
str
\n", + "
int
\n", + "\n", "
\n", - " \n", + " ✅ verse number\n", "
\n", "\n", "
\n", @@ -11075,7 +4842,7 @@ "\n", "
\n", "
dateWritten:
\n", - "
2018-10-08T15:07:50Z
\n", + "
2021-12-09T14:18:18Z
\n", "
\n", "\n", "
\n", @@ -11090,7 +4857,7 @@ "\n", "
\n", "
version:
\n", - "
c
\n", + "
2021
\n", "
\n", "\n", "
\n", @@ -11103,17 +4870,19 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "\n", + "\n", "\n", "\n", "
\n", "
\n", - "ps\n", + "voc_lex\n", "
\n", "
str
\n", + "\n", "
\n", - " \n", + " ✅ vocalized lexeme pointed-transliterated (B.: R;>CIJT BR> >:ELOHIJM)\n", "
\n", "\n", "
\n", @@ -11133,7 +4902,7 @@ "\n", "
\n", "
dateWritten:
\n", - "
2018-10-08T15:07:50Z
\n", + "
2021-12-09T14:21:16Z
\n", "
\n", "\n", "
\n", @@ -11147,8 +4916,13 @@ "
\n", "\n", "
\n", + "
provenance:
\n", + "
from additional lexicon file provided by the ETCBC
\n", + "
\n", + "\n", + "
\n", "
version:
\n", - "
c
\n", + "
2021
\n", "
\n", "\n", "
\n", @@ -11161,17 +4935,19 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "\n", + "\n", "\n", "\n", "
\n", "
\n", - "qere\n", + "voc_lex_utf8\n", "
\n", "
str
\n", + "\n", "
\n", - " \n", + " ✅ vocalized lexeme pointed-Hebrew (בְּ רֵאשִׁית ברא אֱלֹהִים)\n", "
\n", "\n", "
\n", @@ -11191,7 +4967,7 @@ "\n", "
\n", "
dateWritten:
\n", - "
2018-10-08T15:13:50Z
\n", + "
2021-12-09T14:21:17Z
\n", "
\n", "\n", "
\n", @@ -11201,12 +4977,17 @@ "\n", "
\n", "
encoders:
\n", - "
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
\n", + "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "\n", + "
\n", + "
provenance:
\n", + "
from additional lexicon file provided by the ETCBC
\n", "
\n", "\n", "
\n", "
version:
\n", - "
c
\n", + "
2021
\n", "
\n", "\n", "
\n", @@ -11219,17 +5000,19 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "\n", + "\n", "\n", "\n", "
\n", "
\n", - "qere_trailer\n", + "vs\n", "
\n", "
str
\n", + "\n", "
\n", - " \n", + " ✅ verbal stem (qal; piel; hif; apel; pael)\n", "
\n", "\n", "
\n", @@ -11249,7 +5032,7 @@ "\n", "
\n", "
dateWritten:
\n", - "
2018-10-08T15:13:50Z
\n", + "
2021-12-09T14:18:18Z
\n", "
\n", "\n", "
\n", @@ -11259,12 +5042,12 @@ "\n", "
\n", "
encoders:
\n", - "
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
\n", + "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", - "
c
\n", + "
2021
\n", "
\n", "\n", "
\n", @@ -11277,17 +5060,19 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "\n", + "\n", "\n", "\n", "
\n", "
\n", - "qere_trailer_utf8\n", + "vt\n", "
\n", "
str
\n", + "\n", "
\n", - " \n", + " ✅ verbal tense (perf; impv; wayq; infc)\n", "
\n", "\n", "
\n", @@ -11307,7 +5092,7 @@ "\n", "
\n", "
dateWritten:
\n", - "
2018-10-08T15:13:50Z
\n", + "
2021-12-09T14:18:18Z
\n", "
\n", "\n", "
\n", @@ -11317,12 +5102,12 @@ "\n", "
\n", "
encoders:
\n", - "
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
\n", + "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", - "
c
\n", + "
2021
\n", "
\n", "\n", "
\n", @@ -11335,17 +5120,19 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "\n", + "\n", "\n", "\n", "
\n", - "
\n", - "qere_utf8\n", + "
\n", + "mother\n", "
\n", - "
str
\n", + "
none
\n", + "\n", "
\n", - " \n", + " ✅ linguistic dependency between textual objects\n", "
\n", "\n", "
\n", @@ -11365,7 +5152,7 @@ "\n", "
\n", "
dateWritten:
\n", - "
2018-10-08T15:13:50Z
\n", + "
2021-12-09T14:18:22Z
\n", "
\n", "\n", "
\n", @@ -11375,12 +5162,12 @@ "\n", "
\n", "
encoders:
\n", - "
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
\n", + "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", - "
c
\n", + "
2021
\n", "
\n", "\n", "
\n", @@ -11393,15 +5180,17 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "\n", + "\n", "
\n", "\n", "
\n", - "
\n", - "rank_lex\n", + "
\n", + "oslots\n", "
\n", - "
int
\n", + "
none
\n", + "\n", "
\n", " \n", "
\n", @@ -11423,7 +5212,7 @@ "\n", "
\n", "
dateWritten:
\n", - "
2018-10-08T15:15:00Z
\n", + "
2021-12-09T14:21:17Z
\n", "
\n", "\n", "
\n", @@ -11433,17 +5222,12 @@ "\n", "
\n", "
encoders:
\n", - "
Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
computed addition to core set of features
\n", + "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", - "
c
\n", + "
2021
\n", "
\n", "\n", "
\n", @@ -11456,57 +5240,125 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "\n", + "\n", "
\n", "\n", + " \n", + "\n", + "\n", + "
etcbc/participants/actor/tf\n", + "
\n", + "\n", "
\n", "
\n", - "rela\n", + "actor\n", "
\n", "
str
\n", + "\n", "
\n", - " \n", + " Participant references for words, subphrases and phrases. The references are adapted from Eep Talstra's work on participant tracking. http://doi.org/10.5281/zenodo.1479491\n", "
\n", "\n", "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", + "
coreData:
\n", + "
BHSA
\n", "
\n", "\n", "
\n", - "
dataset:
\n", + "
coreVersion:
\n", + "
c
\n", + "
\n", + "\n", + "
\n", + "
dateWritten:
\n", + "
2021-12-16T11:10:24Z
\n", + "
\n", + "\n", + "
\n", + "
upgraded:
\n", + "
‼️ from version c to 2021
\n", + "
\n", + "\n", + "
\n", + "
writtenBy:
\n", + "
Text-Fabric
\n", + "
\n", + "\n", + "
\n", + "
\n", + "\n", + "
\n", + "\n", + "
\n", + "
\n", + "prs_actor\n", + "
\n", + "
str
\n", + "\n", + "
\n", + " Participant references for pronominal suffixes. The references are adapted from Eep Talstra's work on participant tracking. http://doi.org/10.5281/zenodo.1479491\n", + "
\n", + "\n", + "
\n", + "
coreData:
\n", "
BHSA
\n", "
\n", "\n", "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
coreVersion:
\n", + "
c
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", - "
2018-10-08T15:07:53Z
\n", + "
2021-12-16T11:10:24Z
\n", "
\n", "\n", "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", + "
upgraded:
\n", + "
‼️ from version c to 2021
\n", + "
\n", + "\n", + "
\n", + "
writtenBy:
\n", + "
Text-Fabric
\n", + "
\n", + "\n", + "
\n", + "
\n", + "\n", + "
\n", + "\n", + "
\n", + "
\n", + "coref\n", + "
\n", + "
none
\n", + "\n", + "
\n", + " Edges to co-referring actors on chapter-level. The references are adapted from Eep Talstra's work on participant tracking. http://doi.org/10.5281/zenodo.1479491\n", + "
\n", + "\n", + "
\n", + "
coreData:
\n", + "
BHSA
\n", "
\n", "\n", "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
coreVersion:
\n", + "
c
\n", "
\n", "\n", "
\n", - "
version:
\n", - "
c
\n", + "
dateWritten:
\n", + "
2021-12-16T11:10:24Z
\n", "
\n", "\n", "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", + "
upgraded:
\n", + "
‼️ from version c to 2021
\n", "
\n", "\n", "
\n", @@ -11514,57 +5366,50 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "
\n", + "\n", "\n", "\n", + " \n", + "\n", + "\n", + "
Phonetic Transcriptions\n", + "
\n", + "\n", "
\n", "
\n", - "sp\n", + "phono\n", "
\n", "
str
\n", + "\n", "
\n", - " \n", + " 🆗 phonological transcription (bᵊ rēšˌîṯ bārˈā ʔᵉlōhˈîm)\n", "
\n", "\n", "
\n", "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", + "
BHSA Data: Constantijn Sikkel; Phono Notebook: Dirk Roorda
\n", "
\n", "\n", "
\n", - "
dataset:
\n", + "
coreData:
\n", "
BHSA
\n", "
\n", "\n", "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", "
dateWritten:
\n", - "
2018-10-08T15:11:57Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", + "
2021-12-09T14:25:55Z
\n", "
\n", "\n", "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
\n", + "
provenance:
\n", + "
computed by the phono notebook, see https://github.com/ETCBC/phono
\n", "
\n", "\n", "
\n", "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", + "
2021
\n", "
\n", "\n", "
\n", @@ -11572,1028 +5417,1390 @@ "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "
\n", + "\n", "\n", "\n", "
\n", "
\n", - "st\n", + "phono_trailer\n", "
\n", "
str
\n", + "\n", "
\n", - " \n", + " 🆗 interword material in phonological transcription\n", "
\n", "\n", "
\n", "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", + "
BHSA Data: Constantijn Sikkel; Phono Notebook: Dirk Roorda
\n", "
\n", "\n", "
\n", - "
dataset:
\n", + "
coreData:
\n", "
BHSA
\n", "
\n", "\n", "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
dateWritten:
\n", + "
2021-12-09T14:25:55Z
\n", "
\n", "\n", "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:07:54Z
\n", + "
provenance:
\n", + "
computed by the phono notebook, see https://github.com/ETCBC/phono
\n", "
\n", "\n", "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", + "
version:
\n", + "
2021
\n", "
\n", "\n", "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
writtenBy:
\n", + "
Text-Fabric
\n", + "
\n", + "\n", + "
\n", + "
\n", + "\n", "
\n", "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", + " \n", + "\n" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/html": [ + "" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/html": [ "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", + "\n" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + } + ], + "source": [ + "N = use(\"etcbc/bhsa\", mod=\"etcbc/participants/actor/tf:clone\", silent=\"verbose\")" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "If you click the triangles and navigate to the full metadata of the participants features,\n", + "you see a line\n", + "\n", + "```\n", + "upgraded: ‼️ from version c to 2021\n", + "```" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## Checks\n", + "\n", + "Let's do a few checks to see how well the upgrade process has worked.\n", + "\n", + "First we load the `c` version of the BHSA and Christian's original features." + ] + }, + { + "cell_type": "code", + "execution_count": 24, + "metadata": {}, + "outputs": [ + { + "data": { + "text/html": [ + "TF-app: ~/text-fabric-data/github/etcbc/bhsa/app" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/html": [ + "data: ~/text-fabric-data/github/etcbc/bhsa/tf/c" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/html": [ + "data: ~/text-fabric-data/github/ch-jensen/participants/actor/tf/c" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/html": [ + "data: ~/text-fabric-data/github/etcbc/phono/tf/c" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/html": [ + "data: ~/text-fabric-data/github/etcbc/parallels/tf/c" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/html": [ + "Text-Fabric: Text-Fabric API 10.2.0, etcbc/bhsa/app v3, Search Reference
Data: BHSA, Character table, Feature docs
Features:
\n", + "
Parallel Passages\n", + "
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "
\n", + "crossref\n", "
\n", + "
int
\n", "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", + " \n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + "
\n", + "
\n", "\n", - " \n", - " \n", - "\n", + "
ch-jensen/participants/actor/tf\n", + "
\n", "\n", "
\n", "
\n", - "trailer\n", + "actor\n", "
\n", "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:07:27Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " Participant references for words, subphrases and phrases. The references are adapted from Eep Talstra's work on participant tracking. http://doi.org/10.5281/zenodo.1479491\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "trailer_utf8\n", + "prs_actor\n", "
\n", "
str
\n", - "
\n", - " \n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + " Participant references for pronominal suffixes. The references are adapted from Eep Talstra's work on participant tracking. http://doi.org/10.5281/zenodo.1479491\n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "
\n", + "coref\n", "
\n", + "
none
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:07:28Z
\n", - "
\n", + " Edges to co-referring actors on chapter-level. The references are adapted from Eep Talstra's work on participant tracking. http://doi.org/10.5281/zenodo.1479491\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", "
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", + "
\n", + "
\n", "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", + "
BHSA = Biblia Hebraica Stuttgartensia Amstelodamensis\n", + "
\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", + "
\n", + "
\n", + "book\n", "
\n", + "
str
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " \n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "txt\n", + "book@ll\n", "
\n", "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", + " \n", "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:07:58Z
\n", "
\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", + "
\n", + "
\n", + "chapter\n", "
\n", + "
int
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", + " \n", "\n", - "
\n", - "
version:
\n", - "
c
\n", "
\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", + "
\n", + "
\n", + "code\n", "
\n", + "
int
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " \n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "typ\n", + "det\n", "
\n", "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", + " \n", "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:07:58Z
\n", "
\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", + "
\n", + "
\n", + "domain\n", "
\n", + "
str
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", + " \n", "\n", - "
\n", - "
version:
\n", - "
c
\n", "
\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", + "
\n", + "
\n", + "freq_lex\n", "
\n", + "
int
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " \n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "uvf\n", + "function\n", "
\n", "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:07:59Z
\n", - "
\n", + " \n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", "
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "
\n", + "g_cons\n", "
\n", + "
str
\n", "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", + " \n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
\n", + "
\n", + "g_cons_utf8\n", "
\n", + "
str
\n", + "\n", + " \n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "vbe\n", + "g_lex\n", "
\n", "
str
\n", - "
\n", - " \n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + " \n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "
\n", + "g_lex_utf8\n", "
\n", + "
str
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:08:00Z
\n", - "
\n", + " \n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", "
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "
\n", + "g_word\n", "
\n", + "
str
\n", "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", + " \n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
\n", + "
\n", + "g_word_utf8\n", "
\n", + "
str
\n", + "\n", + " \n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "vbs\n", + "gloss\n", "
\n", "
str
\n", - "
\n", - " \n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + " \n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "
\n", + "gn\n", "
\n", + "
str
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:08:00Z
\n", - "
\n", + " \n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", "
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "
\n", + "label\n", "
\n", + "
str
\n", "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", + " \n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
\n", + "
\n", + "language\n", "
\n", + "
str
\n", + "\n", + " \n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "verse\n", + "lex\n", "
\n", - "
int
\n", - "
\n", - " \n", - "
\n", + "
str
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + " \n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "
\n", + "lex_utf8\n", "
\n", + "
str
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:08:01Z
\n", - "
\n", + " \n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", "
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "
\n", + "ls\n", "
\n", + "
str
\n", "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", + " \n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
\n", + "
\n", + "nametype\n", "
\n", + "
str
\n", + "\n", + " \n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "voc_lex\n", + "nme\n", "
\n", "
str
\n", - "
\n", - " \n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + " \n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "
\n", + "nu\n", "
\n", + "
str
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2019-01-31T17:40:54Z
\n", - "
\n", + " \n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", "
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
\n", + "
\n", + "
\n", + "number\n", "
\n", + "
int
\n", "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", + " \n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
\n", + "
\n", + "otype\n", "
\n", + "
str
\n", + "\n", + " \n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "voc_lex_utf8\n", + "pargr\n", "
\n", "
str
\n", - "
\n", - " \n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + " \n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "
\n", + "pdp\n", "
\n", + "
str
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2019-01-31T17:40:55Z
\n", - "
\n", + " \n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", "
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
\n", + "
\n", + "
\n", + "pfm\n", "
\n", + "
str
\n", "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", + " \n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
\n", + "
\n", + "prs\n", "
\n", + "
str
\n", + "\n", + " \n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "vs\n", + "prs_gn\n", "
\n", "
str
\n", - "
\n", - " \n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + " \n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "
\n", + "prs_nu\n", "
\n", + "
str
\n", + "\n", + " \n", "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:08:01Z
\n", "
\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", + "
\n", + "
\n", + "prs_ps\n", "
\n", + "
str
\n", + "\n", + " \n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", - "
\n", - "
version:
\n", - "
c
\n", + "
\n", + "
\n", + "ps\n", "
\n", + "
str
\n", + "\n", + " \n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
\n", + "
\n", + "qere\n", "
\n", + "
str
\n", + "\n", + " \n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "vt\n", + "qere_trailer\n", "
\n", "
str
\n", - "
\n", - " \n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + " \n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "
\n", + "qere_trailer_utf8\n", "
\n", + "
str
\n", + "\n", + " \n", "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:08:02Z
\n", "
\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", + "
\n", + "
\n", + "qere_utf8\n", "
\n", + "
str
\n", + "\n", + " \n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", - "
\n", - "
version:
\n", - "
c
\n", + "
\n", + "
\n", + "rank_lex\n", "
\n", + "
int
\n", + "\n", + " \n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
\n", + "
\n", + "rela\n", "
\n", + "
str
\n", + "\n", + " \n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", - "
\n", - "mother\n", + "
\n", + "sp\n", "
\n", - "
none
\n", - "
\n", - " \n", - "
\n", + "
str
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + " \n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "
\n", + "st\n", "
\n", + "
str
\n", + "\n", + " \n", "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:08:09Z
\n", "
\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", + "
\n", + "
\n", + "tab\n", "
\n", + "
int
\n", + "\n", + " \n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", - "
\n", - "
version:
\n", - "
c
\n", + "
\n", + "
\n", + "trailer\n", "
\n", + "
str
\n", + "\n", + " \n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
\n", + "
\n", + "trailer_utf8\n", "
\n", + "
str
\n", + "\n", + " \n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", - "
\n", - "oslots\n", + "
\n", + "txt\n", "
\n", - "
none
\n", - "
\n", - " \n", - "
\n", + "
str
\n", + "\n", + " \n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", + "
\n", + "
\n", + "typ\n", "
\n", + "
str
\n", + "\n", + " \n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:11:57Z
\n", + "
\n", + "
\n", + "uvf\n", "
\n", + "
str
\n", + "\n", + " \n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", "
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
\n", + "
\n", + "
\n", + "vbe\n", "
\n", + "
str
\n", + "\n", + " \n", "\n", - "
\n", - "
version:
\n", - "
c
\n", "
\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", + "
\n", + "
\n", + "vbs\n", "
\n", + "
str
\n", + "\n", + " \n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "
\n", + "verse\n", "
\n", + "
int
\n", "\n", - "
\n", - "
\n", + " \n", "\n", - "
Phonetic Transcriptions\n", - "
\n", + "
\n", "\n", "
\n", "
\n", - "phono\n", + "voc_lex\n", "
\n", "
str
\n", - "
\n", - " \n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
BHSA Data: Constantijn Sikkel; Phono Notebook: Dirk Roorda
\n", - "
\n", + " \n", "\n", - "
\n", - "
coreData:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
coreVersion:
\n", - "
_temp
\n", + "
\n", + "
\n", + "voc_lex_utf8\n", "
\n", + "
str
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:16:04Z
\n", - "
\n", + " \n", "\n", - "
\n", - "
source:
\n", - "
Phono Notebook applied to BHSA Data
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
\n", + "
\n", + "vs\n", "
\n", + "
str
\n", + "\n", + " \n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "phono_trailer\n", + "vt\n", "
\n", "
str
\n", - "
\n", - " \n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
BHSA Data: Constantijn Sikkel; Phono Notebook: Dirk Roorda
\n", + " \n", + "\n", "
\n", "\n", - "
\n", - "
coreData:
\n", - "
BHSA
\n", + "
\n", + "
\n", + "mother\n", "
\n", + "
none
\n", + "\n", + " \n", "\n", - "
\n", - "
coreVersion:
\n", - "
_temp
\n", "
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:16:04Z
\n", + "
\n", + "
\n", + "oslots\n", "
\n", + "
none
\n", + "\n", + " \n", "\n", - "
\n", - "
source:
\n", - "
Phono Notebook applied to BHSA Data
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
\n", + "
\n", + "\n", + "
Phonetic Transcriptions\n", + "
\n", + "\n", + "
\n", + "
\n", + "phono\n", + "
\n", + "
str
\n", + "\n", + " \n", + "\n", + "
\n", + "\n", + "
\n", + "
\n", + "phono_trailer\n", "
\n", + "
str
\n", + "\n", + " \n", "\n", - "
\n", - "
\n", "
\n", "\n", " \n", @@ -13035,6 +7242,14 @@ ".ccon {\n", " background-color: yellow ! important;\n", "}\n", + ".ccon,.ccoff {\n", + " padding-right: 0.1rem;\n", + " padding-left: 0.1rem;\n", + "}\n", + ".ccline {\n", + " font-size: xx-large;\n", + " font-weight: bold;\n", + "}\n", "/* TF header */\n", "\n", "summary {\n", @@ -13197,7 +7412,7 @@ }, { "cell_type": "code", - "execution_count": 12, + "execution_count": 25, "metadata": {}, "outputs": [ { @@ -13209,7 +7424,7 @@ } ], "source": [ - "N.load(\"omap@c-2021\", silent=True)\n", + "N.load(\"omap@c-2021\", silent=\"deep\")\n", "N.isLoaded(\"omap@c-2021\")\n", "\n", "hiddenTypes=\"half_verse,sentence_atom,clause,clause_atom\"\n", @@ -13229,7 +7444,7 @@ }, { "cell_type": "code", - "execution_count": 13, + "execution_count": 26, "metadata": {}, "outputs": [ { @@ -13238,7 +7453,7 @@ "{'phrase_atom', 'subphrase'}" ] }, - "execution_count": 13, + "execution_count": 26, "metadata": {}, "output_type": "execute_result" } @@ -13249,7 +7464,7 @@ }, { "cell_type": "code", - "execution_count": 14, + "execution_count": 27, "metadata": {}, "outputs": [ { @@ -13258,7 +7473,7 @@ "{'phrase_atom', 'subphrase'}" ] }, - "execution_count": 14, + "execution_count": 27, "metadata": {}, "output_type": "execute_result" } @@ -13276,7 +7491,7 @@ }, { "cell_type": "code", - "execution_count": 15, + "execution_count": 28, "metadata": {}, "outputs": [ { @@ -13327,15 +7542,15 @@ }, { "cell_type": "code", - "execution_count": 16, + "execution_count": 29, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ - " 0.18s 7 results\n", - " 0.17s 9 results\n" + " 0.09s 7 results\n", + " 0.09s 9 results\n" ] } ], @@ -13351,7 +7566,7 @@ }, { "cell_type": "code", - "execution_count": 17, + "execution_count": 30, "metadata": {}, "outputs": [ { @@ -13412,7 +7627,7 @@ }, { "cell_type": "code", - "execution_count": 18, + "execution_count": 31, "metadata": {}, "outputs": [ { @@ -13453,7 +7668,7 @@ }, { "cell_type": "code", - "execution_count": 19, + "execution_count": 32, "metadata": {}, "outputs": [ { @@ -13462,7 +7677,7 @@ "((1181957, None),)" ] }, - "execution_count": 19, + "execution_count": 32, "metadata": {}, "output_type": "execute_result" } @@ -13475,7 +7690,7 @@ }, { "cell_type": "code", - "execution_count": 20, + "execution_count": 33, "metadata": {}, "outputs": [ { @@ -13522,7 +7737,7 @@ }, { "cell_type": "code", - "execution_count": 21, + "execution_count": 34, "metadata": {}, "outputs": [], "source": [ @@ -13532,7 +7747,7 @@ }, { "cell_type": "code", - "execution_count": 22, + "execution_count": 35, "metadata": {}, "outputs": [], "source": [ @@ -13547,7 +7762,7 @@ }, { "cell_type": "code", - "execution_count": 23, + "execution_count": 36, "metadata": {}, "outputs": [ { @@ -13564,7 +7779,7 @@ " ('subphrase', 'phrase_atom'): 1621})" ] }, - "execution_count": 23, + "execution_count": 36, "metadata": {}, "output_type": "execute_result" } @@ -13583,7 +7798,7 @@ }, { "cell_type": "code", - "execution_count": 24, + "execution_count": 37, "metadata": {}, "outputs": [ { @@ -13591,8 +7806,8 @@ "output_type": "stream", "text": [ "word - subphrase \n", - " 0.16s 471 results\n", - " 0.16s 471 results\n", + " 0.09s 471 results\n", + " 0.08s 471 results\n", "good: 471\n", "bad : 0\n", "Good:\n", @@ -13602,8 +7817,8 @@ "----------------------------------------\n", "\n", "word - phrase_atom \n", - " 0.33s 20188 results\n", - " 0.35s 20254 results\n", + " 0.17s 20188 results\n", + " 0.16s 20254 results\n", "good: 3785\n", "bad : 16403\n", "Good:\n", @@ -13617,8 +7832,8 @@ "----------------------------------------\n", "\n", "word - word \n", - " 0.47s 19884 results\n", - " 0.49s 19884 results\n", + " 0.22s 19884 results\n", + " 0.22s 19884 results\n", "good: 19884\n", "bad : 0\n", "Good:\n", @@ -13628,8 +7843,8 @@ "----------------------------------------\n", "\n", "phrase_atom - phrase_atom \n", - " 0.35s 34215 results\n", - " 0.23s 34404 results\n", + " 0.16s 34215 results\n", + " 0.16s 34404 results\n", "good: 745\n", "bad : 33470\n", "Good:\n", @@ -13643,8 +7858,8 @@ "----------------------------------------\n", "\n", "phrase_atom - subphrase \n", - " 0.14s 1599 results\n", - " 0.14s 1621 results\n", + " 0.06s 1599 results\n", + " 0.07s 1621 results\n", "good: 220\n", "bad : 1379\n", "Good:\n", @@ -13658,8 +7873,8 @@ "----------------------------------------\n", "\n", "subphrase - subphrase \n", - " 0.11s 1086 results\n", - " 0.06s 1086 results\n", + " 0.05s 1086 results\n", + " 0.04s 1086 results\n", "good: 1086\n", "bad : 0\n", "Good:\n", @@ -13745,7 +7960,7 @@ }, { "cell_type": "code", - "execution_count": 25, + "execution_count": 38, "metadata": {}, "outputs": [], "source": [ @@ -13758,7 +7973,7 @@ }, { "cell_type": "code", - "execution_count": 26, + "execution_count": 39, "metadata": {}, "outputs": [], "source": [ @@ -13771,7 +7986,7 @@ }, { "cell_type": "code", - "execution_count": 27, + "execution_count": 40, "metadata": {}, "outputs": [ { @@ -13808,7 +8023,7 @@ }, { "cell_type": "code", - "execution_count": 28, + "execution_count": 41, "metadata": {}, "outputs": [ { @@ -13864,7 +8079,7 @@ }, { "cell_type": "code", - "execution_count": 29, + "execution_count": 42, "metadata": {}, "outputs": [], "source": [ @@ -13877,7 +8092,7 @@ }, { "cell_type": "code", - "execution_count": 30, + "execution_count": 43, "metadata": {}, "outputs": [], "source": [ @@ -13890,7 +8105,7 @@ }, { "cell_type": "code", - "execution_count": 31, + "execution_count": 44, "metadata": {}, "outputs": [ { @@ -13914,7 +8129,7 @@ }, { "cell_type": "code", - "execution_count": 32, + "execution_count": 45, "metadata": {}, "outputs": [ { @@ -13961,7 +8176,7 @@ }, { "cell_type": "code", - "execution_count": 33, + "execution_count": 46, "metadata": {}, "outputs": [], "source": [ @@ -13974,7 +8189,7 @@ }, { "cell_type": "code", - "execution_count": 34, + "execution_count": 47, "metadata": {}, "outputs": [], "source": [ @@ -13987,7 +8202,7 @@ }, { "cell_type": "code", - "execution_count": 35, + "execution_count": 48, "metadata": {}, "outputs": [ { @@ -14024,7 +8239,7 @@ }, { "cell_type": "code", - "execution_count": 36, + "execution_count": 49, "metadata": {}, "outputs": [ { @@ -14139,7 +8354,7 @@ "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython3", - "version": "3.10.2" + "version": "3.10.4" }, "widgets": { "application/vnd.jupyter.widget-state+json": { diff --git a/tutorial/search.ipynb b/tutorial/search.ipynb index 7e69b0d1..ad2dffe5 100644 --- a/tutorial/search.ipynb +++ b/tutorial/search.ipynb @@ -115,7 +115,7 @@ { "data": { "text/html": [ - "TF-app: ~/text-fabric-data/etcbc/bhsa/app" + "TF-app: ~/text-fabric-data/github/etcbc/bhsa/app" ], "text/plain": [ "" @@ -127,7 +127,7 @@ { "data": { "text/html": [ - "data: ~/text-fabric-data/etcbc/bhsa/tf/2021" + "data: ~/text-fabric-data/github/etcbc/bhsa/tf/2021" ], "text/plain": [ "" @@ -139,7 +139,7 @@ { "data": { "text/html": [ - "data: ~/text-fabric-data/etcbc/phono/tf/2021" + "data: ~/text-fabric-data/github/etcbc/phono/tf/2021" ], "text/plain": [ "" @@ -151,7 +151,7 @@ { "data": { "text/html": [ - "data: ~/text-fabric-data/etcbc/parallels/tf/2021" + "data: ~/text-fabric-data/github/etcbc/parallels/tf/2021" ], "text/plain": [ "" @@ -160,64 +160,21 @@ "metadata": {}, "output_type": "display_data" }, - { - "name": "stdout", - "output_type": "stream", - "text": [ - "This is Text-Fabric 9.3.2\n", - "Api reference : https://annotation.github.io/text-fabric/tf/cheatsheet.html\n", - "\n", - "122 features found and 0 ignored\n" - ] - }, { "data": { "text/html": [ - "Text-Fabric: Text-Fabric API 9.3.2, etcbc/bhsa/app v3, Search Reference
Data: BHSA, Character table, Feature docs
Features:
\n", + "Text-Fabric: Text-Fabric API 10.2.4, etcbc/bhsa/app v3, Search Reference
Data: BHSA, Character table, Feature docs
Features:
\n", "
Parallel Passages\n", "
\n", "\n", "
\n", "
\n", - "crossref\n", + "crossref\n", "
\n", "
int
\n", - "
\n", - " 🆗 links between similar passages\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
BHSA Data: Constantijn Sikkel; Parallels Notebook: Dirk Roorda, Martijn Naaijer
\n", - "
\n", - "\n", - "
\n", - "
coreData:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:40:46Z
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
Parallels notebook, see https://github.com/ETCBC/parallels
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " 🆗 links between similar passages\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", @@ -228,3500 +185,598 @@ "\n", "
\n", "
\n", - "book\n", + "book\n", "
\n", "
str
\n", - "
\n", - " ✅ book name in Latin (Genesis; Numeri; Reges1; ...)\n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + " ✅ book name in Latin (Genesis; Numeri; Reges1; ...)\n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "
\n", + "book@ll\n", "
\n", + "
str
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:17:55Z
\n", - "
\n", + " ✅ book name in amharic (ኣማርኛ)\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", "
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "
\n", + "chapter\n", "
\n", + "
int
\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", + " ✅ chapter number (1; 2; 3; ...)\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
\n", + "
\n", + "code\n", "
\n", + "
int
\n", + "\n", + " ✅ identifier of a clause atom relationship (0; 74; 367; ...)\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "book@ll\n", + "det\n", "
\n", "
str
\n", - "
\n", - " ✅ book name in amharic (ኣማርኛ)\n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + " ✅ determinedness of phrase(atom) (det; und; NA.)\n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "
\n", + "domain\n", "
\n", + "
str
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:20:27Z
\n", - "
\n", + " ✅ text type of clause (? (Unknown); N (narrative); D (discursive); Q (Quotation).)\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", "
\n", "\n", - "
\n", - "
encoders:
\n", - "
Dirk Roorda (TF)
\n", + "
\n", + "
\n", + "freq_lex\n", "
\n", + "
int
\n", "\n", - "
\n", - "
language:
\n", - "
ኣማርኛ
\n", - "
\n", + " ✅ frequency of lexemes\n", "\n", - "
\n", - "
languageCode:
\n", - "
am
\n", "
\n", "\n", - "
\n", - "
languageEnglish:
\n", - "
amharic
\n", + "
\n", + "
\n", + "function\n", "
\n", + "
str
\n", "\n", - "
\n", - "
provenance:
\n", - "
book names from wikipedia and other sources
\n", - "
\n", + " ✅ syntactic function of phrase (Cmpl; Objc; Pred; ...)\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", "
\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", + "
\n", + "
\n", + "g_cons\n", "
\n", + "
str
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " ✅ word consonantal-transliterated (B R>CJT BR> >LHJM ...)\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "chapter\n", - "
\n", - "
int
\n", - "
\n", - " ✅ chapter number (1; 2; 3; ...)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", + "g_cons_utf8\n", "
\n", + "
str
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", + " ✅ word consonantal-Hebrew (ב ראשׁית ברא אלהים)\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:17:55Z
\n", "
\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", + "
\n", + "
\n", + "g_lex\n", "
\n", + "
str
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", + " ✅ lexeme pointed-transliterated (B.:- R;>CIJT B.@R@> >:ELOH ...)\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", "
\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", + "
\n", + "
\n", + "g_lex_utf8\n", "
\n", + "
str
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " ✅ lexeme pointed-Hebrew (בְּ רֵאשִׁית בָּרָא אֱלֹה)\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "code\n", - "
\n", - "
int
\n", - "
\n", - " ✅ identifier of a clause atom relationship (0; 74; 367; ...)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", + "g_word\n", "
\n", + "
str
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", + " ✅ word pointed-transliterated (B.:- R;>CI73JT B.@R@74> >:ELOHI92JM)\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:17:56Z
\n", "
\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", + "
\n", + "
\n", + "g_word_utf8\n", "
\n", + "
str
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", + " ✅ word pointed-Hebrew (בְּ רֵאשִׁ֖ית בָּרָ֣א אֱלֹהִ֑ים)\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", "
\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", + "
\n", + "
\n", + "gloss\n", "
\n", + "
str
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " 🆗 english translation of lexeme (beginning create god(s))\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "det\n", + "gn\n", "
\n", "
str
\n", - "
\n", - " ✅ determinedness of phrase(atom) (det; und; NA.)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", + " ✅ grammatical gender (m; f; NA; unknown.)\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:17:56Z
\n", "
\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", + "
\n", + "
\n", + "label\n", "
\n", + "
str
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", + " ✅ (half-)verse label (half verses: A; B; C; verses: GEN 01,02)\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", "
\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", + "
\n", + "
\n", + "language\n", "
\n", + "
str
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " ✅ of word or lexeme (Hebrew; Aramaic.)\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "domain\n", + "lex\n", "
\n", "
str
\n", - "
\n", - " ✅ text type of clause (? (Unknown); N (narrative); D (discursive); Q (Quotation).)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", + " ✅ lexeme consonantal-transliterated (B R>CJT/ BR>[ >LHJM/)\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:17:57Z
\n", "
\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", + "
\n", + "
\n", + "lex_utf8\n", "
\n", + "
str
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", + " ✅ lexeme consonantal-Hebrew (ב ראשׁית֜ ברא אלהים֜)\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", "
\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", + "
\n", + "
\n", + "ls\n", "
\n", + "
str
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " ✅ lexical set, subclassification of part-of-speech (card; ques; mult)\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "freq_lex\n", + "nametype\n", "
\n", - "
int
\n", - "
\n", - " ✅ frequency of lexemes\n", - "
\n", + "
str
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + " ⚠️ named entity type (pers; mens; gens; topo; ppde.)\n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "
\n", + "nme\n", "
\n", + "
str
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:24:45Z
\n", - "
\n", + " ✅ nominal ending consonantal-transliterated (absent; n/a; JM, ...)\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", "
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "
\n", + "nu\n", "
\n", + "
str
\n", "\n", - "
\n", - "
provenance:
\n", - "
computed on the basis of the ETCBC core set of features
\n", - "
\n", + " ✅ grammatical number (sg; du; pl; NA; unknown.)\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", "
\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", + "
\n", + "
\n", + "number\n", "
\n", + "
int
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " ✅ sequence number of an object within its context\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "function\n", + "otype\n", "
\n", "
str
\n", - "
\n", - " ✅ syntactic function of phrase (Cmpl; Objc; Pred; ...)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", + " \n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:17:57Z
\n", "
\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", + "
\n", + "
\n", + "pargr\n", "
\n", + "
str
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", + " 🆗 hierarchical paragraph number (1; 1.2; 1.2.3.4; ...)\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", "
\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", + "
\n", + "
\n", + "pdp\n", "
\n", + "
str
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " ✅ phrase dependent part-of-speech (art; verb; subs; nmpr, ...)\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "g_cons\n", + "pfm\n", "
\n", "
str
\n", - "
\n", - " ✅ word consonantal-transliterated (B R>CJT BR> >LHJM ...)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", + " ✅ preformative consonantal-transliterated (absent; n/a; J, ...)\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:17:57Z
\n", "
\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", + "
\n", + "
\n", + "prs\n", "
\n", + "
str
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", + " ✅ pronominal suffix consonantal-transliterated (absent; n/a; W; ...)\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", "
\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", + "
\n", + "
\n", + "prs_gn\n", "
\n", + "
str
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " ✅ pronominal suffix gender (m; f; NA; unknown.)\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "g_cons_utf8\n", + "prs_nu\n", "
\n", "
str
\n", - "
\n", - " ✅ word consonantal-Hebrew (ב ראשׁית ברא אלהים)\n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", + " ✅ pronominal suffix number (sg; du; pl; NA; unknown.)\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:17:58Z
\n", "
\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", + "
\n", + "
\n", + "prs_ps\n", "
\n", + "
str
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", + " ✅ pronominal suffix person (p1; p2; p3; NA; unknown.)\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", "
\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", + "
\n", + "
\n", + "ps\n", "
\n", + "
str
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " ✅ grammatical person (p1; p2; p3; NA; unknown.)\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "g_lex\n", + "qere\n", "
\n", "
str
\n", - "
\n", - " ✅ lexeme pointed-transliterated (B.:- R;>CIJT B.@R@> >:ELOH ...)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", + " ✅ word pointed-transliterated masoretic reading correction\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:17:58Z
\n", "
\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", + "
\n", + "
\n", + "qere_trailer\n", "
\n", + "
str
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", + " ✅ interword material -pointed-transliterated (Masoretic correction)\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", "
\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", + "
\n", + "
\n", + "qere_trailer_utf8\n", "
\n", + "
str
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " ✅ interword material -pointed-transliterated (Masoretic correction)\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "g_lex_utf8\n", + "qere_utf8\n", "
\n", "
str
\n", - "
\n", - " ✅ lexeme pointed-Hebrew (בְּ רֵאשִׁית בָּרָא אֱלֹה)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", + " ✅ word pointed-Hebrew masoretic reading correction\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:17:59Z
\n", "
\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", + "
\n", + "
\n", + "rank_lex\n", "
\n", + "
int
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", + " ✅ ranking of lexemes based on freqnuecy\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", "
\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", + "
\n", + "
\n", + "rela\n", "
\n", + "
str
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " ✅ linguistic relation between clause/(sub)phrase(atom) (ADJ; MOD; ATR; ...)\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "g_word\n", + "sp\n", "
\n", "
str
\n", - "
\n", - " ✅ word pointed-transliterated (B.:- R;>CI73JT B.@R@74> >:ELOHI92JM)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", + " ✅ part-of-speech (art; verb; subs; nmpr, ...)\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:04Z
\n", "
\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", + "
\n", + "
\n", + "st\n", "
\n", + "
str
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", + " ✅ state of a noun (a (absolute); c (construct); e (emphatic).)\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", "
\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", + "
\n", + "
\n", + "tab\n", "
\n", + "
int
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " ✅ clause atom: its level in the linguistic embedding\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "g_word_utf8\n", + "trailer\n", "
\n", "
str
\n", - "
\n", - " ✅ word pointed-Hebrew (בְּ רֵאשִׁ֖ית בָּרָ֣א אֱלֹהִ֑ים)\n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + " ✅ interword material pointed-transliterated (& 00 05 00_P ...)\n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "
\n", + "trailer_utf8\n", "
\n", + "
str
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:04Z
\n", - "
\n", + " ✅ interword material pointed-Hebrew (־ ׃)\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", "
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "
\n", + "txt\n", "
\n", + "
str
\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", + " ✅ text type of clause and surrounding (repetion of ? N D Q as in feature domain)\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
\n", + "
\n", + "typ\n", "
\n", + "
str
\n", + "\n", + " ✅ clause/phrase(atom) type (VP; NP; Ellp; Ptcp; WayX)\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "gloss\n", + "uvf\n", "
\n", "
str
\n", - "
\n", - " 🆗 english translation of lexeme (beginning create god(s))\n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + " ✅ univalent final consonant consonantal-transliterated (absent; N; J; ...)\n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "
\n", + "vbe\n", "
\n", + "
str
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:21:13Z
\n", - "
\n", + " ✅ verbal ending consonantal-transliterated (n/a; W; ...)\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", "
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "
\n", + "vbs\n", "
\n", + "
str
\n", "\n", - "
\n", - "
provenance:
\n", - "
from additional lexicon file provided by the ETCBC
\n", - "
\n", + " ✅ root formation consonantal-transliterated (absent; n/a; H; ...)\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", "
\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", + "
\n", + "
\n", + "verse\n", "
\n", + "
int
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " ✅ verse number\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "gn\n", + "voc_lex\n", "
\n", "
str
\n", - "
\n", - " ✅ grammatical gender (m; f; NA; unknown.)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", + " ✅ vocalized lexeme pointed-transliterated (B.: R;>CIJT BR> >:ELOHIJM)\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:05Z
\n", "
\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", + "
\n", + "
\n", + "voc_lex_utf8\n", "
\n", + "
str
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", + " ✅ vocalized lexeme pointed-Hebrew (בְּ רֵאשִׁית ברא אֱלֹהִים)\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", "
\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", + "
\n", + "
\n", + "vs\n", "
\n", + "
str
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " ✅ verbal stem (qal; piel; hif; apel; pael)\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "label\n", + "vt\n", "
\n", "
str
\n", - "
\n", - " ✅ (half-)verse label (half verses: A; B; C; verses: GEN 01,02)\n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + " ✅ verbal tense (perf; impv; wayq; infc)\n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "
\n", + "mother\n", "
\n", + "
none
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:06Z
\n", - "
\n", + " ✅ linguistic dependency between textual objects\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", "
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "
\n", + "oslots\n", "
\n", + "
none
\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", + " \n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + "
\n", + "
\n", "\n", - "
\n", - "
\n", - "\n", + "
Phonetic Transcriptions\n", + "
\n", "\n", "
\n", "
\n", - "language\n", + "phono\n", "
\n", "
str
\n", - "
\n", - " ✅ of word or lexeme (Hebrew; Aramaic.)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", + " 🆗 phonological transcription (bᵊ rēšˌîṯ bārˈā ʔᵉlōhˈîm)\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:21:13Z
\n", + "
\n", + "
\n", + "phono_trailer\n", "
\n", + "
str
\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
from additional lexicon file provided by the ETCBC
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "lex\n", - "
\n", - "
str
\n", - "
\n", - " ✅ lexeme consonantal-transliterated (B R>CJT/ BR>[ >LHJM/)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:21:14Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
from additional lexicon file provided by the ETCBC
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "lex_utf8\n", - "
\n", - "
str
\n", - "
\n", - " ✅ lexeme consonantal-Hebrew (ב ראשׁית֜ ברא אלהים֜)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:21:15Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
from additional lexicon file provided by the ETCBC
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "ls\n", - "
\n", - "
str
\n", - "
\n", - " ✅ lexical set, subclassification of part-of-speech (card; ques; mult)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:21:15Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
from additional lexicon file provided by the ETCBC
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "nametype\n", - "
\n", - "
str
\n", - "
\n", - " ⚠️ named entity type (pers; mens; gens; topo; ppde.)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:21:15Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
from additional lexicon file provided by the ETCBC
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "nme\n", - "
\n", - "
str
\n", - "
\n", - " ✅ nominal ending consonantal-transliterated (absent; n/a; JM, ...)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:08Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "nu\n", - "
\n", - "
str
\n", - "
\n", - " ✅ grammatical number (sg; du; pl; NA; unknown.)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:08Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "number\n", - "
\n", - "
int
\n", - "
\n", - " ✅ sequence number of an object within its context\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:09Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "otype\n", - "
\n", - "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:21:15Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "pargr\n", - "
\n", - "
str
\n", - "
\n", - " 🆗 hierarchical paragraph number (1; 1.2; 1.2.3.4; ...)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:22:50Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
from additional paragraph file provided by the ETCBC
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "pdp\n", - "
\n", - "
str
\n", - "
\n", - " ✅ phrase dependent part-of-speech (art; verb; subs; nmpr, ...)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:10Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "pfm\n", - "
\n", - "
str
\n", - "
\n", - " ✅ preformative consonantal-transliterated (absent; n/a; J, ...)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:11Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "prs\n", - "
\n", - "
str
\n", - "
\n", - " ✅ pronominal suffix consonantal-transliterated (absent; n/a; W; ...)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:11Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "prs_gn\n", - "
\n", - "
str
\n", - "
\n", - " ✅ pronominal suffix gender (m; f; NA; unknown.)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:11Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "prs_nu\n", - "
\n", - "
str
\n", - "
\n", - " ✅ pronominal suffix number (sg; du; pl; NA; unknown.)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:12Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "prs_ps\n", - "
\n", - "
str
\n", - "
\n", - " ✅ pronominal suffix person (p1; p2; p3; NA; unknown.)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:12Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "ps\n", - "
\n", - "
str
\n", - "
\n", - " ✅ grammatical person (p1; p2; p3; NA; unknown.)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:12Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "qere\n", - "
\n", - "
str
\n", - "
\n", - " ✅ word pointed-transliterated masoretic reading correction\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:23:29Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
from additional ketiv/qere file provided by the ETCBC
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "qere_trailer\n", - "
\n", - "
str
\n", - "
\n", - " ✅ interword material -pointed-transliterated (Masoretic correction)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:23:29Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
from additional ketiv/qere file provided by the ETCBC
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "qere_trailer_utf8\n", - "
\n", - "
str
\n", - "
\n", - " ✅ interword material -pointed-transliterated (Masoretic correction)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:23:29Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
from additional ketiv/qere file provided by the ETCBC
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "qere_utf8\n", - "
\n", - "
str
\n", - "
\n", - " ✅ word pointed-Hebrew masoretic reading correction\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:23:29Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
from additional ketiv/qere file provided by the ETCBC
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "rank_lex\n", - "
\n", - "
int
\n", - "
\n", - " ✅ ranking of lexemes based on freqnuecy\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:24:46Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
computed on the basis of the ETCBC core set of features
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "rela\n", - "
\n", - "
str
\n", - "
\n", - " ✅ linguistic relation between clause/(sub)phrase(atom) (ADJ; MOD; ATR; ...)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:13Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "sp\n", - "
\n", - "
str
\n", - "
\n", - " ✅ part-of-speech (art; verb; subs; nmpr, ...)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:21:16Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
from additional lexicon file provided by the ETCBC
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "st\n", - "
\n", - "
str
\n", - "
\n", - " ✅ state of a noun (a (absolute); c (construct); e (emphatic).)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:14Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "tab\n", - "
\n", - "
int
\n", - "
\n", - " ✅ clause atom: its level in the linguistic embedding\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:16Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "trailer\n", - "
\n", - "
str
\n", - "
\n", - " ✅ interword material pointed-transliterated (& 00 05 00_P ...)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:01Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "trailer_utf8\n", - "
\n", - "
str
\n", - "
\n", - " ✅ interword material pointed-Hebrew (־ ׃)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:01Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "txt\n", - "
\n", - "
str
\n", - "
\n", - " ✅ text type of clause and surrounding (repetion of ? N D Q as in feature domain)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:16Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "typ\n", - "
\n", - "
str
\n", - "
\n", - " ✅ clause/phrase(atom) type (VP; NP; Ellp; Ptcp; WayX)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:16Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "uvf\n", - "
\n", - "
str
\n", - "
\n", - " ✅ univalent final consonant consonantal-transliterated (absent; N; J; ...)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:17Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "vbe\n", - "
\n", - "
str
\n", - "
\n", - " ✅ verbal ending consonantal-transliterated (n/a; W; ...)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:17Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "vbs\n", - "
\n", - "
str
\n", - "
\n", - " ✅ root formation consonantal-transliterated (absent; n/a; H; ...)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:17Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "verse\n", - "
\n", - "
int
\n", - "
\n", - " ✅ verse number\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:18Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "voc_lex\n", - "
\n", - "
str
\n", - "
\n", - " ✅ vocalized lexeme pointed-transliterated (B.: R;>CIJT BR> >:ELOHIJM)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:21:16Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
from additional lexicon file provided by the ETCBC
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "voc_lex_utf8\n", - "
\n", - "
str
\n", - "
\n", - " ✅ vocalized lexeme pointed-Hebrew (בְּ רֵאשִׁית ברא אֱלֹהִים)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:21:17Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
from additional lexicon file provided by the ETCBC
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "vs\n", - "
\n", - "
str
\n", - "
\n", - " ✅ verbal stem (qal; piel; hif; apel; pael)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:18Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "vt\n", - "
\n", - "
str
\n", - "
\n", - " ✅ verbal tense (perf; impv; wayq; infc)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:18Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "mother\n", - "
\n", - "
none
\n", - "
\n", - " ✅ linguistic dependency between textual objects\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:22Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "oslots\n", - "
\n", - "
none
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:21:17Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "\n", - "
Phonetic Transcriptions\n", - "
\n", - "\n", - "
\n", - "
\n", - "phono\n", - "
\n", - "
str
\n", - "
\n", - " 🆗 phonological transcription (bᵊ rēšˌîṯ bārˈā ʔᵉlōhˈîm)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
BHSA Data: Constantijn Sikkel; Phono Notebook: Dirk Roorda
\n", - "
\n", - "\n", - "
\n", - "
coreData:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:25:55Z
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
computed by the phono notebook, see https://github.com/ETCBC/phono
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "phono_trailer\n", - "
\n", - "
str
\n", - "
\n", - " 🆗 interword material in phonological transcription\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
BHSA Data: Constantijn Sikkel; Phono Notebook: Dirk Roorda
\n", - "
\n", - "\n", - "
\n", - "
coreData:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:25:55Z
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
computed by the phono notebook, see https://github.com/ETCBC/phono
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " 🆗 interword material in phonological transcription\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", @@ -4806,7 +1861,7 @@ "name": "stdout", "output_type": "stream", "text": [ - " 0.43s 6 results\n" + " 0.42s 6 results\n" ] } ], @@ -5482,7 +2537,7 @@ "name": "stdout", "output_type": "stream", "text": [ - " 0.43s 4 results\n" + " 0.44s 4 results\n" ] }, { @@ -5916,7 +2971,7 @@ "name": "stdout", "output_type": "stream", "text": [ - " 0.89s 10638 results\n" + " 0.86s 10638 results\n" ] } ], diff --git a/tutorial/share.ipynb b/tutorial/share.ipynb index e14f12b3..31c0c4a6 100644 --- a/tutorial/share.ipynb +++ b/tutorial/share.ipynb @@ -121,13 +121,13 @@ }, { "cell_type": "code", - "execution_count": 3, + "execution_count": 4, "metadata": {}, "outputs": [ { "data": { "text/html": [ - "TF-app: ~/text-fabric-data/etcbc/bhsa/app" + "TF-app: ~/text-fabric-data/github/etcbc/bhsa/app" ], "text/plain": [ "" @@ -139,7 +139,7 @@ { "data": { "text/html": [ - "data: ~/text-fabric-data/etcbc/bhsa/tf/2021" + "data: ~/text-fabric-data/github/etcbc/bhsa/tf/2021" ], "text/plain": [ "" @@ -151,7 +151,7 @@ { "data": { "text/html": [ - "data: ~/text-fabric-data/etcbc/lingo/heads/tf/2021" + "data: ~/text-fabric-data/github/etcbc/lingo/heads/tf/2021" ], "text/plain": [ "" @@ -163,7 +163,7 @@ { "data": { "text/html": [ - "data: ~/text-fabric-data/etcbc/valence/tf/2021" + "data: ~/text-fabric-data/github/etcbc/valence/tf/2021" ], "text/plain": [ "" @@ -175,7 +175,7 @@ { "data": { "text/html": [ - "data: ~/text-fabric-data/etcbc/phono/tf/2021" + "data: ~/text-fabric-data/github/etcbc/phono/tf/2021" ], "text/plain": [ "" @@ -187,7 +187,7 @@ { "data": { "text/html": [ - "data: ~/text-fabric-data/etcbc/parallels/tf/2021" + "data: ~/text-fabric-data/github/etcbc/parallels/tf/2021" ], "text/plain": [ "" @@ -196,64 +196,21 @@ "metadata": {}, "output_type": "display_data" }, - { - "name": "stdout", - "output_type": "stream", - "text": [ - "This is Text-Fabric 9.2.3\n", - "Api reference : https://annotation.github.io/text-fabric/tf/cheatsheet.html\n", - "\n", - "135 features found and 0 ignored\n" - ] - }, { "data": { "text/html": [ - "Text-Fabric: Text-Fabric API 9.2.3, etcbc/bhsa/app v3, Search Reference
Data: BHSA, Character table, Feature docs
Features:
\n", + "Text-Fabric: Text-Fabric API 10.2.0, etcbc/bhsa/app v3, Search Reference
Data: BHSA, Character table, Feature docs
Features:
\n", "
Parallel Passages\n", "
\n", "\n", "
\n", "
\n", - "crossref\n", + "crossref\n", "
\n", "
int
\n", - "
\n", - " 🆗 links between similar passages\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
BHSA Data: Constantijn Sikkel; Parallels Notebook: Dirk Roorda, Martijn Naaijer
\n", - "
\n", - "\n", - "
\n", - "
coreData:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:40:46Z
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
Parallels notebook, see https://github.com/ETCBC/parallels
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " 🆗 links between similar passages\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", @@ -264,9753 +221,2740 @@ "\n", "
\n", "
\n", - "book\n", + "book\n", "
\n", "
str
\n", - "
\n", - " ✅ book name in Latin (Genesis; Numeri; Reges1; ...)\n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + " ✅ book name in Latin (Genesis; Numeri; Reges1; ...)\n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "
\n", + "book@ll\n", "
\n", + "
str
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:17:55Z
\n", - "
\n", + " ✅ book name in amharic (ኣማርኛ)\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", "
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "
\n", + "chapter\n", "
\n", + "
int
\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", + " ✅ chapter number (1; 2; 3; ...)\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
\n", + "
\n", + "code\n", "
\n", + "
int
\n", + "\n", + " ✅ identifier of a clause atom relationship (0; 74; 367; ...)\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "book@ll\n", + "det\n", "
\n", "
str
\n", - "
\n", - " ✅ book name in amharic (ኣማርኛ)\n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + " ✅ determinedness of phrase(atom) (det; und; NA.)\n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "
\n", + "domain\n", "
\n", + "
str
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:20:27Z
\n", - "
\n", + " ✅ text type of clause (? (Unknown); N (narrative); D (discursive); Q (Quotation).)\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", "
\n", "\n", - "
\n", - "
encoders:
\n", - "
Dirk Roorda (TF)
\n", + "
\n", + "
\n", + "freq_lex\n", "
\n", + "
int
\n", "\n", - "
\n", - "
language:
\n", - "
ኣማርኛ
\n", - "
\n", + " ✅ frequency of lexemes\n", "\n", - "
\n", - "
languageCode:
\n", - "
am
\n", "
\n", "\n", - "
\n", - "
languageEnglish:
\n", - "
amharic
\n", + "
\n", + "
\n", + "function\n", "
\n", + "
str
\n", "\n", - "
\n", - "
provenance:
\n", - "
book names from wikipedia and other sources
\n", - "
\n", + " ✅ syntactic function of phrase (Cmpl; Objc; Pred; ...)\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", "
\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", + "
\n", + "
\n", + "g_cons\n", "
\n", + "
str
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " ✅ word consonantal-transliterated (B R>CJT BR> >LHJM ...)\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "chapter\n", - "
\n", - "
int
\n", - "
\n", - " ✅ chapter number (1; 2; 3; ...)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", + "g_cons_utf8\n", "
\n", + "
str
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", + " ✅ word consonantal-Hebrew (ב ראשׁית ברא אלהים)\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:17:55Z
\n", "
\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", + "
\n", + "
\n", + "g_lex\n", "
\n", + "
str
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", + " ✅ lexeme pointed-transliterated (B.:- R;>CIJT B.@R@> >:ELOH ...)\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", "
\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", + "
\n", + "
\n", + "g_lex_utf8\n", "
\n", + "
str
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " ✅ lexeme pointed-Hebrew (בְּ רֵאשִׁית בָּרָא אֱלֹה)\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "code\n", + "g_word\n", "
\n", - "
int
\n", - "
\n", - " ✅ identifier of a clause atom relationship (0; 74; 367; ...)\n", - "
\n", + "
str
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + " ✅ word pointed-transliterated (B.:- R;>CI73JT B.@R@74> >:ELOHI92JM)\n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "
\n", + "g_word_utf8\n", "
\n", + "
str
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:17:56Z
\n", - "
\n", + " ✅ word pointed-Hebrew (בְּ רֵאשִׁ֖ית בָּרָ֣א אֱלֹהִ֑ים)\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", "
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "
\n", + "gloss\n", "
\n", + "
str
\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", + " 🆗 english translation of lexeme (beginning create god(s))\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
\n", + "
\n", + "gn\n", "
\n", + "
str
\n", + "\n", + " ✅ grammatical gender (m; f; NA; unknown.)\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "det\n", + "label\n", "
\n", "
str
\n", - "
\n", - " ✅ determinedness of phrase(atom) (det; und; NA.)\n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + " ✅ (half-)verse label (half verses: A; B; C; verses: GEN 01,02)\n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "
\n", + "language\n", "
\n", + "
str
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:17:56Z
\n", - "
\n", + " ✅ of word or lexeme (Hebrew; Aramaic.)\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", "
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "
\n", + "lex\n", "
\n", + "
str
\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", + " ✅ lexeme consonantal-transliterated (B R>CJT/ BR>[ >LHJM/)\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
\n", + "
\n", + "lex_utf8\n", "
\n", + "
str
\n", + "\n", + " ✅ lexeme consonantal-Hebrew (ב ראשׁית֜ ברא אלהים֜)\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "domain\n", + "ls\n", "
\n", "
str
\n", - "
\n", - " ✅ text type of clause (? (Unknown); N (narrative); D (discursive); Q (Quotation).)\n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + " ✅ lexical set, subclassification of part-of-speech (card; ques; mult)\n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "
\n", + "nametype\n", "
\n", + "
str
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:17:57Z
\n", - "
\n", + " ⚠️ named entity type (pers; mens; gens; topo; ppde.)\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", "
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "
\n", + "nme\n", "
\n", + "
str
\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", + " ✅ nominal ending consonantal-transliterated (absent; n/a; JM, ...)\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
\n", + "
\n", + "nu\n", "
\n", + "
str
\n", + "\n", + " ✅ grammatical number (sg; du; pl; NA; unknown.)\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "freq_lex\n", + "number\n", "
\n", "
int
\n", - "
\n", - " ✅ frequency of lexemes\n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + " ✅ sequence number of an object within its context\n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "
\n", + "otype\n", "
\n", + "
str
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:24:45Z
\n", - "
\n", + " \n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", "
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "
\n", + "pargr\n", "
\n", + "
str
\n", "\n", - "
\n", - "
provenance:
\n", - "
computed on the basis of the ETCBC core set of features
\n", - "
\n", + " 🆗 hierarchical paragraph number (1; 1.2; 1.2.3.4; ...)\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", "
\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", + "
\n", + "
\n", + "pdp\n", "
\n", + "
str
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " ✅ phrase dependent part-of-speech (art; verb; subs; nmpr, ...)\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "function\n", + "pfm\n", "
\n", "
str
\n", - "
\n", - " ✅ syntactic function of phrase (Cmpl; Objc; Pred; ...)\n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + " ✅ preformative consonantal-transliterated (absent; n/a; J, ...)\n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "
\n", + "prs\n", "
\n", + "
str
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:17:57Z
\n", - "
\n", + " ✅ pronominal suffix consonantal-transliterated (absent; n/a; W; ...)\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", "
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "
\n", + "prs_gn\n", "
\n", + "
str
\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", + " ✅ pronominal suffix gender (m; f; NA; unknown.)\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
\n", + "
\n", + "prs_nu\n", "
\n", + "
str
\n", + "\n", + " ✅ pronominal suffix number (sg; du; pl; NA; unknown.)\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "g_cons\n", + "prs_ps\n", "
\n", "
str
\n", - "
\n", - " ✅ word consonantal-transliterated (B R>CJT BR> >LHJM ...)\n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + " ✅ pronominal suffix person (p1; p2; p3; NA; unknown.)\n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "
\n", + "ps\n", "
\n", + "
str
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:17:57Z
\n", - "
\n", + " ✅ grammatical person (p1; p2; p3; NA; unknown.)\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", "
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "
\n", + "qere\n", "
\n", + "
str
\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", + " ✅ word pointed-transliterated masoretic reading correction\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
\n", + "
\n", + "qere_trailer\n", "
\n", + "
str
\n", + "\n", + " ✅ interword material -pointed-transliterated (Masoretic correction)\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "g_cons_utf8\n", + "qere_trailer_utf8\n", "
\n", "
str
\n", - "
\n", - " ✅ word consonantal-Hebrew (ב ראשׁית ברא אלהים)\n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + " ✅ interword material -pointed-transliterated (Masoretic correction)\n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "
\n", + "qere_utf8\n", "
\n", + "
str
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:17:58Z
\n", - "
\n", + " ✅ word pointed-Hebrew masoretic reading correction\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", "
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "
\n", + "rank_lex\n", "
\n", + "
int
\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", + " ✅ ranking of lexemes based on freqnuecy\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
\n", + "
\n", + "rela\n", "
\n", + "
str
\n", + "\n", + " ✅ linguistic relation between clause/(sub)phrase(atom) (ADJ; MOD; ATR; ...)\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "g_lex\n", + "sp\n", "
\n", "
str
\n", - "
\n", - " ✅ lexeme pointed-transliterated (B.:- R;>CIJT B.@R@> >:ELOH ...)\n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + " ✅ part-of-speech (art; verb; subs; nmpr, ...)\n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "
\n", + "st\n", "
\n", + "
str
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:17:58Z
\n", - "
\n", + " ✅ state of a noun (a (absolute); c (construct); e (emphatic).)\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", "
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "
\n", + "tab\n", "
\n", + "
int
\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", + " ✅ clause atom: its level in the linguistic embedding\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
\n", + "
\n", + "trailer\n", "
\n", + "
str
\n", + "\n", + " ✅ interword material pointed-transliterated (& 00 05 00_P ...)\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "g_lex_utf8\n", + "trailer_utf8\n", "
\n", "
str
\n", - "
\n", - " ✅ lexeme pointed-Hebrew (בְּ רֵאשִׁית בָּרָא אֱלֹה)\n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + " ✅ interword material pointed-Hebrew (־ ׃)\n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "
\n", + "txt\n", "
\n", + "
str
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:17:59Z
\n", - "
\n", + " ✅ text type of clause and surrounding (repetion of ? N D Q as in feature domain)\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", "
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "
\n", + "typ\n", "
\n", + "
str
\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", + " ✅ clause/phrase(atom) type (VP; NP; Ellp; Ptcp; WayX)\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
\n", + "
\n", + "uvf\n", "
\n", + "
str
\n", + "\n", + " ✅ univalent final consonant consonantal-transliterated (absent; N; J; ...)\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "g_word\n", + "vbe\n", "
\n", "
str
\n", - "
\n", - " ✅ word pointed-transliterated (B.:- R;>CI73JT B.@R@74> >:ELOHI92JM)\n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + " ✅ verbal ending consonantal-transliterated (n/a; W; ...)\n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "
\n", + "vbs\n", "
\n", + "
str
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:04Z
\n", - "
\n", + " ✅ root formation consonantal-transliterated (absent; n/a; H; ...)\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", "
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "
\n", + "verse\n", "
\n", + "
int
\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", + " ✅ verse number\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
\n", + "
\n", + "voc_lex\n", "
\n", + "
str
\n", + "\n", + " ✅ vocalized lexeme pointed-transliterated (B.: R;>CIJT BR> >:ELOHIJM)\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "g_word_utf8\n", + "voc_lex_utf8\n", "
\n", "
str
\n", - "
\n", - " ✅ word pointed-Hebrew (בְּ רֵאשִׁ֖ית בָּרָ֣א אֱלֹהִ֑ים)\n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + " ✅ vocalized lexeme pointed-Hebrew (בְּ רֵאשִׁית ברא אֱלֹהִים)\n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "
\n", + "vs\n", "
\n", + "
str
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:04Z
\n", - "
\n", + " ✅ verbal stem (qal; piel; hif; apel; pael)\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", "
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "
\n", + "vt\n", "
\n", + "
str
\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", + " ✅ verbal tense (perf; impv; wayq; infc)\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
\n", + "
\n", + "mother\n", "
\n", + "
none
\n", + "\n", + " ✅ linguistic dependency between textual objects\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", - "
\n", - "gloss\n", + "
\n", + "oslots\n", "
\n", - "
str
\n", - "
\n", - " 🆗 english translation of lexeme (beginning create god(s))\n", - "
\n", + "
none
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + " \n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", + "
\n", + "
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:21:13Z
\n", - "
\n", + "
etcbc/lingo/heads/tf\n", + "
\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", + "
\n", + "
\n", + "heads\n", "
\n", + "
none
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", + " \n", "\n", - "
\n", - "
provenance:
\n", - "
from additional lexicon file provided by the ETCBC
\n", "
\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", + "
\n", + "
\n", + "noun_heads\n", "
\n", + "
none
\n", + "\n", + " \n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
\n", + "
\n", + "prep_obj\n", "
\n", + "
none
\n", + "\n", + " \n", "\n", - "
\n", - "
\n", "
\n", "\n", + "
\n", + "
\n", + "\n", + "
Phonetic Transcriptions\n", + "
\n", + "\n", "
\n", "
\n", - "gn\n", + "phono\n", "
\n", "
str
\n", - "
\n", - " ✅ grammatical gender (m; f; NA; unknown.)\n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + " 🆗 phonological transcription (bᵊ rēšˌîṯ bārˈā ʔᵉlōhˈîm)\n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "
\n", + "phono_trailer\n", "
\n", + "
str
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:05Z
\n", - "
\n", + " 🆗 interword material in phonological transcription\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", "
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", + "
\n", + "
\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", + "
etcbc/valence/tf\n", + "
\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", + "
\n", + "
\n", + "cfunction\n", "
\n", + "
str
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " ❗️ corrected phrase function, only present for phrases that were in a correction sheet\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "label\n", + "f_correction\n", "
\n", "
str
\n", - "
\n", - " ✅ (half-)verse label (half verses: A; B; C; verses: GEN 01,02)\n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + " ❗️ whether the phrase function has been manually corrected\n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "
\n", + "grammatical\n", "
\n", + "
str
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:06Z
\n", - "
\n", + " ❗️ constituent role main classification\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "language\n", - "
\n", - "
str
\n", - "
\n", - " ✅ of word or lexeme (Hebrew; Aramaic.)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:21:13Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
from additional lexicon file provided by the ETCBC
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "lex\n", + "lexical\n", "
\n", "
str
\n", - "
\n", - " ✅ lexeme consonantal-transliterated (B R>CJT/ BR>[ >LHJM/)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:21:14Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", "\n", - "
\n", - "
provenance:
\n", - "
from additional lexicon file provided by the ETCBC
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " ❗️ additional lexical characteristics\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "lex_utf8\n", + "original\n", "
\n", "
str
\n", - "
\n", - " ✅ lexeme consonantal-Hebrew (ב ראשׁית֜ ברא אלהים֜)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:21:15Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
from additional lexicon file provided by the ETCBC
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " ❗️ default value before enrichment logic has been applied\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "ls\n", + "predication\n", "
\n", "
str
\n", - "
\n", - " ✅ lexical set, subclassification of part-of-speech (card; ques; mult)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:21:15Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
from additional lexicon file provided by the ETCBC
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " ❗️ verbal function main classification\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "nametype\n", + "s_manual\n", "
\n", "
str
\n", - "
\n", - " ⚠️ named entity type (pers; mens; gens; topo; ppde.)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:21:15Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
from additional lexicon file provided by the ETCBC
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " ❗️ whether the generated enrichment features have been manually changed\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "nme\n", + "semantic\n", "
\n", "
str
\n", - "
\n", - " ✅ nominal ending consonantal-transliterated (absent; n/a; JM, ...)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:08Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " ❗️ additional semantic characteristics\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "nu\n", + "sense\n", "
\n", "
str
\n", - "
\n", - " ✅ grammatical number (sg; du; pl; NA; unknown.)\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:08Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "number\n", - "
\n", - "
int
\n", - "
\n", - " ✅ sequence number of an object within its context\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:18:09Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " ❗️ sense label of verb occurrences (d-; i.; -p; d-; ...)\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "otype\n", + "valence\n", "
\n", "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2021-12-09T14:21:15Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", "\n", - "
\n", - "
version:
\n", - "
2021
\n", - "
\n", + " ❗️ verbal valence main classification\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + "
\n", + "
\n" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/html": [ + "" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - }, - { - "data": { - "text/html": [ - "\n", - "\n" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - }, - { - "data": { - "text/html": [ - "
Text-Fabric API: names N F E L T S C TF directly usable

" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - } - ], - "source": [ - "A = use('etcbc/bhsa', mod=\"etcbc/lingo/heads/tf,etcbc/valence/tf\", hoist=globals())" - ] - }, - { - "cell_type": "markdown", - "metadata": { - "tags": [] - }, - "source": [ - "You see that the features from the **etcbc/valence/tf** and **etcbc/lingo/heads/tf** modules have been added to the mix.\n", - "\n", - "## ETCBC Valence\n", - "\n", - "Click the triangle before **etcbc/valence/tf** to see what features have been contributed.\n", - "\n", - "Note that edge features are in **_bold italic_**.\n", - "\n", - "Let's find out more about *sense*.\n", - "\n", - "You can start with clicking the triangle afte \"sense str\" above.\n", - "It tells you where the feature comes from, and it shows you the context where it has been constructed.\n", - "You might go there to see additional documentation.\n", - "\n", - "But we can also dive directly into its data:" - ] - }, - { - "cell_type": "code", - "execution_count": 4, - "metadata": {}, - "outputs": [ - { - "data": { - "text/plain": [ - "(('--', 17941),\n", - " ('d-', 9975),\n", - " ('-p', 6537),\n", - " ('-i', 3604),\n", - " ('-c', 3231),\n", - " ('dp', 1899),\n", - " ('dc', 1002),\n", - " ('di', 918),\n", - " ('l.', 876),\n", - " ('i.', 630),\n", - " ('n.', 532),\n", - " ('-b', 64),\n", - " ('db', 61),\n", - " ('c.', 57),\n", - " ('k.', 54))" - ] - }, - "execution_count": 4, - "metadata": {}, - "output_type": "execute_result" - } - ], - "source": [ - "F.sense.freqList()" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "Which nodes have a sense feature?" - ] - }, - { - "cell_type": "code", - "execution_count": 5, - "metadata": {}, - "outputs": [ - { - "data": { - "text/plain": [ - "{'word'}" - ] - }, - "execution_count": 5, - "metadata": {}, - "output_type": "execute_result" - } - ], - "source": [ - "{F.otype.v(n) for n in N.walk() if F.sense.v(n)}" - ] - }, - { - "cell_type": "code", - "execution_count": 6, - "metadata": {}, - "outputs": [ - { - "name": "stdout", - "output_type": "stream", - "text": [ - " 0.24s 47381 results\n" - ] - } - ], - "source": [ - "results = A.search(\n", - " \"\"\"\n", - "word sense\n", - "\"\"\"\n", - ")" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "Let's show some of the rarer sense values:" - ] - }, - { - "cell_type": "code", - "execution_count": 7, - "metadata": {}, - "outputs": [ - { - "name": "stdout", - "output_type": "stream", - "text": [ - " 0.29s 54 results\n" - ] - } - ], - "source": [ - "results = A.search(\n", - " \"\"\"\n", - "word sense=k.\n", - "\"\"\"\n", - ")" - ] - }, - { - "cell_type": "code", - "execution_count": 8, - "metadata": {}, - "outputs": [ - { - "data": { - "text/html": [ - "\n", - "\n", - "\n", - "\n", - "\n", - "
npword
1Genesis 4:17יִּקְרָא֙
2Genesis 13:16שַׂמְתִּ֥י
3Genesis 32:13שַׂמְתִּ֤י
4Genesis 34:31יַעֲשֶׂ֖ה
5Genesis 48:20יְשִֽׂמְךָ֣
" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - } - ], - "source": [ - "A.table(results, end=5)" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "If we do a pretty display, the `sense` feature shows up." - ] - }, - { - "cell_type": "code", - "execution_count": 9, - "metadata": {}, - "outputs": [ - { - "data": { - "text/html": [ - "

result 1

" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - }, - { - "data": { - "text/html": [ - "
verse:1414485
sentence:1172591
clause:427946
phrase:652729
1943 וַ
phrase:652730
sense=d-
phrase:652731
phrase:652732
sentence:1172592
clause:427947
phrase:652733
1948 וַ
phrase:652734
sense=--
sentence:1172593
clause:427948
phrase:652735
1950 וַ
phrase:652736
sense=d-
phrase:652737
sentence:1172594
clause:427949
phrase:652738
1954 וַֽ
phrase:652739
sense=--
clause:427950
phrase:652740
sense=d-
phrase:652741
sentence:1172595
clause:427951
phrase:652742
1958 וַ
phrase:652743
sense=k.
phrase:652744
" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - } - ], - "source": [ - "A.show(results, start=1, end=1, withNodes=True)" - ] - }, - { - "cell_type": "markdown", - "metadata": { - "tags": [] - }, - "source": [ - "## Lingo heads\n", - "If you click the triangle before **etcbc/lingo/heads/tf** you see what features it contributes.\n", - "Unfortunately, the authors have not provided a description of this feature, but if you click\n", - "on the triangle after *heads* none, you see where the feature comes from and who has made it.\n", - "\n", - "Moreover, the fact that *heads* is in italics makes clear that it is an edge feature.\n", - "\n", - "Let's use it in a query:\n", - "Now, `heads` is an edge feature, we cannot directly make it visible in pretty displays, but we can use it in queries.\n", - "\n", - "We also want to make the feature `sense` visible, so we mention the feature in the query, without restricting the results." - ] - }, - { - "cell_type": "code", - "execution_count": 10, - "metadata": {}, - "outputs": [ - { - "name": "stdout", - "output_type": "stream", - "text": [ - " 0.87s 402 results\n" - ] - } - ], - "source": [ - "results = A.search(\n", - " \"\"\"\n", - "book book=Genesis\n", - " chapter chapter=1\n", - " clause\n", - " phrase\n", - " -heads> word sense*\n", - "\"\"\"\n", - ")" - ] - }, - { - "cell_type": "code", - "execution_count": 11, - "metadata": {}, - "outputs": [ - { - "data": { - "text/html": [ - "

result 1

" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - }, - { - "data": { - "text/html": [ - "
book Genesis
book=Genesis
" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - }, - { - "data": { - "text/html": [ - "
chapter Genesis 1
book=Genesischapter=1
" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - }, - { - "data": { - "text/html": [ - "" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - }, - { - "data": { - "text/html": [ - "

result 2

" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - }, - { - "data": { - "text/html": [ - "
book Genesis
book=Genesis
" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - }, - { - "data": { - "text/html": [ - "
chapter Genesis 1
book=Genesischapter=1
" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - }, - { - "data": { - "text/html": [ - "" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - } - ], - "source": [ - "A.show(results, start=1, end=2)" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "Note how the words that are **_heads_** of their phrases are highlighted within their phrases." - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "# Participants\n", - "\n", - "Now we are going to add another promising module, provided by Christian Canu Højgaard, from this repo:\n", - "[participants](https://github.com/ch-jensen/participants).\n", - "\n", - "Let's do it in the straightforward way:" - ] - }, - { - "cell_type": "code", - "execution_count": 12, - "metadata": {}, - "outputs": [ - { - "data": { - "text/html": [ - "TF-app: ~/text-fabric-data/etcbc/bhsa/app" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - }, - { - "data": { - "text/html": [ - "data: ~/text-fabric-data/etcbc/bhsa/tf/2021" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - }, - { - "data": { - "text/html": [ - "data: ~/text-fabric-data/etcbc/lingo/heads/tf/2021" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - }, - { - "data": { - "text/html": [ - "data: ~/text-fabric-data/etcbc/valence/tf/2021" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - }, - { - "name": "stdout", - "output_type": "stream", - "text": [ - "The requested data is not available offline\n", - "\t~/text-fabric-data/ch-jensen/participants/actor/tf not found\n", - "rate limit is 5000 requests per hour, with 5000 left for this hour\n", - "\tconnecting to online GitHub repo ch-jensen/participants ... connected\n", - "No directory actor/tf/2021 in #9671910a329c069cfd3d366526ea816de57666dcWill try something else\n", - "\tFailed" - ] - }, - { - "name": "stderr", - "output_type": "stream", - "text": [ - "No directory actor/tf/2021 in #9671910a329c069cfd3d366526ea816de57666dc\tFailed" - ] - }, - { - "data": { - "text/html": [ - "data: ~/text-fabric-data/etcbc/phono/tf/2021" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - }, - { - "data": { - "text/html": [ - "data: ~/text-fabric-data/etcbc/parallels/tf/2021" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - }, - { - "name": "stderr", - "output_type": "stream", - "text": [ - "There were problems with loading data.\n", - "The Text-Fabric API has not been loaded!\n", - "The app \"etcbc/bhsa\" will not work!\n" - ] - } - ], - "source": [ - "A = use(\n", - " 'etcbc/bhsa',\n", - " mod=(\n", - " \"etcbc/lingo/heads/tf\",\n", - " \"etcbc/valence/tf\",\n", - " \"ch-jensen/participants/actor/tf\"\n", - " ),\n", - " hoist=globals(),\n", - ")" - ] - }, - { - "cell_type": "markdown", - "metadata": {}, - "source": [ - "The features are not there!\n", - "\n", - "If we have a look on Github in this repo we see under\n", - "[actor/tf](https://github.com/ch-jensen/participants/tree/master/actor/tf)\n", - "the directory `c` only. Christian has produced his features against version `c` of the BHSA.\n", - "\n", - "Ok, then we go back, and run our command for version `c`." - ] - }, - { - "cell_type": "code", - "execution_count": 17, - "metadata": {}, - "outputs": [ - { - "data": { - "text/html": [ - "TF-app: ~/text-fabric-data/etcbc/bhsa/app" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - }, - { - "data": { - "text/html": [ - "data: ~/text-fabric-data/etcbc/bhsa/tf/c" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - }, - { - "data": { - "text/html": [ - "data: ~/text-fabric-data/etcbc/lingo/heads/tf/c" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - }, - { - "data": { - "text/html": [ - "data: ~/text-fabric-data/etcbc/valence/tf/c" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - }, - { - "data": { - "text/html": [ - "data: ~/text-fabric-data/ch-jensen/participants/actor/tf/c" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - }, - { - "data": { - "text/html": [ - "data: ~/text-fabric-data/etcbc/phono/tf/c" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - }, - { - "data": { - "text/html": [ - "data: ~/text-fabric-data/etcbc/parallels/tf/c" - ], - "text/plain": [ - "" - ] - }, - "metadata": {}, - "output_type": "display_data" - }, - { - "name": "stdout", - "output_type": "stream", - "text": [ - "This is Text-Fabric 9.2.3\n", - "Api reference : https://annotation.github.io/text-fabric/tf/cheatsheet.html\n", - "\n", - "136 features found and 0 ignored\n" - ] - }, - { - "data": { - "text/html": [ - "Text-Fabric: Text-Fabric API 9.2.3, etcbc/bhsa/app v3, Search Reference
Data: BHSA, Character table, Feature docs
Features:
\n", - "
Parallel Passages\n", - "
\n", - "\n", - "
\n", - "
\n", - "crossref\n", - "
\n", - "
int
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
BHSA Data: Constantijn Sikkel; Parallels Notebook: Dirk Roorda, Martijn Naaijer
\n", - "
\n", - "\n", - "
\n", - "
coreData:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
coreVersion:
\n", - "
_temp
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:18:08Z
\n", - "
\n", - "\n", - "
\n", - "
source:
\n", - "
Parallels Module
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "\n", - "
ch-jensen/participants/actor/tf\n", - "
\n", - "\n", - "
\n", - "
\n", - "actor\n", - "
\n", - "
str
\n", - "
\n", - " Participant references for words, subphrases and phrases. The references are adapted from Eep Talstra's work on participant tracking. http://doi.org/10.5281/zenodo.1479491\n", - "
\n", - "\n", - "
\n", - "
coreData:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
coreVersion:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2020-05-11T13:34:09Z
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "prs_actor\n", - "
\n", - "
str
\n", - "
\n", - " Participant references for pronominal suffixes. The references are adapted from Eep Talstra's work on participant tracking. http://doi.org/10.5281/zenodo.1479491\n", - "
\n", - "\n", - "
\n", - "
coreData:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
coreVersion:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2020-05-11T13:34:13Z
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "coref\n", - "
\n", - "
none
\n", - "
\n", - " Edges to co-referring actors on chapter-level. The references are adapted from Eep Talstra's work on participant tracking. http://doi.org/10.5281/zenodo.1479491\n", - "
\n", - "\n", - "
\n", - "
coreData:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
coreVersion:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2020-05-11T13:34:16Z
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "\n", - "
BHSA = Biblia Hebraica Stuttgartensia Amstelodamensis\n", - "
\n", - "\n", - "
\n", - "
\n", - "book\n", - "
\n", - "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:07:15Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "book@ll\n", - "
\n", - "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:11:15Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
language:
\n", - "
ኣማርኛ
\n", - "
\n", - "\n", - "
\n", - "
languageCode:
\n", - "
am
\n", - "
\n", - "\n", - "
\n", - "
languageEnglish:
\n", - "
amharic
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
book names from wikipedia and other sources
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "chapter\n", - "
\n", - "
int
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:07:15Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "code\n", - "
\n", - "
int
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:07:15Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "det\n", - "
\n", - "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:07:15Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "domain\n", - "
\n", - "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:07:19Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "freq_lex\n", - "
\n", - "
int
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:14:58Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
computed addition to core set of features
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "function\n", - "
\n", - "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:07:19Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "g_cons\n", - "
\n", - "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:07:19Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "g_cons_utf8\n", - "
\n", - "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:07:20Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "g_lex\n", - "
\n", - "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:07:21Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "g_lex_utf8\n", - "
\n", - "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:07:22Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "g_word\n", - "
\n", - "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:07:34Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "g_word_utf8\n", - "
\n", - "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:07:34Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "gloss\n", - "
\n", - "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2019-01-31T17:40:54Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "gn\n", - "
\n", - "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:07:35Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "label\n", - "
\n", - "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:07:37Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "language\n", - "
\n", - "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:11:51Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "lex\n", - "
\n", - "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:11:53Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "lex_utf8\n", - "
\n", - "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:11:54Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "ls\n", - "
\n", - "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:11:55Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "nametype\n", - "
\n", - "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2019-01-31T17:40:54Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "nme\n", - "
\n", - "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:07:41Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "nu\n", - "
\n", - "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:07:42Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "number\n", - "
\n", - "
int
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:07:43Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "otype\n", - "
\n", - "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:11:56Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "pargr\n", - "
\n", - "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:13:35Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "pdp\n", - "
\n", - "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:07:46Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "pfm\n", - "
\n", - "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:07:46Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "prs\n", - "
\n", - "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:07:47Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "prs_gn\n", - "
\n", - "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:07:48Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "prs_nu\n", - "
\n", - "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:07:49Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "prs_ps\n", - "
\n", - "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:07:50Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "ps\n", - "
\n", - "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:07:50Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "qere\n", - "
\n", - "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:13:50Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "qere_trailer\n", - "
\n", - "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:13:50Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "qere_trailer_utf8\n", - "
\n", - "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:13:50Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "qere_utf8\n", - "
\n", - "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:13:50Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "rank_lex\n", - "
\n", - "
int
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:15:00Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
provenance:
\n", - "
computed addition to core set of features
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "rela\n", - "
\n", - "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:07:53Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "sp\n", - "
\n", - "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:11:57Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", - "\n", - "
\n", - "
\n", - "
\n", - "\n", - "
\n", - "
\n", - "st\n", - "
\n", - "
str
\n", - "
\n", - " \n", - "
\n", + "@font-face {\n", + " font-family: \"SantakkuM\";\n", + " src: local('SantakkuM'),\n", + " url('/server/static/fonts/SantakkuM.woff') format('woff'),\n", + " url('https://github.com/annotation/text-fabric/blob/master/tf/server/static/fonts/SantakkuM.woff?raw=true') format('woff');\n", + "}\n", + "/* bypassing some classical notebook settings */\n", + "div#notebook {\n", + " line-height: unset;\n", + "}\n", + "/* neutral text */\n", + ".txtn,.txtn a:visited,.txtn a:link {\n", + " font-family: sans-serif;\n", + " font-size: medium;\n", + " direction: ltr;\n", + " unicode-bidi: embed;\n", + " text-decoration: none;\n", + " color: var(--text-color);\n", + "}\n", + "/* transcription text */\n", + ".txtt,.txtt a:visited,.txtt a:link {\n", + " font-family: monospace;\n", + " font-size: medium;\n", + " direction: ltr;\n", + " unicode-bidi: embed;\n", + " text-decoration: none;\n", + " color: var(--text-color);\n", + "}\n", + "/* source text */\n", + ".txto,.txto a:visited,.txto a:link {\n", + " font-family: serif;\n", + " font-size: medium;\n", + " direction: ltr;\n", + " unicode-bidi: embed;\n", + " text-decoration: none;\n", + " color: var(--text-color);\n", + "}\n", + "/* phonetic text */\n", + ".txtp,.txtp a:visited,.txtp a:link {\n", + " font-family: Gentium, sans-serif;\n", + " font-size: medium;\n", + " direction: ltr;\n", + " unicode-bidi: embed;\n", + " text-decoration: none;\n", + " color: var(--text-color);\n", + "}\n", + "/* original script text */\n", + ".txtu,.txtu a:visited,.txtu a:link {\n", + " font-family: Gentium, sans-serif;\n", + " font-size: medium;\n", + " text-decoration: none;\n", + " color: var(--text-color);\n", + "}\n", + "/* hebrew */\n", + ".txtu.hbo,.lex.hbo {\n", + " font-family: \"Ezra SIL\", \"SBL Hebrew\", sans-serif;\n", + " font-size: large;\n", + " direction: rtl ! important;\n", + " unicode-bidi: embed;\n", + "}\n", + "/* syriac */\n", + ".txtu.syc,.lex.syc {\n", + " font-family: \"Estrangelo Edessa\", sans-serif;\n", + " font-size: medium;\n", + " direction: rtl ! important;\n", + " unicode-bidi: embed;\n", + "}\n", + "/* neo aramaic */\n", + ".txtu.cld,.lex.cld {\n", + " font-family: \"CharisSIL-R\", sans-serif;\n", + " font-size: medium;\n", + " direction: ltr ! important;\n", + " unicode-bidi: embed;\n", + "}\n", + "/* standard arabic */\n", + ".txtu.ara,.lex.ara {\n", + " font-family: \"AmiriQuran\", sans-serif;\n", + " font-size: large;\n", + " direction: rtl ! important;\n", + " unicode-bidi: embed;\n", + "}\n", + "/* cuneiform */\n", + ".txtu.akk,.lex.akk {\n", + " font-family: Santakku, sans-serif;\n", + " font-size: large;\n", + " direction: ltr ! important;\n", + " unicode-bidi: embed;\n", + "}\n", + "/* greek */\n", + ".txtu.grc,.lex.grc a:link {\n", + " font-family: Gentium, sans-serif;\n", + " font-size: medium;\n", + " direction: ltr ! important;\n", + " unicode-bidi: embed;\n", + "}\n", + "a:hover {\n", + " text-decoration: underline | important;\n", + " color: #0000ff | important;\n", + "}\n", + ".ltr {\n", + " direction: ltr ! important;\n", + " unicode-bidi: embed;\n", + "}\n", + ".rtl {\n", + " direction: rtl ! important;\n", + " unicode-bidi: embed;\n", + "}\n", + ".ubd {\n", + " unicode-bidi: embed;\n", + "}\n", + ".col {\n", + " display: inline-block;\n", + "}\n", + ".features {\n", + " font-family: monospace;\n", + " font-size: medium;\n", + " font-weight: bold;\n", + " color: var(--features);\n", + " display: flex;\n", + " flex-flow: column nowrap;\n", + " justify-content: flex-start;\n", + " align-items: flex-start;\n", + " align-content: flex-start;\n", + " padding: 2px;\n", + " margin: 2px;\n", + " direction: ltr;\n", + " unicode-bidi: embed;\n", + " border: var(--meta-width) solid var(--meta-color);\n", + " border-radius: var(--meta-width);\n", + "}\n", + ".features div,.features span {\n", + " padding: 0;\n", + " margin: -2px 0;\n", + "}\n", + ".features .f {\n", + " font-family: sans-serif;\n", + " font-size: small;\n", + " font-weight: normal;\n", + " color: #5555bb;\n", + "}\n", + ".features .xft {\n", + " color: #000000;\n", + " background-color: #eeeeee;\n", + " font-size: medium;\n", + " margin: 2px 0px;\n", + "}\n", + ".features .xft .f {\n", + " color: #000000;\n", + " background-color: #eeeeee;\n", + " font-size: small;\n", + " font-weight: normal;\n", + "}\n", + ".tfsechead {\n", + " font-family: sans-serif;\n", + " font-size: small;\n", + " font-weight: bold;\n", + " color: var(--tfsechead);\n", + " unicode-bidi: embed;\n", + " text-align: start;\n", + "}\n", + ".structure {\n", + " font-family: sans-serif;\n", + " font-size: small;\n", + " font-weight: bold;\n", + " color: var(--structure);\n", + " unicode-bidi: embed;\n", + " text-align: start;\n", + "}\n", + ".comments {\n", + " display: flex;\n", + " justify-content: flex-start;\n", + " align-items: flex-start;\n", + " align-content: flex-start;\n", + " flex-flow: column nowrap;\n", + "}\n", + ".nd, a:link.nd {\n", + " font-family: sans-serif;\n", + " font-size: small;\n", + " color: var(--node);\n", + " vertical-align: super;\n", + " direction: ltr ! important;\n", + " unicode-bidi: embed;\n", + "}\n", + ".lex {\n", + " color: var(--lex-color);;\n", + "}\n", + ".children,.children.ltr {\n", + " display: flex;\n", + " border: 0;\n", + " background-color: #ffffff;\n", + " justify-content: flex-start;\n", + " align-items: flex-start;\n", + " align-content: flex-start;\n", + "}\n", + ".children.stretch {\n", + " align-items: stretch;\n", + "}\n", + ".children.hor {\n", + " flex-flow: row nowrap;\n", + "}\n", + ".children.hor.wrap {\n", + " flex-flow: row wrap;\n", + "}\n", + ".children.ver {\n", + " flex-flow: column nowrap;\n", + "}\n", + ".children.ver.wrap {\n", + " flex-flow: column wrap;\n", + "}\n", + ".contnr {\n", + " width: fit-content;\n", + " display: flex;\n", + " justify-content: flex-start;\n", + " align-items: flex-start;\n", + " align-content: flex-start;\n", + " flex-flow: column nowrap;\n", + " background: #ffffff none repeat scroll 0 0;\n", + " padding: 10px 2px 2px 2px;\n", + " margin: 16px 2px 2px 2px;\n", + " border-style: solid;\n", + " font-size: small;\n", + "}\n", + ".contnr.trm {\n", + " background-attachment: local;\n", + "}\n", + ".contnr.cnul {\n", + " padding: 0;\n", + " margin: 0;\n", + " border-style: solid;\n", + " font-size: xx-small;\n", + "}\n", + ".contnr.cnul,.lbl.cnul {\n", + " border-color: var(--border-color-nul);\n", + " border-width: var(--border-width-nul);\n", + " border-radius: var(--border-width-nul);\n", + "}\n", + ".contnr.c0,.lbl.c0 {\n", + " border-color: var(--border-color0);\n", + " border-width: var(--border-width0);\n", + " border-radius: var(--border-width0);\n", + "}\n", + ".contnr.c1,.lbl.c1 {\n", + " border-color: var(--border-color1);\n", + " border-width: var(--border-width1);\n", + " border-radius: var(--border-width1);\n", + "}\n", + ".contnr.c2,.lbl.c2 {\n", + " border-color: var(--border-color2);\n", + " border-width: var(--border-width2);\n", + " border-radius: var(--border-width2);\n", + "}\n", + ".contnr.c3,.lbl.c3 {\n", + " border-color: var(--border-color3);\n", + " border-width: var(--border-width3);\n", + " border-radius: var(--border-width3);\n", + "}\n", + ".contnr.c4,.lbl.c4 {\n", + " border-color: var(--border-color4);\n", + " border-width: var(--border-width4);\n", + " border-radius: var(--border-width4);\n", + "}\n", + "span.plain {\n", + " display: inline-block;\n", + " white-space: pre-wrap;\n", + "}\n", + ".plain {\n", + " background-color: #ffffff;\n", + "}\n", + ".plain.l,.contnr.l,.contnr.l>.lbl {\n", + " border-left-style: dotted\n", + "}\n", + ".plain.r,.contnr.r,.contnr.r>.lbl {\n", + " border-right-style: dotted\n", + "}\n", + ".plain.lno,.contnr.lno,.contnr.lno>.lbl {\n", + " border-left-style: none\n", + "}\n", + ".plain.rno,.contnr.rno,.contnr.rno>.lbl {\n", + " border-right-style: none\n", + "}\n", + ".plain.l {\n", + " padding-left: 4px;\n", + " margin-left: 2px;\n", + " border-width: var(--border-width-plain);\n", + "}\n", + ".plain.r {\n", + " padding-right: 4px;\n", + " margin-right: 2px;\n", + " border-width: var(--border-width-plain);\n", + "}\n", + ".lbl {\n", + " font-family: monospace;\n", + " margin-top: -24px;\n", + " margin-left: 20px;\n", + " background: #ffffff none repeat scroll 0 0;\n", + " padding: 0 6px;\n", + " border-style: solid;\n", + " display: block;\n", + " color: var(--label)\n", + "}\n", + ".lbl.trm {\n", + " background-attachment: local;\n", + " margin-top: 2px;\n", + " margin-left: 2px;\n", + " padding: 2px 2px;\n", + " border-style: none;\n", + "}\n", + ".lbl.cnul {\n", + " font-size: xx-small;\n", + "}\n", + ".lbl.c0 {\n", + " font-size: small;\n", + "}\n", + ".lbl.c1 {\n", + " font-size: small;\n", + "}\n", + ".lbl.c2 {\n", + " font-size: medium;\n", + "}\n", + ".lbl.c3 {\n", + " font-size: medium;\n", + "}\n", + ".lbl.c4 {\n", + " font-size: large;\n", + "}\n", + ".occs, a:link.occs {\n", + " font-size: small;\n", + "}\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + "/* PROVENANCE */\n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", + "div.prov {\n", + "\tmargin: 40px;\n", + "\tpadding: 20px;\n", + "\tborder: 2px solid var(--fog-rim);\n", + "}\n", + "div.pline {\n", + "\tdisplay: flex;\n", + "\tflex-flow: row nowrap;\n", + "\tjustify-content: stretch;\n", + "\talign-items: baseline;\n", + "}\n", + "div.p2line {\n", + "\tmargin-left: 2em;\n", + "\tdisplay: flex;\n", + "\tflex-flow: row nowrap;\n", + "\tjustify-content: stretch;\n", + "\talign-items: baseline;\n", + "}\n", + "div.psline {\n", + "\tdisplay: flex;\n", + "\tflex-flow: row nowrap;\n", + "\tjustify-content: stretch;\n", + "\talign-items: baseline;\n", + "\tbackground-color: var(--gold-mist-back);\n", + "}\n", + "div.pname {\n", + "\tflex: 0 0 5rem;\n", + "\tfont-weight: bold;\n", + "}\n", + "div.pval {\n", + " flex: 1 1 auto;\n", + "}\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", + "/* KEYBOARD */\n", + ".ccoff {\n", + " background-color: inherit;\n", + "}\n", + ".ccon {\n", + " background-color: yellow ! important;\n", + "}\n", + ".ccon,.ccoff {\n", + " padding-right: 0.1rem;\n", + " padding-left: 0.1rem;\n", + "}\n", + ".ccline {\n", + " font-size: xx-large;\n", + " font-weight: bold;\n", + "}\n", + "/* TF header */\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:07:54Z
\n", - "
\n", + "summary {\n", + " /* needed to override the normalize.less\n", + " * in the classical jupyter notebook\n", + " */\n", + " display: list-item ! important;\n", + "}\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", + ".fcorpus {\n", + " display: flex;\n", + " flex-flow: column nowrap;\n", + " justify-content: flex-start;\n", + " align-items: flex-start;\n", + " align-content: flex-start;\n", + "}\n", + ".frow {\n", + " display: flex;\n", + " flex-flow: row nowrap;\n", + " justify-content: flex-start;\n", + " align-items: flex-start;\n", + " align-content: flex-start;\n", + "}\n", + ".fmeta {\n", + " display: flex;\n", + " flex-flow: column nowrap;\n", + " justify-content: flex-start;\n", + " align-items: flex-start;\n", + " align-content: flex-start;\n", + "}\n", + ".fmetarow {\n", + " display: flex;\n", + " flex-flow: row nowrap;\n", + " justify-content: flex-start;\n", + " align-items: flex-start;\n", + " align-content: flex-start;\n", + "}\n", + ".fmetakey {\n", + " min-width: 10rem;\n", + " font-family: monospace;\n", + "}\n", + ".fnamecat {\n", + " min-width: 10rem;\n", + "}\n", + ".fnamecat.edge {\n", + " font-weight: bold;\n", + " font-style: italic;\n", + "}\n", + ".fmono {\n", + " font-family: monospace;\n", + "}\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", + ":root {\n", + "\t--node: hsla(120, 100%, 20%, 1.0 );\n", + "\t--label: hsla( 0, 100%, 20%, 1.0 );\n", + "\t--tfsechead: hsla( 0, 100%, 25%, 1.0 );\n", + "\t--structure: hsla(120, 100%, 25%, 1.0 );\n", + "\t--features: hsla( 0, 0%, 30%, 1.0 );\n", + " --text-color: hsla( 60, 80%, 10%, 1.0 );\n", + " --lex-color: hsla(220, 90%, 60%, 1.0 );\n", + " --meta-color: hsla( 0, 0%, 90%, 0.7 );\n", + " --meta-width: 3px;\n", + " --border-color-nul: hsla( 0, 0%, 90%, 0.5 );\n", + " --border-color0: hsla( 0, 0%, 90%, 0.9 );\n", + " --border-color1: hsla( 0, 0%, 80%, 0.9 );\n", + " --border-color2: hsla( 0, 0%, 70%, 0.9 );\n", + " --border-color3: hsla( 0, 0%, 80%, 0.8 );\n", + " --border-color4: hsla( 0, 0%, 60%, 0.9 );\n", + " --border-width-nul: 2px;\n", + " --border-width0: 2px;\n", + " --border-width1: 3px;\n", + " --border-width2: 4px;\n", + " --border-width3: 6px;\n", + " --border-width4: 5px;\n", + " --border-width-plain: 2px;\n", + "}\n", + ".hl {\n", + " background-color: var(--hl-strong);\n", + "}\n", + "span.hl {\n", + "\tbackground-color: var(--hl-strong);\n", + "\tborder-width: 0;\n", + "\tborder-radius: 2px;\n", + "\tborder-style: solid;\n", + "}\n", + "div.contnr.hl,div.lbl.hl {\n", + " background-color: var(--hl-strong);\n", + "}\n", + "div.contnr.hl {\n", + " border-color: var(--hl-rim) ! important;\n", + "\tborder-width: 4px ! important;\n", + "}\n", "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", + "span.hlbx {\n", + "\tborder-color: var(--hl-rim);\n", + "\tborder-width: 4px ! important;\n", + "\tborder-style: solid;\n", + "\tborder-radius: 6px;\n", + " padding: 4px;\n", + " margin: 4px;\n", + "}\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", + "span.plain {\n", + " display: inline-block;\n", + " white-space: pre-wrap;\n", + "}\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + ":root {\n", + "\t--hl-strong: hsla( 60, 100%, 70%, 0.9 );\n", + "\t--hl-rim: hsla( 55, 80%, 50%, 1.0 );\n", + "}\n", + "" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/html": [ "\n", - "
\n", - "
\n", - "
\n", + "\n" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/html": [ + "
Text-Fabric API: names N F E L T S C TF directly usable

" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + } + ], + "source": [ + "A = use('etcbc/bhsa', mod=\"etcbc/lingo/heads/tf,etcbc/valence/tf\", hoist=globals())" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "tags": [] + }, + "source": [ + "You see that the features from the **etcbc/valence/tf** and **etcbc/lingo/heads/tf** modules have been added to the mix.\n", + "\n", + "## ETCBC Valence\n", + "\n", + "Click the triangle before **etcbc/valence/tf** to see what features have been contributed.\n", + "\n", + "Note that edge features are in **_bold italic_**.\n", + "\n", + "Let's find out more about *sense*.\n", + "\n", + "You can start with clicking the triangle afte \"sense str\" above.\n", + "It tells you where the feature comes from, and it shows you the context where it has been constructed.\n", + "You might go there to see additional documentation.\n", + "\n", + "But we can also dive directly into its data:" + ] + }, + { + "cell_type": "code", + "execution_count": 5, + "metadata": {}, + "outputs": [ + { + "data": { + "text/plain": [ + "(('--', 17941),\n", + " ('d-', 9975),\n", + " ('-p', 6537),\n", + " ('-i', 3604),\n", + " ('-c', 3231),\n", + " ('dp', 1899),\n", + " ('dc', 1002),\n", + " ('di', 918),\n", + " ('l.', 876),\n", + " ('i.', 630),\n", + " ('n.', 532),\n", + " ('-b', 64),\n", + " ('db', 61),\n", + " ('c.', 57),\n", + " ('k.', 54))" + ] + }, + "execution_count": 5, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "F.sense.freqList()" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Which nodes have a sense feature?" + ] + }, + { + "cell_type": "code", + "execution_count": 6, + "metadata": {}, + "outputs": [ + { + "data": { + "text/plain": [ + "{'word'}" + ] + }, + "execution_count": 6, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "{F.otype.v(n) for n in N.walk() if F.sense.v(n)}" + ] + }, + { + "cell_type": "code", + "execution_count": 7, + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + " 0.14s 47381 results\n" + ] + } + ], + "source": [ + "results = A.search(\n", + " \"\"\"\n", + "word sense\n", + "\"\"\"\n", + ")" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Let's show some of the rarer sense values:" + ] + }, + { + "cell_type": "code", + "execution_count": 8, + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + " 0.15s 54 results\n" + ] + } + ], + "source": [ + "results = A.search(\n", + " \"\"\"\n", + "word sense=k.\n", + "\"\"\"\n", + ")" + ] + }, + { + "cell_type": "code", + "execution_count": 9, + "metadata": {}, + "outputs": [ + { + "data": { + "text/html": [ + "\n", + "\n", + "\n", + "\n", + "\n", + "
npword
1Genesis 4:17יִּקְרָא֙
2Genesis 13:16שַׂמְתִּ֥י
3Genesis 32:13שַׂמְתִּ֤י
4Genesis 34:31יַעֲשֶׂ֖ה
5Genesis 48:20יְשִֽׂמְךָ֣
" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + } + ], + "source": [ + "A.table(results, end=5)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "If we do a pretty display, the `sense` feature shows up." + ] + }, + { + "cell_type": "code", + "execution_count": 10, + "metadata": {}, + "outputs": [ + { + "data": { + "text/html": [ + "

result 1

" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/html": [ + "
verse:1414485
sentence:1172591
clause:427946
phrase:652729
1943 וַ
phrase:652730
sense=d-
phrase:652731
phrase:652732
sentence:1172592
clause:427947
phrase:652733
1948 וַ
phrase:652734
sense=--
sentence:1172593
clause:427948
phrase:652735
1950 וַ
phrase:652736
sense=d-
phrase:652737
sentence:1172594
clause:427949
phrase:652738
1954 וַֽ
phrase:652739
sense=--
clause:427950
phrase:652740
sense=d-
phrase:652741
sentence:1172595
clause:427951
phrase:652742
1958 וַ
phrase:652743
sense=k.
phrase:652744
" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + } + ], + "source": [ + "A.show(results, start=1, end=1, withNodes=True)" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "tags": [] + }, + "source": [ + "## Lingo heads\n", + "If you click the triangle before **etcbc/lingo/heads/tf** you see what features it contributes.\n", + "Unfortunately, the authors have not provided a description of this feature, but if you click\n", + "on the triangle after *heads* none, you see where the feature comes from and who has made it.\n", + "\n", + "Moreover, the fact that *heads* is in italics makes clear that it is an edge feature.\n", + "\n", + "Let's use it in a query:\n", + "Now, `heads` is an edge feature, we cannot directly make it visible in pretty displays, but we can use it in queries.\n", + "\n", + "We also want to make the feature `sense` visible, so we mention the feature in the query, without restricting the results." + ] + }, + { + "cell_type": "code", + "execution_count": 11, + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + " 0.40s 402 results\n" + ] + } + ], + "source": [ + "results = A.search(\n", + " \"\"\"\n", + "book book=Genesis\n", + " chapter chapter=1\n", + " clause\n", + " phrase\n", + " -heads> word sense*\n", + "\"\"\"\n", + ")" + ] + }, + { + "cell_type": "code", + "execution_count": 12, + "metadata": {}, + "outputs": [ + { + "data": { + "text/html": [ + "

result 1

" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/html": [ + "
book Genesis
book=Genesis
" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/html": [ + "
chapter Genesis 1
book=Genesischapter=1
" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/html": [ + "" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/html": [ + "

result 2

" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/html": [ + "
book Genesis
book=Genesis
" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/html": [ + "
chapter Genesis 1
book=Genesischapter=1
" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/html": [ + "" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + } + ], + "source": [ + "A.show(results, start=1, end=2)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Note how the words that are **_heads_** of their phrases are highlighted within their phrases." + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "# Participants\n", + "\n", + "Now we are going to add another promising module, provided by Christian Canu Højgaard, from this repo:\n", + "[participants](https://github.com/ch-jensen/participants).\n", + "\n", + "Let's do it in the straightforward way:" + ] + }, + { + "cell_type": "code", + "execution_count": 13, + "metadata": {}, + "outputs": [ + { + "data": { + "text/html": [ + "TF-app: ~/text-fabric-data/github/etcbc/bhsa/app" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/html": [ + "data: ~/text-fabric-data/github/etcbc/bhsa/tf/2021" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/html": [ + "data: ~/text-fabric-data/github/etcbc/lingo/heads/tf/2021" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/html": [ + "data: ~/text-fabric-data/github/etcbc/valence/tf/2021" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "name": "stdout", + "output_type": "stream", + "text": [ + "The requested data is not available offline\n", + "\t~/text-fabric-data/github/ch-jensen/participants/actor/tf/2021 not found\n", + "No directory actor/tf/2021 in #9671910a329c069cfd3d366526ea816de57666dcWill try something else\n", + "\tFailed" + ] + }, + { + "name": "stderr", + "output_type": "stream", + "text": [ + "No directory actor/tf/2021 in #9671910a329c069cfd3d366526ea816de57666dc\tFailed" + ] + }, + { + "data": { + "text/html": [ + "data: ~/text-fabric-data/github/etcbc/phono/tf/2021" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/html": [ + "data: ~/text-fabric-data/github/etcbc/parallels/tf/2021" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "name": "stderr", + "output_type": "stream", + "text": [ + "There was an error loading TF-app etcbc/bhsa from ~/text-fabric-data/github/etcbc/bhsa/app\n", + "AttributeError(\"'TfApp' object has no attribute 'TF'\")\n", + "Traceback (most recent call last):\n", + " File \"/Users/me/github/annotation/text-fabric/tf/advanced/app.py\", line 542, in findApp\n", + " app = appClass(\n", + " File \"/Users/me/text-fabric-data/github/etcbc/bhsa/app/app.py\", line 6, in __init__\n", + " super().__init__(*args, **kwargs)\n", + " File \"/Users/me/github/annotation/text-fabric/tf/advanced/app.py\", line 178, in __init__\n", + " volumesApi(self)\n", + " File \"/Users/me/github/annotation/text-fabric/tf/advanced/volumes.py\", line 39, in volumesApi\n", + " TF = app.TF\n", + "AttributeError: 'TfApp' object has no attribute 'TF'\n", + "Text-Fabric is not loaded\n" + ] + } + ], + "source": [ + "A = use(\n", + " 'etcbc/bhsa',\n", + " mod=(\n", + " \"etcbc/lingo/heads/tf\",\n", + " \"etcbc/valence/tf\",\n", + " \"ch-jensen/participants/actor/tf\"\n", + " ),\n", + " hoist=globals(),\n", + ")" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "The features are not there!\n", + "\n", + "If we have a look on Github in this repo we see under\n", + "[actor/tf](https://github.com/ch-jensen/participants/tree/master/actor/tf)\n", + "the directory `c` only. Christian has produced his features against version `c` of the BHSA.\n", + "\n", + "Ok, then we go back, and run our command for version `c`." + ] + }, + { + "cell_type": "code", + "execution_count": 14, + "metadata": {}, + "outputs": [ + { + "data": { + "text/html": [ + "TF-app: ~/text-fabric-data/github/etcbc/bhsa/app" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/html": [ + "data: ~/text-fabric-data/github/etcbc/bhsa/tf/c" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/html": [ + "data: ~/text-fabric-data/github/etcbc/lingo/heads/tf/c" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/html": [ + "data: ~/text-fabric-data/github/etcbc/valence/tf/c" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/html": [ + "data: ~/text-fabric-data/github/ch-jensen/participants/actor/tf/c" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/html": [ + "data: ~/text-fabric-data/github/etcbc/phono/tf/c" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/html": [ + "data: ~/text-fabric-data/github/etcbc/parallels/tf/c" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/html": [ + "Text-Fabric: Text-Fabric API 10.2.0, etcbc/bhsa/app v3, Search Reference
Data: BHSA, Character table, Feature docs
Features:
\n", + "
Parallel Passages\n", + "
\n", "\n", "
\n", - "
\n", - "tab\n", + "
\n", + "crossref\n", "
\n", "
int
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:07:57Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", + " \n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + "
\n", + "
\n", "\n", - "
\n", - "
\n", - "
\n", + "
ch-jensen/participants/actor/tf\n", + "
\n", "\n", "
\n", "
\n", - "trailer\n", + "actor\n", "
\n", "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:07:27Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " Participant references for words, subphrases and phrases. The references are adapted from Eep Talstra's work on participant tracking. http://doi.org/10.5281/zenodo.1479491\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "trailer_utf8\n", + "prs_actor\n", "
\n", "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:07:28Z
\n", - "
\n", + " Participant references for pronominal suffixes. The references are adapted from Eep Talstra's work on participant tracking. http://doi.org/10.5281/zenodo.1479491\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", "
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", + "
\n", + "
\n", + "coref\n", "
\n", + "
none
\n", "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", + " Edges to co-referring actors on chapter-level. The references are adapted from Eep Talstra's work on participant tracking. http://doi.org/10.5281/zenodo.1479491\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + "
\n", + "
\n", "\n", - "
\n", - "
\n", - "
\n", + "
BHSA = Biblia Hebraica Stuttgartensia Amstelodamensis\n", + "
\n", "\n", "
\n", "
\n", - "txt\n", + "book\n", "
\n", "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:07:58Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " \n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "typ\n", + "book@ll\n", "
\n", "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:07:58Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " \n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "uvf\n", - "
\n", - "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:07:59Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", + "chapter\n", "
\n", + "
int
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", + " \n", "\n", - "
\n", - "
version:
\n", - "
c
\n", "
\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", + "
\n", + "
\n", + "code\n", "
\n", + "
int
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " \n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "vbe\n", + "det\n", "
\n", "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:08:00Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " \n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "vbs\n", + "domain\n", "
\n", "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:08:00Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " \n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "verse\n", + "freq_lex\n", "
\n", "
int
\n", - "
\n", - " \n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:08:01Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " \n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "voc_lex\n", + "function\n", "
\n", "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2019-01-31T17:40:54Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " \n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "voc_lex_utf8\n", + "g_cons\n", "
\n", "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2019-01-31T17:40:55Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " \n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "vs\n", + "g_cons_utf8\n", "
\n", "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", - "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:08:01Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", - "\n", - "
\n", - "
version:
\n", - "
c
\n", - "
\n", - "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", - "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " \n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "vt\n", + "g_lex\n", "
\n", "
str
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", - "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", - "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:08:02Z
\n", - "
\n", - "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", - "
\n", - "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", + " \n", "\n", - "
\n", - "
version:
\n", - "
c
\n", "
\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", + "
\n", + "
\n", + "g_lex_utf8\n", "
\n", + "
str
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " \n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", - "
\n", - "mother\n", - "
\n", - "
none
\n", - "
\n", - " \n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", - "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", + "
\n", + "g_word\n", "
\n", + "
str
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", - "
\n", + " \n", "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:08:09Z
\n", "
\n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", + "
\n", + "
\n", + "g_word_utf8\n", "
\n", + "
str
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", - "
\n", + " \n", "\n", - "
\n", - "
version:
\n", - "
c
\n", "
\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", + "
\n", + "
\n", + "gloss\n", "
\n", + "
str
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " \n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", - "
\n", - "oslots\n", + "
\n", + "gn\n", "
\n", - "
none
\n", - "
\n", - " \n", - "
\n", + "
str
\n", "\n", - "
\n", - "
author:
\n", - "
Eep Talstra Centre for Bible and Computer
\n", - "
\n", + " \n", "\n", - "
\n", - "
dataset:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
datasetName:
\n", - "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", + "
\n", + "
\n", + "label\n", "
\n", + "
str
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:11:57Z
\n", - "
\n", + " \n", "\n", - "
\n", - "
email:
\n", - "
shebanq@ancient-data.org
\n", "
\n", "\n", - "
\n", - "
encoders:
\n", - "
Constantijn Sikkel (QDF), and Dirk Roorda (TF)
\n", + "
\n", + "
\n", + "language\n", "
\n", + "
str
\n", + "\n", + " \n", "\n", - "
\n", - "
version:
\n", - "
c
\n", "
\n", "\n", - "
\n", - "
website:
\n", - "
https://shebanq.ancient-data.org
\n", + "
\n", + "
\n", + "lex\n", "
\n", + "
str
\n", + "\n", + " \n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "
\n", + "lex_utf8\n", "
\n", + "
str
\n", "\n", - "
\n", - "
\n", + " \n", "\n", - "
etcbc/lingo/heads/tf\n", - "
\n", + "
\n", "\n", "
\n", - "
\n", - "heads\n", + "
\n", + "ls\n", "
\n", - "
none
\n", - "
\n", - " \n", - "
\n", + "
str
\n", "\n", - "
\n", - "
coreData:
\n", - "
BHSA
\n", - "
\n", + " \n", "\n", - "
\n", - "
coreVersion:
\n", - "
c
\n", "
\n", "\n", - "
\n", - "
created_by:
\n", - "
Cody Kingham
\n", + "
\n", + "
\n", + "nametype\n", "
\n", + "
str
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2018-11-06T14:47:00Z
\n", - "
\n", + " \n", "\n", - "
\n", - "
source:
\n", - "
see the notebook at https://github.com/etcbc/lingo/heads
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
\n", + "
\n", + "nme\n", "
\n", + "
str
\n", + "\n", + " \n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", - "
\n", - "noun_heads\n", + "
\n", + "nu\n", "
\n", - "
none
\n", - "
\n", - " \n", - "
\n", + "
str
\n", "\n", - "
\n", - "
coreData:
\n", - "
BHSA
\n", - "
\n", + " \n", "\n", - "
\n", - "
coreVersion:
\n", - "
c
\n", "
\n", "\n", - "
\n", - "
created_by:
\n", - "
Cody Kingham
\n", + "
\n", + "
\n", + "number\n", "
\n", + "
int
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2018-11-06T14:47:01Z
\n", - "
\n", + " \n", "\n", - "
\n", - "
source:
\n", - "
see the notebook at https://github.com/etcbc/lingo/heads
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
\n", + "
\n", + "otype\n", "
\n", + "
str
\n", + "\n", + " \n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", - "
\n", - "prep_obj\n", + "
\n", + "pargr\n", "
\n", - "
none
\n", - "
\n", - " \n", - "
\n", + "
str
\n", "\n", - "
\n", - "
coreData:
\n", - "
BHSA
\n", - "
\n", + " \n", "\n", - "
\n", - "
coreVersion:
\n", - "
c
\n", "
\n", "\n", - "
\n", - "
created_by:
\n", - "
Cody Kingham
\n", + "
\n", + "
\n", + "pdp\n", "
\n", + "
str
\n", + "\n", + " \n", "\n", - "
\n", - "
dateWritten:
\n", - "
2018-11-06T14:47:02Z
\n", "
\n", "\n", - "
\n", - "
source:
\n", - "
see the notebook at https://github.com/etcbc/lingo/heads
\n", + "
\n", + "
\n", + "pfm\n", "
\n", + "
str
\n", + "\n", + " \n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "
\n", + "prs\n", "
\n", + "
str
\n", "\n", - "
\n", - "
\n", + " \n", "\n", - "
Phonetic Transcriptions\n", - "
\n", + "
\n", "\n", "
\n", "
\n", - "phono\n", + "prs_gn\n", "
\n", "
str
\n", - "
\n", - " \n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
BHSA Data: Constantijn Sikkel; Phono Notebook: Dirk Roorda
\n", - "
\n", + " \n", "\n", - "
\n", - "
coreData:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
coreVersion:
\n", - "
_temp
\n", + "
\n", + "
\n", + "prs_nu\n", "
\n", + "
str
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:16:04Z
\n", - "
\n", + " \n", "\n", - "
\n", - "
source:
\n", - "
Phono Notebook applied to BHSA Data
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
\n", + "
\n", + "prs_ps\n", "
\n", + "
str
\n", + "\n", + " \n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "phono_trailer\n", + "ps\n", "
\n", "
str
\n", - "
\n", - " \n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
BHSA Data: Constantijn Sikkel; Phono Notebook: Dirk Roorda
\n", - "
\n", + " \n", "\n", - "
\n", - "
coreData:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
coreVersion:
\n", - "
_temp
\n", + "
\n", + "
\n", + "qere\n", "
\n", + "
str
\n", + "\n", + " \n", "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:16:04Z
\n", "
\n", "\n", - "
\n", - "
source:
\n", - "
Phono Notebook applied to BHSA Data
\n", + "
\n", + "
\n", + "qere_trailer\n", "
\n", + "
str
\n", + "\n", + " \n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", "
\n", "\n", - "
\n", - "
\n", + "
\n", + "
\n", + "qere_trailer_utf8\n", "
\n", + "
str
\n", "\n", - "
\n", - "
\n", + " \n", "\n", - "
etcbc/valence/tf\n", - "
\n", + "
\n", "\n", "
\n", "
\n", - "cfunction\n", + "qere_utf8\n", "
\n", "
str
\n", - "
\n", - " corrected phrase function, only present for phrases that were in a correction sheet\n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
The content and nature of the features are by Janet Dyk, the workflow is by Dirk Roorda
\n", - "
\n", + " \n", "\n", - "
\n", - "
coreData:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
coreVersion:
\n", - "
_temp
\n", + "
\n", + "
\n", + "rank_lex\n", "
\n", + "
int
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:17:06Z
\n", - "
\n", + " \n", "\n", - "
\n", - "
method:
\n", - "
Generated blank correction and enrichment spreadsheets with selected clauses
\n", "
\n", "\n", - "
\n", - "
purpose:
\n", - "
Support the decision process of assigning valence to verbs
\n", + "
\n", + "
\n", + "rela\n", "
\n", + "
str
\n", "\n", - "
\n", - "
steps:
\n", - "
sheets filled out by researcher; read back in by program; generated new features based on contents
\n", - "
\n", + " \n", "\n", - "
\n", - "
title:
\n", - "
Correction and enrichment features
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
\n", + "
\n", + "sp\n", "
\n", + "
str
\n", + "\n", + " \n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "f_correction\n", + "st\n", "
\n", "
str
\n", - "
\n", - " whether the phrase function has been manually corrected\n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
The content and nature of the features are by Janet Dyk, the workflow is by Dirk Roorda
\n", - "
\n", + " \n", "\n", - "
\n", - "
coreData:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
coreVersion:
\n", - "
_temp
\n", + "
\n", + "
\n", + "tab\n", "
\n", + "
int
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:17:06Z
\n", - "
\n", + " \n", "\n", - "
\n", - "
method:
\n", - "
Generated blank correction and enrichment spreadsheets with selected clauses
\n", "
\n", "\n", - "
\n", - "
purpose:
\n", - "
Support the decision process of assigning valence to verbs
\n", + "
\n", + "
\n", + "trailer\n", "
\n", + "
str
\n", "\n", - "
\n", - "
steps:
\n", - "
sheets filled out by researcher; read back in by program; generated new features based on contents
\n", - "
\n", + " \n", "\n", - "
\n", - "
title:
\n", - "
Correction and enrichment features
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
\n", + "
\n", + "trailer_utf8\n", "
\n", + "
str
\n", + "\n", + " \n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "grammatical\n", + "txt\n", "
\n", "
str
\n", - "
\n", - " constituent role main classification\n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
The content and nature of the features are by Janet Dyk, the workflow is by Dirk Roorda
\n", - "
\n", + " \n", "\n", - "
\n", - "
coreData:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
coreVersion:
\n", - "
_temp
\n", + "
\n", + "
\n", + "typ\n", "
\n", + "
str
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:17:07Z
\n", - "
\n", + " \n", "\n", - "
\n", - "
method:
\n", - "
Generated blank correction and enrichment spreadsheets with selected clauses
\n", "
\n", "\n", - "
\n", - "
purpose:
\n", - "
Support the decision process of assigning valence to verbs
\n", + "
\n", + "
\n", + "uvf\n", "
\n", + "
str
\n", "\n", - "
\n", - "
steps:
\n", - "
sheets filled out by researcher; read back in by program; generated new features based on contents
\n", - "
\n", + " \n", "\n", - "
\n", - "
title:
\n", - "
Correction and enrichment features
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
\n", + "
\n", + "vbe\n", "
\n", + "
str
\n", + "\n", + " \n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "lexical\n", + "vbs\n", "
\n", "
str
\n", - "
\n", - " additional lexical characteristics\n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
The content and nature of the features are by Janet Dyk, the workflow is by Dirk Roorda
\n", - "
\n", + " \n", "\n", - "
\n", - "
coreData:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
coreVersion:
\n", - "
_temp
\n", + "
\n", + "
\n", + "verse\n", "
\n", + "
int
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:17:07Z
\n", - "
\n", + " \n", "\n", - "
\n", - "
method:
\n", - "
Generated blank correction and enrichment spreadsheets with selected clauses
\n", "
\n", "\n", - "
\n", - "
purpose:
\n", - "
Support the decision process of assigning valence to verbs
\n", + "
\n", + "
\n", + "voc_lex\n", "
\n", + "
str
\n", "\n", - "
\n", - "
steps:
\n", - "
sheets filled out by researcher; read back in by program; generated new features based on contents
\n", - "
\n", + " \n", "\n", - "
\n", - "
title:
\n", - "
Correction and enrichment features
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
\n", + "
\n", + "voc_lex_utf8\n", "
\n", + "
str
\n", + "\n", + " \n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "original\n", + "vs\n", "
\n", "
str
\n", - "
\n", - " default value before enrichment logic has been applied\n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
The content and nature of the features are by Janet Dyk, the workflow is by Dirk Roorda
\n", - "
\n", + " \n", "\n", - "
\n", - "
coreData:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
coreVersion:
\n", - "
_temp
\n", + "
\n", + "
\n", + "vt\n", "
\n", + "
str
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:17:08Z
\n", - "
\n", + " \n", "\n", - "
\n", - "
method:
\n", - "
Generated blank correction and enrichment spreadsheets with selected clauses
\n", "
\n", "\n", - "
\n", - "
purpose:
\n", - "
Support the decision process of assigning valence to verbs
\n", + "
\n", + "
\n", + "mother\n", "
\n", + "
none
\n", "\n", - "
\n", - "
steps:
\n", - "
sheets filled out by researcher; read back in by program; generated new features based on contents
\n", - "
\n", + " \n", "\n", - "
\n", - "
title:
\n", - "
Correction and enrichment features
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
\n", + "
\n", + "oslots\n", "
\n", + "
none
\n", + "\n", + " \n", "\n", - "
\n", - "
\n", "
\n", "\n", + "
\n", + "
\n", + "\n", + "
etcbc/lingo/heads/tf\n", + "
\n", + "\n", "
\n", - "
\n", - "predication\n", + "
\n", + "heads\n", "
\n", - "
str
\n", - "
\n", - " verbal function main classification\n", - "
\n", + "
none
\n", "\n", - "
\n", - "
author:
\n", - "
The content and nature of the features are by Janet Dyk, the workflow is by Dirk Roorda
\n", - "
\n", + " \n", "\n", - "
\n", - "
coreData:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
coreVersion:
\n", - "
_temp
\n", + "
\n", + "
\n", + "noun_heads\n", "
\n", + "
none
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:17:08Z
\n", - "
\n", + " \n", "\n", - "
\n", - "
method:
\n", - "
Generated blank correction and enrichment spreadsheets with selected clauses
\n", "
\n", "\n", - "
\n", - "
purpose:
\n", - "
Support the decision process of assigning valence to verbs
\n", + "
\n", + "
\n", + "prep_obj\n", "
\n", + "
none
\n", "\n", - "
\n", - "
steps:
\n", - "
sheets filled out by researcher; read back in by program; generated new features based on contents
\n", - "
\n", + " \n", "\n", - "
\n", - "
title:
\n", - "
Correction and enrichment features
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + "
\n", + "
\n", "\n", - "
\n", - "
\n", - "\n", + "
Phonetic Transcriptions\n", + "
\n", "\n", "
\n", "
\n", - "s_manual\n", + "phono\n", "
\n", "
str
\n", - "
\n", - " whether the generated enrichment features have been manually changed\n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
The content and nature of the features are by Janet Dyk, the workflow is by Dirk Roorda
\n", - "
\n", + " \n", "\n", - "
\n", - "
coreData:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
coreVersion:
\n", - "
_temp
\n", + "
\n", + "
\n", + "phono_trailer\n", "
\n", + "
str
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:17:09Z
\n", - "
\n", + " \n", "\n", - "
\n", - "
method:
\n", - "
Generated blank correction and enrichment spreadsheets with selected clauses
\n", "
\n", "\n", - "
\n", - "
purpose:
\n", - "
Support the decision process of assigning valence to verbs
\n", - "
\n", + "
\n", + "
\n", "\n", - "
\n", - "
steps:
\n", - "
sheets filled out by researcher; read back in by program; generated new features based on contents
\n", - "
\n", + "
etcbc/valence/tf\n", + "
\n", "\n", - "
\n", - "
title:
\n", - "
Correction and enrichment features
\n", + "
\n", + "
\n", + "cfunction\n", "
\n", + "
str
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " corrected phrase function, only present for phrases that were in a correction sheet\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "semantic\n", + "f_correction\n", "
\n", "
str
\n", - "
\n", - " additional semantic characteristics\n", - "
\n", - "\n", - "
\n", - "
author:
\n", - "
The content and nature of the features are by Janet Dyk, the workflow is by Dirk Roorda
\n", - "
\n", - "\n", - "
\n", - "
coreData:
\n", - "
BHSA
\n", - "
\n", "\n", - "
\n", - "
coreVersion:
\n", - "
_temp
\n", - "
\n", + " whether the phrase function has been manually corrected\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:17:09Z
\n", "
\n", "\n", - "
\n", - "
method:
\n", - "
Generated blank correction and enrichment spreadsheets with selected clauses
\n", + "
\n", + "
\n", + "grammatical\n", "
\n", + "
str
\n", "\n", - "
\n", - "
purpose:
\n", - "
Support the decision process of assigning valence to verbs
\n", - "
\n", + " constituent role main classification\n", "\n", - "
\n", - "
steps:
\n", - "
sheets filled out by researcher; read back in by program; generated new features based on contents
\n", "
\n", "\n", - "
\n", - "
title:
\n", - "
Correction and enrichment features
\n", + "
\n", + "
\n", + "lexical\n", "
\n", + "
str
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " additional lexical characteristics\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "sense\n", + "original\n", "
\n", "
str
\n", - "
\n", - " sense label verb occurrences, computed by the flowchart algorithm, see https://github.com/ETCBC/valence/wiki/Legend\n", - "
\n", "\n", - "
\n", - "
coreData:
\n", - "
BHSA
\n", - "
\n", + " default value before enrichment logic has been applied\n", "\n", - "
\n", - "
coreVersion:
\n", - "
_temp
\n", "
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:17:54Z
\n", + "
\n", + "
\n", + "predication\n", "
\n", + "
str
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", - "
\n", + " verbal function main classification\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", "
\n", - "valence\n", + "s_manual\n", "
\n", "
str
\n", - "
\n", - " verbal valence main classification\n", - "
\n", "\n", - "
\n", - "
author:
\n", - "
The content and nature of the features are by Janet Dyk, the workflow is by Dirk Roorda
\n", - "
\n", + " whether the generated enrichment features have been manually changed\n", "\n", - "
\n", - "
coreData:
\n", - "
BHSA
\n", "
\n", "\n", - "
\n", - "
coreVersion:
\n", - "
_temp
\n", + "
\n", + "
\n", + "semantic\n", "
\n", + "
str
\n", "\n", - "
\n", - "
dateWritten:
\n", - "
2018-10-08T15:17:09Z
\n", - "
\n", + " additional semantic characteristics\n", "\n", - "
\n", - "
method:
\n", - "
Generated blank correction and enrichment spreadsheets with selected clauses
\n", "
\n", "\n", - "
\n", - "
purpose:
\n", - "
Support the decision process of assigning valence to verbs
\n", + "
\n", + "
\n", + "sense\n", "
\n", + "
str
\n", "\n", - "
\n", - "
steps:
\n", - "
sheets filled out by researcher; read back in by program; generated new features based on contents
\n", - "
\n", + " sense label verb occurrences, computed by the flowchart algorithm, see https://github.com/ETCBC/valence/wiki/Legend\n", "\n", - "
\n", - "
title:
\n", - "
Correction and enrichment features
\n", "
\n", "\n", - "
\n", - "
writtenBy:
\n", - "
Text-Fabric
\n", + "
\n", + "
\n", + "valence\n", "
\n", + "
str
\n", + "\n", + " verbal valence main classification\n", "\n", - "
\n", - "
\n", "
\n", "\n", "
\n", @@ -10452,6 +3396,14 @@ ".ccon {\n", " background-color: yellow ! important;\n", "}\n", + ".ccon,.ccoff {\n", + " padding-right: 0.1rem;\n", + " padding-left: 0.1rem;\n", + "}\n", + ".ccline {\n", + " font-size: xx-large;\n", + " font-weight: bold;\n", + "}\n", "/* TF header */\n", "\n", "summary {\n", @@ -10678,7 +3630,7 @@ }, { "cell_type": "code", - "execution_count": 18, + "execution_count": 15, "metadata": {}, "outputs": [ { @@ -10687,7 +3639,7 @@ "415" ] }, - "execution_count": 18, + "execution_count": 15, "metadata": {}, "output_type": "execute_result" } @@ -10699,7 +3651,7 @@ }, { "cell_type": "code", - "execution_count": 19, + "execution_count": 16, "metadata": {}, "outputs": [ { @@ -10717,7 +3669,7 @@ " ('KHN', 33))" ] }, - "execution_count": 19, + "execution_count": 16, "metadata": {}, "output_type": "execute_result" } @@ -10735,7 +3687,7 @@ }, { "cell_type": "code", - "execution_count": 20, + "execution_count": 17, "metadata": {}, "outputs": [ { @@ -10744,7 +3696,7 @@ "{'phrase_atom', 'subphrase'}" ] }, - "execution_count": 20, + "execution_count": 17, "metadata": {}, "output_type": "execute_result" } @@ -10755,14 +3707,14 @@ }, { "cell_type": "code", - "execution_count": 21, + "execution_count": 18, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ - " 0.12s 2062 results\n" + " 0.08s 2062 results\n" ] } ], @@ -10783,14 +3735,14 @@ }, { "cell_type": "code", - "execution_count": 22, + "execution_count": 19, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ - " 0.17s 30 results\n" + " 0.10s 30 results\n" ] } ], @@ -10804,7 +3756,7 @@ }, { "cell_type": "code", - "execution_count": 23, + "execution_count": 20, "metadata": {}, "outputs": [ { @@ -10856,7 +3808,7 @@ }, { "cell_type": "code", - "execution_count": 24, + "execution_count": 21, "metadata": {}, "outputs": [ { @@ -10898,7 +3850,7 @@ }, { "cell_type": "code", - "execution_count": 25, + "execution_count": 22, "metadata": {}, "outputs": [], "source": [ @@ -10914,7 +3866,7 @@ }, { "cell_type": "code", - "execution_count": 26, + "execution_count": 23, "metadata": {}, "outputs": [ { @@ -10955,7 +3907,7 @@ }, { "cell_type": "code", - "execution_count": 27, + "execution_count": 24, "metadata": {}, "outputs": [ { @@ -11046,14 +3998,14 @@ }, { "cell_type": "code", - "execution_count": 28, + "execution_count": 25, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ - " 0.80s 30 results\n" + " 0.39s 30 results\n" ] } ], @@ -11070,7 +4022,7 @@ }, { "cell_type": "code", - "execution_count": 29, + "execution_count": 26, "metadata": {}, "outputs": [ { @@ -11158,7 +4110,7 @@ "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython3", - "version": "3.10.2" + "version": "3.10.4" }, "widgets": { "application/vnd.jupyter.widget-state+json": { diff --git a/tutorial/start.ipynb b/tutorial/start.ipynb index 791ad2e9..99cad8ca 100644 --- a/tutorial/start.ipynb +++ b/tutorial/start.ipynb @@ -161,7 +161,7 @@ "name": "stdout", "output_type": "stream", "text": [ - "This is Text-Fabric 9.3.2\n", + "This is Text-Fabric 9.5.2\n", "Api reference : https://annotation.github.io/text-fabric/tf/cheatsheet.html\n", "\n", "122 features found and 0 ignored\n" @@ -170,7 +170,7 @@ { "data": { "text/html": [ - "Text-Fabric: Text-Fabric API 9.3.2, etcbc/bhsa/app v3, Search Reference
Data: BHSA, Character table, Feature docs
Features:
\n", + "Text-Fabric: Text-Fabric API 9.5.2, etcbc/bhsa/app v3, Search Reference
Data: BHSA, Character table, Feature docs
Features:
\n", "
Parallel Passages\n", "
\n", "\n", @@ -4514,7 +4514,7 @@ "output_type": "stream", "text": [ " 0.00s Counting nodes ...\n", - " 0.12s 1446831 nodes\n" + " 0.09s 1446831 nodes\n" ] } ], @@ -4741,12 +4741,12 @@ " | 0.00s 63717 sentences\n", " | 0.00s 64514 sentence_atoms\n", " | 0.01s 88131 clauses\n", - " | 0.01s 90704 clause_atoms\n", - " | 0.02s 253203 phrases\n", - " | 0.02s 267532 phrase_atoms\n", + " | 0.00s 90704 clause_atoms\n", + " | 0.01s 253203 phrases\n", + " | 0.01s 267532 phrase_atoms\n", " | 0.01s 113850 subphrases\n", - " | 0.03s 426590 words\n", - " 0.11s Done\n" + " | 0.02s 426590 words\n", + " 0.08s Done\n" ] } ],