You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Where can I find documentation of the file formats used? Unfortunately I neither can find one for the .gr files in the repo nor for the file formats generated by WriteGrammarToTextFile such as .grammar (as described in README).
Though I can guess most of what's in the .grammar files I'm still a bit puzzled. I have invoked by the command
Some transitions starting at ROOT_0 are duplicated, e.g.:
ROOT_0 -> ROOT_0 1.0
ROOT_0 -> ROOT_0 1.0
What does this mean? Can these duplicates be ignored or do the weights 1.0 sum up to 2.0 such that the above transitions are equivalent to the following?
Where can I find documentation of the file formats used? Unfortunately I neither can find one for the
.gr
files in the repo nor for the file formats generated byWriteGrammarToTextFile
such as.grammar
(as described inREADME
).Though I can guess most of what's in the
.grammar
files I'm still a bit puzzled. I have invoked by the commandand in the content of
arb_sm5.grammar
I'm wondering:@
have a special meaning or is it just an ordinary character in names? Is there a difference between@..
and non-@
names?$_1
/$_0
-suffix have a special meaning?(I also couldn't find any notes on the file format in the publications COLING-ACL 2006 and HLT_NAACL 2007 that are mentioned in the README).
The reason I am asking is that I'm considering supporting
.gr
or.grammar
input files in an own project CoPaR.The text was updated successfully, but these errors were encountered: