Package schemax incorporates a powerful RFC 4512 parser, wrapped with convenient, reflective features for creating and interrogating directory schemas.
Forked from Jesse Coretta's go-schemax package.
Requires Go version 1.22 or higher.
The schemax package is available under the terms of the MIT license. For further details, see the LICENSE file within the root of the repository.
Two (2) releases are available for end-users:
Version | Notes |
---|---|
1.1.6 | Legacy, custom parser |
>= 1.5.0 | Current, ANTLR parser |
The goal of schemax has always been to provide a reliable parsing subsystem for directory schema definitions that allows transformation into usable Go objects.
The original design of schemax (version < 1.5.0) involved a custom-made parser. While this design performed remarkably well for years, it was not without its shortcomings.
The newly released build of schemax involves the import of an ANTLR4-based RFC 4512 lexer/parser solution. This is made possible using a newly released "sister" package -- go-antlr4512
-- which handles all of the low-level ANTLR actions such as tokenization.
Therefore, the new build of schemax is of a simpler fundamental design thanks to offloading the bulk of the parser to another package. This also keeps all code-grading penalties (due to ANTLR's characteristically high cyclomatic factors) confined elsewhere, and allows schemax to focus on extending the slick features users have come to expect.
Users who are only interested in tokenization and do not require the advanced features of this package should consider use of go-antlr4512
exclusively.
The (ANTLR) parsing subsystem imported by the aforementioned sister package is flexible in terms of the following:
- Presence of header, footer and line-terminating Bash comments surrounding a given definition is acceptable
- Note that comments are entirely discarded by ANTLR
- Support for (escaped!)
'
and\
characters within quoted strings ('this isn\'t a bad example') - Support for linebreaks within definitions
- Definition prefixing allows variations of the standard RFC 4512 "labels" during file and directory parsing
- "
attributeTypes
", "attributeType
" and other variations are permitted forAttributeType
definitions
- "
- Definition delimitation -- using colon (
:
), equals (=
) or whitespace (\t
) of any sensible combination -- are permitted for the purpose of separating a definition prefix (label) from its definition statement- "attributeTypes: ...", "attributeType=...", "attributeType ..." are valid expressions
- Multiple files are joined using an ASCII #10 during directory parsing
- Users need not worry about adding a trailing newline to each file to be read; schemax will do this for you if needed
The legacy release branches of schemax did not offer a robust file and directory parsing solution, rather it focused on the byte representations of a given definition and the tokens derived therein, leaving it to the end-user to devise a delivery method.
The new (>=1.5.0) release branches introduce proper ParseRaw
, ParseFile
and ParseDirectory
methods that greatly simplify use of this package in the midst of an established schema "library". For example:
func main() {
r := NewSchema()
// Let's parse a directory into our
// receiver instance of Schema (r).
schemaDir := "/home/you/ds/schema"
if err := r.ParseDirectory(schemaDir); err != nil {
fmt.Println(err)
return
}
// Check our definition counters
fmt.Printf("%s", r.Counters())
// Output:
// LS: 67
// MR: 44
// AT: 131
// MU: 29
// OC: 39
// DC: 1
// NF: 1
// DS: 1
}
Though the ParseFile
function operates identically to the above-demonstrated ParseDirectory
function, it is important to order the respective files and directories according to any applicable dependencies. In other words, if "fileB.schema" requires definitions from "fileA.schema", "fileA.schema" must be parsed first.
Sub-directories encountered shall be traversed indefinitely. The effective name of a given directory is not significant.
Files encountered through directory traversal shall only be read and parsed IF the extension is ".schema". This prevents other files -- such as text or README.md
files -- from interfering with the parsing process needlessly.
An eligible schema file may contain one definition, or many. The effective name of an eligible schema file is significant, unlike directories. Each schema file must be named in a manner that fosters the correct ordering of dependent definitions -- whether or not subdirectories are involved. To offer a real-world example, the 389DS/Netscape schema directory deployed during a typical installation is defined and governed in a similar manner.
The general rule-of-thumb is suggests that if the ls -l
Bash command consistently lists the indicated schema files in correct order, and assuming those files contain properly ordered and well-formed definitions, the parsing process should work nicely.
Alternatively, the ParseRaw
method is ideal for parsing []byte
instances that have already been read from the filesystem in some manner, or written "in-line" such as for unit testing.
The Schema
type defined within this package is a stackage.Stack
derivative type. An instance of a Schema
can manifest in any of the following manners:
- As an empty (unpopulated)
Schema
, initialized by way of theNewEmptySchema
function - As a basic (minimally populated)
Schema
, initialized by way of theNewBasicSchema
function - As a complete (fully populated)
Schema
, initialized by way of theNewSchema
function
There are certain scenarios which call for one of the above initialization procedures:
- An empty
Schema
is ideal for LDAP professionals, and allows for the creation of aSchema
of particularly narrow focus for R&D, testing or product development - A basic
Schema
resembles the foundational (starting)Schema
context observed in most directory server products, in that it comes "pre-loaded" with officialLDAPSyntax
andMatchingRule
definitions -- but few to noAttributeTypes
-- making it a most suitable empty canvas upon which a newSchema
may be devised from scratch - A full
Schema
is the most obvious choice for "Quick Start" scenarios, in that aSchema
is produced containing a very large portion of the standardAttributeType
andObjectClass
definitions used in the wild by most (if not all) directory products
Regardless of the content present, a given Schema
is capable of storing definitions from all eight (8) RFC 4512 "categories". These are known as "collections", and are stored in nested stackage.Stack
derivative types, accessed using any of the following methods:
Schema.LDAPSyntaxes
Schema.MatchingRules
Schema.AttributeTypes
Schema.MatchingRuleUses
Schema.ObjectClasses
Schema.DITContentRules
Schema.NameForms
Schema.DITStructureRules
Definition instances produced by way of parsing -- namely using one of the Schema.Parse<Type>
methods-- will automatically gain internal access to the Schema
instance in which it is stored.
However, definitions produced manually by way of the various Set<Item>
methods or by way of localized Parse
method extended through types defined within this package will require manual execution of the SetSchema
method, using the intended Schema
instance as the input argument. Ideally this should occur early in the definition composition.
In either case, this internal reference is used for seamless verification of any reference, such as an LDAPSyntax
, when introduced to a given type instance. This ensures definition pointer references remain valid.
This package is closure-friendly with regards to user-authored closure functions or methods meant to perform specific tasks:
- Assertion matching, by way of an instance of
MatchingRule
applicable to two assertion values within aAssertionMatcher
closure (i.e.: is "value1" equal to "value2"?) - Syntax qualification, by way of an instance of
LDAPSyntax
to be honored by a value within aSyntaxQualifier
closure (i.e.: does value qualify for specified syntax?) - General-use value qualification, by way of an instance of
AttributeType
to be analyzed in specialized scenarios within aValueQualifier
closure (i.e: company/user-specific value processing) - Definition string representation, through assignment of a custom
Stringer
closure to eligible definition instances
Understand that assertion, syntax and general-use qualifying closures are entirely user-defined; this package does not provide such predefined instances itself, leaving that to the user or another package which may be imported and used in a "pluggable" manner in this context.
See RFC 4517, et al, for some practical guidelines relating to certain syntax and assertion matching procedures that may guide users in creating such closures.
This package does, however, include a default Stringer
, which can be invoked for an instance simply by running the instance's SetStringer
method in niladic form.
This package extends fluent methods that are write-based in nature. Typically these methods are prefaced with Set
or Push
. This means such methods may be "chained" together using the standard Go command "." delimiter.
Fluency does not extend to methods that are interrogative in nature, in that they return bool
, string
or error
values. Fluency also precludes use of the Registration
interface due to unique return signatures.
The following table describes the contents and coverage of the so-called "built-in" schema definitions, all of which are sourced from recognized RFCs only. These can be imported en masse by users, or in piece-meal fashion.
DOCUMENT | LS | MR | AT | OC | DC | NF | DS |
---|---|---|---|---|---|---|---|
✅ | ✅ | ✅ | ✅ | ⁿ/ₐ | ⁿ/ₐ | ⁿ/ₐ | |
ⁿ/ₐ | ⁿ/ₐ | ✅ | ✅ | ⁿ/ₐ | ⁿ/ₐ | ⁿ/ₐ | |
ⁿ/ₐ | ⁿ/ₐ | ✅ | ⁿ/ₐ | ⁿ/ₐ | ⁿ/ₐ | ⁿ/ₐ | |
ⁿ/ₐ | ⁿ/ₐ | ✅ | ⁿ/ₐ | ⁿ/ₐ | ⁿ/ₐ | ⁿ/ₐ | |
ⁿ/ₐ | ⁿ/ₐ | ✅ | ✅ | ⁿ/ₐ | ⁿ/ₐ | ⁿ/ₐ | |
ⁿ/ₐ | ⁿ/ₐ | ✅ | ✅ | ⁿ/ₐ | ⁿ/ₐ | ⁿ/ₐ | |
ⁿ/ₐ | ⁿ/ₐ | ✅ | ✅ | ⁿ/ₐ | ⁿ/ₐ | ⁿ/ₐ | |
✅ | ✅ | ⁿ/ₐ | ⁿ/ₐ | ⁿ/ₐ | ⁿ/ₐ | ⁿ/ₐ | |
ⁿ/ₐ | ⁿ/ₐ | ✅ | ✅ | ⁿ/ₐ | ⁿ/ₐ | ⁿ/ₐ | |
✅ | ✅ | ✅ | ✅ | ⁿ/ₐ | ⁿ/ₐ | ⁿ/ₐ | |
ⁿ/ₐ | ⁿ/ₐ | ✅ | ✅ | ⁿ/ₐ | ⁿ/ₐ | ⁿ/ₐ | |
✅ | ✅ | ✅ | ⁿ/ₐ | ⁿ/ₐ | ⁿ/ₐ | ⁿ/ₐ | |
ⁿ/ₐ | ⁿ/ₐ | ✅ | ⁿ/ₐ | ⁿ/ₐ | ⁿ/ₐ | ⁿ/ₐ |