Choose a configuration approach for Aglyph¶
Release: | 3.0.0.post1 |
---|
Aglyph explicitly supports two methods of configuration:
- Declarative XML configuration conforming to the
Aglyph context DTD
- Programmatic configuration via The Aglyph Context fluent API
Opinions vary widely on the merits of XML (particularly for configuration), but in fairness there’s also plenty of debate over the merits of “code as configuration” (or “configuration as code” if you prefer) as well. Aglyph strives to not have an opinion one way or the other by supporting either approach.
However, both approaches to Aglyph configuration have strengths and weaknesses that you should understand before choosing one over the other.
Declarative XML configuration¶
An Aglyph context can be defined in an XML document that conforms to the
Aglyph context DTD
.
The XML context document is parsed by aglyph.context.XMLContext
, which
is just a subclass of aglyph.context.Context
that knows how to
populate itself from the parsed XML document. Once populated, this context
can then be used to create an aglyph.assembler.Assembler
:
>>> from aglyph.assembler import Assembler
>>> from aglyph.context import XMLContext
>>> my_app_context = XMLContext("my-application-context.xml")
>>> assembler = Assembler(my_app_context)
Note
The
Aglyph context DTD
is
provided primarily as a reference. The aglyph.context.XMLContext
class uses a non-validating parser by default.
Developers are encouraged to explicitly validate an application’s context XML document during testing.
See also
- Use a custom XML parser for XMLContext
- This recipe could also be used to force Aglyph to use a validating XML parser.
XMLContext configures mutable builtin objects safely¶
Consider the following example:
<?xml version="1.0" encoding="utf-8"?>
<context id="cookbook">
<component id="cookbook.Example">
<attributes>
<attribute name="mutable">
<list>
<int>1</int>
<int>2</int>
<int>3</int>
</list>
</attribute>
</attributes>
</component>
</context>
Because builtin list
objects are mutable, Aglyph will automatically turn
the “mutable” attribute above into an aglyph.component.Evaluator
(which is very similar to a functools.partial
). Whenever the
“cookbook.Example” component is assembled, the Evaluator
for the
“mutable” attribute is called, which will produce a new list
object.
Why is this important? Consider a corresponding programmatic configuration for the same component:
context.prototype("cookbook.Example").set(mutable=[1, 2, 3]).register()
This configuration leads to a (likely) logic error: all objects of the “cookbook.Example” component will share a reference to a single list object. An example illustrates the problem:
>>> example1 = assembler.assemble("cookbook.Example")
>>> example1.mutable
[1, 2, 3]
>>> example1.mutable.append(4)
>>> example2 = assembler.assemble("cookbook.Example")
>>> example2.mutable
[1, 2, 3, 4]
Uh-oh! That’s almost certainly not what we intended. To guard against this behavior, we would need to modify the binding:
from functools import partial
context.prototype("cookbook.Example").set(mutable=partial(list, [1, 2, 3])).register()
Now we will get a “fresh” list every time the component is assembled, so modifying the list on one instance will not affect the lists of any other instances.
(And what if we were actually specifying a list-of-list, or a tuple-of-list, or a list-of-dict? Now we would need to account for mutability of each member!)
This is an easy thing to forget, and can lead to a great deal of (programmatic)
configuration code, which is why
aglyph.context.XMLContext
handles it automatically for any <list>
,
<tuple>
, and <dict>
declared in the XML context document.
XMLContext is Unicode-aware and supports automatic character set conversion¶
Aglyph properly handles Unicode text and encoded byte data in XML context documents, regardless of Python version.
Aglyph can also provide your application components with byte data encoded to a user-specified character set.
Consider the following example:
<?xml version="1.0" encoding="utf-8"?>
<context id="cookbook">
<component id="cookbook.TextAndData">
<attributes>
<attribute name="text">
<unicode>ΑΦΔ</unicode>
</attribute>
<attribute name="data1">
<bytes>ΑΦΔ</bytes>
</attribute>
<attribute name="data2">
<bytes encoding="iso-8859-7">ΑΦΔ</bytes>
</attribute>
</attributes>
</component>
</context>
The first thing to notice is that <bytes>ΑΦΔ</bytes>
is missing a character
encoding. This can be problematic on Python 2, because the default string
encoding used by the Unicode implementation is typically ASCII:
$ python2.7
Python 2.7.9 (default, Dec 13 2014, 15:13:49)
[GCC 4.2.1 Compatible Apple LLVM 6.0 (clang-600.0.56)] on darwin
Type "help", "copyright", "credits" or "license" for more information.
>>> import sys
>>> sys.getdefaultencoding()
'ascii'
>>> from aglyph.context import XMLContext
>>> context = XMLContext("cookbook-context.xml")
Traceback (most recent call last):
...
UnicodeEncodeError: 'ascii' codec can't encode characters in position 0-2: ordinal not in range(128)
One solution would be to add the encoding=
attribute. Alternatively, you
can instruct XMLContext
to use a different default encoding (it uses the
value of sys.getdefaultencoding()
by default):
$ python2.7
Python 2.7.9 (default, Dec 13 2014, 15:13:49)
[GCC 4.2.1 Compatible Apple LLVM 6.0 (clang-600.0.56)] on darwin
Type "help", "copyright", "credits" or "license" for more information.
>>> from aglyph.assembler import Assembler
>>> from aglyph.context import XMLContext
>>> context = XMLContext("cookbook-context.xml", default_encoding="UTF-8")
>>> assembler = Assembler(context)
>>> text_and_data = assembler.assemble("cookbook.TextAndData")
>>> text_and_data.text
u'\u0391\u03a6\u0394'
>>> text_and_data.data1
'\xce\x91\xce\xa6\xce\x94'
>>> text_and_data.data2
'\xc1\xd6\xc4'
If we run the same example under Python 3 (which uses “UTF-8” as the default
encoding), we still get correct results, but without the need to explicitly set
the default encoding on the XMLContext
:
$ python3.4
Python 3.4.2 (default, Nov 12 2014, 18:23:59)
[GCC 4.2.1 Compatible Apple LLVM 6.0 (clang-600.0.54)] on darwin
Type "help", "copyright", "credits" or "license" for more information.
>>> from aglyph.assembler import Assembler
>>> from aglyph.context import XMLContext
>>> context = XMLContext("cookbook-context.xml")
>>> assembler = Assembler(context)
>>> text_and_data = assembler.assemble("cookbook.TextAndData")
>>> text_and_data.text
'ΑΦΔ'
>>> text_and_data.data1
b'\xce\x91\xce\xa6\xce\x94'
>>> text_and_data.data2
b'\xc1\xd6\xc4'
One important thing to notice is the difference in the types of the Unicode and byte strings, dependent upon which version of Python is used.
Unicode and character encoding differences between Python 2 and Python 3¶
The builtin str
type has changed significantly between Python 2 and
Python 3 (see Text Vs. Data Instead Of Unicode Vs. 8-bit).
In short: str
represented encoded byte data up to and including
Python 2, but representes Unicode text as of Python 3.0:
Version | Unicode text | Encoded byte data |
---|---|---|
Python 2 | unicode |
str |
Python 3 | str |
bytes |
The Aglyph context DTD
defines <bytes>
, <str>
,
and <unicode>
elements that correspond to the types in the table above, but
treats the element content differently depending on the version of Python under
which Aglyph is running:
Version | <unicode> content |
<str> content |
<bytes> content |
---|---|---|---|
Python 2 | unicode |
str |
str |
Python 3 | str |
str |
bytes |
To summarize the above:
<unicode>
is interpreted as aunicode
type in Python 2 and astr
type in Python 3<str>
is always interpreted as astr
type<bytes>
is interpreted as astr
type in Python 2 and abytes
type in Python 3
Note
For clarity in XML context documents, it is always safe to use <bytes>
for encoded byte data and <unicode>
for Unicode text (regardless of
Python version), avoiding entirely the ambiguity of <str>
.
Warning
Althoug the DTD permits an encoding=”…” attribute on <str>
elements,
the attribute is ignored in Python 3 (a WARNING-level log message is
emitted to the aglyph.context.XMLContext channel if it is present).
Programmatic configuration using the Context fluent API¶
New in version 3.0.0.
Objects of aglyph.context.Context
now support a chained call “fluent” API.
Please refer to The Aglyph Context fluent API for details.
Custom configuration using Context¶
Do neither declarative XML nor fluent configuration suit your fancy?
An aglyph.context.Context
is just a dict
that maps component ID
strings (i.e. aglyph.component.Component.component_id
) to
aglyph.component.Component
instances, so embrace the open source
philosophy and “roll your own” configuration mechanism!
Note
Look at aglyph.context.XMLContext
for a starting point - it’s just a
aglyph.context.Context
subclass that populates itself from a parsed
XML document.