it's all about memory so far, so good. and so nice and simple. but the memory for those tags has to come from somewhere so we use malloc and malloc, internally, is not exactly simple. that turned out to be our performance bottleneck