performance improvements

- revision of the memory subsystem initialization to avoid false sharing
- using a exponential backoff to realize a spinlock
- improving the performace of basic functions (e.g. memcpy)
This tag has no release notes.