I'm personally working on a similar project, I mentioned it breifly in another thread. A purely functional stack based language with static typeing shallow binding arrays[1] and primitives for chunking data structures into peices based on preproccessor commands like @cache-line-length, @simd-lanes, @logical-cores, etc. Also explicit inlining and cache prefeching @prefetch @inline, I've got some ideas for syntax but I don't want to get into it. Needless to say from the other thread ergonomics is a priority as is homoiconism.
I'm also looking into "goal-directed" control structures[2], lazy futures to allow for infinit datastructures and optimisation, aswell as some sort of concurrency model, maybe even a partitioned global address space[3], although I don't know enough to pich one let alone implement it. Whole things based primarily off of Joy[4], and Kitten[5].
It's going to be a little while before I can even really develope very much of it, but the plan is to start out with a simple metacircular compiler bootstraped off scheme when I finish SICP and then head off to the dragon book filling out the feature set along the way.
[1] http://home.pipeline.com/~hbaker1/ShallowArrays.html
[2] https://en.wikipedia.org/wiki/Icon_programming_language#Goal-directed_execution
[3] https://en.wikipedia.org/wiki/Partitioned_global_address_space
[4] http://www.kevinalbrecht.com/code/joy-mirror/joy.html
[5] http://kittenlang.org