Hey, I know I'm (extremely) late to this party but had a similar idea after seeing SolReSol and Moss (
http://www.thelanguageofmoss.com/) and wondering if both could be combined, and I feel that this is a great idea!
What I think could make it even better would be an entrance tone to clarify where the pitch center is (just a "nucleus" tone, like a C) and then a way to signify the end/beginning of a word. Possibly could be done with articulation of the final note considering that there is never successive consonants/vowels. (EDIT: One could also end with a simple half-step slurred down from the final letter, signifying the end of a sentence, and a half-step slurred up, signifying the natural vocal inflections one would have for a question.)
If anybody is interested in screwing around with this, I'd be glad to talk with any of you!
EDIT: Also, to clarify further the understanding of vowel/consonant placement, you could slur all consonant-vowel movements, which would start to group together letter pairs (wa-, te-, etc) together in the listener's ear.