Hey! I work at Xmartlabs and we have built Bender: a library to build and run Neural Networks on iOS.
As it uses Metal under the hood, it takes advantage of the device GPU, so it runs fast and supports real-time applications.
It also allows to run TensorFlow models, so it should be easy to run your existing model.
If you guys could provide feedback, it would be cool!