Hard-coding neural networks is a hefty task for ML Engineers, especially when tasked with training a myriad networks in search for the best hyper parameters. We have greatly simplified this task by allowing the user to rapidly prototype and train networks using plain English.

What it does

In short, NeuNet is a Python utility that allows the user to easily construct Neural Networks using their voice or just plain text, which is essential for rapid prototyping and hyper parameter optimization. Simply describe the parameters of your model in plain English and NeuNet will build and train the network for you.

Example: “Make me a UNet with input size of 192 pixels and 4 output classes. Use ReLU nonlinearities and train the model on 25 epochs of the buildings dataset.”

How we built it

NeuNet implements the Google Cloud Speech-to-text API to transform user voice input into string form. The semantic meaning of the string request is then extracted using language processing techniques. NeuNet then uses these parameters to dynamically build and train a corresponding neural network in PyTorch, and displays the model and training metrics in the terminal window.

Challenges we ran into

The Google Cloud Speech-to-text API is not perfect in its speech detection capabilities--For example, the number 4 is pronounced the same as “for”--so we had to account for its shortcomings.

Accomplishments that we're proud of

We’re proud of being able to implement Google’s API and successfully building neural networks based off of very abstracted English.

What we learned

We learned how to take advantage of online tutorials and stack overflow in order to assist in our debugging process. We learned basic HTML, CSS, and Javascript skills to implement the design aspects of our website’s frontend.

What's next for NeuNet

We’re aiming to make NeuNet a more accurate speech detection platform. We also hope to one day expand the front end into a full-fledged domain, with captivating design and dynamic user interaction functionality.

Share this project: