Speech to CAD Model Generator

We worked on designing a fully integrated speech to STL model generator with an interactive GUI, as well a custom natural language processor to parse useful information from the text. To generate the 3d objects, we made calls to the Fusion360 API.

Our algorithm used a four step process to construct these 3d objects from speech.

Using our user-friendly GUI, you can record audio in natural language and watch our algorithm parse it for information in real time. If the algorithm mis-hears you, or the information it produces is incorrect, you can edit it within the Text element and the system will automatically reparse the new text.
We used the speech_recognition and pyttsx3 modules to convert this raw audio into English text. You have the option to use your microphone to record, or even upload a pre-recorded .WAV file to the system, which it will then parse. This was extremely helpful during testing, and the system will always return the same text from a given .WAV.
We custom-built a natural language parser to extract the necessary information from the English text. We incorporated intelligent feature extraction that compares user input with a corpus of keywords and trigger phrases. The system can detect multiple objects, and will consistently find what information is present, and what it needs to request.
Using the key information extracted from the audio, we use the Fusion API to create a CAD model to suit the user’s needs. Our project generates CAD models based on user input, and can easily create models that it has never seen before.

Developed by Akash Pamal, Jack Blair, and Rohit Prasanna HackNEHS 2021 4/4/21 MIT License

Sources Cited:

https://forums.autodesk.com/t5/fusion-360-api-and-scripts/simple-python-example-request/td-p/5428202 https://github.com/brysontyrrell/Python-GUI-Example/blob/master/Python-GUI-Example.py https://www.autodesk.com/autodesk-university/class/Getting-Started-Fusion-360-API-2020#handout

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
.vscode		.vscode
Audio Inputs		Audio Inputs
Documentation		Documentation
Fusion Scripts		Fusion Scripts
__pycache__		__pycache__
res		res
samples		samples
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
all_objects_2d.py		all_objects_2d.py
all_objects_3d.py		all_objects_3d.py
fusion_script_generator.py		fusion_script_generator.py
object_2d.py		object_2d.py
object_3d.py		object_3d.py
runner_gui.py		runner_gui.py
text_parser.py		text_parser.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speech to CAD Model Generator

Sources Cited:

About

Releases

Packages

Contributors 2

Languages

License

akashpamal/SpeechToCAD

Folders and files

Latest commit

History

Repository files navigation

Speech to CAD Model Generator

Sources Cited:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages