Speech to 3D Scene Generation

Manthan Turakhia*, Umang Nandu**, Prayesh Shah***, Siddharth Sharma****, Sagar Korde*****
*_*****Department of Information Technology, K. J. Somaiya College of Engineering, Mumbai, India.
Periodicity:April - June'2019
DOI : https://doi.org/10.26634/jse.13.4.15903


3D scenes and graphics are widely used in the creative industry. However, the entire task of imagination and then depicting the same as 3D graphics is done manually today, which consumes a lot of time, not to mention the inability to depict the scene precisely as imagined. We aim to reduce human efforts for the same by generating 3D scenes described by the user with precision.

On the other hand, some industries currently lack the use of appropriate technology to make their tasks easier and more captivating, such as the education industry. We intend to replace the existing methods of teaching and learning by using speech to 3D scene generation to depict exactly what the professor is trying to explain.


Speech to Scene, Linguistic Analysis, Spatial Relationship, Natural language processing.

How to Cite this Article?

Turakhia, M., Nandu, U., Shah, P., Sharma, S., & Korde, S. (2019). Speech to 3D Scene Generation. i-manager's Journal on Software Engineering, 13(4),16-23. https://doi.org/10.26634/jse.13.4.15903


[1]. Chang, A. X. (2015). Text to 3D Scene Generation (Doctoral Dissertation, Stanford University, USA).
[2]. Chang, A., Monroe, W., Savva, M., Potts, C., & Manning, C. D. (2015). Text to 3D scene generation with rich lexical grounding. arXiv preprint arXiv:1505.06289 (pp. 53-62).
[3]. Cheng, Y., Sun, Z., Bi, S., Li, C., & Xi, N. (2017, December). A supervisory hierarchical control approach for text to 2D scene generation. In 2017 IEEE International Conference on Robotics and Biomimetics (ROBIO) (pp. 2261-2266). IEEE.
[4]. Coyne, B., & Sproat, R. (2001, August). WordsEye: An automatic text-to-scene conversion system. In Proceedings of the 28th Annual Conference on Computer Graphics and Interactive Techniques (pp. 487-496). ACM.
[5]. Hossain, M. S., & Salam, A. (2017). Text-to-3D Scene Generation using Semantic Parsing and Spatial Knowledge with Rule Based System. International Journal of Computer Science Issues (IJCSI), 14(5), 37-41.
[6]. Johansson, R., Berglund, A., Danielsson, M., & Nugues, P. (2005, July). Automatic text-to-scene conversion in the traffic accident domain. In IJCAI, 5, 1073-1078.
[7]. Monroe, W. (2008). 3D Scene Retrieval from Text with Semantic Parsing. Stanford University Computer Science Department Stanford, USA.
[8]. Panda3D, (n. d). The Open Source Framework for 3D Rendering and Games. Retrieved from https://www.panda3d.org/
[9]. Seversky, L. M., & Yin, L. (2006, October). Real-time automatic 3D scene generation from natural language th voice and text descriptions. In Proceedings of the 14 ACM International Conference on Multimedia (pp. 61-64). ACM.
[10]. Sneha, N., Dessai, & Dhanaraj, R. (2016). Text to 3D scene generation. In International Journal of Latest Trends in Engineering and Technology (IJLTET), 6(3).

Purchase Instant Access

Single Article

North Americas,UK,
Middle East,Europe
India Rest of world
Pdf 35 35 200 20
Online 35 35 200 15
Pdf & Online 35 35 400 25

If you have access to this article please login to view the article or kindly login to purchase the article
Options for accessing this content:
  • If you would like institutional access to this content, please recommend the title to your librarian.
    Library Recommendation Form
  • If you already have i-manager's user account: Login above and proceed to purchase the article.
  • New Users: Please register, then proceed to purchase the article.