This intensive 10-day summer school, organised under the ACM India Summer Schools programme, covers both foundational and practical aspects of natural language processing — spanning text and speech — with a focused emphasis on Indian language processing.
Through a carefully balanced mix of theory lectures and hands-on lab sessions, the school aims to train and motivate young minds to take up careers in language technology and make meaningful contributions to this rapidly growing field.
The Centre for Linguistic Science & Technology (CLST) at IIT Guwahati is one of India's foremost interdisciplinary research centres at the intersection of linguistics, computer science, and language technology. Established with a vision of advancing human language technology for India's diverse linguistic landscape, CLST aspires to establish itself as an internationally recognized center for research in natural language processing and speech processing.
The centre focuses on developing computational resources, tools, and datasets for Indian languages — with special emphasis on low-resource and Northeast Indian languages like Assamese, Bodo, Meitei (Manipuri), and tribal languages of the region. Research spans speech synthesis, recognition, machine translation, multilingual NLP, corpus development, and sign language processing.
| Day | Date | 9:30 – 11:00 AM | 11:30 AM – 1:00 PM | 2:30 – 3:45 PM | 4:00 – 5:00 PM |
|---|---|---|---|---|---|
| Day 1 | 3 Jun | Machine Learning & Deep Learning for Language ProcessingSanasam Ranbir Singh | Machine Learning & Deep Learning for Language ProcessingSanasam Ranbir Singh | Practical Session on Basics of Python for Language Processing | Practical Session on Basics of Python for Language Processing |
| Day 2 | 4 Jun | Machine Learning & Deep Learning for Language ProcessingSuresh Sundaram | Machine Learning & Deep Learning for Language ProcessingSuresh Sundaram | Practical Session on Basics of Python for Language Processing | Practical Session on Basics of Python for Language Processing |
| Day 3 | 5 Jun | Speech Data Collection & CurationPriyankoo Sarmah | Speech Data Collection & CurationPriyankoo Sarmah | Practical: ESPNet & Tacotron 2 for TTS | Practical Session on ESPNet Toolkit & Tacotron 2 for TTS |
| Day 4 | 6 Jun | Recent Trends in TTS & The Indian ScenarioK S R Murthy | Recent Trends in TTS & The Indian ScenarioK S R Murthy | Recent Trends in TTS & The Indian ScenarioK S R Murthy | Practical Session on ESPNet Toolkit & Tacotron 2 for TTS |
| Day 5 | 7 Jun | ☀️ Sunday — No Classes | |||
| Day 5 | 8 Jun | State-of-the-Art of ASRRohit Sinha | State-of-the-Art of ASRRohit Sinha | State-of-the-Art of ASRRohit Sinha | Practical: ESPNet & Tacotron 2 |
| Day 6 | 9 Jun | Speech data Evaluation & BenchmarkingK Samudravijaya | Speech data Evaluation & BenchmarkingK Samudravijaya | Speech data Evaluation & BenchmarkingK Samudravijaya | Practical: ESPNet & Tacotron 2 |
| Day 7 | 10 Jun | Text Data Collection & Curation for NLPSamit Bhattacharya | Text Data Collection & Curation for NLPSamit Bhattacharya | Practical Session on ESPNet Toolkit & Tacotron 2 for TTS | Practical Session on ESPNet Toolkit & Tacotron 2 for TTS |
| Day 8 | 11 Jun | Multilingual Natural Language ProcessingAsif Ekbal | Multilingual Natural Language ProcessingAsif Ekbal | Hands-on Session on NLP & MT | Hands-on Session on NLP & MT |
| Day 9 | 12 Jun | Basics of MT SystemsAsif Ekbal | Basics of MT SystemsAsif Ekbal | Hands-on Session on NLP & MT | Hands-on Session on NLP & MT |
| Day 10 | 13 Jun | Basics of MT SystemsAsif Ekbal | Basics of MT SystemsAsif Ekbal | Hands-on Session on NLP & MT | 🏅 Valedictory — 4:30 PM |

An Faculty member in the Department of Computer Science and Engineering at IIT Guwahati and Head of the Centre for Linguistic Science and Technology (CLST). His research focuses on extended reality (virtual, augmented, and mixed reality), human-computer interaction, and user-centric computing. His work also spans affective and ubiquitous systems, mobile and wearable technologies, and ICT applications in education, agriculture, and healthcare.
Visit Website →
A Faculty member in the Department of Computer Science and Engineering and Centre for Linguistic Science and Technology (CLST) at IIT Guwahati whose research focuses on open source intelligence, natural language processing, information retrieval, and social media analytics. His work spans artificial intelligence, machine learning, and data mining, with a focus on extracting actionable insights from large-scale online data. He has contributed to building language resources and NLP tools for Meitei (Manipuri), Assamese, and other Northeast Indian languages.
Visit Website →
A Faculty member in the Department of Humanities and Social Sciences and Centre for Linguistic Science and Technology (CLST) at IIT Guwahati. He previously served at Hankuk University of Foreign Studies. His research interests include phonetics, speech analysis, speech perception, and speech technology, with a particular focus on tone production and perception in languages such as Bodo, Dimasa, Mizo, and Rabha, as well as sociophonetic studies of Assamese and Angami.
Visit Website →
A Faculty member in the Department of Electronics and Electrical Engineering and Centre for Linguistic Science and Technology (CLST) at IIT Guwahati. His research focuses on speech and signal processing, including acoustic segmentation, speech enhancement, and robust automatic speech recognition. His work also includes speaker recognition, emotion classification, language recognition, image enhancement, and spectrum sensing for cognitive radio systems.
Visit Website →
A Faculty member in the Department of Electronics and Electrical Engineering and Centre for Linguistic Science and Technology (CLST) at IIT Guwahati. His research focuses on machine learning, neural networks, and pattern recognition, with applications in signal processing and data analysis.
Visit Website →
A Faculty member in the Department of Computer Science and Engineering at IIT Patna whose research focuses on artificial intelligence, natural language processing, and applied machine learning. His work includes multilingual NLP, information extraction, and machine translation for Indian languages. He has contributed extensively to research and has served in key roles in major international conferences and editorial boards of leading journals.
Visit Website →
A Faculty member in the Department of Electrical Engineering at IIT Hyderabad and is affiliated with the Department of Artificial Intelligence. His research interests include signal processing, speech analysis, recognition and synthesis, phase processing and modelling, along with pattern recognition and deep learning, with applications in modern speech and audio systems.
Visit Website →
A veteran speech scientist and Faculty member at KL University, with over four decades of pioneering work in speech technology for Indian languages. He has made significant contributions to the development of speech databases and multilingual speech corpora, particularly for under-resourced Indian languages. His work has played an important role in advancing speech recognition and language technologies in the Indian context.
For accommodation, logistics, or any queries, contact the local administrator or coordinators directly.
samit@iitg.ac.in ranbir@iitg.ac.in priyankoo@iitg.ac.in 
Contact for Accommodation
s.chinglemba@iitg.ac.in s.maisang@iitg.ac.inFor more information
ACM Summer School ↗