🏠 Accommodation Hostel accommodations will be provided from 2nd June to 13th June 2026 · 🔔 Updates Further details and updates will be provided soon ! · 🏠 Accommodation Hostel accommodations will be provided from 2nd June to 13th June 2026 · 🔔 Updates Further details and updates will be provided soon ! ·
IIT Guwahati · CLST · 3–13 June 2026

Natural Language Models for Language Technology Development

Data Curation · Modelling · Application
📅
Dates3 – 13 June 2026
📍
VenueIIT Guwahati, Assam
🗓️
Duration10 Days
NLP · ASR · TTS · MT · DL

Training the Next Generation of Indian NLP Researchers

This intensive 10-day summer school, organised under the ACM India Summer Schools programme, covers both foundational and practical aspects of natural language processing — spanning text and speech — with a focused emphasis on Indian language processing.

Through a carefully balanced mix of theory lectures and hands-on lab sessions, the school aims to train and motivate young minds to take up careers in language technology and make meaningful contributions to this rapidly growing field.

10Days
7Speakers
6+Topics
2Lab Tracks
🏛️
Host Institution
Centre for Linguistic Science & Technology (CLST)
IIT Guwahati, Assam – 781039
🎓
Academic Coordinator
Samit Bhattacharya
samit@iitg.ac.in
👤
Local Coordinator 1
Sanasam Ranbir Singh
ranbir@iitg.ac.in
👤
Local Coordinator 2
Priyankoo Sarmah
priyankoo@iitg.ac.in
IIT Guwahati Centre for Linguistic
Science & Technology
www.iitg.ac.in/clst →

About CLST

The Centre for Linguistic Science & Technology (CLST) at IIT Guwahati is one of India's foremost interdisciplinary research centres at the intersection of linguistics, computer science, and language technology. Established with a vision of advancing human language technology for India's diverse linguistic landscape, CLST aspires to establish itself as an internationally recognized center for research in natural language processing and speech processing.

The centre focuses on developing computational resources, tools, and datasets for Indian languages — with special emphasis on low-resource and Northeast Indian languages like Assamese, Bodo, Meitei (Manipuri), and tribal languages of the region. Research spans speech synthesis, recognition, machine translation, multilingual NLP, corpus development, and sign language processing.

🗣️Speech Technology
🌏Multilingual NLP
📚Corpus Development
🤖Machine Translation
🧬Low-Resource Languages
🖐️Sign Language Processing

Topics Covered

Theory
  • 01Machine Learning & Deep Learning for Language Processing
  • 02Multilingual Natural Language Processing
  • 03Data Collection & Curation for Low-Resource Languages
  • 04Basics of Machine Translation
  • 05Recent Trends in Text-to-Speech Synthesis & the Indian Scenario
  • 06State of the Art in Automatic Speech Recognition (ASR)
  • 07Processing & Modelling Code Switched Speech
  • 08Speech Data Evaluation & Benchmarking
Hands-on / Practical
  • Hands-on/PracticalBasics of Python for Language Processing
  • Hands-on/PracticalESPNet Toolkit & Tacotron 2 for TTS
  • Hands-on/PracticalHands-on Sessions on NLP & Machine Translation
Recommended Background
  • Programming Concepts
  • Algorithms
  • Signal Processing
  • Linear Algebra

Day-wise Schedule

🎉 Inauguration: Day 1 (3 June) — 9:00–9:30 AM | 🏅 Valedictory: Day 10 (13 June) — 4:30–5:00 PM
Day Date 9:30 – 11:00 AM 11:30 AM – 1:00 PM 2:30 – 3:45 PM 4:00 – 5:00 PM
Day 13 JunMachine Learning & Deep Learning for Language ProcessingSanasam Ranbir SinghMachine Learning & Deep Learning for Language ProcessingSanasam Ranbir SinghPractical Session on Basics of Python for Language ProcessingPractical Session on Basics of Python for Language Processing
Day 24 JunMachine Learning & Deep Learning for Language ProcessingSuresh SundaramMachine Learning & Deep Learning for Language ProcessingSuresh SundaramPractical Session on Basics of Python for Language ProcessingPractical Session on Basics of Python for Language Processing
Day 35 JunSpeech Data Collection & CurationPriyankoo SarmahSpeech Data Collection & CurationPriyankoo SarmahPractical: ESPNet & Tacotron 2 for TTSPractical Session on ESPNet Toolkit & Tacotron 2 for TTS
Day 46 JunRecent Trends in TTS & The Indian ScenarioK S R MurthyRecent Trends in TTS & The Indian ScenarioK S R MurthyRecent Trends in TTS & The Indian ScenarioK S R MurthyPractical Session on ESPNet Toolkit & Tacotron 2 for TTS
Day 57 Jun☀️ Sunday — No Classes
Day 58 JunState-of-the-Art of ASRRohit SinhaState-of-the-Art of ASRRohit SinhaState-of-the-Art of ASRRohit SinhaPractical: ESPNet & Tacotron 2
Day 69 JunSpeech data Evaluation & BenchmarkingK SamudravijayaSpeech data Evaluation & BenchmarkingK SamudravijayaSpeech data Evaluation & BenchmarkingK SamudravijayaPractical: ESPNet & Tacotron 2
Day 710 JunText Data Collection & Curation for NLPSamit BhattacharyaText Data Collection & Curation for NLPSamit BhattacharyaPractical Session on ESPNet Toolkit & Tacotron 2 for TTSPractical Session on ESPNet Toolkit & Tacotron 2 for TTS
Day 811 JunMultilingual Natural Language ProcessingAsif EkbalMultilingual Natural Language ProcessingAsif EkbalHands-on Session on NLP & MTHands-on Session on NLP & MT
Day 912 JunBasics of MT SystemsAsif EkbalBasics of MT SystemsAsif EkbalHands-on Session on NLP & MTHands-on Session on NLP & MT
Day 1013 JunBasics of MT SystemsAsif EkbalBasics of MT SystemsAsif EkbalHands-on Session on NLP & MT🏅 Valedictory — 4:30 PM
Theory Session
Practical / Lab
Ceremony
Tea Break: 11:00–11:30 AM & 3:45–4:00 PM  |  Lunch Break: 1:00–2:30 PM

Distinguished Speakers

Academic Coordinator

Samit Bhattacharya

IIT Guwahati · Dept. of CSE · CLST
Text Data Collection & Curation for NLP

An Faculty member in the Department of Computer Science and Engineering at IIT Guwahati and Head of the Centre for Linguistic Science and Technology (CLST). His research focuses on extended reality (virtual, augmented, and mixed reality), human-computer interaction, and user-centric computing. His work also spans affective and ubiquitous systems, mobile and wearable technologies, and ICT applications in education, agriculture, and healthcare.

Visit Website →
Local Coordinator 1

Sanasam Ranbir Singh

IIT Guwahati · Dept. of CSE · CLST
Machine Learning & Deep Learning for Language Processing

A Faculty member in the Department of Computer Science and Engineering and Centre for Linguistic Science and Technology (CLST) at IIT Guwahati whose research focuses on open source intelligence, natural language processing, information retrieval, and social media analytics. His work spans artificial intelligence, machine learning, and data mining, with a focus on extracting actionable insights from large-scale online data. He has contributed to building language resources and NLP tools for Meitei (Manipuri), Assamese, and other Northeast Indian languages.

Visit Website →
Local Coordinator 2

Priyankoo Sarmah

IIT Guwahati · Dept. of HSS · CLST
Speech Data Collection & Curation

A Faculty member in the Department of Humanities and Social Sciences and Centre for Linguistic Science and Technology (CLST) at IIT Guwahati. He previously served at Hankuk University of Foreign Studies. His research interests include phonetics, speech analysis, speech perception, and speech technology, with a particular focus on tone production and perception in languages such as Bodo, Dimasa, Mizo, and Rabha, as well as sociophonetic studies of Assamese and Angami.

Visit Website →

Rohit Sinha

IIT Guwahati · Dept. of EEE· CLST
State-of-the-Art of ASR

A Faculty member in the Department of Electronics and Electrical Engineering and Centre for Linguistic Science and Technology (CLST) at IIT Guwahati. His research focuses on speech and signal processing, including acoustic segmentation, speech enhancement, and robust automatic speech recognition. His work also includes speaker recognition, emotion classification, language recognition, image enhancement, and spectrum sensing for cognitive radio systems.

Visit Website →

Suresh Sundaram

IIT Guwahati · Dept. of EEE · CLST
Machine Learning & Deep Learning for Language Processing

A Faculty member in the Department of Electronics and Electrical Engineering and Centre for Linguistic Science and Technology (CLST) at IIT Guwahati. His research focuses on machine learning, neural networks, and pattern recognition, with applications in signal processing and data analysis.

Visit Website →
External · IIT Patna

Asif Ekbal

IIT Patna · Dept. of CSE
Multilingual NLP & Basics of Machine Translation

A Faculty member in the Department of Computer Science and Engineering at IIT Patna whose research focuses on artificial intelligence, natural language processing, and applied machine learning. His work includes multilingual NLP, information extraction, and machine translation for Indian languages. He has contributed extensively to research and has served in key roles in major international conferences and editorial boards of leading journals.

Visit Website →
External · IIT Hyderabad

Sri Rama Murty Kodukula

IIT Hyderabad · Dept. of EE
Recent Trends in TTS & The Indian Scenario

A Faculty member in the Department of Electrical Engineering at IIT Hyderabad and is affiliated with the Department of Artificial Intelligence. His research interests include signal processing, speech analysis, recognition and synthesis, phase processing and modelling, along with pattern recognition and deep learning, with applications in modern speech and audio systems.

Visit Website →
External · KL University

K Samudravijaya

KL University
Speech Evaluation & Benchmarking

A veteran speech scientist and Faculty member at KL University, with over four decades of pioneering work in speech technology for Indian languages. He has made significant contributions to the development of speech databases and multilingual speech corpora, particularly for under-resourced Indian languages. His work has played an important role in advancing speech recognition and language technologies in the Indian context.

Practical Information

🏠

Accommodations & Food

  • Institute Guest House for speakers
  • Hostel Accomodations for Students
  • **Accommodations details for selected Students to be provided later
📚

Venue

  • Conference Hall 2
Conference Hall
📬

Contact & Queries

For accommodation, logistics, or any queries, contact the local administrator or coordinators directly.

samit@iitg.ac.in ranbir@iitg.ac.in priyankoo@iitg.ac.in

 

Contact for Accommodation

s.chinglemba@iitg.ac.in s.maisang@iitg.ac.in

For more information

ACM Summer School ↗