Comment period is currently open for the National Clean Air Programme (NCAP, see blog entry here), and Shahzad Gani conducted a text analytics assessment of the document to identify words and themes. Some interesting patterns emerged from this analysis. The words ‘air’ and ‘pollution’ were mentioned more than 100 times in the NCAP document. Here is a doodle depicting the most used words:
The top ten most common words were:
air (161) | pollution (106)| quality (76)| monitoring (65)| cities (43)| national (40)| health (36)| standards (35)| ambient (34)| data (29)
Other words that were mentioned more than ten times include: pollutants (29); control (24); various (24); aqi (23); stations (23); source(22); sources (22); emission (21); m3 (20); action (19); technology (19); areas (18); including (18); india (18); dust (17); measures (17); studies (17); time (17); μ (17); based (16); concentration (16); environment (16); government (16); hr (16); implementation (16); index (16); management (16); major (15); network (15); people (15); pm10 (15); annual (14); burning(14); city (14); cpcb (14); nil (14); non (14); pm2.5 (14); rural (14); country (13); hours (13); iv (13); level (13); need (13); number (13); poor (13); apportionment (12); attainment (12); delhi (12).
Note: A word version of the NCAP document was kindly provided by Dr. Shoibal Chakravarty (profile) is available for download (NCAP). The analysis was conducted using an online text analytics tool (link).
Comments on NCAP are being accepted until May 17, 2018.