-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Graded Assignment -4 (Jan Term 2023):- Redesigning The Hindu Data Point Stories #8
Comments
Post-COVID-19 math skills of students in southern and Western States dipped the mostShagun Dwivedi Here's the article! The Story Authors Narrate: Data as Encoded by the Authors:
Collected Data from ASER, also included data related to students of III grade, and region-coded it. What it does better:
What it does better:
Thanks. |
Terror Attacks in Pakistan: a data StoryName : Chaitanya KumariaRoll No. : 21f1000479Article Link: Terror Attacks in Pakistan : A data StoryContextThe Article is part of the the series Week in 5 Charts, I have taken Story number 4 Peshawar Bombings for the assignment. Here the author starts with the news of Peshawar Mosque Bombings which killed over a 100 people and left 200+ others injured. Author then delves into the history of Terror attacks and Suicide Bombings in the country of Pakistan over past two decades and discusses the reason for same Data Charts Provided by the AuthorChart 1
Issues
Chart 2
Issues
ApproachFor my analysis, I have used raw data from the South Asian Terrorism Portal Website Chart 1
Chart 2
Chart 3
Final VersionThis basically combines all the visualizations and adds required labels to better help understand the whole story in one shot. |
Personal DetailsName: Arya Bhattacharyya Article ChosenFriction over revenue sharing formula: Why some States get more money from Centre (https://www.thehindu.com/data/data-friction-over-revenue-sharing-formula-why-some-states-get-more-money-from-centre/article66625863.ece) The NarrativeThe article discusses the recent point of contention between the centre and various states. Overall, it tries to present the viewpoint of various stakeholder from different government institutions (at the state and central level) and uses data visualizations to drive across specific points as mentioned in the corresponding charts. Charts By The Author v/s My VisualizationsChart 1 (By the Author)Here the author is trying to show the returns that various states receive for every rupee that they contribute to the centre's tax collection. From the overall theme of the article, there is a contention between less returns for southern states, while higher returns for the northern states. Data Cons in Encoding
Chart 1 (By me)(Interactive Version: https://datawrapper.dwcdn.net/wHHNK/6/) Approach
Chart 2 (By the Author)Here the author tries to illustrate the idea that the share of the southern states over the divisible pool of taxes has "consistently" declined over the years. Here again, the focus is on the geographical location of the states and how it correlates with the share of the taxes that they have received. Data Pros
Cons in Encoding
Chart 2 (By me)Approach
Chart 3 (By the Author)Here the author tries to explain how the states with higher shares (like Bihar and UP as seen earlier) have higher fertility rates and hence effectively the center is not able to reward states with lower fertility values. This is presented to support one of the opinions stated by Tamil Nadu's finance minister (written in the article above the figure). Data Cons in Encoding
Chart 3 (By me)Approach
Chart 4 (By the Author)Here the author tries to illustrate the point that was made by Tamil Nadu's finance minister regarding the lack of development to the poorer states (which as per my understanding refer to states which receive higher returns for their contribution to the tax, effectively meaning the northern states). As an indicator for development, the author uses the HDI values over the years to show the change and the actual divide again between the northern and southern states in terms of their HDI values. Data Pros
Cons in encoding
Chart 4(By me)Approach
Chart 5(By the Author)Here the author is using the NSDP metric to illustrate the same point as in the previous chart which was to compare the growth of the states with their shares of taxes. The point they are trying to make is that growth sedates for the states with higher returns while growth is much better for states with lower returns. Data Pros
Cons
Chart 5 (By me)Approach
Final Output |
Working onRussia's Invasion of Ukraine The author is trying to narrate the impacts of the Russian invasion of Ukraine from February 2022 to February 2023 and how it has changed the political, social and economic landscape for the people of Ukraine and people of countries around the world. The author looks as the following aspects of the invasion with regards to visualizations:
Redesigning the Current Situation of the InvasionThe author wants to talk about the areas under capture or under active war in Ukraine till 20th February 2023, as shown below. By the author: Here is how I redesigned it. Removed irrelevant news pieces to the map.
Redesigning Military CasualtiesThe author draws a comparison between military losses/casualties between Russia and Ukraine. How I redesigned it: Introduced category-wise pie-charts to represent the comparisons clearly.
Redesigning the Refugee SituationThe author lists out the countries which the Ukraininans saw as refuge. By the author: How I redesigned it: Decided to use a bar chart instead of a map, to show only the relevant countries, which are a handful in number in comparison to all the countries shown on the map.
Redesigning the Financial Aid VisualizationThe author aims to provide an idea of which countries provided how much aid to Ukraine in the past year after the invasion. By the author: How I redesigned the chart: Decided to utilize the Treemap hierarchical structure for optimal space utilization |
Data | Layoffs by IT firms in the U.S. will greatly impact H-1B workers from India
Here's the article! What is the story the author is trying to tell? The story "Layoffs by IT firms in the U.S. will greatly impact H-1B workers from India" focuses on the impact of layoffs in the IT sector in the United States on workers from India with H-1B visas. The article uses data from the U.S. Department of Labor's Office of Foreign Labor Certification (OFLC) and other sources to provide insights into the number of H-1B visa holders affected by these layoffs and the potential impact on the Indian IT workforce. What data is the author using to tell the story? The author uses data from the OFLC to show that there were 415,920 H-1B visa holders in the U.S. as of September 2021, with over 70% of them being from India. The article also mentions that layoffs in the IT sector have been happening for several years, and that the COVID-19 pandemic has exacerbated the situation, leading to more job losses. The author highlights the concerns of Indian IT workers who fear losing their jobs and being forced to return to India. The article also discusses the potential impact on the Indian IT industry, which has been heavily reliant on the H-1B program to send workers to the U.S. The story relies heavily on data from the OFLC and other sources to support its claims about the number of H-1B visa holders in the U.S., the percentage of Indian workers, and the impact of layoffs on the Indian IT workforce. However, the data is not visually encoded, and there is no visualization provided in the article to help readers understand the data better. According to the article, IT firms in the US are facing a challenging business environment due to the COVID-19 pandemic and increasing automation, which has led to job cuts. Many of these job cuts have affected H-1B workers from India, who may face difficulties in finding new employment due to the restrictive nature of the visa program. The article also highlights the fact that the H-1B visa program has been the subject of much debate in the US, with some critics arguing that it takes away jobs from American workers. As a result, the Trump administration had tightened the rules around the H-1B visa program, making it more difficult for companies to hire foreign workers. How is it encoded, what problems are with it, and how have you attempted to improve it? To improve the visual encoding of this story, I would suggest creating a chart or graph to display the number of H-1B visa holders by country of origin. This would make it easier for readers to understand the dominance of Indian workers in the program. Additionally, a bar chart or line graph could be used to show the trend in H-1B visa approvals and rejections over time, highlighting any changes in policy or trends in the industry. Another useful visualization could be a map of the U.S. showing the states with the highest numbers of H-1B visa holders and the industries where they are employed. This could help readers understand which states and industries are most affected by layoffs and potential job losses. Overall, while the story provides valuable insights into the impact of layoffs on H-1B visa holders from India, there is room for improvement in the visual encoding of the data to make it easier for readers to understand and interpret the information.the article paints a bleak picture for H-1B workers from India who may be impacted by layoffs in the US IT sector. It underscores the need for better policies to protect the interests of foreign workers and ensure that they are not unfairly targeted in times of economic hardship. we could consider adding some data and visualizations. Here are some suggestions:
|
Europe picks up more arms even as global weapon imports drop
ROLL NUMBER - 21F1003953NAME - PARAM CHORDIYA
CONTEXT -The article is dated March 18, 2023 and mentions about the data associated with arms supply to European countries. The article highlights the rise in arms imports by European countries, particularly those in the NATO alliance, due to increased tensions with Russia after its invasion of Ukraine in 2014. Despite a global decline in arms transfers, Europe's share has increased significantly, with many countries importing the highest volume of arms in the latest five-year period. Ukraine's arms imports have also seen a sharp increase, making it the world's third-largest importer of arms in 2022. The US dominates global arms exports, with a 14% increase in exports between 2013-2017 and 2018-2022, while Russia's arms exports have fallen by 31% in the same period due to the invasion of Ukraine and trade sanctions. SIPRI's Trend Indicator Values measure the volume of international transfers of major arms and provide a common unit to measure trends in the flow of arms to particular countries and regions over time. Data Charts Provided by the Author -Chart 1 : Shows arms imports of select European nations using SIPRI’s Trend Indicator Values (TIVs) expressed in millions. It shows the import data for five time periods: 1998-2002, 2003-2007, 2008-2012, 2013-2017 and 2018-2022. Issues -
Chart 2 : Shows the region-wise share of arms imports in the last two five-year periods. Issues -
Chart 3 : Country wise domination of arms exporters Issues -
My Visualizations -Chart 1: Chart 2: Percent share of arms imports in 2018-2022 - Percent share of arms imports in 2013-2017 - Chart 3: Conclusion -I have tried to redesign the charts provided in the article by the author in a way that tries to cover the flaws in the original charts. |
Topic: Relatively fewer tobacco users in the southern StatesName: Mukesh Kumar Singh Artical link: - https://www.thehindu.com/data/data-relatively-fewer-tobacco-users-in-the-southern-states/article66568119.ece In this article author is trying to compare the tobacco uses across different status in India and comparing how southern States consume less tobacco compare to other states. Based on data author find that north-eastern States of India, consumption among men in both smokable and chewable forms was higher than rest of India in 2019-21. If only the smokable forms were considered, the share was higher in the northern States of Himachal Pradesh, Uttarakhand, Haryana, J&K U.T. and the eastern State of West Bengal. If only the chewable forms were considered, the share was higher in the east — Jharkhand, Bihar and Odisha — and in Uttar Pradesh, Madhya Pradesh and Gujarat. Authors has created different map to compare data. Map-1 shows the percentage of all men aged 15-49 who smoked cigarettes and/or bidis and/or cigars and/or pipe and/or hookah in 2019-21. The share should be read with caution as those who smoked cigarettes could also be bidi smokers, which means they were counted twice. The share was much higher among some northern and all north-eastern States except Assam. While the share of smokers was low in the south, it was even lower in the western States of Gujarat and Maharashtra. My observation: - Authors has used choropleth map to show the uses of tobacco in different state on India. Red Colours shaded are used Most high uses of tobacco where light blue for lesser uses. But colour coding used do not present the data well like Mizoram has highest 84.7 whereas Manipur as 40.2 but by colour schema it will difficult to understand so I used different colour scheme to present same graph. Now it is clear visible that Mizoram is highest compare to other including Manipur. (Map-1 graph- Reproduce in different colour schema as choropleth Graph- My Graph) Addition to that author is comparing only men tobacco uses in his data but as per topic tobacco users I feels that we need to consider women in our data analysis. So I find some data from “https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7942198/” and plotted 2D cluster column data to compare between men and women for different state. From Graph it is clear visible that Mizoram is highest for men as well as women whereas Kerala, Punjab women users of tobacco is very less compare to men. In this graph we can compare by state but it cannot explain the difference among geography position of India like northern states consumes more or less compare to north- eastern states. Other observation with author explains If only cigarette smokers were considered, Mizoram (62.4% smoke cigarettes), Meghalaya (49.6%), Manipur (36.2%) and Arunachal Pradesh (31.7%) were the top four. In West Bengal, 24.3% smoked bidis, the highest share in India. In Haryana, 9.9% smoked hookah, the highest share by a high margin. Map 2 shows the percentage of all men aged 15-49 who chewed gutka with tobacco and/or paan masala with tobacco and/or paan with tobacco and/or khaini and/or other forms of tobacco in 2019-21. The share was much higher in the northeastern, eastern, and some central, western and northern States. All the southern States and some northern States have a relatively low share.. The usage of khaini was over 35% in Bihar and Jharkhand. These two States led by a wide margin. In Gujarat, over 33% men chewed gutka/paan masala with tobacco, the second highest share, followed by Odisha (31%), M.P. (29.6%) and U.P. (27.6%). My observation: - Authors has used choropleth map to show the uses of percentage of all men aged 15-49 who chewed gutka with tobacco and/or paan masala with tobacco and/or paan with tobacco and/or khaini and/or other forms of tobacco in 2019-21 in different state on India. Red Colours shaded are used Most high uses of chewed gutka with tobacco where light blue for lesser uses. But colour coding used do not present the data well like Manipur has highest 92.9 whereas Tripura as 45.2 but by colour schema it will difficult to understand so I used different colour scheme to present same graph. Now it is clear visible that Manipur is highest compare to other including Tripura. (Map-2 graph- Reproduce in different colour schema as choropleth Graph- My Graph) Map 3 shows the share of male smokers who smoked more than five sticks a day in 2019-21 in India. The share in all the southern States, some northern States and some north-eastern States was higher than the rest of India. Map 3: My Observation- Authors shows the share of male smokers who smoked more than five sticks a day in 2019-21 in India looks not correct or description is not correct. When we compare the Tobacco users Percentage for “Andhra Pradesh” in Map-1 is 19.1 whereas in Map-3 Percentage users for “Andhra Pradesh” is 44.9 so issue is, this 44.9% is from total population or from the smoker population that is not being clearly understandable. Table Data:
My Observation: I tried to display using 2d line graph and it is clearly visible that Smoked Tobacco users are declined for both Urban and Rural whereas Chewed Tobacco for Rural are not decline however Urban Chewed Tobacco are declined. Also, among smokers, the share of those who smoked more than nine sticks a day reduced significantly and those who smoked less than five has increased (Chart 5) . |
Imaad Ansari - 21f1004808Shortfall of surgeons, gynaecologists and paediatricians in rural India was 80% in 2022
Charts By Author:Chart 1 Problems:
Chart 2 Problems:
Chart 3 Problems:
Chart 4 Problems:
My Approach:Chart 1 What it does better:
Chart 2 & Chart 3 What it does better:
Chart 4 What it does better:
Thanks! |
Name: Jemma Mariya George Data | Law to raise marital age is not enough as child marriages rarely get reportedOver 60% Indian women aged 25-29 in 2019-21 married before their 21st birthday
What is the story the author is trying to tell? What data he/she is using to tell the story?
How is it encoded, what problems are with it? State wise distribution of percentage of women in the age 20-24, married before the legal age of 18:
State wise distribution of percentage of women in the age 25-29, married before the age of 21:
Share of women aged 20-24 and 45-49 in 2019-21 who married before their 18th birthday: How have you attempted to improve it?
|
Disclaimer: The story uses data from NFHS-5 and NFHS-5. In order to come up with my suggestions of how data could be better visualized in this particular story, I have scraped the data from the original web page itself, since searching for the original data and preprocessing it was far too time-consuming. Women from Tamil Nadu are at a high risk of anaemiaThe original story points out that the number of women consuming dark green, leafy vegetables has drastically declined in Tamil Nadu. They further point out that this decline is indeed significant, compared to that in the other states. The lack of an iron-rich diet puts any demographic under a severe risk for anaemia. While this is bad generally for any population, the story does an exceptional job of pointing out that the deficiency is highly localized to southern states, especially Tamil Nadu. In other words, the situation is generally alarming, but given that a particular region or a state fares very poorly, points to a bigger underlying problem. Data and ChartsThe story uses NFHS-4 and NFHS-5 as their primary data sources (both surveys had independent questionnaires for men and women). Using this data, they have come up with four basic metrics for each state:
Thus, there are essentially two metrics, for each gender. Accordingly, the story features four charts, each of which describes these metrics across all Indian states. Here is a screenshot of the chart representing the first metric: As can be seen, Tamil Nadu is at the leftmost position along the X-axis. This alone does meet the purpose of convincing the reader that the situation in Tamil Nadu is indeed dire. However, the original chart does not include the popup around the marker which represents Tamil Nadu - one has to click each circular marker to see which state it belongs to. Thus, there is no way to pre-attentively understand the point of the graphic. Additionally, the colors used for the markers do not have any special semantics - they exist simply to differentiate between geographical zones (i.e. there is no particular reason to color central Indian states in black and eastern states in grey). Finally, the axis labels are too dim. The only visual scale that actually matters in this chart is the X-axis. It is clear that a state to the left is worse than a state to the right. Every other visual encoding - the vertical grouping of states into zones, the color encodings for the zone, and even the size of the circular markers - is completely arbitrary. These decisions do not help us exploit the natural, innate conventions that humans understand. For example, see the second chart: Since Tamil Nadu is at the leftmost position again, all we understand is that it is in the worst possible position relative to the other states. The other visual encodings contribute little or nothing to better understanding of that data. ImprovementsIt is reasonable to assume that anyone who is interested enough to read the original story would know a little about the geography of India. Therefore, I propose that this data is better represented with choropleths. Choropleths also have some additional advantages:
Here, then, is the dataset represented as four choropleths, one for each metric: As can be seen, each choropleth makes the point of the original article perfectly:
Thus, we have used colors, orientation, and the viewer's (presumably sufficient) knowledge of Indian geography a lot more effectively to improve the narrative. |
AISHE higher education surveyName: Chirag Goel Article Chhosen: DataThe article tells recently released All-India Survey on Higher Education (AISHE) 2020-21 report had revised the Gross Enrolment Ratio (GER) in higher education retrospectively for the previous four years, by recalculating it based on population projections as per the 2011 Census. Previous reports had used projections based on the 2001 Census. Charts in DataIssues:
Issues:
(Chart by me) |
Name:Harshitha Srikanth Title : Indians are spending more but eating less in the post-pandemic period Story author is trying to tell: The data has been presented and visualized as tables itself with colour encodings added. 3)Change in consumption patterns: Data encodings: Problems in encoding: Solving the issues: Visualizations: Visual 2: Visual 3: Problems I've tried to address: |
Risky diet: Which Indian States top fried food and aerated drink consumption |
Original Story: "30 crore missing voters in India: mostly young, urban, or migrants" Central Message: Data Used in the Story:
Encoding in the Original Story and Proposed Improvements:
Adjustments to Dataset and Scope: By redesigning the visualizations, the story becomes more engaging and informative, helping users grasp the magnitude of missing voters in India and the efforts being made by the ECI to increase voter turnout. |
Name: Kevin Mathew Varghese Original Story: "Where does your State stand on the India Innovation Index?" What is the story the author is trying to tell?The author of the article is discussing India's position in the Global Innovation Index 2019 and NITI Aayog's India Innovation Index report 2019. The author highlights that India ranks 52nd out of 129 countries in the Global Innovation Index, but it has consistently improved its position in recent years. The article also compares India's position with other BRICS nations in the Global Innovation Index, where India ranks third but is far behind China, which is in the top position. The author further highlights that the India Innovation Index report classifies States into three categories, major States, hill & northeast States, and Union Territories & smaller States, and provides the innovation index of each State. According to the report, three of the five major States with the highest innovation index were in the south of India, while Sikkim topped the northeastern States. The article also mentions that the innovation framework of the India Innovation Index report has two dimensions, enablers (human capital, investment, business environment) and performance (knowledge output, knowledge diffusion), and provides examples of indicators for each dimension. Finally, the author discusses the input-outcome gap in the India Innovation Index report, where the scores of the enabler metrics are higher than the performance scores in 29 out of 36 States and UTs, indicating an outcome gap relative to inputs. The author highlights that the gap is highest in Punjab and Gujarat among major States. What data he/she is using to tell the story? Describe its details -- type of data, extent of the data, dimensions of the data, gaps in the data, what data is essential and what is irrelevant.The author of the article is using two main sources of data to tell the story:
The data used in the article includes both quantitative and qualitative information. The quantitative information includes scores and rankings, while the qualitative information includes descriptions of the enablers and performance indicators, the classification of States, and examples of the indicators. The extent of the data used in the article is limited to India and other BRICS nations, as the focus is on India's position in the global innovation landscape and the innovation index of each State in India. The article does not provide information about other countries outside of the BRICS nations or compare India's position with them. The dimensions of the data used in the article include India's position in the global innovation landscape, the innovation index of each State in India, and the enablers and performance indicators used in the India Innovation Index report. There are some gaps in the data used in the article. For example, the article does not provide information about the methodology used in the Global Innovation Index or the India Innovation Index report. The article also does not provide information about the sample sizes used in the indices. Additionally, the article does not discuss any limitations or weaknesses in the data used. The data that is essential for the story includes India's ranking in the Global Innovation Index, the innovation index of each State in India, the enablers and performance indicators used in the India Innovation Index report, and the input-outcome gap in the report. The data that is irrelevant to the story includes any information that does not relate to India's position in the global innovation landscape or the innovation index of each State in India. How is it encoded, what problems are with it, and how have you attempted to improve it?The Author's Chart: What to improve: My Improvement: In the above chart, we take an aggregated score of 3 different innovation measures as chosen and outlined by the Global Innovation Index. This picture makes it clearer what goes behind the final ranking. |
Name: Sanjeeb Dey Article: Younger diabetes patients on the rise in most Indian States The story the authors are trying to convey: In more than 20 states out of 29 analysed, the percentage of young people with high glucose levels have increased over the last 5 years. The Data: The data used by authors is from the National Family Health Survey-5(NFHS-5) which was conducted between 2019-2021 and the National Family Health Survey-4 (NFHS-4) which was conducted between 2015-2016. Data Charts provided in the articleChart 1a
Chart 1b
Chart 2a
Chart 2b
Issues:
As it both the graphs of male and female have been placed side by side with their scales adjusted, it is much easier to visually compare the two. Having a vertical line at 0 on the X axis makes it easier to differentiate states with positive vs negative growth. |
Another government survey debunks Swachh Bharat’s 100% ODF claim, count increases to fourFaiz Ali21f1006793STORYThe story that the author is trying to tell is that the Indian government's claims of achieving 100% open defecation-free (ODF) status in various regions across the country through its Swachh Bharat Abhiyan program are misleading and inaccurate. The article cites data from the National Annual Rural Sanitation Survey (NARSS) conducted by the government itself, which indicates that only four out of 28 states and union territories in India can claim to be ODF. What data is he/she using to tell the story?The author is using data from the National Annual Rural Sanitation Survey (NARSS) conducted by the Ministry of Jal Shakti. The survey collects data from rural areas in India to determine the ODF status of different states. The survey data covers the time period between November 2019 and February 2020. The data contains information on the ODF status of different states, including the number of villages and households surveyed, the number of toilets present, and the percentage of households with access to toilets. The data also includes information on the discrepancies found in the ODF status claimed by the government. The data is essential in determining the accuracy of the government's claim of achieving 100% ODF status in various states. However, there are gaps in the data since the survey was conducted before the COVID-19 pandemic, which may have affected the progress made in sanitation. Furthermore The author used data from two more surveys - How is it encoded, what problems are with it, and what improvements could be done?The gaps in the data are not explicitly mentioned in the article, but it is likely that there are some limitations to the surveys in terms of sample size, representativeness, and accuracy of data collection. The data that is essential to the story is the comparison between the claimed ODF status and the actual availability and usage of sanitation facilities in rural areas, as this is the main point of contention. In total , 4 charts ( geo maps/ choropleths) has been used in the article Chart 1COLOR CONTRAST To address this issue, one possible solution would be to use a color scheme that has a higher contrast and is easier to distinguish between different states. For example, instead of using shades of blue and green for different levels of ODF status, we could use a color scheme that uses distinct colors for each level, such as green for ODF, yellow for partially ODF, and red for not ODF. Another option would be to use patterns or textures to differentiate between different levels of ODF status. For example, we could use diagonal lines for ODF, horizontal lines for partially ODF, and no pattern for not ODF. This would make it easier to distinguish between different states, even for readers who are colorblind . Finally, we could also consider using a combination of color and texture to create a more effective and accessible visualization. By using a color scheme with high contrast and patterns or textures to differentiate between different levels of ODF status, we could create a visualization that is both informative and easy to read. chart 2 and 3
I would suggest combine Chart 2 and Chart 3 into a single chart to make it easier for readers to compare the changes in ODF status over time.One possible solution could be to use a single chloropleth map that shows the ODF status for each state in both 2022 and 2023. The map could use different colors or patterns to differentiate between the two time periods, and it could also include a legend or a caption to explain the changes over time. Another option could be to use a small multiples approach, where multiple smaller maps are used to show the ODF status for each state in both years side by side. This would allow readers to compare the changes more easily without having to scroll down the page. Regardless of the approach chosen, it's important to make sure that the chart is clearly labeled and easy to read, with a clear title and axes, and a well-designed legend or key. This will help readers to quickly understand the information presented and to draw meaningful insights from the data. chart 4quoted from the article ** "While the SBMG dashboard does not track toilet access separately, the Swachh Survekshan Grameen survey (December 2021-April 2022) lists the percentage of households with access to toilets (chart 4). It concluded that in 28 States, the share of such households crossed 90% and India’s average was 95%, wildly different from the figures of the MIS survey completed six months earlier."** To improve the clarity of the story and better convey the difference between the MIS survey and the Swachh Survekshan Grameen survey, one possible approach would be to redesign Chart 4 to show a clear comparison between the two data sources. For example, instead of just showing the percentage of households with access to toilets for each state, the chart could include two separate bars or columns for each state, one showing the percentage from the Swachh Survekshan Grameen survey and the other showing the percentage from the MIS survey. This would allow readers to easily see the difference between the two sources and understand the reasons behind the disparity in the results. Overall, the goal of the redesign should be to help readers better understand the nuances and complexities of the data behind the story, and to provide a clear and accurate representation of the differences between the various surveys and data sources. AddddditionallyyyyyAchieving ODF status is not just a matter of building more toilets. In addition to access to toilets, several other factors can affect the overall success of a sanitation program, such as access to clean water, proper drainage systems, and public awareness campaigns to promote good hygiene practices. For example, even if toilets are built, they may not be used if there is not enough water to flush them( and i've seen this situation in some of the villages in Rajasthan) , or if the waste is not properly managed and disposed of. Similarly, if people are not aware of the importance of using toilets and practicing good hygiene, they may continue to defecate in the open or use shared toilets, which can contribute to the spread of diseases. Therefore, achieving and maintaining ODF status requires a multi-pronged approach that addresses not only the physical infrastructure but also the underlying social, cultural, and economic factors that affect people's behavior and attitudes towards sanitation. Some possible strategies could include: Promoting awareness and education campaigns to increase understanding of the benefits of using toilets and practicing good hygiene. Building and maintaining proper sanitation infrastructure, including toilets, drainage systems, and waste management facilities. Encouraging community participation and ownership of sanitation programs, which can help increase their sustainability and effectiveness. Addressing issues related to water scarcity and access, which can affect the usage and maintenance of toilets. By taking a comprehensive approach that addresses these various factors, it may be possible to achieve and maintain ODF status in a sustainable and effective way. |
Name: Rudraraj Dasgupta Article: The religious states of America, in 22 maps What is the story the author is trying to tell? What data he/she is using to tell the story? Describe its details -- type of data, extent of the data, dimensions of the data, gaps in the data, what data is essential and what is irrelevant. From the following visualization we can conclude that the southern states are more religious compared to the northern states. The data was gathered via interviews on attendance of weekly religious services. Catholics are the largest religious group in America. The author segregates the Catholics in the categories as per their ethnicities. Catholic Americans were present in small numbers early in United States history, both in Maryland and in the former French and Spanish colonies that were eventually absorbed into the United States, the vast majority of Catholics in the United States today derive from unprecedented waves of immigration from primarily Catholic countries and regions (Ireland was still part of the United Kingdom until 1921 and German unification didn't officially occur until 1871)link during the mid-to-late 19th and 20th century. We can observe that the Catholics are fairly distributed throughout the country barring exceptions of some states predominately some south eastern states. Catholics by ethnicity Unaffiliated is the second largest group. This group does to belong to any religions. The concentration of the unaffiliated group or irreligious group seem to have high concentrations in the west coast. Irreligion is the active rejection of religion in general, or any of its more specific organized forms, as distinct from absence of religion. Evangelicals are the third largest group in the survey. In the United States, evangelicalism is a movement among Protestant Christians who believe in the necessity of being born again, emphasize the importance of evangelism, and affirm traditional Protestant teachings on the authority as well as the historicity of the Bible. Evangelicals Protestants Mainline Protestants are the forth largest group in the survey. Protestantism emphasizes the Christian believer's justification by God in faith alone (sola fide) rather than by a combination of faith with good works as in Catholicism. They are segregated into various ethnicities just like the Catholics. The author has also provided visualizations for other religious groups such as Mormons, Jews, Jehovah's Witness, Orthodox Christians, Muslims, Buddhists and Hindus. How is it encoded, what problems are with it, and how have you attempted to improve it? While the author has provided visualizations for the religious(including ethnicity) groups. There is no context to understand the relative population of the states. Without the context of the population, conclusions cannot be drawn. Some states have higher population that other, some might have higher population densities. Also there is no timeline to understand the religion distribution among the states throughout history. United States of America has is highly diverse when it comes to religion and ethnicity. We are given the current distribution, but with history we have no context. We cannot draw any conclusions on the trends and the patterns that might occur if such data was provided. Although the author talks about the percentage of population for some religious groups, the data is not consistent throughout the article. Wkipedia Article Religion in the United States provides more clarity. The above line graph show us the trends when it comes to religious groups in the US. The stacked bar graph shows us the trends, for example the increase in unaffiliated group and the decrease in Protestantism. |
Data | Latest AISHE higher education survey has many discrepancies Name: Nino Leenus Article Link: The Story Authors Narrate: Data: Design Process: • Chart 2: This chart shows the GER for Tamil Nadu in the AISHE 2020-21 report based on the 2011 Census and the AISHE 2019-20 report based on the 2001 Census. The problem with this chart is that it is challenging to compare the two reports visually as they use different Censuses. • Table 3: This table calculates the GER using the population projections and enrolment figures taken from the AISHE 2020-21 report for Tamil Nadu. The problem with this table is that the calculated figures do not match with the ones stated in the report. • Chart 4A and 4B: These charts show the difference in population numbers and GER calculations between the latest AISHE figures and the CoI’s numbers for different states. The problem with these charts is that it is challenging to compare the two sets of data visually. Improving the Encoding To improve the encoding, I would suggest the following: Clarify the significance of changes in GER: While the story mentions that the GER has gone up or down in most states, it does not provide an explanation of why this matters. The story could benefit from providing context on the importance of GER as a metric for assessing educational development and the potential impact of changes in GER on various stakeholders. Include a comparison of state-wise GER rankings: The story talks about changes in GER for individual states, but it would be useful to also provide a ranking of states based on their GER in the latest AISHE report. This would help readers understand where each state stands in terms of educational development and how it compares to other states. Provide more information on the methodology: While the story briefly explains how GER is calculated, it could benefit from providing more details on the methodology used in the AISHE report. This would help readers understand how the GER figures were arrived at and the limitations of the methodology. Include quotes from experts: To provide more depth to the story, it could benefit from including quotes from experts in the education sector. These experts could provide insights on the significance of the changes in GER and the implications for policy-making. Use interactive visualizations: While the scatterplot and radar graph used in the story are useful in showing changes in GER over time and across states, interactive visualizations could be more engaging for readers. For example, an interactive map that allows readers to compare GER across different states or a tool that enables readers to explore trends in GER over time could be more effective in conveying the story's key message. In a radar chart we can observe changes in 2D: Year and Population. The difference in projected population for each state can be visualized using data slicer which helps in interactive visualization beyond a static representation. |
Chosen Article: Data | Justice Chandrachud to begin longest tenure for a CJI in a while The article broadly discusses the appointment of Justice DY Chandrachud as the Chief Justice of India. The authors try to provide a background surrounding his appointment in numbers, and compare how his tenure may be similar or different from his predecessors. What are the authors trying to convey?The author discusses how Justice DY Chandrachud will have one of the longest tenures as the CJI in recent history. The length of the tenure may have implications on how much and what impact can the CJI have in the Judicial system, and broadly in the country. Unlike other Justices who had much shorter tenures (especially in the last decade), Justice Chandrchud may have the opportunity to make systemic, and much more meaningful changes to the judicial system. What do the authors miss?On Length of TenureThe authors do not seem to provide many graphical visualizations to support their narrative, and instead simply presents a table listing the CJIs and their tenure in decreasing order. Some legal context Given this context, it is important to be mindful that certain Justices at the SC never get to be CJs (i.e. if an SC Judge turns 65 and retires before they become the senior most). This largely depends on how young the Judge was when they were elevated to the SC. The authors do not take the above into consideration, and simply compare the tenure of DY Chandrachud with the absolute length of tenures of his predecessors. While it is true that Justice Chandrachud's tenure would be the second longest in the past decade, there may be more to why the length of his tenure is something we may not see that often. A better visualization for length of tenuresIn order to account for the increase in strength of judges at the SC, data was obtained from an open-source legal data and legal tech initiative Agami. Specifically, data was obtained from JusticeHub.in, wherein the dataset was partially developed by my alma matter. Although the dataset includes data only up to 2019, details of 3 subsequent CJI appointed thereform, and details of 6 other Judges who may be appointed as the CJI after Justice Chandrachud's retirement have been appended manually. The above boxplots effectively show the distribution of the length of tenure of all past CJIs, with each box plots being indicative of the approved strength at the SC. As shown, there is a decreasing trend in the average length of the tenure as the strength of the Supreme Court increased; which was exactly as we had expected. The only exception to this would be the boxplot representing the lengths of tenures of past CJs who were appointed when the SC strength was 18. Note that this period was relatively short, and the data only comprises tenures of two judges, including of Justice YV Chandrachud who holds the honour of serving as the CJI for the longest duration. Justice YV Chandrachud also happens to be the father of Justice DY Chandrachud. Analysis of proposed visualizationWhile a more coherent graph appropriate to the message that needs to be conveyed with suitably provided pop-outs have been used, the visualization could further be improved on following aspects:
On Distribution of Birth StateThe authors go on to provide the above treemap to indicate the distribution of the birth states and universities that the appointed CJIs belong. The former has come into discussion in recent years with questions generally being raised with respect to the diversity of judicial appointments in the country. While the appointment of CJ is based on seniority, appointment of Judges to the SC are determined by the collegium (which comprises 5 of the senior most judges of the SC). Such subjective system for appointment of Judges may subsequently reflected by the birth state distribution of the Judges. In order to better visualize the diversity, or lack thereof, of the birth states of past CJIs, it would be more effective to provide a choropleth map instead of a treemap. A choropleth map would allow for a more intuitive and geographically accurate representation of the birth state distribution of the CJIs, which would enable a more meaningful analysis of the diversity in judicial appointments. A better visualization for birth statesThe following chart is proposed for representing the same: It is clear that Maharashtra, West Bengal and Uttar Pradesh consume a significant proportion of the birth state distribution, thereby indicating that the CJI appointments have not been diverse. One possible reason for the same could be due to West Bengal and Maharashtra having two of the oldest common law courts in the country. Since these states have some of the oldest institutions for legal practice, it is likely that their residents have had more opportunities to pursue a legal career and thus are more likely to enter the judiciary and be appointed as judges, including as CJIs. Moreover, some of the oldest and most traditional institutions have been established in these states, thereby allowing their residents to more comfortably obtain a quality legal education. Analysis of the proposed visualization
|
Name: Swapnadeep Pradhan What is the story the author is trying to tell?The authors of the article is discussing how education plays a more important role than wealth in determining when women get married in India. The article cites data from the National Family Health Survey (NFHS) that show that women who have completed more than 11 years of schooling tend to marry later than those who have less than five years of schooling. This trend has been consistent for decades, regardless of the current age group of women. The article also reveals that wealth has only recently become a relevant factor in influencing women's marital age. Among older generations, even women from richer households married at a younger age than those from poorer households. However, among younger generations, women from wealthier households tend to marry later than those from poorer households. The article further explores how caste and location also affect women's marital age. Women from SC/ST/OBC communities tend to marry earlier than those from non-SC/ST/OBC communities, even among younger generations. Similarly, women from rural areas tend to marry earlier than those from urban areas. What data he/she is using to tell the story? Describe its details -- type of data, extent of the data, dimensions of the data, gaps in the data, what data is essential and what is irrelevant.The article uses data from the National Family Health Survey (NFHS) 2019-21 to analyze how education, wealth, caste and location affect the age at which women get married in India.The NFHS is a large-scale survey conducted by the Ministry of Health and Family Welfare that covers various aspects of health and well-being of women, men and children. The survey collects information from a nationally representative sample of households using face-to-face interviews. The article focuses on one indicator from NFHS-5: the median age at first marriage among women aged 25-49 years. This indicator reflects the prevalence of child marriage and early marriage among women, which have implications for their health, education, empowerment and rights. The article compares the median age at first marriage across different groups of women based on their wealth quintiles (from poorest to richest), years of schooling completed (from none to more than 11 years), caste categories (SC/ST/OBC/other) and location (rural/urban). The article uses tables and charts to present the data. It also provides some context and explanation for the patterns observed in the data. Some possible sources of error or bias could be:
The data used in the article is essential for understanding how social factors affect women's choices and outcomes regarding marriage. However, some data that could be irrelevant or less important for this analysis are:
How is it encoded, what problems are with it, and how have you attempted to improve it?This article does not have any graphs. Only tables are shown viz. It would have been better served if this table was supplemented by some graphs viz. Another table given is this This also could have been supplemented by line charts viz. Based on these charts, we can get a clearer picture of the societal factors which affect the age of marriage in women. Among these, the factor of education has the greatest correlation and this has been true across multiple generations. Thus increasing access to education for women must be a priority to discourage girl-child marriage. Apart from these this article also has two charts elucidating the same data for men. This is irrelevant as the article purports to focus on women's marital age. |
Title : Data | From 5% to 15%, China’s share in India’s imports tripled in last two decadesArticle by JASMIN NIHALANI StoryThe author uses India's trade data(import-export by value) from 2002 to 2022 to highlight the increase in imports from China to India over two decades. The author also highlights the decline in India's exports to China from 2020 (6.9%) to 2022 (3.4%). However during the same period (2002 to 2022) India's exports to China do not show a consistent trend as in the case of imports. The author uses two charts to show the trade deficit between the two countries widening between 2002 and 2022. The story also looks at the change in imports between China and 9 other countries including India between 2011 and 2021. A slope chart is used to illustrate this difference. The chart indicates and increase in imports from China for 8 countries while US remains constant. The story also explores the types of commodities imported by India from China and exported from India to China. While non-value added raw materials constitute most of India's exports to China, India's imports primarily constitute finished electronic goods and machinery with a high degree of value addition. The author uses 2 tree-map charts to represent this. Line charts have also been used to capture the dependency of India on China for 5 types of goods/commodities from 2012 to 2021 Data Sources used in the storyThe original data used for the story was downloaded from the above 2 sources. India's net import / export values for the years 2002 - 2022 were also taken. This is not represented in the original story. Type of data : International Trade data from 2002 to 2022 Visualisation 1Imports from China to India
The original visualisation uses import data (China to India) in $ billion from 2002 to 2022 to display a bar-chart. The line in the chart represents the change in percentage of imports from China from net imports year on year. The axis for this line is on the right side of the chart and it ranges from 4 to 17. Spanning this range across the entire vertical extant of the chart causes small changes in percentages to be amplified. The axis is also cut-off at the horizontal axis. In the absence of net import data in the visualisation it is had to perceive linkage between the change in percentage of Chinese imports relative to net imports in the given time period. To modify this visualisation net import data was taken from the provided source and added to the visualisation. This is represented as a grey coloured bar. The red bar (semantically encoded) in the grey bar indicates the portion of Chinese imports. The blue line in the chart represents the percentage of net imports that constitute imports from China. The blue line spans 0-20% and occupies the lower half of the chart giving a clearer picture of reality without amplifying small changes in percentages. Visualisation 2Exports from India to China
The original visualisation uses export data (India to China) in $ billion from 2002 to 2022 to display a bar-chart. The line in the chart represents the change in percentage of exports to China relative to net exports year on year. The axis for this line is on the right side of the chart and it ranges from 3 to 8. Spanning this range across the entire vertical extant of the chart causes small changes in percentages to be amplified. The axis is also cut-off at the horizontal axis. In the absence of net export data in the visualisation it is had to perceive linkage between the change in percentage of exports to China relative to net exports in the given time period. To modify this visualisation net export data was taken from the provided source and added to the visualisation. This is represented as a grey coloured bar. The red bar (semantically encoded) in the grey bar indicates the portion of exports to China. The blue line in the chart represents the percentage of net exports that constitute exports to China. The blue line spans 0-10% and occupies the lower half of the chart giving a clearer picture of reality without amplifying small changes in percentages. Visualisation 3Imports from China to nine other countries including India and some of its neighbours
The original visualisation uses data of nine countries - their total imports from China in 2011 and 2021. The data is visualised using a slope chart which highlights the general increase in imports from China. The United States seems to have maintained stability by constraining imports from China to ~18.4% of it's total imports over the 10 year period. This is represented by the nearly straight line parallel to the horizontal axis for USA. This chart could be quite hard to understand for someone who has not seen slope charts before and in the original visualisation the names of 2 countries are missing. The visualisation has been modified to represent the Chinese import data of the same countries on a radar plot. This has been complemented with 2 Choropleth maps (2011 and 2021) to capture the change in percentage of imports from China. Semantic Encoding - A deeper shade of red indicates a higher percentage of imports from China and a lighter shade indicates a lower percentage. Both the Choropleth maps have been colour adjusted so that USA has the same colour in both maps since its percentage remains the same. Other Visualisations Present in the Story
The three visualisations above represent and depict the data quite well in line with the narrative presented in the story and were not modified. |
Rajan Kumar Title : A year of Russia’s invasion of Ukraine Article by THE HINDU BUREAU Story The article is a collection of charts and data that illustrate the events surrounding Russia's invasion of Ukraine over the past year. It covers topics such as the number of people affected by the conflict, the amount of military spending by Ukraine and Russia, and the impact of economic sanctions on Russia. The charts provide a visual representation of the conflict and the article provides a concise summary of the key points. On February 24, 2022, Russia launched an attack on Ukraine, following weeks of military build-up along the border of its neighbour. The conflict dates back to 2014, when Russia annexed Crimea. As of February 21, 2023, deaths of Russian and Ukrainian military personnel amounted to 180,000 and 100,000, respectively, while there were 16,150 civilian casualties in Ukraine, since February 24, 2022. Nearly one-third of the population of Ukraine remains forcibly displaced from their homes, making it one of the largest displacement crises in the world today, according to the United Nations High Commissioner for Refugees. The conflict has had a devastating effect on the lives of many. Data Sources used in the story
Visualisation 1: There is huge difference in data value and there are many parameters on which this comparison has been done. So decided to go with doubnut chart After Modification: After Modification Problems with the visualisation and my attempt to improve it |
Kumar Chandan : 21F1004845 Title : Data | Food( Fish, Chicken, Egg) habit of Indians state-wise Story by Author: The author of the article uses several graphs to illustrate the data on meat consumption in India. This article provides statistical information on the percentage of Indians who consume meat. Since the data used in graph not mentioned clearly that from where he got these data so i scrap the data from graph and put it in CSV file to analyse it better before going to redraw this graph again. i also added one extra column for population to see total number of consumption state-wise, which is given below. Even the author try to explain the the data in 4 different graph but again there are many informations missing:
I tried to solve the missing informations which was missing in given graph
Interesting Fact: In graph we can see clearly that only 55% people eat non-veg from UP compare to Tamil Nadu but consumption of meat/fish/egg is much larger in UP IMP Points:Since the chart and the graph generated using programatically so its highly dynamic in nature. Technology and data :
|
Lolla Ajay Kumar : 21f1000200 The News article that was chosen by me is a Analysis report on the Indian Economy and various parameters that are effecting it in the post-pandemic world. The writer of the article has considered 4 major parameters that effect the GDP growth rate into consideration. In this article of THE HINDU, 5 charts have been used to depict various trends. CHART-1 NEGATIVE POINTS-
Keeping in mind all the above drawbacks, I had come up with a new approach. MY APPROACH- CHART-1 CHART-2
MY APPROACH- CHART-2 NEGATIVE POINTS- MY APPROACH -CHART-3 I have introduced a new line that depicts the lag between the REPO rate and FED rate. It clearly shows which happens first and which happens later. |
For this assignment, we'll use data stories from The Hindu Data Point. Use what you have learned in Week 4 & Week 5 for doing this assignment.
Select a story that you like, study it carefully, and redesign it. Specifically, we want you to focus on understanding the data that powers the story, and how it is visually encoded to tell the intended story. Document your design process, capturing the following:
You may choose to expand or curtail the scope of the data used in the story or add an additional dataset to tell the story better. But do not deviate from the main intent of the original story. In other words, it is a redesign exercise, and hence I do not want you to tell a different, unrelated story.
While you should provide a link to the original story, it might be useful to capture and display inline, appropriate parts of the original visualization, and your own design iterations to produce coherent documentation.
For reference, take a look at what the previous batches (2019,2020,2021, 2022 )did with this assignment.
The text was updated successfully, but these errors were encountered: