Sunteți pe pagina 1din 5

World Development Indicators

Team: Echo
Rohit Naik Jarupla
Afzal Shama Soudagar
March 15, 2017

1 Project Description
Representation of development across the world is an onerous task that needs a deeper analysis of
various social and economic factors. Development Indicators over countries and time period act as
tools for systematic analysis. We intend to handle such a large data to represent it in a compre-
hensible format to the viewers including interesting features such as sorting, filtering and graphical
representation.

Our project is a website that represents World Development Indicators over a thousand annual
indicators of development from hundreds of countries over the world over the years 1960-2016 along
with the source of each indicator. The indicators are categorized into various levels. Information
such as its description, periodicity, source, etc are represented. We show a detail account of each
country including its economic status and latest census.

We compare/Represent the selected indicators of selected countries and years using either tables
or charts (line graph, bar graph, etc). Our team Echo echoes the development.

2 Data Sourcing and Statistics


Our source of data is Kaggle DataSets
Link: https://www.kaggle.com/worldbank/world-development-indicators

The following tables are used from the data set: Country, Series, Indicators We refined the data
into following tables
• Country : It is split into 3 smaller tables - country info, country economy and coun-
try census. Only the attributes country code, country name are frequently used throughout
the project. We divided the table in order to avoid the access all other attributes in each
access of the record.
• Series : All the columns other than the ones mentioned in the table below are removed since
they are non-relevant to the context of our project
• Indicators : We removed the columns country name, indicator name from the table since
the country code, indicator code respectively are sufficient.
• Category : A table is created from series code, category columns from the Series table.
The category attribute is split into 3 levels( columns) delimited by ’:’. This is to facilitate
categorization of category of indicator into various levels for applying filters.
It is done using below python code for splitting the category column:

1
Figure 1: Python Code

Table 1: List of Entities and Attributes


Entity Attributes
country info country code , short name, long name, alpha2code, currency unit, region
country economy country code , income grp, sys accnts, sys trade, govt accnt
country census country code , population, house survey, src ie, agriculture, industrial, trade, water wdrl
series series code , indicator name, description, periodicity, source
indicators country code , indicator code, year, value
category series code , level1, level2, level3

3 Functionality and Working


3.1 User’s View of the Website
a) Indicators Tab :
Indicators Tab is where the user can view information about various indicators. The indicators are
categorized into levels (maximum 3) based on their type.
First level corresponds to the (eg: Health, Environment).
Clicking on the list item. If the next level-
Exists (NOT NULL): Takes you to the next level. List of sub-levels of previous level is populated.
Doesn’t Exist (NULL): List of Indicators of corresponding hierarchy is obtained. The user can view
the information regarding an indicator by clicking on it.
The information contains:
Indicator Name
Indicator Code
Description/ Definition of the indicator
Periodicity - how frequently the indicator is updated
Source for the data of the indicator

b) Countries Tab:
A list of countries is shown( 247 countries). User can click on the country of choice to view its
information.
Various kinds of information about the country (its Info, Economic data, Latest census) is shown.
About the country: Country Code, Short Name, Long Name, Alpha2Code, Currency Unit, Region

2
Table 2: Data Statistics
Table No. of Tuples Time to Load Raw DataSet Size Raw DataSet Size(clean)
country info 247 19.925 ms 19 KB
country economy 247 20.681 ms 132 KB 28 KB
country census 247 18.514 ms 21 KB
series 1,345 114.376 ms 3.3 MB 0.811 MB
indicators 5,656,458 10452.334 ms 574.3 MB 185.3 MB
category 1,345 67.206 ms – 86 KB

Economic Data: Income Group, System of National Accounts, System of Trade, Government Ac-
counting Concept
Lateset Census: Latest Population Census, Latest Household Survey, Source of Most Recent In-
come and Expenditure Data, Latest Agricultural Census, Latest Industrial Data, Latest Trade
Data, Latest Water Withdrawal Data

c) Comparison/Representation Tab:
• Tabular Form:
User can chose Countries, Indicators and Years of interest from the corresponding lists. Click-
ing on ”Show” button gives the list of values of the indicators of that countries in those years.
The columns of the table are Country, Indicator, Year and Value.
These values can be sorted by column. These can also be filtered (The countries, indicators ,
years can be selected and removed from the side panel).
• Graphical Form: The main objective of graphical form is comparison.
The values of an Indicator of various countries, years chosen are plotted on a graph( year on
X-axis, indicator value on y-axis, countries curves with different color) or bar diagram.
If more than one indicator is selected, they are represented on different graphs.

3.2 Special Functionality


a) Indexes:
Indexes are used for faster search. In our data set, the Indicator table contains more than 5 million
tuples. Hence we applied indexes (using B-Tree index) on indicator code, country code for faster
lookups.

b) Constraints
Since the data is available for years is from 1960 to 2015,

year >= 1960andyear <= 2015 (1)

c) Referential Constraints:
There is referential constraint on columns of following table:
(indicator code in Indicator, series code in series)
(country code in Indicator, country code in country info)
(series code in category, series code in series)

d) Privileges User Mode: Has Read Only privileges Can only view the data
Admin Mode: Has unlimited access privileges to the database. Admin can view, add, update and
delete any record.

3
3.3 List of Queries
• Indicators Tab:
– SELECT DISTINCT level1 FROM category;
– SELECT DISTINCT level1 FROM category WHERE BTRIM(level1)= $1;
– SELECT DISTINCT level1 FROM category WHERE BTRIM(level1)= $1 and BTRIM(level2)=
$2;
– SELECT series code FROM category WHERE BTRIM(level1)=$1 and BTRIM(level2)=
$2 BTRIM(level3)= $3;
– SELECT * FROM series WHERE BTRIM(series code)=$1;
• Countries Tab:
– SELECT country code, short name FROM country info;
– SELECT * FROM country info, country economy, country census WHERE country info.country code=countr
AND country economy.country code=country census.country code AND country info.country code=$1;

• Comparison/Representation Tab:
– Tabular Form
∗ SELECT * FROM indicators WHERE country code=$1 AND indicator code=$2
AND year=$3;
∗ SELECT * FROM indicators WHERE country code=$1 AND indicator code=$2
AND year=(eq 1));
∗ SELECT * FROM indicators WHERE (country code=$1 OR country code=$3)
AND indicator code=$3 AND year=(eq 1));
∗ SELECT * FROM indicators WHERE (country code=$1 OR country code=$3)
AND (indicator code=$3 OR indicator code=$4) AND year=(eq 1));
∗ SELECT * FROM indicators WHERE (country code=$1 OR country code=$2 OR
country code=$3) AND (indicator code=$4 OR indicator code=$5 OR indicator code=$6)
AND year=(eq 1)) ORDER BY country code, indicator code, year;
– Graphical Form
∗ SELECT * FROM indicators WHERE country code=$1 AND indicator code=$2
AND year=(eq 2));
∗ SELECT * FROM indicators WHERE country code=$1 AND indicator code=$2
AND year=(eq 3));
∗ SELECT * FROM indicators WHERE (country code=$1 or country code=$2) AND
indicator code=$3 AND year=(eq 3));

4 ER Diagram

4
Figure 2: ER Diagram

Table 3: My caption
Query Number Average Running Time (ms)
1.1 11.639
1.2 3.135
1.3 33.779
1.4 2.098
1.5 17.152
2.1 20.996
2.2 1.636
3.1.1 0.594
3.1.2 0.799
3.1.3 58.921
3.1.4 3372.370
3.1.5 2118.127
3.2.1 3.731
3.2.2 13.732
3.2.3 70.964

S-ar putea să vă placă și