Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 119808 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Total size in memory | 4.6 MiB |
Average record size in memory | 40.0 B |
Variable types
Text | 3 |
---|---|
Numeric | 2 |
Variable descriptions
ald_business_unit | sub-sector of ald_sector |
---|---|
capacity_factor | ratio by which a capacity is converted into production. |
scenario | name of the scenario |
scenario_geography | regional geography of a scenario |
year | year |
capacity_factor has 8888 (7.4%) zeros | Zeros |
Reproduction
Analysis started | 2024-04-15 12:05:21.593548 |
---|---|
Analysis finished | 2024-04-15 12:05:21.732322 |
Duration | 0.14 seconds |
Software version | ydata-profiling vv4.7.0 |
Download configuration | config.json |
scenario
Text
name of the scenario
Distinct | 34 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 936.1 KiB |
Length
Max length | 22 |
---|---|
Median length | 20 |
Mean length | 16.58698918 |
Min length | 8 |
Characters and Unicode
Total characters | 1987254 |
---|---|
Distinct characters | 37 |
Distinct categories | 1 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | WEO2021_SDS |
---|---|
2nd row | WEO2021_SDS |
3rd row | WEO2021_SDS |
4th row | WEO2021_SDS |
5th row | WEO2021_SDS |
Value | Count | Frequency (%) |
ipr2023_baseline | 6090 | 5.1% |
ipr2023_fps | 6090 | 5.1% |
ngfs2023gcam_nz2050 | 5688 | 4.7% |
ngfs2023gcam_cp | 5688 | 4.7% |
ngfs2023gcam_fw | 5688 | 4.7% |
ngfs2023gcam_ndc | 5688 | 4.7% |
ngfs2023gcam_dt | 5688 | 4.7% |
ngfs2023gcam_ld | 5688 | 4.7% |
ngfs2023gcam_b2ds | 5688 | 4.7% |
ngfs2023remind_dt | 4740 | 4.0% |
Other values (24) | 63072 |
Most occurring characters
Value | Count | Frequency (%) |
2 | 268956 | |
S | 175362 | 8.8% |
N | 159642 | 8.0% |
G | 159264 | 8.0% |
0 | 152076 | 7.7% |
_ | 122376 | 6.2% |
F | 116058 | 5.8% |
3 | 108924 | 5.5% |
M | 96222 | 4.8% |
E | 94866 | 4.8% |
Other values (27) | 533508 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 1987254 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
2 | 268956 | |
S | 175362 | 8.8% |
N | 159642 | 8.0% |
G | 159264 | 8.0% |
0 | 152076 | 7.7% |
_ | 122376 | 6.2% |
F | 116058 | 5.8% |
3 | 108924 | 5.5% |
M | 96222 | 4.8% |
E | 94866 | 4.8% |
Other values (27) | 533508 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 1987254 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
2 | 268956 | |
S | 175362 | 8.8% |
N | 159642 | 8.0% |
G | 159264 | 8.0% |
0 | 152076 | 7.7% |
_ | 122376 | 6.2% |
F | 116058 | 5.8% |
3 | 108924 | 5.5% |
M | 96222 | 4.8% |
E | 94866 | 4.8% |
Other values (27) | 533508 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 1987254 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
2 | 268956 | |
S | 175362 | 8.8% |
N | 159642 | 8.0% |
G | 159264 | 8.0% |
0 | 152076 | 7.7% |
_ | 122376 | 6.2% |
F | 116058 | 5.8% |
3 | 108924 | 5.5% |
M | 96222 | 4.8% |
E | 94866 | 4.8% |
Other values (27) | 533508 |
regional geography of a scenario
Distinct | 41 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 936.1 KiB |
Length
Max length | 22 |
---|---|
Median length | 18 |
Mean length | 9.341897035 |
Min length | 2 |
Characters and Unicode
Total characters | 1119234 |
---|---|
Distinct characters | 39 |
Distinct categories | 1 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | AdvancedEconomies |
---|---|
2nd row | AdvancedEconomies |
3rd row | AdvancedEconomies |
4th row | AdvancedEconomies |
5th row | AdvancedEconomies |
Value | Count | Frequency (%) |
global | 12820 | |
china | 10410 | |
reformingeconomies | 9954 | |
oecdandeu | 9954 | |
middleeastandafrica | 9954 | |
latinamerica | 9954 | |
asia | 9954 | |
japan | 7672 | 6.4% |
india | 7672 | 6.4% |
unitedstates | 7672 | 6.4% |
Other values (31) | 23792 |
Most occurring characters
Value | Count | Frequency (%) |
a | 127898 | 11.4% |
i | 112264 | 10.0% |
n | 95552 | 8.5% |
e | 77802 | 7.0% |
d | 71292 | 6.4% |
A | 61376 | 5.5% |
o | 57566 | 5.1% |
t | 55072 | 4.9% |
s | 53912 | 4.8% |
c | 44500 | 4.0% |
Other values (29) | 362000 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 1119234 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
a | 127898 | 11.4% |
i | 112264 | 10.0% |
n | 95552 | 8.5% |
e | 77802 | 7.0% |
d | 71292 | 6.4% |
A | 61376 | 5.5% |
o | 57566 | 5.1% |
t | 55072 | 4.9% |
s | 53912 | 4.8% |
c | 44500 | 4.0% |
Other values (29) | 362000 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 1119234 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
a | 127898 | 11.4% |
i | 112264 | 10.0% |
n | 95552 | 8.5% |
e | 77802 | 7.0% |
d | 71292 | 6.4% |
A | 61376 | 5.5% |
o | 57566 | 5.1% |
t | 55072 | 4.9% |
s | 53912 | 4.8% |
c | 44500 | 4.0% |
Other values (29) | 362000 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 1119234 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
a | 127898 | 11.4% |
i | 112264 | 10.0% |
n | 95552 | 8.5% |
e | 77802 | 7.0% |
d | 71292 | 6.4% |
A | 61376 | 5.5% |
o | 57566 | 5.1% |
t | 55072 | 4.9% |
s | 53912 | 4.8% |
c | 44500 | 4.0% |
Other values (29) | 362000 |
sub-sector of ald_sector
Distinct | 16 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 936.1 KiB |
Length
Max length | 13 |
---|---|
Median length | 9.5 |
Mean length | 8.365100828 |
Min length | 6 |
Characters and Unicode
Total characters | 1002206 |
---|---|
Distinct characters | 33 |
Distinct categories | 1 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | CoalCap |
---|---|
2nd row | CoalCap |
3rd row | CoalCap |
4th row | CoalCap |
5th row | CoalCap |
Value | Count | Frequency (%) |
coalcap | 19096 | |
hydrocap | 19096 | |
gascap | 19096 | |
nuclearcap | 19096 | |
oilcap | 19096 | |
renewablescap | 19096 | |
biomasscap | 1218 | 1.0% |
solarcap | 1218 | 1.0% |
onwindcap | 1218 | 1.0% |
offwindcap | 1218 | 1.0% |
Other values (6) | 360 | 0.3% |
Most occurring characters
Value | Count | Frequency (%) |
a | 198268 | |
C | 138544 | |
p | 119448 | |
l | 77602 | 7.7% |
e | 76384 | 7.6% |
s | 40628 | 4.1% |
o | 40628 | 4.1% |
r | 39410 | 3.9% |
n | 22750 | 2.3% |
i | 22750 | 2.3% |
Other values (23) | 225794 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 1002206 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
a | 198268 | |
C | 138544 | |
p | 119448 | |
l | 77602 | 7.7% |
e | 76384 | 7.6% |
s | 40628 | 4.1% |
o | 40628 | 4.1% |
r | 39410 | 3.9% |
n | 22750 | 2.3% |
i | 22750 | 2.3% |
Other values (23) | 225794 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 1002206 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
a | 198268 | |
C | 138544 | |
p | 119448 | |
l | 77602 | 7.7% |
e | 76384 | 7.6% |
s | 40628 | 4.1% |
o | 40628 | 4.1% |
r | 39410 | 3.9% |
n | 22750 | 2.3% |
i | 22750 | 2.3% |
Other values (23) | 225794 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 1002206 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
a | 198268 | |
C | 138544 | |
p | 119448 | |
l | 77602 | 7.7% |
e | 76384 | 7.6% |
s | 40628 | 4.1% |
o | 40628 | 4.1% |
r | 39410 | 3.9% |
n | 22750 | 2.3% |
i | 22750 | 2.3% |
Other values (23) | 225794 |
year
Real number (ℝ)
year
Distinct | 80 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2055.87505 |
Minimum | 2021 |
---|---|
Maximum | 2100 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 936.1 KiB |
Quantile statistics
Minimum | 2021 |
---|---|
5-th percentile | 2024 |
Q1 | 2035 |
median | 2052 |
Q3 | 2076 |
95-th percentile | 2096 |
Maximum | 2100 |
Range | 79 |
Interquartile range (IQR) | 41 |
Descriptive statistics
Standard deviation | 23.35979019 |
---|---|
Coefficient of variation (CV) | 0.01136245619 |
Kurtosis | -1.194863645 |
Mean | 2055.87505 |
Median Absolute Deviation (MAD) | 19 |
Skewness | 0.3036264903 |
Sum | 246310278 |
Variance | 545.6797975 |
Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
Value | Count | Frequency (%) |
2022 | 2184 | 1.8% |
2032 | 2184 | 1.8% |
2023 | 2184 | 1.8% |
2040 | 2184 | 1.8% |
2039 | 2184 | 1.8% |
2038 | 2184 | 1.8% |
2037 | 2184 | 1.8% |
2036 | 2184 | 1.8% |
2035 | 2184 | 1.8% |
2034 | 2184 | 1.8% |
Other values (70) | 97968 |
Value | Count | Frequency (%) |
2021 | 12 | < 0.1% |
2022 | 2184 | |
2023 | 2184 | |
2024 | 2184 | |
2025 | 2184 |
Value | Count | Frequency (%) |
2100 | 1230 | |
2099 | 1230 | |
2098 | 1230 | |
2097 | 1230 | |
2096 | 1230 |
capacity_factor
Real number (ℝ)
ZEROS
 
ratio by which a capacity is converted into production.
Distinct | 81591 |
---|---|
Distinct (%) | 68.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.4399129151 |
Minimum | 0 |
---|---|
Maximum | 1 |
Zeros | 8888 |
Zeros (%) | 7.4% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 936.1 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0.2503956826 |
median | 0.3849102617 |
Q3 | 0.6665467365 |
95-th percentile | 0.8483013396 |
Maximum | 1 |
Range | 1 |
Interquartile range (IQR) | 0.416151054 |
Descriptive statistics
Standard deviation | 0.2657513697 |
---|---|
Coefficient of variation (CV) | 0.6040999493 |
Kurtosis | -1.008876789 |
Mean | 0.4399129151 |
Median Absolute Deviation (MAD) | 0.1760115187 |
Skewness | 0.1887513051 |
Sum | 52705.08653 |
Variance | 0.07062379051 |
Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
Value | Count | Frequency (%) |
0 | 8888 | 7.4% |
0.2535047025 | 2744 | 2.3% |
0.2535047025 | 784 | 0.7% |
0.3407007705 | 553 | 0.5% |
0.2508640285 | 392 | 0.3% |
1 | 299 | 0.2% |
0.636920856 | 180 | 0.2% |
0.4477501457 | 147 | 0.1% |
0.461395189 | 120 | 0.1% |
0.4277891854 | 107 | 0.1% |
Other values (81581) | 105594 |
Value | Count | Frequency (%) |
0 | 8888 | |
1.431037907 × 10-7 | 1 | < 0.1% |
2.854079446 × 10-7 | 1 | < 0.1% |
4.269191454 × 10-7 | 1 | < 0.1% |
5.676440024 × 10-7 | 1 | < 0.1% |
Value | Count | Frequency (%) |
1 | 299 | |
0.9981747661 | 2 | < 0.1% |
0.9961213632 | 1 | < 0.1% |
0.9956474099 | 1 | < 0.1% |
0.9951028753 | 1 | < 0.1% |