Overview
Brought to you by YData
Dataset statistics
Number of variables | 13 |
---|---|
Number of observations | 54808 |
Missing cells | 6533 |
Missing cells (%) | 0.9% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 18.1 MiB |
Average record size in memory | 346.9 B |
Variable types
Numeric | 5 |
---|---|
Categorical | 8 |
Variable descriptions
employee_id | ID único del empleado |
---|---|
department | Departamento |
region | Región de empleo |
education | Nivel educativo |
gender | Género (f, m) |
recruitment_channel | Canal de reclutamiento |
no_of_trainings | Nº de capacitaciones año previo |
age | Edad |
previous_year_rating | Calificación año previo |
length_of_service | Años de servicio |
awards_won? | ¿Ganó premio año previo? (0/1) |
avg_training_score | Promedio evaluación de formación |
is_promoted | ¿Promovido? (0/1) |
age is highly overall correlated with length_of_service | High correlation |
avg_training_score is highly overall correlated with department | High correlation |
department is highly overall correlated with avg_training_score | High correlation |
length_of_service is highly overall correlated with age | High correlation |
awards_won? is highly imbalanced (84.1%) | Imbalance |
is_promoted is highly imbalanced (58.0%) | Imbalance |
education has 2409 (4.4%) missing values | Missing |
previous_year_rating has 4124 (7.5%) missing values | Missing |
employee_id has unique values | Unique |
Reproduction
Analysis started | 2025-09-25 03:16:04.248784 |
---|---|
Analysis finished | 2025-09-25 03:16:20.443421 |
Duration | 16.19 seconds |
Software version | ydata-profiling vv4.17.0 |
Download configuration | config.json |
Variables
Distinct | 54808 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 39195.831 |
Minimum | 1 |
---|---|
Maximum | 78298 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 428.3 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 3916.35 |
Q1 | 19669.75 |
median | 39225.5 |
Q3 | 58730.5 |
95-th percentile | 74415.3 |
Maximum | 78298 |
Range | 78297 |
Interquartile range (IQR) | 39060.75 |
Descriptive statistics
Standard deviation | 22586.581 |
---|---|
Coefficient of variation (CV) | 0.57624959 |
Kurtosis | -1.1964792 |
Mean | 39195.831 |
Median Absolute Deviation (MAD) | 19531.5 |
Skewness | -0.0031279472 |
Sum | 2.1482451 × 109 |
Variance | 5.1015366 × 108 |
Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
Value | Count | Frequency (%) |
65438 | 1 | < 0.1% |
16223 | 1 | < 0.1% |
38250 | 1 | < 0.1% |
68086 | 1 | < 0.1% |
78080 | 1 | < 0.1% |
52654 | 1 | < 0.1% |
31009 | 1 | < 0.1% |
8086 | 1 | < 0.1% |
24976 | 1 | < 0.1% |
60468 | 1 | < 0.1% |
Other values (54798) | 54798 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
4 | 1 | |
5 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 | |
12 | 1 | |
14 | 1 |
Value | Count | Frequency (%) |
78298 | 1 | |
78297 | 1 | |
78296 | 1 | |
78294 | 1 | |
78292 | 1 | |
78291 | 1 | |
78290 | 1 | |
78289 | 1 | |
78288 | 1 | |
78287 | 1 |
Distinct | 9 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.2 MiB |
Sales & Marketing | |
---|---|
Operations | |
Technology | |
Procurement | |
Analytics | |
Other values (4) |
Length
Max length | 17 |
---|---|
Median length | 11 |
Mean length | 11.469238 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Sales & Marketing |
---|---|
2nd row | Operations |
3rd row | Sales & Marketing |
4th row | Sales & Marketing |
5th row | Technology |
Common Values
Value | Count | Frequency (%) |
Sales & Marketing | 16840 | |
Operations | 11348 | |
Technology | 7138 | |
Procurement | 7138 | |
Analytics | 5352 | 9.8% |
Finance | 2536 | 4.6% |
HR | 2418 | 4.4% |
Legal | 1039 | 1.9% |
R&D | 999 | 1.8% |
Length
Histogram of lengths of the category
Common Values (Plot)
Value | Count | Frequency (%) |
sales | 16840 | |
16840 | ||
marketing | 16840 | |
operations | 11348 | |
technology | 7138 | |
procurement | 7138 | |
analytics | 5352 | 6.0% |
finance | 2536 | 2.9% |
hr | 2418 | 2.7% |
legal | 1039 | 1.2% |
Most occurring characters
Value | Count | Frequency (%) |
e | 70017 | 11.1% |
a | 53955 | 8.6% |
n | 52888 | 8.4% |
r | 42464 | 6.8% |
t | 40678 | 6.5% |
i | 36076 | 5.7% |
33680 | 5.4% | |
s | 33540 | 5.3% |
o | 32762 | 5.2% |
l | 30369 | 4.8% |
Other values (20) | 202177 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 628606 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
e | 70017 | 11.1% |
a | 53955 | 8.6% |
n | 52888 | 8.4% |
r | 42464 | 6.8% |
t | 40678 | 6.5% |
i | 36076 | 5.7% |
33680 | 5.4% | |
s | 33540 | 5.3% |
o | 32762 | 5.2% |
l | 30369 | 4.8% |
Other values (20) | 202177 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 628606 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
e | 70017 | 11.1% |
a | 53955 | 8.6% |
n | 52888 | 8.4% |
r | 42464 | 6.8% |
t | 40678 | 6.5% |
i | 36076 | 5.7% |
33680 | 5.4% | |
s | 33540 | 5.3% |
o | 32762 | 5.2% |
l | 30369 | 4.8% |
Other values (20) | 202177 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 628606 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
e | 70017 | 11.1% |
a | 53955 | 8.6% |
n | 52888 | 8.4% |
r | 42464 | 6.8% |
t | 40678 | 6.5% |
i | 36076 | 5.7% |
33680 | 5.4% | |
s | 33540 | 5.3% |
o | 32762 | 5.2% |
l | 30369 | 4.8% |
Other values (20) | 202177 |
region
Categorical
Región de empleo
Distinct | 34 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.0 MiB |
region_2 | |
---|---|
region_22 | |
region_7 | |
region_15 | |
region_13 | |
Other values (29) |
Length
Max length | 9 |
---|---|
Median length | 9 |
Mean length | 8.5917384 |
Min length | 8 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | region_7 |
---|---|
2nd row | region_22 |
3rd row | region_19 |
4th row | region_23 |
5th row | region_26 |
Common Values
Value | Count | Frequency (%) |
region_2 | 12343 | |
region_22 | 6428 | 11.7% |
region_7 | 4843 | 8.8% |
region_15 | 2808 | 5.1% |
region_13 | 2648 | 4.8% |
region_26 | 2260 | 4.1% |
region_31 | 1935 | 3.5% |
region_4 | 1703 | 3.1% |
region_27 | 1659 | 3.0% |
region_16 | 1465 | 2.7% |
Other values (24) | 16716 |
Length
Histogram of lengths of the category
Value | Count | Frequency (%) |
region_2 | 12343 | |
region_22 | 6428 | 11.7% |
region_7 | 4843 | 8.8% |
region_15 | 2808 | 5.1% |
region_13 | 2648 | 4.8% |
region_26 | 2260 | 4.1% |
region_31 | 1935 | 3.5% |
region_4 | 1703 | 3.1% |
region_27 | 1659 | 3.0% |
region_16 | 1465 | 2.7% |
Other values (24) | 16716 |
Most occurring characters
Value | Count | Frequency (%) |
r | 54808 | |
g | 54808 | |
i | 54808 | |
o | 54808 | |
n | 54808 | |
_ | 54808 | |
e | 54808 | |
2 | 36638 | |
1 | 16183 | 3.4% |
3 | 8536 | 1.8% |
Other values (7) | 25883 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 470896 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
r | 54808 | |
g | 54808 | |
i | 54808 | |
o | 54808 | |
n | 54808 | |
_ | 54808 | |
e | 54808 | |
2 | 36638 | |
1 | 16183 | 3.4% |
3 | 8536 | 1.8% |
Other values (7) | 25883 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 470896 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
r | 54808 | |
g | 54808 | |
i | 54808 | |
o | 54808 | |
n | 54808 | |
_ | 54808 | |
e | 54808 | |
2 | 36638 | |
1 | 16183 | 3.4% |
3 | 8536 | 1.8% |
Other values (7) | 25883 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 470896 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
r | 54808 | |
g | 54808 | |
i | 54808 | |
o | 54808 | |
n | 54808 | |
_ | 54808 | |
e | 54808 | |
2 | 36638 | |
1 | 16183 | 3.4% |
3 | 8536 | 1.8% |
Other values (7) | 25883 |
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 2409 |
Missing (%) | 4.4% |
Memory size | 3.2 MiB |
Bachelor's | |
---|---|
Master's & above | |
Below Secondary | 805 |
Length
Max length | 16 |
---|---|
Median length | 10 |
Mean length | 11.785817 |
Min length | 10 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Master's & above |
---|---|
2nd row | Bachelor's |
3rd row | Bachelor's |
4th row | Bachelor's |
5th row | Bachelor's |
Common Values
Value | Count | Frequency (%) |
Bachelor's | 36669 | |
Master's & above | 14925 | |
Below Secondary | 805 | 1.5% |
(Missing) | 2409 | 4.4% |
Length
Histogram of lengths of the category
Common Values (Plot)
Value | Count | Frequency (%) |
bachelor's | 36669 | |
master's | 14925 | |
14925 | ||
above | 14925 | |
below | 805 | 1.0% |
secondary | 805 | 1.0% |
Most occurring characters
Value | Count | Frequency (%) |
e | 68129 | |
a | 67324 | |
s | 66519 | |
o | 53204 | |
r | 52399 | |
' | 51594 | |
B | 37474 | 6.1% |
c | 37474 | 6.1% |
l | 37474 | 6.1% |
h | 36669 | 5.9% |
Other values (11) | 109305 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 617565 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
e | 68129 | |
a | 67324 | |
s | 66519 | |
o | 53204 | |
r | 52399 | |
' | 51594 | |
B | 37474 | 6.1% |
c | 37474 | 6.1% |
l | 37474 | 6.1% |
h | 36669 | 5.9% |
Other values (11) | 109305 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 617565 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
e | 68129 | |
a | 67324 | |
s | 66519 | |
o | 53204 | |
r | 52399 | |
' | 51594 | |
B | 37474 | 6.1% |
c | 37474 | 6.1% |
l | 37474 | 6.1% |
h | 36669 | 5.9% |
Other values (11) | 109305 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 617565 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
e | 68129 | |
a | 67324 | |
s | 66519 | |
o | 53204 | |
r | 52399 | |
' | 51594 | |
B | 37474 | 6.1% |
c | 37474 | 6.1% |
l | 37474 | 6.1% |
h | 36669 | 5.9% |
Other values (11) | 109305 |
gender
Categorical
Género (f, m)
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.6 MiB |
m | |
---|---|
f |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | f |
---|---|
2nd row | m |
3rd row | m |
4th row | m |
5th row | m |
Common Values
Value | Count | Frequency (%) |
m | 38496 | |
f | 16312 |
Length
Histogram of lengths of the category
Common Values (Plot)
Value | Count | Frequency (%) |
m | 38496 | |
f | 16312 |
Most occurring characters
Value | Count | Frequency (%) |
m | 38496 | |
f | 16312 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 54808 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
m | 38496 | |
f | 16312 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 54808 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
m | 38496 | |
f | 16312 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 54808 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
m | 38496 | |
f | 16312 |
recruitment_channel
Categorical
Canal de reclutamiento
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.9 MiB |
other | |
---|---|
sourcing | |
referred | 1142 |
Length
Max length | 8 |
---|---|
Median length | 5 |
Mean length | 6.3334915 |
Min length | 5 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | sourcing |
---|---|
2nd row | other |
3rd row | sourcing |
4th row | other |
5th row | other |
Common Values
Value | Count | Frequency (%) |
other | 30446 | |
sourcing | 23220 | |
referred | 1142 | 2.1% |
Length
Histogram of lengths of the category
Common Values (Plot)
Value | Count | Frequency (%) |
other | 30446 | |
sourcing | 23220 | |
referred | 1142 | 2.1% |
Most occurring characters
Value | Count | Frequency (%) |
r | 57092 | |
o | 53666 | |
e | 33872 | |
t | 30446 | |
h | 30446 | |
s | 23220 | |
u | 23220 | |
c | 23220 | |
i | 23220 | |
n | 23220 | |
Other values (3) | 25504 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 347126 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
r | 57092 | |
o | 53666 | |
e | 33872 | |
t | 30446 | |
h | 30446 | |
s | 23220 | |
u | 23220 | |
c | 23220 | |
i | 23220 | |
n | 23220 | |
Other values (3) | 25504 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 347126 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
r | 57092 | |
o | 53666 | |
e | 33872 | |
t | 30446 | |
h | 30446 | |
s | 23220 | |
u | 23220 | |
c | 23220 | |
i | 23220 | |
n | 23220 | |
Other values (3) | 25504 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 347126 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
r | 57092 | |
o | 53666 | |
e | 33872 | |
t | 30446 | |
h | 30446 | |
s | 23220 | |
u | 23220 | |
c | 23220 | |
i | 23220 | |
n | 23220 | |
Other values (3) | 25504 |
no_of_trainings
Real number (ℝ)
Nº de capacitaciones año previo
Distinct | 10 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.2530105 |
Minimum | 1 |
---|---|
Maximum | 10 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 428.3 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 1 |
Q3 | 1 |
95-th percentile | 2 |
Maximum | 10 |
Range | 9 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 0.60926402 |
---|---|
Coefficient of variation (CV) | 0.48624015 |
Kurtosis | 18.740082 |
Mean | 1.2530105 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 3.4454339 |
Sum | 68675 |
Variance | 0.37120264 |
Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=10)
Value | Count | Frequency (%) |
1 | 44378 | |
2 | 7987 | 14.6% |
3 | 1776 | 3.2% |
4 | 468 | 0.9% |
5 | 128 | 0.2% |
6 | 44 | 0.1% |
7 | 12 | < 0.1% |
8 | 5 | < 0.1% |
10 | 5 | < 0.1% |
9 | 5 | < 0.1% |
Value | Count | Frequency (%) |
1 | 44378 | |
2 | 7987 | 14.6% |
3 | 1776 | 3.2% |
4 | 468 | 0.9% |
5 | 128 | 0.2% |
6 | 44 | 0.1% |
7 | 12 | < 0.1% |
8 | 5 | < 0.1% |
9 | 5 | < 0.1% |
10 | 5 | < 0.1% |
Value | Count | Frequency (%) |
10 | 5 | < 0.1% |
9 | 5 | < 0.1% |
8 | 5 | < 0.1% |
7 | 12 | < 0.1% |
6 | 44 | 0.1% |
5 | 128 | 0.2% |
4 | 468 | 0.9% |
3 | 1776 | 3.2% |
2 | 7987 | 14.6% |
1 | 44378 |
Distinct | 41 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 34.803915 |
Minimum | 20 |
---|---|
Maximum | 60 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 428.3 KiB |
Quantile statistics
Minimum | 20 |
---|---|
5-th percentile | 25 |
Q1 | 29 |
median | 33 |
Q3 | 39 |
95-th percentile | 51 |
Maximum | 60 |
Range | 40 |
Interquartile range (IQR) | 10 |
Descriptive statistics
Standard deviation | 7.6601692 |
---|---|
Coefficient of variation (CV) | 0.22009504 |
Kurtosis | 0.79235337 |
Mean | 34.803915 |
Median Absolute Deviation (MAD) | 4 |
Skewness | 1.0074318 |
Sum | 1907533 |
Variance | 58.678192 |
Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=41)
Value | Count | Frequency (%) |
30 | 3665 | 6.7% |
32 | 3534 | 6.4% |
31 | 3534 | 6.4% |
29 | 3405 | 6.2% |
33 | 3210 | 5.9% |
28 | 3147 | 5.7% |
34 | 3076 | 5.6% |
27 | 2827 | 5.2% |
35 | 2711 | 4.9% |
36 | 2517 | 4.6% |
Other values (31) | 23182 |
Value | Count | Frequency (%) |
20 | 113 | 0.2% |
21 | 98 | 0.2% |
22 | 231 | 0.4% |
23 | 428 | 0.8% |
24 | 845 | 1.5% |
25 | 1299 | 2.4% |
26 | 2060 | |
27 | 2827 | |
28 | 3147 | |
29 | 3405 |
Value | Count | Frequency (%) |
60 | 217 | |
59 | 209 | |
58 | 213 | |
57 | 238 | |
56 | 264 | |
55 | 294 | |
54 | 313 | |
53 | 364 | |
52 | 351 | |
51 | 389 |
Distinct | 5 |
---|---|
Distinct (%) | < 0.1% |
Missing | 4124 |
Missing (%) | 7.5% |
Memory size | 2.7 MiB |
3.0 | |
---|---|
5.0 | |
4.0 | |
1.0 | |
2.0 |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 5.0 |
---|---|
2nd row | 5.0 |
3rd row | 3.0 |
4th row | 1.0 |
5th row | 3.0 |
Common Values
Value | Count | Frequency (%) |
3.0 | 18618 | |
5.0 | 11741 | |
4.0 | 9877 | |
1.0 | 6223 | 11.4% |
2.0 | 4225 | 7.7% |
(Missing) | 4124 | 7.5% |
Length
Histogram of lengths of the category
Common Values (Plot)
Value | Count | Frequency (%) |
3.0 | 18618 | |
5.0 | 11741 | |
4.0 | 9877 | |
1.0 | 6223 | 12.3% |
2.0 | 4225 | 8.3% |
Most occurring characters
Value | Count | Frequency (%) |
. | 50684 | |
0 | 50684 | |
3 | 18618 | 12.2% |
5 | 11741 | 7.7% |
4 | 9877 | 6.5% |
1 | 6223 | 4.1% |
2 | 4225 | 2.8% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 152052 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
. | 50684 | |
0 | 50684 | |
3 | 18618 | 12.2% |
5 | 11741 | 7.7% |
4 | 9877 | 6.5% |
1 | 6223 | 4.1% |
2 | 4225 | 2.8% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 152052 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
. | 50684 | |
0 | 50684 | |
3 | 18618 | 12.2% |
5 | 11741 | 7.7% |
4 | 9877 | 6.5% |
1 | 6223 | 4.1% |
2 | 4225 | 2.8% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 152052 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
. | 50684 | |
0 | 50684 | |
3 | 18618 | 12.2% |
5 | 11741 | 7.7% |
4 | 9877 | 6.5% |
1 | 6223 | 4.1% |
2 | 4225 | 2.8% |
Distinct | 35 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.8655123 |
Minimum | 1 |
---|---|
Maximum | 37 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 428.3 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 5 |
Q3 | 7 |
95-th percentile | 15 |
Maximum | 37 |
Range | 36 |
Interquartile range (IQR) | 4 |
Descriptive statistics
Standard deviation | 4.2650942 |
---|---|
Coefficient of variation (CV) | 0.72714776 |
Kurtosis | 4.4140314 |
Mean | 5.8655123 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 1.7380615 |
Sum | 321477 |
Variance | 18.191028 |
Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=35)
Value | Count | Frequency (%) |
3 | 7033 | |
4 | 6836 | |
2 | 6684 | |
5 | 5832 | |
7 | 5551 | |
6 | 4734 | |
1 | 4547 | |
8 | 2883 | |
9 | 2629 | 4.8% |
10 | 2193 | 4.0% |
Other values (25) | 5886 |
Value | Count | Frequency (%) |
1 | 4547 | |
2 | 6684 | |
3 | 7033 | |
4 | 6836 | |
5 | 5832 | |
6 | 4734 | |
7 | 5551 | |
8 | 2883 | |
9 | 2629 | 4.8% |
10 | 2193 | 4.0% |
Value | Count | Frequency (%) |
37 | 1 | < 0.1% |
34 | 4 | < 0.1% |
33 | 9 | < 0.1% |
32 | 10 | < 0.1% |
31 | 20 | |
30 | 12 | < 0.1% |
29 | 30 | |
28 | 30 | |
27 | 36 | |
26 | 41 |
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.6 MiB |
0 | |
---|---|
1 | 1270 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 53538 | |
1 | 1270 | 2.3% |
Length
Histogram of lengths of the category
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 53538 | |
1 | 1270 | 2.3% |
Most occurring characters
Value | Count | Frequency (%) |
0 | 53538 | |
1 | 1270 | 2.3% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 54808 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
0 | 53538 | |
1 | 1270 | 2.3% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 54808 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
0 | 53538 | |
1 | 1270 | 2.3% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 54808 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
0 | 53538 | |
1 | 1270 | 2.3% |
Distinct | 61 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 63.38675 |
Minimum | 39 |
---|---|
Maximum | 99 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 428.3 KiB |
Quantile statistics
Minimum | 39 |
---|---|
5-th percentile | 47 |
Q1 | 51 |
median | 60 |
Q3 | 76 |
95-th percentile | 86 |
Maximum | 99 |
Range | 60 |
Interquartile range (IQR) | 25 |
Descriptive statistics
Standard deviation | 13.371559 |
---|---|
Coefficient of variation (CV) | 0.21095197 |
Kurtosis | -1.0496493 |
Mean | 63.38675 |
Median Absolute Deviation (MAD) | 10 |
Skewness | 0.45190809 |
Sum | 3474101 |
Variance | 178.7986 |
Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
Value | Count | Frequency (%) |
50 | 2716 | 5.0% |
49 | 2681 | 4.9% |
48 | 2437 | 4.4% |
51 | 2347 | 4.3% |
60 | 2155 | 3.9% |
59 | 2064 | 3.8% |
58 | 1898 | 3.5% |
61 | 1879 | 3.4% |
52 | 1856 | 3.4% |
47 | 1746 | 3.2% |
Other values (51) | 33029 |
Value | Count | Frequency (%) |
39 | 2 | < 0.1% |
40 | 5 | < 0.1% |
41 | 26 | < 0.1% |
42 | 62 | 0.1% |
43 | 176 | 0.3% |
44 | 335 | 0.6% |
45 | 681 | 1.2% |
46 | 1136 | |
47 | 1746 | |
48 | 2437 |
Value | Count | Frequency (%) |
99 | 35 | 0.1% |
98 | 37 | 0.1% |
97 | 49 | 0.1% |
96 | 48 | 0.1% |
95 | 45 | 0.1% |
94 | 65 | 0.1% |
93 | 84 | |
92 | 99 | |
91 | 117 | |
90 | 185 |
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.6 MiB |
0 | |
---|---|
1 | 4668 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 50140 | |
1 | 4668 | 8.5% |
Length
Histogram of lengths of the category
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 50140 | |
1 | 4668 | 8.5% |
Most occurring characters
Value | Count | Frequency (%) |
0 | 50140 | |
1 | 4668 | 8.5% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 54808 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
0 | 50140 | |
1 | 4668 | 8.5% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 54808 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
0 | 50140 | |
1 | 4668 | 8.5% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 54808 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
0 | 50140 | |
1 | 4668 | 8.5% |
Interactions
Correlations
age | avg_training_score | awards_won? | department | education | employee_id | gender | is_promoted | length_of_service | no_of_trainings | previous_year_rating | recruitment_channel | region | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
age | 1.000 | -0.041 | 0.000 | 0.075 | 0.463 | -0.000 | 0.041 | 0.029 | 0.644 | -0.086 | 0.015 | 0.030 | 0.147 |
avg_training_score | -0.041 | 1.000 | 0.163 | 0.574 | 0.063 | -0.000 | 0.195 | 0.303 | -0.029 | 0.053 | 0.091 | 0.040 | 0.080 |
awards_won? | 0.000 | 0.163 | 1.000 | 0.006 | 0.000 | 0.005 | 0.000 | 0.196 | 0.043 | 0.000 | 0.029 | 0.002 | 0.017 |
department | 0.075 | 0.574 | 0.006 | 1.000 | 0.124 | 0.000 | 0.286 | 0.051 | 0.046 | 0.057 | 0.109 | 0.062 | 0.132 |
education | 0.463 | 0.063 | 0.000 | 0.124 | 1.000 | 0.012 | 0.027 | 0.026 | 0.189 | 0.027 | 0.021 | 0.027 | 0.181 |
employee_id | -0.000 | -0.000 | 0.005 | 0.000 | 0.012 | 1.000 | 0.006 | 0.000 | 0.001 | -0.003 | 0.008 | 0.000 | 0.002 |
gender | 0.041 | 0.195 | 0.000 | 0.286 | 0.027 | 0.006 | 1.000 | 0.010 | 0.028 | 0.087 | 0.027 | 0.008 | 0.161 |
is_promoted | 0.029 | 0.303 | 0.196 | 0.051 | 0.026 | 0.000 | 0.010 | 1.000 | 0.016 | 0.022 | 0.170 | 0.018 | 0.090 |
length_of_service | 0.644 | -0.029 | 0.043 | 0.046 | 0.189 | 0.001 | 0.028 | 0.016 | 1.000 | -0.057 | 0.006 | 0.018 | 0.083 |
no_of_trainings | -0.086 | 0.053 | 0.000 | 0.057 | 0.027 | -0.003 | 0.087 | 0.022 | -0.057 | 1.000 | 0.040 | 0.013 | 0.040 |
previous_year_rating | 0.015 | 0.091 | 0.029 | 0.109 | 0.021 | 0.008 | 0.027 | 0.170 | 0.006 | 0.040 | 1.000 | 0.050 | 0.051 |
recruitment_channel | 0.030 | 0.040 | 0.002 | 0.062 | 0.027 | 0.000 | 0.008 | 0.018 | 0.018 | 0.013 | 0.050 | 1.000 | 0.110 |
region | 0.147 | 0.080 | 0.017 | 0.132 | 0.181 | 0.002 | 0.161 | 0.090 | 0.083 | 0.040 | 0.051 | 0.110 | 1.000 |
Missing values
A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.
Sample
employee_id | department | region | education | gender | recruitment_channel | no_of_trainings | age | previous_year_rating | length_of_service | awards_won? | avg_training_score | is_promoted | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 65438 | Sales & Marketing | region_7 | Master's & above | f | sourcing | 1 | 35 | 5.0 | 8 | 0 | 49 | 0 |
1 | 65141 | Operations | region_22 | Bachelor's | m | other | 1 | 30 | 5.0 | 4 | 0 | 60 | 0 |
2 | 7513 | Sales & Marketing | region_19 | Bachelor's | m | sourcing | 1 | 34 | 3.0 | 7 | 0 | 50 | 0 |
3 | 2542 | Sales & Marketing | region_23 | Bachelor's | m | other | 2 | 39 | 1.0 | 10 | 0 | 50 | 0 |
4 | 48945 | Technology | region_26 | Bachelor's | m | other | 1 | 45 | 3.0 | 2 | 0 | 73 | 0 |
5 | 58896 | Analytics | region_2 | Bachelor's | m | sourcing | 2 | 31 | 3.0 | 7 | 0 | 85 | 0 |
6 | 20379 | Operations | region_20 | Bachelor's | f | other | 1 | 31 | 3.0 | 5 | 0 | 59 | 0 |
7 | 16290 | Operations | region_34 | Master's & above | m | sourcing | 1 | 33 | 3.0 | 6 | 0 | 63 | 0 |
8 | 73202 | Analytics | region_20 | Bachelor's | m | other | 1 | 28 | 4.0 | 5 | 0 | 83 | 0 |
9 | 28911 | Sales & Marketing | region_1 | Master's & above | m | sourcing | 1 | 32 | 5.0 | 5 | 0 | 54 | 0 |
employee_id | department | region | education | gender | recruitment_channel | no_of_trainings | age | previous_year_rating | length_of_service | awards_won? | avg_training_score | is_promoted | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
54798 | 40257 | Sales & Marketing | region_2 | Master's & above | f | other | 2 | 40 | 5.0 | 4 | 0 | 51 | 0 |
54799 | 68093 | Procurement | region_2 | Master's & above | f | other | 1 | 50 | 5.0 | 6 | 1 | 67 | 0 |
54800 | 39227 | HR | region_11 | Bachelor's | m | other | 2 | 34 | 5.0 | 3 | 0 | 52 | 0 |
54801 | 12431 | Technology | region_26 | Bachelor's | f | sourcing | 1 | 31 | NaN | 1 | 0 | 78 | 0 |
54802 | 6915 | Sales & Marketing | region_14 | Bachelor's | m | other | 2 | 31 | 1.0 | 2 | 0 | 49 | 0 |
54803 | 3030 | Technology | region_14 | Bachelor's | m | sourcing | 1 | 48 | 3.0 | 17 | 0 | 78 | 0 |
54804 | 74592 | Operations | region_27 | Master's & above | f | other | 1 | 37 | 2.0 | 6 | 0 | 56 | 0 |
54805 | 13918 | Analytics | region_1 | Bachelor's | m | other | 1 | 27 | 5.0 | 3 | 0 | 79 | 0 |
54806 | 13614 | Sales & Marketing | region_9 | NaN | m | sourcing | 1 | 29 | 1.0 | 2 | 0 | 45 | 0 |
54807 | 51526 | HR | region_22 | Bachelor's | m | other | 1 | 27 | 1.0 | 5 | 0 | 49 | 0 |