Skip to content

Commit 112e170

Browse files
committed
start updating model readme.md's
1 parent 7cbf9a7 commit 112e170

File tree

1 file changed

+47
-1
lines changed

1 file changed

+47
-1
lines changed

model/README.md

Lines changed: 47 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,45 @@
11
# Model
22

3-
This folder contains the color naming models and the translation loss which we referred in our paper and the scripts creating them.
3+
4+
This folder contains the calculations and models we made about colors and color names.
5+
6+
## Color Calculations (without color names)
7+
The "color_info_pre_naming" has data color spaces and color bins that will be used in later steps once we have color names.
8+
9+
Created by:
10+
- The scripts in: 00_pre-processing-colors
11+
12+
## Cleaned Color Names
13+
"cleaned_color_names.csv" contains the cleaned version of the raw dataset. Besides removing data, we:
14+
- update the "name" field to include a simplified matching name (e.g., fixing typos, removing diacritic marks)
15+
- the "entered_name" field is the name as it was entered
16+
- the "standardized_entered_name" has some standardization steps run on it (e.g., all lowercase, trimmed white space)
17+
18+
Created by:
19+
- processing_scripts/01_data_cleaning/01_dataCleaning.js
20+
21+
## Removed Color Data
22+
"removed_color_data.csv" is the raw data that was excluded in our data cleaning process
23+
24+
Created by:
25+
- processing_scripts/01_data_cleaning/01_dataCleaning.js
26+
27+
## Language Info
28+
29+
TODO
30+
31+
## Full Colors Info
32+
33+
34+
TODO
35+
36+
## Binned Hue Colors
37+
38+
TODO
39+
40+
## Binned Full Colors
41+
42+
TODO:
443

544
Each color naming model is a JSON array of color-name pairs. Each pair has the below properties:
645

@@ -12,8 +51,15 @@ Each color naming model is a JSON array of color-name pairs. Each pair has the b
1251
- pTC : Probability of a term (t) given a color (c) (P(t|c))
1352
- schema : (for scheme_color_model only) Schema
1453

54+
## Translation loss
55+
56+
TODO
1557

1658
`translation_loss` is also an array having the translation losses between the top 100 English and Korean color name for full colors. 'dist' property indicate the distance (loss) between the English term (enTerm) and the Korean term (koTerm).
1759

1860

61+
## Scheme Color Data
62+
63+
TODO
64+
1965
Note: We represent the color labels provided by the participants in our study, which includes whatever racial biases they have (e.g., the color "skin"). This is not meant to be a prescriptive definition of what colors fit what labels.

0 commit comments

Comments
 (0)