You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
JudiLing supports the use of multiple threads. Simply start up Julia in your terminal as follows:
@@ -526,6 +538,16 @@ You can download and try out this script [here](https://osf.io/sa89x/download).
526
538
527
539
We implemented a high-level wrapper function that aims to provide quick and preliminary studies on multiple datasets with different parameter settings. For a sophisticated study, we suggest to build a script step by step.
528
540
541
+
In general, `test_combo` function will perform the following operations:
542
+
543
+
- prepare datasets
544
+
- make cue matrix object
545
+
- make semantic matrix
546
+
- learn transfrom mapping F and G
547
+
- perform path-finding algorithms for both `learn_paths` and `build_paths` in training and validation datasets
548
+
- evaluate results
549
+
- save outputs
550
+
529
551
### Split mode
530
552
`test_combo` function provides four split mode. `:train_only` give the opportunity to only evaluate the model with training data or partial training data. `data_path` is the path to the CSV file and `data_output_dir` is the directory for store training and validation datasets for future analysis.
531
553
@@ -567,7 +589,7 @@ JudiLing.test_combo(
567
589
)
568
590
```
569
591
570
-
`:random_split` will randomly split data into training and validation datasets. In this case, it is high likely that unseen n-grams and features are in the validation datasets. Therefore, `if_combined` should be turned on. `data_path` is the path to the directory containing CSV files and `data_output_dir` is the directory for store training and validation datasets for future analysis.
592
+
`:random_split` will randomly split data into training and validation datasets. In this case, it is high likely that unseen n-grams and features are in the validation datasets. Therefore, you should set `if_combined` to true. `data_path` is the path to the directory containing CSV files and `data_output_dir` is the directory for store training and validation datasets for future analysis.
571
593
572
594
```julia
573
595
JudiLing.test_combo(
@@ -594,7 +616,7 @@ JudiLing.test_combo(
594
616
)
595
617
```
596
618
597
-
`:careful_split` will carefully split data into training and validation datasets where there will be no unseen n-grams and features in the validation datasets. Therefore, `if_combined`shall be truned off. `data_path` is the path to the directory containing CSV files and `data_output_dir` is the directory for store training and validation datasets for future analysis. `n_features_columns` gives names of feature columns and target column.
619
+
`:careful_split` will carefully split data into training and validation datasets where there will be no unseen n-grams and features in the validation datasets. Therefore, you should set `if_combined`to false. `data_path` is the path to the directory containing CSV files and `data_output_dir` is the directory for store training and validation datasets for future analysis. `n_features_columns` gives names of feature columns and target column.
0 commit comments