num_sent_processes: 4 [0] Creating shuffled datasets in /scratch/gz5hp/tfbs_region2vec_models/expr_universe_tile25k/50W100D-0.0250r/shuffled_datasets [1] Creating shuffled datasets in /scratch/gz5hp/tfbs_region2vec_models/expr_universe_tile25k/50W100D-0.0250r/shuffled_datasets [2] Creating shuffled datasets in /scratch/gz5hp/tfbs_region2vec_models/expr_universe_tile25k/50W100D-0.0250r/shuffled_datasets [3] Creating shuffled datasets in /scratch/gz5hp/tfbs_region2vec_models/expr_universe_tile25k/50W100D-0.0250r/shuffled_datasets {'num_shuffle': 100, 'embed_dim': 100, 'context_len': 50, 'nworkers': 6, 'save_freq': -1, 'save_dir': '/scratch/gz5hp/tfbs_region2vec_models/expr_universe_tile25k/50W100D-0.0250r', 'resume': '', 'train_alg': 'cbow', 'min_count': 5, 'neg_samples': 5, 'init_lr': 0.025, 'min_lr': 0.0001, 'lr_mode': 'linear', 'milestones': [], 'hier_softmax': False, 'update_vocab': 'once', 'seed': 0} Using cbow, negative sampling with 5 negative samples [03/12/24-17:43:44] Start training [03/12/24-17:43:44] Building vocabulary [03/12/24-17:43:59] Vocabulary size is 100053 [Shuffling 1] loss 4700822.0000 lr 0.0250 vocab_size 100053 (8.87s/15.02m) [Shuffling 2] loss 4113788.0000 lr 0.0248 vocab_size 100053 (8.97s/15.11m) [Shuffling 3] loss 4159221.7500 lr 0.0245 vocab_size 100053 (8.96s/15.14m) [Shuffling 4] loss 4036802.7500 lr 0.0243 vocab_size 100053 (8.94s/15.14m) [Shuffling 5] loss 3770384.0000 lr 0.0240 vocab_size 100053 (9.04s/15.18m) [Shuffling 6] loss 3887794.0000 lr 0.0238 vocab_size 100053 (8.79s/15.13m) [Shuffling 7] loss 3864272.2500 lr 0.0235 vocab_size 100053 (8.82s/15.11m) [Shuffling 8] loss 3834027.0000 lr 0.0233 vocab_size 100053 (8.81s/15.08m) [Shuffling 9] loss 3724485.7500 lr 0.0230 vocab_size 100053 (9.01s/15.11m) [Shuffling 10] loss 3854009.2500 lr 0.0228 vocab_size 100053 (8.84s/15.09m) [Shuffling 11] loss 3866404.2500 lr 0.0225 vocab_size 100053 (8.94s/15.10m) [Shuffling 12] loss 3900528.0000 lr 0.0223 vocab_size 100053 (8.92s/15.10m) [Shuffling 13] loss 3651875.5000 lr 0.0220 vocab_size 100053 (8.95s/15.11m) [Shuffling 14] loss 3836569.0000 lr 0.0218 vocab_size 100053 (8.88s/15.10m) [Shuffling 15] loss 3795630.7500 lr 0.0215 vocab_size 100053 (9.06s/15.12m) [Shuffling 16] loss 3833278.2500 lr 0.0213 vocab_size 100053 (9.00s/15.13m) [Shuffling 17] loss 3850579.7500 lr 0.0210 vocab_size 100053 (8.91s/15.13m) [Shuffling 18] loss 3690693.7500 lr 0.0208 vocab_size 100053 (9.16s/15.15m) [Shuffling 19] loss 3588258.0000 lr 0.0205 vocab_size 100053 (9.05s/15.16m) [Shuffling 20] loss 3749305.2500 lr 0.0203 vocab_size 100053 (8.78s/15.14m) [Shuffling 21] loss 3795088.2500 lr 0.0200 vocab_size 100053 (8.81s/15.13m) [Shuffling 22] loss 3794094.7500 lr 0.0198 vocab_size 100053 (8.88s/15.13m) [Shuffling 23] loss 3690510.5000 lr 0.0195 vocab_size 100053 (8.87s/15.13m) [Shuffling 24] loss 3814325.5000 lr 0.0193 vocab_size 100053 (8.75s/15.11m) [Shuffling 25] loss 3640215.5000 lr 0.0190 vocab_size 100053 (8.75s/15.10m) [Shuffling 26] loss 3710800.2500 lr 0.0188 vocab_size 100053 (8.78s/15.09m) [Shuffling 27] loss 3724266.7500 lr 0.0185 vocab_size 100053 (9.07s/15.10m) [Shuffling 28] loss 3694968.0000 lr 0.0183 vocab_size 100053 (8.75s/15.10m) [Shuffling 29] loss 3723281.2500 lr 0.0180 vocab_size 100053 (8.76s/15.09m) [Shuffling 30] loss 3534921.7500 lr 0.0178 vocab_size 100053 (8.90s/15.09m) [Shuffling 31] loss 3641326.7500 lr 0.0175 vocab_size 100053 (9.06s/15.10m) [Shuffling 32] loss 3610140.2500 lr 0.0173 vocab_size 100053 (8.93s/15.10m) [Shuffling 33] loss 3685697.0000 lr 0.0170 vocab_size 100053 (8.66s/15.08m) [Shuffling 34] loss 3582560.2500 lr 0.0168 vocab_size 100053 (9.05s/15.09m) [Shuffling 35] loss 3703905.5000 lr 0.0165 vocab_size 100053 (9.10s/15.10m) [Shuffling 36] loss 3571254.2500 lr 0.0163 vocab_size 100053 (9.18s/15.11m) [Shuffling 37] loss 3743966.7500 lr 0.0160 vocab_size 100053 (8.89s/15.11m) [Shuffling 38] loss 3515318.0000 lr 0.0158 vocab_size 100053 (9.00s/15.12m) [Shuffling 39] loss 3723539.0000 lr 0.0155 vocab_size 100053 (8.85s/15.11m) [Shuffling 40] loss 3671185.7500 lr 0.0153 vocab_size 100053 (8.76s/15.11m) [Shuffling 41] loss 3703281.2500 lr 0.0150 vocab_size 100053 (8.78s/15.10m) [Shuffling 42] loss 3674235.2500 lr 0.0148 vocab_size 100053 (8.89s/15.10m) [Shuffling 43] loss 3807498.7500 lr 0.0145 vocab_size 100053 (8.85s/15.10m) [Shuffling 44] loss 3708105.0000 lr 0.0143 vocab_size 100053 (8.94s/15.10m) [Shuffling 45] loss 3643518.7500 lr 0.0140 vocab_size 100053 (8.84s/15.10m) [Shuffling 46] loss 3732983.0000 lr 0.0138 vocab_size 100053 (8.99s/15.10m) [Shuffling 47] loss 3653742.7500 lr 0.0135 vocab_size 100053 (8.83s/15.10m) [Shuffling 48] loss 3529980.2500 lr 0.0133 vocab_size 100053 (9.05s/15.10m) [Shuffling 49] loss 3457859.5000 lr 0.0130 vocab_size 100053 (9.37s/15.12m) [Shuffling 50] loss 3744170.2500 lr 0.0128 vocab_size 100053 (8.70s/15.11m) [Shuffling 51] loss 3695032.2500 lr 0.0126 vocab_size 100053 (8.96s/15.11m) [Shuffling 52] loss 3701025.5000 lr 0.0123 vocab_size 100053 (8.78s/15.11m) [Shuffling 53] loss 3625830.0000 lr 0.0121 vocab_size 100053 (8.86s/15.10m) [Shuffling 54] loss 3679879.7500 lr 0.0118 vocab_size 100053 (8.77s/15.10m) [Shuffling 55] loss 3650833.5000 lr 0.0116 vocab_size 100053 (8.77s/15.10m) [Shuffling 56] loss 3720262.0000 lr 0.0113 vocab_size 100053 (8.97s/15.10m) [Shuffling 57] loss 3490888.5000 lr 0.0111 vocab_size 100053 (8.86s/15.10m) [Shuffling 58] loss 3637301.0000 lr 0.0108 vocab_size 100053 (9.12s/15.10m) [Shuffling 59] loss 3671565.5000 lr 0.0106 vocab_size 100053 (8.83s/15.10m) [Shuffling 60] loss 3701144.7500 lr 0.0103 vocab_size 100053 (8.90s/15.10m) [Shuffling 61] loss 3695603.2500 lr 0.0101 vocab_size 100053 (8.91s/15.10m) [Shuffling 62] loss 3644617.7500 lr 0.0098 vocab_size 100053 (8.79s/15.10m) [Shuffling 63] loss 3709498.2500 lr 0.0096 vocab_size 100053 (8.90s/15.10m) [Shuffling 64] loss 3517235.5000 lr 0.0093 vocab_size 100053 (8.97s/15.10m) [Shuffling 65] loss 3693029.7500 lr 0.0091 vocab_size 100053 (8.91s/15.10m) [Shuffling 66] loss 3651631.2500 lr 0.0088 vocab_size 100053 (8.84s/15.10m) [Shuffling 67] loss 3645936.7500 lr 0.0086 vocab_size 100053 (8.86s/15.10m) [Shuffling 68] loss 3659113.2500 lr 0.0083 vocab_size 100053 (8.74s/15.09m) [Shuffling 69] loss 3623937.5000 lr 0.0081 vocab_size 100053 (8.92s/15.09m) [Shuffling 70] loss 3616708.0000 lr 0.0078 vocab_size 100053 (8.75s/15.09m) [Shuffling 71] loss 3635136.0000 lr 0.0076 vocab_size 100053 (8.91s/15.09m) [Shuffling 72] loss 3669018.5000 lr 0.0073 vocab_size 100053 (8.83s/15.09m) [Shuffling 73] loss 3683391.5000 lr 0.0071 vocab_size 100053 (8.96s/15.09m) [Shuffling 74] loss 3654634.7500 lr 0.0068 vocab_size 100053 (8.77s/15.08m) [Shuffling 75] loss 3641897.0000 lr 0.0066 vocab_size 100053 (8.76s/15.08m) [Shuffling 76] loss 3649723.2500 lr 0.0063 vocab_size 100053 (8.85s/15.08m) [Shuffling 77] loss 3598295.5000 lr 0.0061 vocab_size 100053 (8.88s/15.08m) [Shuffling 78] loss 3629253.2500 lr 0.0058 vocab_size 100053 (8.83s/15.08m) [Shuffling 79] loss 3603654.5000 lr 0.0056 vocab_size 100053 (8.72s/15.07m) [Shuffling 80] loss 3683187.0000 lr 0.0053 vocab_size 100053 (8.82s/15.07m) [Shuffling 81] loss 3589290.2500 lr 0.0051 vocab_size 100053 (8.92s/15.07m) [Shuffling 82] loss 3712678.7500 lr 0.0048 vocab_size 100053 (9.05s/15.08m) [Shuffling 83] loss 3710622.7500 lr 0.0046 vocab_size 100053 (8.85s/15.08m) [Shuffling 84] loss 3598147.2500 lr 0.0043 vocab_size 100053 (8.68s/15.07m) [Shuffling 85] loss 3663035.7500 lr 0.0041 vocab_size 100053 (8.81s/15.07m) [Shuffling 86] loss 3630492.0000 lr 0.0038 vocab_size 100053 (8.75s/15.07m) [Shuffling 87] loss 3562548.5000 lr 0.0036 vocab_size 100053 (8.72s/15.06m) [Shuffling 88] loss 3510472.5000 lr 0.0033 vocab_size 100053 (8.78s/15.06m) [Shuffling 89] loss 3636457.7500 lr 0.0031 vocab_size 100053 (8.87s/15.06m) [Shuffling 90] loss 3564556.5000 lr 0.0028 vocab_size 100053 (8.79s/15.06m) [Shuffling 91] loss 3601905.5000 lr 0.0026 vocab_size 100053 (8.75s/15.06m) [Shuffling 92] loss 3624191.7500 lr 0.0023 vocab_size 100053 (8.83s/15.06m) [Shuffling 93] loss 3643651.5000 lr 0.0021 vocab_size 100053 (8.74s/15.05m) [Shuffling 94] loss 3553992.0000 lr 0.0018 vocab_size 100053 (8.79s/15.05m) [Shuffling 95] loss 3673688.0000 lr 0.0016 vocab_size 100053 (8.84s/15.05m) [Shuffling 96] loss 3576782.5000 lr 0.0013 vocab_size 100053 (8.74s/15.05m) [Shuffling 97] loss 3579045.7500 lr 0.0011 vocab_size 100053 (8.75s/15.05m) [Shuffling 98] loss 3574242.0000 lr 0.0008 vocab_size 100053 (8.64s/15.04m) [Shuffling 99] loss 3646146.5000 lr 0.0006 vocab_size 100053 (8.85s/15.04m) [Shuffling 100] loss 3603949.5000 lr 0.0003 vocab_size 100053 (10.72s/15.07m) [03/12/24-17:58:49] Training finished, training Time 15.07m