num_sent_processes: 4 [0] Creating shuffled datasets in /scratch/gz5hp/tfbs_region2vec_models/expr_universe_Large/5W10D-0.0250r/shuffled_datasets [1] Creating shuffled datasets in /scratch/gz5hp/tfbs_region2vec_models/expr_universe_Large/5W10D-0.0250r/shuffled_datasets [2] Creating shuffled datasets in /scratch/gz5hp/tfbs_region2vec_models/expr_universe_Large/5W10D-0.0250r/shuffled_datasets [3] Creating shuffled datasets in /scratch/gz5hp/tfbs_region2vec_models/expr_universe_Large/5W10D-0.0250r/shuffled_datasets {'num_shuffle': 100, 'embed_dim': 10, 'context_len': 5, 'nworkers': 6, 'save_freq': -1, 'save_dir': '/scratch/gz5hp/tfbs_region2vec_models/expr_universe_Large/5W10D-0.0250r', 'resume': '', 'train_alg': 'cbow', 'min_count': 5, 'neg_samples': 5, 'init_lr': 0.025, 'min_lr': 0.0001, 'lr_mode': 'linear', 'milestones': [], 'hier_softmax': False, 'update_vocab': 'once', 'seed': 0} Using cbow, negative sampling with 5 negative samples [03/12/24-16:55:48] Start training [03/12/24-16:55:48] Building vocabulary [03/12/24-16:56:12] Vocabulary size is 287225 [Shuffling 1] loss 7215591.0000 lr 0.0250 vocab_size 287225 (12.64s/21.46m) [Shuffling 2] loss 5218942.0000 lr 0.0248 vocab_size 287225 (13.02s/21.78m) [Shuffling 3] loss 5264791.0000 lr 0.0245 vocab_size 287225 (12.88s/21.82m) [Shuffling 4] loss 4755721.0000 lr 0.0243 vocab_size 287225 (12.95s/21.86m) [Shuffling 5] loss 4723434.0000 lr 0.0240 vocab_size 287225 (13.05s/21.92m) [Shuffling 6] loss 4413869.0000 lr 0.0238 vocab_size 287225 (13.04s/21.96m) [Shuffling 7] loss 4771415.5000 lr 0.0235 vocab_size 287225 (12.99s/21.97m) [Shuffling 8] loss 4608160.5000 lr 0.0233 vocab_size 287225 (13.12s/22.01m) [Shuffling 9] loss 4415953.5000 lr 0.0230 vocab_size 287225 (12.86s/21.99m) [Shuffling 10] loss 4412073.0000 lr 0.0228 vocab_size 287225 (13.03s/22.00m) [Shuffling 11] loss 4406727.0000 lr 0.0225 vocab_size 287225 (13.11s/22.03m) [Shuffling 12] loss 4448283.0000 lr 0.0223 vocab_size 287225 (12.98s/22.03m) [Shuffling 13] loss 4232773.5000 lr 0.0220 vocab_size 287225 (13.06s/22.04m) [Shuffling 14] loss 4179004.0000 lr 0.0218 vocab_size 287225 (13.07s/22.05m) [Shuffling 15] loss 4214172.0000 lr 0.0215 vocab_size 287225 (13.00s/22.05m) [Shuffling 16] loss 4348330.0000 lr 0.0213 vocab_size 287225 (12.77s/22.03m) [Shuffling 17] loss 4330461.5000 lr 0.0210 vocab_size 287225 (12.93s/22.02m) [Shuffling 18] loss 4128843.2500 lr 0.0208 vocab_size 287225 (13.08s/22.03m) [Shuffling 19] loss 4255397.5000 lr 0.0205 vocab_size 287225 (13.03s/22.04m) [Shuffling 20] loss 4184479.2500 lr 0.0203 vocab_size 287225 (12.92s/22.03m) [Shuffling 21] loss 4329456.5000 lr 0.0200 vocab_size 287225 (12.95s/22.03m) [Shuffling 22] loss 4277193.5000 lr 0.0198 vocab_size 287225 (12.88s/22.02m) [Shuffling 23] loss 4212633.5000 lr 0.0195 vocab_size 287225 (13.08s/22.03m) [Shuffling 24] loss 4032725.0000 lr 0.0193 vocab_size 287225 (13.26s/22.05m) [Shuffling 25] loss 4154318.2500 lr 0.0190 vocab_size 287225 (13.01s/22.05m) [Shuffling 26] loss 4182207.7500 lr 0.0188 vocab_size 287225 (13.03s/22.06m) [Shuffling 27] loss 4048836.5000 lr 0.0185 vocab_size 287225 (13.06s/22.06m) [Shuffling 28] loss 4017848.5000 lr 0.0183 vocab_size 287225 (13.31s/22.08m) [Shuffling 29] loss 3928186.7500 lr 0.0180 vocab_size 287225 (13.18s/22.09m) [Shuffling 30] loss 4172099.0000 lr 0.0178 vocab_size 287225 (12.97s/22.09m) [Shuffling 31] loss 4009301.5000 lr 0.0175 vocab_size 287225 (12.96s/22.08m) [Shuffling 32] loss 4137934.0000 lr 0.0173 vocab_size 287225 (13.08s/22.09m) [Shuffling 33] loss 4134665.2500 lr 0.0170 vocab_size 287225 (12.96s/22.08m) [Shuffling 34] loss 4067655.7500 lr 0.0168 vocab_size 287225 (13.10s/22.09m) [Shuffling 35] loss 4029137.7500 lr 0.0165 vocab_size 287225 (12.84s/22.08m) [Shuffling 36] loss 4183093.7500 lr 0.0163 vocab_size 287225 (12.70s/22.07m) [Shuffling 37] loss 3917885.0000 lr 0.0160 vocab_size 287225 (12.86s/22.06m) [Shuffling 38] loss 4132147.0000 lr 0.0158 vocab_size 287225 (12.82s/22.05m) [Shuffling 39] loss 4119006.2500 lr 0.0155 vocab_size 287225 (12.79s/22.04m) [Shuffling 40] loss 4077470.2500 lr 0.0153 vocab_size 287225 (12.91s/22.04m) [Shuffling 41] loss 3827812.0000 lr 0.0150 vocab_size 287225 (13.02s/22.04m) [Shuffling 42] loss 3834446.7500 lr 0.0148 vocab_size 287225 (12.90s/22.04m) [Shuffling 43] loss 3831836.7500 lr 0.0145 vocab_size 287225 (13.19s/22.05m) [Shuffling 44] loss 3981570.0000 lr 0.0143 vocab_size 287225 (12.86s/22.04m) [Shuffling 45] loss 4038462.5000 lr 0.0140 vocab_size 287225 (13.16s/22.05m) [Shuffling 46] loss 4175197.7500 lr 0.0138 vocab_size 287225 (13.09s/22.05m) [Shuffling 47] loss 3889647.5000 lr 0.0135 vocab_size 287225 (12.86s/22.05m) [Shuffling 48] loss 3966156.2500 lr 0.0133 vocab_size 287225 (13.01s/22.05m) [Shuffling 49] loss 4155949.5000 lr 0.0130 vocab_size 287225 (12.99s/22.05m) [Shuffling 50] loss 3909042.5000 lr 0.0128 vocab_size 287225 (12.92s/22.05m) [Shuffling 51] loss 3904464.5000 lr 0.0126 vocab_size 287225 (12.76s/22.04m) [Shuffling 52] loss 3950972.7500 lr 0.0123 vocab_size 287225 (12.76s/22.03m) [Shuffling 53] loss 4171775.7500 lr 0.0121 vocab_size 287225 (12.90s/22.03m) [Shuffling 54] loss 3863055.0000 lr 0.0118 vocab_size 287225 (12.85s/22.03m) [Shuffling 55] loss 3895689.7500 lr 0.0116 vocab_size 287225 (13.24s/22.04m) [Shuffling 56] loss 4018428.0000 lr 0.0113 vocab_size 287225 (13.00s/22.04m) [Shuffling 57] loss 3901767.5000 lr 0.0111 vocab_size 287225 (12.95s/22.04m) [Shuffling 58] loss 3905969.0000 lr 0.0108 vocab_size 287225 (12.99s/22.04m) [Shuffling 59] loss 3996837.2500 lr 0.0106 vocab_size 287225 (12.78s/22.03m) [Shuffling 60] loss 3958144.5000 lr 0.0103 vocab_size 287225 (12.96s/22.03m) [Shuffling 61] loss 3972815.2500 lr 0.0101 vocab_size 287225 (12.84s/22.03m) [Shuffling 62] loss 3722119.2500 lr 0.0098 vocab_size 287225 (12.99s/22.03m) [Shuffling 63] loss 4101116.0000 lr 0.0096 vocab_size 287225 (12.91s/22.02m) [Shuffling 64] loss 3922862.7500 lr 0.0093 vocab_size 287225 (13.11s/22.03m) [Shuffling 65] loss 3958352.0000 lr 0.0091 vocab_size 287225 (12.88s/22.03m) [Shuffling 66] loss 3965758.0000 lr 0.0088 vocab_size 287225 (12.91s/22.02m) [Shuffling 67] loss 4013139.5000 lr 0.0086 vocab_size 287225 (13.05s/22.03m) [Shuffling 68] loss 3955197.0000 lr 0.0083 vocab_size 287225 (13.01s/22.03m) [Shuffling 69] loss 3875008.0000 lr 0.0081 vocab_size 287225 (12.97s/22.03m) [Shuffling 70] loss 4015311.2500 lr 0.0078 vocab_size 287225 (13.04s/22.03m) [Shuffling 71] loss 3889854.2500 lr 0.0076 vocab_size 287225 (13.04s/22.03m) [Shuffling 72] loss 4176637.5000 lr 0.0073 vocab_size 287225 (13.27s/22.04m) [Shuffling 73] loss 3910239.5000 lr 0.0071 vocab_size 287225 (13.05s/22.04m) [Shuffling 74] loss 3965903.7500 lr 0.0068 vocab_size 287225 (13.01s/22.04m) [Shuffling 75] loss 4010205.5000 lr 0.0066 vocab_size 287225 (12.94s/22.04m) [Shuffling 76] loss 3860862.0000 lr 0.0063 vocab_size 287225 (13.00s/22.04m) [Shuffling 77] loss 3969214.7500 lr 0.0061 vocab_size 287225 (13.10s/22.04m) [Shuffling 78] loss 3923828.2500 lr 0.0058 vocab_size 287225 (12.95s/22.04m) [Shuffling 79] loss 3912791.5000 lr 0.0056 vocab_size 287225 (12.88s/22.04m) [Shuffling 80] loss 3739751.2500 lr 0.0053 vocab_size 287225 (13.06s/22.04m) [Shuffling 81] loss 3730581.0000 lr 0.0051 vocab_size 287225 (13.03s/22.04m) [Shuffling 82] loss 3954861.7500 lr 0.0048 vocab_size 287225 (12.99s/22.04m) [Shuffling 83] loss 3843222.7500 lr 0.0046 vocab_size 287225 (12.88s/22.04m) [Shuffling 84] loss 3690789.5000 lr 0.0043 vocab_size 287225 (12.86s/22.04m) [Shuffling 85] loss 4034139.5000 lr 0.0041 vocab_size 287225 (12.91s/22.04m) [Shuffling 86] loss 3896485.7500 lr 0.0038 vocab_size 287225 (13.24s/22.04m) [Shuffling 87] loss 3936361.5000 lr 0.0036 vocab_size 287225 (12.99s/22.04m) [Shuffling 88] loss 4057893.0000 lr 0.0033 vocab_size 287225 (12.79s/22.04m) [Shuffling 89] loss 3788126.2500 lr 0.0031 vocab_size 287225 (12.87s/22.04m) [Shuffling 90] loss 3844503.2500 lr 0.0028 vocab_size 287225 (13.12s/22.04m) [Shuffling 91] loss 3813162.5000 lr 0.0026 vocab_size 287225 (12.97s/22.04m) [Shuffling 92] loss 3860861.2500 lr 0.0023 vocab_size 287225 (13.07s/22.04m) [Shuffling 93] loss 3839917.7500 lr 0.0021 vocab_size 287225 (12.93s/22.04m) [Shuffling 94] loss 3977155.5000 lr 0.0018 vocab_size 287225 (12.84s/22.04m) [Shuffling 95] loss 3924614.7500 lr 0.0016 vocab_size 287225 (25.88s/22.26m) [Shuffling 96] loss 3826873.7500 lr 0.0013 vocab_size 287225 (25.85s/22.48m) [Shuffling 97] loss 3839462.7500 lr 0.0011 vocab_size 287225 (26.09s/22.70m) [Shuffling 98] loss 3827278.7500 lr 0.0008 vocab_size 287225 (25.58s/22.91m) [Shuffling 99] loss 4052207.5000 lr 0.0006 vocab_size 287225 (26.08s/23.12m) [Shuffling 100] loss 3867009.5000 lr 0.0003 vocab_size 287225 (25.95s/23.33m) [03/12/24-17:19:08] Training finished, training Time 23.33m