num_sent_processes: 4 [0] Creating shuffled datasets in /scratch/gz5hp/tfbs_region2vec_models/expr_universe_tile25k/5W10D-0.1000r/shuffled_datasets [1] Creating shuffled datasets in /scratch/gz5hp/tfbs_region2vec_models/expr_universe_tile25k/5W10D-0.1000r/shuffled_datasets [2] Creating shuffled datasets in /scratch/gz5hp/tfbs_region2vec_models/expr_universe_tile25k/5W10D-0.1000r/shuffled_datasets [3] Creating shuffled datasets in /scratch/gz5hp/tfbs_region2vec_models/expr_universe_tile25k/5W10D-0.1000r/shuffled_datasets {'num_shuffle': 100, 'embed_dim': 10, 'context_len': 5, 'nworkers': 6, 'save_freq': -1, 'save_dir': '/scratch/gz5hp/tfbs_region2vec_models/expr_universe_tile25k/5W10D-0.1000r', 'resume': '', 'train_alg': 'cbow', 'min_count': 5, 'neg_samples': 5, 'init_lr': 0.1, 'min_lr': 0.0001, 'lr_mode': 'linear', 'milestones': [], 'hier_softmax': False, 'update_vocab': 'once', 'seed': 0} Using cbow, negative sampling with 5 negative samples [03/12/24-17:58:56] Start training [03/12/24-17:58:56] Building vocabulary [03/12/24-17:59:11] Vocabulary size is 100053 [Shuffling 1] loss 4715292.0000 lr 0.1000 vocab_size 100053 (5.68s/9.71m) [Shuffling 2] loss 4190582.0000 lr 0.0990 vocab_size 100053 (6.01s/9.98m) [Shuffling 3] loss 4119116.5000 lr 0.0980 vocab_size 100053 (6.04s/10.09m) [Shuffling 4] loss 4034807.2500 lr 0.0970 vocab_size 100053 (6.10s/10.17m) [Shuffling 5] loss 4166621.5000 lr 0.0960 vocab_size 100053 (6.05s/10.20m) [Shuffling 6] loss 4003852.5000 lr 0.0950 vocab_size 100053 (6.08s/10.23m) [Shuffling 7] loss 4064791.2500 lr 0.0940 vocab_size 100053 (6.18s/10.27m) [Shuffling 8] loss 4043195.5000 lr 0.0930 vocab_size 100053 (6.05s/10.28m) [Shuffling 9] loss 4033335.5000 lr 0.0920 vocab_size 100053 (6.05s/10.29m) [Shuffling 10] loss 3970018.2500 lr 0.0910 vocab_size 100053 (6.08s/10.30m) [Shuffling 11] loss 3956892.0000 lr 0.0900 vocab_size 100053 (6.11s/10.31m) [Shuffling 12] loss 3987299.0000 lr 0.0890 vocab_size 100053 (6.07s/10.31m) [Shuffling 13] loss 4076600.0000 lr 0.0880 vocab_size 100053 (6.06s/10.31m) [Shuffling 14] loss 4027275.7500 lr 0.0870 vocab_size 100053 (6.05s/10.32m) [Shuffling 15] loss 4358144.0000 lr 0.0860 vocab_size 100053 (6.08s/10.32m) [Shuffling 16] loss 4235102.0000 lr 0.0850 vocab_size 100053 (6.12s/10.33m) [Shuffling 17] loss 4369522.5000 lr 0.0840 vocab_size 100053 (6.10s/10.33m) [Shuffling 18] loss 4280547.0000 lr 0.0830 vocab_size 100053 (6.13s/10.34m) [Shuffling 19] loss 4295878.0000 lr 0.0820 vocab_size 100053 (6.08s/10.34m) [Shuffling 20] loss 4021054.7500 lr 0.0810 vocab_size 100053 (6.09s/10.34m) [Shuffling 21] loss 4117636.7500 lr 0.0800 vocab_size 100053 (6.08s/10.35m) [Shuffling 22] loss 4036473.0000 lr 0.0790 vocab_size 100053 (6.03s/10.34m) [Shuffling 23] loss 4027451.5000 lr 0.0780 vocab_size 100053 (6.09s/10.35m) [Shuffling 24] loss 4130976.7500 lr 0.0770 vocab_size 100053 (6.10s/10.35m) [Shuffling 25] loss 4004813.2500 lr 0.0760 vocab_size 100053 (6.08s/10.35m) [Shuffling 26] loss 4007510.0000 lr 0.0750 vocab_size 100053 (6.09s/10.35m) [Shuffling 27] loss 4102923.5000 lr 0.0740 vocab_size 100053 (6.13s/10.36m) [Shuffling 28] loss 4004077.5000 lr 0.0730 vocab_size 100053 (6.09s/10.36m) [Shuffling 29] loss 3917611.5000 lr 0.0720 vocab_size 100053 (6.11s/10.36m) [Shuffling 30] loss 3892405.0000 lr 0.0710 vocab_size 100053 (6.15s/10.36m) [Shuffling 31] loss 4039113.7500 lr 0.0700 vocab_size 100053 (6.11s/10.37m) [Shuffling 32] loss 3947205.7500 lr 0.0690 vocab_size 100053 (6.11s/10.37m) [Shuffling 33] loss 3969095.5000 lr 0.0680 vocab_size 100053 (6.13s/10.37m) [Shuffling 34] loss 3976562.2500 lr 0.0670 vocab_size 100053 (6.09s/10.37m) [Shuffling 35] loss 4025479.2500 lr 0.0660 vocab_size 100053 (6.12s/10.37m) [Shuffling 36] loss 3825551.7500 lr 0.0650 vocab_size 100053 (6.14s/10.38m) [Shuffling 37] loss 3842831.7500 lr 0.0640 vocab_size 100053 (6.07s/10.38m) [Shuffling 38] loss 4022957.0000 lr 0.0630 vocab_size 100053 (6.09s/10.38m) [Shuffling 39] loss 3917015.2500 lr 0.0620 vocab_size 100053 (6.06s/10.38m) [Shuffling 40] loss 3849643.7500 lr 0.0610 vocab_size 100053 (6.08s/10.38m) [Shuffling 41] loss 3983182.0000 lr 0.0600 vocab_size 100053 (6.07s/10.38m) [Shuffling 42] loss 3916145.2500 lr 0.0590 vocab_size 100053 (6.08s/10.38m) [Shuffling 43] loss 3961468.7500 lr 0.0580 vocab_size 100053 (6.09s/10.38m) [Shuffling 44] loss 4150521.2500 lr 0.0570 vocab_size 100053 (6.08s/10.38m) [Shuffling 45] loss 3987536.2500 lr 0.0560 vocab_size 100053 (6.06s/10.38m) [Shuffling 46] loss 4044998.2500 lr 0.0550 vocab_size 100053 (6.07s/10.37m) [Shuffling 47] loss 3988579.5000 lr 0.0540 vocab_size 100053 (6.05s/10.37m) [Shuffling 48] loss 3957478.0000 lr 0.0530 vocab_size 100053 (6.10s/10.37m) [Shuffling 49] loss 3850998.5000 lr 0.0520 vocab_size 100053 (6.13s/10.38m) [Shuffling 50] loss 4080349.7500 lr 0.0510 vocab_size 100053 (6.08s/10.38m) [Shuffling 51] loss 3971823.7500 lr 0.0500 vocab_size 100053 (6.08s/10.38m) [Shuffling 52] loss 3942982.5000 lr 0.0491 vocab_size 100053 (6.09s/10.38m) [Shuffling 53] loss 3850798.5000 lr 0.0481 vocab_size 100053 (6.05s/10.38m) [Shuffling 54] loss 4055363.7500 lr 0.0471 vocab_size 100053 (6.06s/10.37m) [Shuffling 55] loss 4072431.7500 lr 0.0461 vocab_size 100053 (6.08s/10.37m) [Shuffling 56] loss 4071473.0000 lr 0.0451 vocab_size 100053 (6.09s/10.37m) [Shuffling 57] loss 3952577.0000 lr 0.0441 vocab_size 100053 (6.06s/10.37m) [Shuffling 58] loss 4108501.5000 lr 0.0431 vocab_size 100053 (6.07s/10.37m) [Shuffling 59] loss 3902463.0000 lr 0.0421 vocab_size 100053 (6.06s/10.37m) [Shuffling 60] loss 3874066.0000 lr 0.0411 vocab_size 100053 (6.07s/10.37m) [Shuffling 61] loss 3834600.0000 lr 0.0401 vocab_size 100053 (6.13s/10.37m) [Shuffling 62] loss 3953849.7500 lr 0.0391 vocab_size 100053 (6.08s/10.37m) [Shuffling 63] loss 3970887.0000 lr 0.0381 vocab_size 100053 (6.09s/10.38m) [Shuffling 64] loss 3877531.5000 lr 0.0371 vocab_size 100053 (6.08s/10.38m) [Shuffling 65] loss 3967415.2500 lr 0.0361 vocab_size 100053 (6.07s/10.37m) [Shuffling 66] loss 3958652.5000 lr 0.0351 vocab_size 100053 (6.11s/10.38m) [Shuffling 67] loss 4057001.2500 lr 0.0341 vocab_size 100053 (6.13s/10.38m) [Shuffling 68] loss 3939370.0000 lr 0.0331 vocab_size 100053 (6.16s/10.38m) [Shuffling 69] loss 3950719.5000 lr 0.0321 vocab_size 100053 (6.08s/10.38m) [Shuffling 70] loss 4127679.5000 lr 0.0311 vocab_size 100053 (6.12s/10.38m) [Shuffling 71] loss 3937242.0000 lr 0.0301 vocab_size 100053 (6.13s/10.38m) [Shuffling 72] loss 3977541.2500 lr 0.0291 vocab_size 100053 (6.15s/10.38m) [Shuffling 73] loss 3873719.0000 lr 0.0281 vocab_size 100053 (6.12s/10.38m) [Shuffling 74] loss 3997338.0000 lr 0.0271 vocab_size 100053 (6.09s/10.38m) [Shuffling 75] loss 3887667.7500 lr 0.0261 vocab_size 100053 (6.14s/10.38m) [Shuffling 76] loss 3798465.0000 lr 0.0251 vocab_size 100053 (6.15s/10.39m) [Shuffling 77] loss 3906372.5000 lr 0.0241 vocab_size 100053 (6.13s/10.39m) [Shuffling 78] loss 4038051.7500 lr 0.0231 vocab_size 100053 (6.12s/10.39m) [Shuffling 79] loss 3936903.0000 lr 0.0221 vocab_size 100053 (6.16s/10.39m) [Shuffling 80] loss 3915004.0000 lr 0.0211 vocab_size 100053 (6.09s/10.39m) [Shuffling 81] loss 3850958.7500 lr 0.0201 vocab_size 100053 (6.27s/10.39m) [Shuffling 82] loss 3990176.5000 lr 0.0191 vocab_size 100053 (6.10s/10.39m) [Shuffling 83] loss 3926304.0000 lr 0.0181 vocab_size 100053 (6.09s/10.39m) [Shuffling 84] loss 3896768.0000 lr 0.0171 vocab_size 100053 (6.08s/10.39m) [Shuffling 85] loss 3981100.0000 lr 0.0161 vocab_size 100053 (6.06s/10.39m) [Shuffling 86] loss 3916631.0000 lr 0.0151 vocab_size 100053 (6.11s/10.39m) [Shuffling 87] loss 3985048.0000 lr 0.0141 vocab_size 100053 (6.09s/10.39m) [Shuffling 88] loss 3857218.0000 lr 0.0131 vocab_size 100053 (5.97s/10.39m) [Shuffling 89] loss 3879335.7500 lr 0.0121 vocab_size 100053 (5.99s/10.39m) [Shuffling 90] loss 3837171.5000 lr 0.0111 vocab_size 100053 (6.09s/10.39m) [Shuffling 91] loss 3903120.0000 lr 0.0101 vocab_size 100053 (6.14s/10.39m) [Shuffling 92] loss 4111173.2500 lr 0.0091 vocab_size 100053 (6.08s/10.39m) [Shuffling 93] loss 3953385.7500 lr 0.0081 vocab_size 100053 (6.10s/10.39m) [Shuffling 94] loss 3884191.2500 lr 0.0071 vocab_size 100053 (6.13s/10.39m) [Shuffling 95] loss 3727897.5000 lr 0.0061 vocab_size 100053 (6.08s/10.39m) [Shuffling 96] loss 3872739.7500 lr 0.0051 vocab_size 100053 (5.99s/10.39m) [Shuffling 97] loss 3806798.2500 lr 0.0041 vocab_size 100053 (6.29s/10.39m) [Shuffling 98] loss 3844399.7500 lr 0.0031 vocab_size 100053 (11.84s/10.49m) [Shuffling 99] loss 3819431.7500 lr 0.0021 vocab_size 100053 (15.86s/10.65m) [Shuffling 100] loss 3936491.7500 lr 0.0011 vocab_size 100053 (16.87s/10.83m) [03/12/24-18:09:46] Training finished, training Time 10.83m